Getting Data In

Why Cluster Peer (Indexer) takes a long time to start splunkweb when Cluster Master is down?

Splunk Employee
Splunk Employee

Why Cluster Peer (Indexer) takes long time to start splunkweb when Cluster Master is down

In my test environment, I was cleaning up all data in Indexer Clustering Peers.
Because there is no splunk utility to clean up all index db in Indexer Clustering environment in the current version ( v6.2.2 ), I did the following steps.

Version: 6.2.2
CM: Cluster Master
CP: Cluster Peer

  1. Stop CM
  2. Stop CPs
  3. Run "splunk clean eventdata -f" in the all CPs
  4. Start CPs

    • Each CP took over five minutes to return a command prompt.

    • What it took time is at "Waiting for web server at to be available...."

    • Sometimes it ended up with splunkweb timeout "WARNING: web interface does not seem to be available! "

  5. Start CM

    • CM shows CPs are okay and I can login to Splnkweb of the CPs

If CM is up, starting a CP does not get does not take time at "Waiting for web server at to be available...." and starts within five sec.
It seems like splunkweb got stuck until CP is connected to CM.

0 Karma

Splunk Employee
Splunk Employee

So the best practices here are not to stop the cluster master, but to put it into maintenance mode. This will prevent bucket fixup and cluster rebalancing processes from being run.

After this, you can stop the individual peers, clean the indexes and restart them. Once this is completed across the cluster, you should take the master out of maintenance mode.

When the peers start, they are attempting to reach out to the cluster master, register, and get peer information in order to meet the CLuster's Search and Replication factors. So yes, with out the cluster master up and running, it will take time for the peers to start.

During the start process, you can monitor the splunkd.log file (in $splunk_home$/var/log/splunk/splunkd.log) and see its connection attempts to the Cluster Master.

Splunk Employee
Splunk Employee

Thanks for your advice.
I wanted to make sure when CM is down this slow start-up happens as expected.

0 Karma
Don’t Miss Global Splunk
User Groups Week!

Free LIVE events worldwide 2/8-2/12
Connect, learn, and collect rad prizes
and swag!