Getting Data In

Why Cluster Peer (Indexer) takes a long time to start splunkweb when Cluster Master is down?

Masa
Splunk Employee
Splunk Employee

Why Cluster Peer (Indexer) takes long time to start splunkweb when Cluster Master is down

In my test environment, I was cleaning up all data in Indexer Clustering Peers.
Because there is no splunk utility to clean up all index db in Indexer Clustering environment in the current version ( v6.2.2 ), I did the following steps.

Version: 6.2.2
CM: Cluster Master
CP: Cluster Peer

  1. Stop CM
  2. Stop CPs
  3. Run "splunk clean eventdata -f" in the all CPs
  4. Start CPs


    • Each CP took over five minutes to return a command prompt.

    • What it took time is at "Waiting for web server at http://127.0.0.1:55110 to be available...."

    • Sometimes it ended up with splunkweb timeout "WARNING: web interface does not seem to be available! "

  5. Start CM

    • CM shows CPs are okay and I can login to Splnkweb of the CPs

If CM is up, starting a CP does not get does not take time at "Waiting for web server at http://127.0.0.1:55110 to be available...." and starts within five sec.
It seems like splunkweb got stuck until CP is connected to CM.

0 Karma

esix_splunk
Splunk Employee
Splunk Employee

So the best practices here are not to stop the cluster master, but to put it into maintenance mode. This will prevent bucket fixup and cluster rebalancing processes from being run.

After this, you can stop the individual peers, clean the indexes and restart them. Once this is completed across the cluster, you should take the master out of maintenance mode.

When the peers start, they are attempting to reach out to the cluster master, register, and get peer information in order to meet the CLuster's Search and Replication factors. So yes, with out the cluster master up and running, it will take time for the peers to start.

During the start process, you can monitor the splunkd.log file (in $splunk_home$/var/log/splunk/splunkd.log) and see its connection attempts to the Cluster Master.

Masa
Splunk Employee
Splunk Employee

Thanks for your advice.
I wanted to make sure when CM is down this slow start-up happens as expected.

0 Karma
Get Updates on the Splunk Community!

Announcing Scheduled Export GA for Dashboard Studio

We're excited to announce the general availability of Scheduled Export for Dashboard Studio. Starting in ...

Extending Observability Content to Splunk Cloud

Watch Now!   In this Extending Observability Content to Splunk Cloud Tech Talk, you'll see how to leverage ...

More Control Over Your Monitoring Costs with Archived Metrics GA in US-AWS!

What if there was a way you could keep all the metrics data you need while saving on storage costs?This is now ...