Hello all,
We are trying to build new indexer cluster with new cluster master. We installed splunk on all the servers and integrated indexers with the cluster master. After all the process we are getting search and replication factor errors with below warning messages. We check all the ports connectivity between all the indexers and the cluster master everything is connected but we still getting this warning. We tried cleaning up the eventdata as suggested in one of the posts but that did not work either. Please let me know if anyone faced this type of issue and resolved it that would be very helpful. Let me know if you need any more info.
We have search and replication factor = 2 with three indexers
Search peer abcd.com has the following message: Too many bucket replication errors to target peer=xx.xx.xx.xx:8080. Will stop streaming data from hot buckets to this target while errors persist. Check for network connectivity from the cluster peer reporting this issue to the replication port of target peer. If this condition persists, you can temporarily put that peer in manual detention.
Thanks.
Hello all,
The problem is the MTU setting on the 1 Gb bonded network interface is set to 9,000 on our new indexes. We changed it to 1500 and that fixed the search and replication factor.
Thanks.
Hello all,
The problem is the MTU setting on the 1 Gb bonded network interface is set to 9,000 on our new indexes. We changed it to 1500 and that fixed the search and replication factor.
Thanks.
@sathwik067 How to check the MTU seeting?
Is it something that needs to be done at splunk end or network end?
Hi @DarshanBK, if you are running Linux, your Linux team can make this change, there is nothing you can do on the Splunk end. To see the MTU value you can run "ifconfig" command and it will give you the output of this MTU value along with some other information about the server and the mounts it has.
Have you checked the connectivity among the individual indexers? Replication is direct from indexer to indexer - not via the CM - so it's critical for an indexer to be able to connect to all other indexers and not just the CM.
Hello,
Thanks for the response. Yes, we have checked the connectivity between the indexers as well and the ports are connected between the indexers.
Hello,
thanks for the response. below are some of the errors we are seeing on the indexers
Waiting 'target_wait_time' before search factor fixup Cannot fix search count as the bucket hasn't rolled yet. |
Search peer abcd.com has the following message: Too many bucket replication errors to target peer=xx.xx.xx.xx:8080. Will stop streaming data from hot buckets to this target while errors persist. Check for network connectivity from the cluster peer reporting this issue to the replication port of target peer. If this condition persists, you can temporarily put that peer in manual detention
It is getting connected
curl -v telnet://xx.xx.xx.xx:8080
* About to connect() to xx.xx.xx.xx port 8080 (#0)
* Trying xx.xx.xx.xx...
* Connected to xx.xx.xx.xx (xx.xx.xx.xx) port 8080 (#0)