I have a DNS entry set up for my 12 indexers. Recently I noticed a large consumer was throwing my traffic balance out of whack. I did some checking around metrics and see that all his servers (and some others to boot) are only cycling through 3 of the servers in the DNS list. Running host myindexers.private returns the entire list on the Forwarders' systems. But they are sending to only 3 of the returned IPs.
I have 8000 forwarders. About 3% of them are impacted by this weirdness. I'm still trying to narrow down what the commonalities are. It is always exactly 3 out of the 12.
EDIT: with a few dozen different versions in play, interesting that only 6.6.0 - 6.6.3 is showing this. (I don't have anything newer than 6.6.3 in service.)
EDIT2: Confirmed, no other versions are experiencing the indexer drop. Only 6.6.1 and 6.6.3. Turns out I have no 6.6.0 nor 6.6.2.
EDIT3: There appears to be an issue with 6.6.1 and 6.6.3, at least, where the longer the forwarder runs, the fewer indexers it will even attempt to talk to. I can reliably reproduce this issue with fresh, very basic, minimalistic installs. I cannot reproduce on versions before 6.6.x. This is true whether i list all indexers in outputs.conf, or use a DNS A list. Ticket open, diags exchanged, waiting...
... View more