According to this blog post: http://blogs.splunk.com/2014/03/18/time-based-load-balancing/
Using this setting Splunk can break
the data stream and reconstruct the
event properly on the indexer.
That's pretty awesome. How exactly does this work when autoLB breaks the event and sends each segment to a different indexer? How do the indexers figure out where the other portion of the event is for reconstruction?
The forwarder sends 64KB chunks of data to the indexer (no matter the events on it). When its forced to send data to a second indexer, it resends the last 64KB chunk again to the second indexer. So:
1 - The first indexer it will index everything before the last event break and ignore the rest (to avoid indexing an incomplete event).
2 - The second indexer will index the data after the last event break which will be complete by the following 64KB chunks data.
This works ok if the events are smaller that 64KB.
We're seeing broken events so I'm not sure if it's working as expected.
whats the size of your events? can you post a sample?
I'm also interested in the behavior of this setting. Any update for us?