Knowledge Management

Why do we see large variations of the data age for a certain index?

danielbb
Motivator

We see the following for one index in the cluster master -

alt text

Why do we see these fluctuations for the data age among the eight indexers?

Tags (1)
0 Karma

solarboyz1
Builder

Buckets get rolled based on the youngest event in the bucket.

Buckets can span minutes, hours, days, etc.. depending on how much data is received into the index and your index configuration.
Additionally, if you turn up a new datasource and its starts reading in old logs. Those old events will now be mixed with your new events.
Finally, if the clock settings on the systems are incorrect...this can lead to events getting old or future dates.

Long story short, when old events get mixed in a bucket with newer events, the old events have to wait till ALL events in the bucket age out.

danielbb
Motivator

Makes sense - how can I check these buckets?

0 Karma

solarboyz1
Builder
|  dbinspect index=yourindex 

It will show the startEpoch and endEpoch for each bucket, among other things.
https://docs.splunk.com/Documentation/Splunk/7.3.1/SearchReference/Dbinspect

If you want to drill into the events in a specific bucket:

index=yourindex | where _bkt="summary~22~4F582768-7B38-4768-95EA-EC3D491A8A23"

Where _bkt matches the bucketId from dbinspect results.

https://docs.splunk.com/Documentation/Splunk/7.3.1/Knowledge/Usedefaultfields

Get Updates on the Splunk Community!

SOC4Kafka - New Kafka Connector Powered by OpenTelemetry

The new SOC4Kafka connector, built on OpenTelemetry, enables the collection of Kafka messages and forwards ...

Your Voice Matters! Help Us Shape the New Splunk Lantern Experience

Splunk Lantern is a Splunk customer success center that provides advice from Splunk experts on valuable data ...

Building Momentum: Splunk Developer Program at .conf25

At Splunk, developers are at the heart of innovation. That’s why this year at .conf25, we officially launched ...