Knowledge Management

Why do we see large variations of the data age for a certain index?

danielbb
Motivator

We see the following for one index in the cluster master -

alt text

Why do we see these fluctuations for the data age among the eight indexers?

Tags (1)
0 Karma

solarboyz1
Builder

Buckets get rolled based on the youngest event in the bucket.

Buckets can span minutes, hours, days, etc.. depending on how much data is received into the index and your index configuration.
Additionally, if you turn up a new datasource and its starts reading in old logs. Those old events will now be mixed with your new events.
Finally, if the clock settings on the systems are incorrect...this can lead to events getting old or future dates.

Long story short, when old events get mixed in a bucket with newer events, the old events have to wait till ALL events in the bucket age out.

danielbb
Motivator

Makes sense - how can I check these buckets?

0 Karma

solarboyz1
Builder
|  dbinspect index=yourindex 

It will show the startEpoch and endEpoch for each bucket, among other things.
https://docs.splunk.com/Documentation/Splunk/7.3.1/SearchReference/Dbinspect

If you want to drill into the events in a specific bucket:

index=yourindex | where _bkt="summary~22~4F582768-7B38-4768-95EA-EC3D491A8A23"

Where _bkt matches the bucketId from dbinspect results.

https://docs.splunk.com/Documentation/Splunk/7.3.1/Knowledge/Usedefaultfields

Get Updates on the Splunk Community!

Introducing the Splunk Community Dashboard Challenge!

Welcome to Splunk Community Dashboard Challenge! This is your chance to showcase your skills in creating ...

Get the T-shirt to Prove You Survived Splunk University Bootcamp

As if Splunk University, in Las Vegas, in-person, with three days of bootcamps and labs weren’t enough, now ...

Wondering How to Build Resiliency in the Cloud?

IT leaders are choosing Splunk Cloud as an ideal cloud transformation platform to drive business resilience,  ...