Getting Data In

len(_raw) vs |dbinspect rawSize

Hoekb03
Explorer

I use a simple query to determine the amount of data I've sent to splunk:

index=x
|eval esize=len(_raw)
|timechart sum(esize) span=1h

This is pretty expensive when ran over long timeranges. I also tried this:

|dbinspect index=x
|eval date=strftime(startEpoch,"%F")
|chart sum(rawSize) over date
|rename sum(*) -> *

The results are different, dbinspect reporting lower values than len(_raw).

Any ideas on a cheap way to get the right results?

0 Karma

FrankVl
Ultra Champion

I usually get that sort of info from the license usage events in _internal.

Eg:

index="_internal" source="*license_usage.log" type=Usage 
| bin _time span=1d 
| stats sum(b) AS bytes by _time,idx 
| eval DailyGB=bytes/1024/1024/1024 
| timechart sum(DailyGB) by idx span=1d
Got questions? Get answers!

Join the Splunk Community Slack to learn, troubleshoot, and make connections with fellow Splunk practitioners in real time!

Meet up IRL or virtually!

Join Splunk User Groups to connect and learn in-person by region or remotely by topic or industry.

Get Updates on the Splunk Community!

Best Practices: Splunk auto adjust pipeline queue

When you enable autoAdjustQueue in Splunk, maxSize should be understood as the queue size Splunk starts with ...

Request for Professional Development: Attending .conf26

Winning Over the Boss: Your Pass to .conf26 conf26 is going to be here before you know it. If don't already ...

Casting Call: Compete in Cyber Games

Lights, Camera, SecOps: Apply to Compete in Cyber Games     Think you have what it takes to beat the clock? ...