Getting Data In

len(_raw) vs |dbinspect rawSize

Hoekb03
Explorer

I use a simple query to determine the amount of data I've sent to splunk:

index=x
|eval esize=len(_raw)
|timechart sum(esize) span=1h

This is pretty expensive when ran over long timeranges. I also tried this:

|dbinspect index=x
|eval date=strftime(startEpoch,"%F")
|chart sum(rawSize) over date
|rename sum(*) -> *

The results are different, dbinspect reporting lower values than len(_raw).

Any ideas on a cheap way to get the right results?

0 Karma

FrankVl
Ultra Champion

I usually get that sort of info from the license usage events in _internal.

Eg:

index="_internal" source="*license_usage.log" type=Usage 
| bin _time span=1d 
| stats sum(b) AS bytes by _time,idx 
| eval DailyGB=bytes/1024/1024/1024 
| timechart sum(DailyGB) by idx span=1d
Get Updates on the Splunk Community!

Observability | How to Think About Instrumentation Overhead (White Paper)

Novice observability practitioners are often overly obsessed with performance. They might approach ...

Cloud Platform | Get Resiliency in the Cloud Event (Register Now!)

IDC Report: Enterprises Gain Higher Efficiency and Resiliency With Migration to Cloud  Today many enterprises ...

The Great Resilience Quest: 10th Leaderboard Update

The tenth leaderboard update (11.23-12.05) for The Great Resilience Quest is out >> As our brave ...