Knowledge Management

How to find out size of sistats based summaries

knielsen
Contributor

Hello,

We're having about 200 daily summaries, and about a third of them are sistats based.

For resizing efforts, I wanted to get an overview of how much data each summary consumes on a daily base.

Naively, I thought this would give me the result:

index=summary_1d | stats sum(eval(len(_raw))) as SizeMB by source | eval SizeMB=round(SizeMB/1024/1024,2) | sort - SizeMB | addcoltotals

This only works for the summaries that were not created with the si- commands though. Due to the nature of sistats, even this will give no answer for a single source:

index=summary_1d source="sistats 1d foobar" | eval bytes=len(_raw)  | stats sum(bytes)

While this would work:

index=summary_1d source="sistats 1d foobar" | eval bytes=len(_raw)  | table bytes | stats sum(bytes)

I don't want to do this manually per source and then bring the results together though. Can anyone think of a more convenient way to get the result for all sources in one go?

Thanks,
-Kai.

0 Karma
1 Solution

knielsen
Contributor

I figured it out myself. I can use the "table trick" for everything, put the source in the table as well, and do stats on that. Surely not the most efficient way, but it works to get the overview I wanted.

index=summary_1d | table source _raw | eval bytes=len(_raw) | stats sum(bytes) as SizeMB by source | eval SizeMB=round(SizeMB/1024/1024,2) | sort - SizeMB | addcoltotals

If there is a better solution, feel free to add that as well... 🙂

View solution in original post

0 Karma

knielsen
Contributor

I figured it out myself. I can use the "table trick" for everything, put the source in the table as well, and do stats on that. Surely not the most efficient way, but it works to get the overview I wanted.

index=summary_1d | table source _raw | eval bytes=len(_raw) | stats sum(bytes) as SizeMB by source | eval SizeMB=round(SizeMB/1024/1024,2) | sort - SizeMB | addcoltotals

If there is a better solution, feel free to add that as well... 🙂

0 Karma
Get Updates on the Splunk Community!

Routing logs with Splunk OTel Collector for Kubernetes

The Splunk Distribution of the OpenTelemetry (OTel) Collector is a product that provides a way to ingest ...

Welcome to the Splunk Community!

(view in My Videos) We're so glad you're here! The Splunk Community is place to connect, learn, give back, and ...

Tech Talk | Elevating Digital Service Excellence: The Synergy of Splunk RUM & APM

Elevating Digital Service Excellence: The Synergy of Real User Monitoring and Application Performance ...