Splunk Search

To identify unused/unsearches data in Splunk

rahulhoney
New Member

Is there a way to find unused/unsearched data in Splunk?

Example:
In an Index=XYZ we are ingesting 100GB of data on a daily basis.

Out of that 100 GB when we run queries we are retrieving 60GB of logs and the remaining 40GB never retrieved or never searched upon.
And using this scenario we can send those events to the NULL queue.

0 Karma

somesoni2
Revered Legend

I don't think there is an easy way to do that. It would be easier if you could talk to the consumer of the data (Splunk users who runs those queries) and find out what type of data they are interested in and what's ok to drop. E.g. there could be some healthcheck type of INFO events that they might not use, so you could drop those.

0 Karma

rahulhoney
New Member

Thanks, @somesoni2 but we are trying to figure it out without asking consumer and sender too.

0 Karma

somesoni2
Revered Legend

There is no way in Splunk to track which specific data is being used. Again, data that is not queried till today, doesn't mean that it won't be queried tomorrow (or even 5 mins from now), so it's a risky business removing/filtering data. Your consumers and senders could only give you 100% accurate details about this.

Other thing you could do (again this is not 100% accurate) is to look at current adhoc and scheduled searches running in your environment, look at their search string to find which sourcetype/sources from that indexes are being used. Other sourcetypes can be a candidate for removal.

0 Karma
Got questions? Get answers!

Join the Splunk Community Slack to learn, troubleshoot, and make connections with fellow Splunk practitioners in real time!

Meet up IRL or virtually!

Join Splunk User Groups to connect and learn in-person by region or remotely by topic or industry.

Get Updates on the Splunk Community!

[Puzzles] Solve, Learn, Repeat: Character substitutions with Regular Expressions

This challenge was first posted on Slack #puzzles channelFor BORE at .conf23, we had a puzzle question which ...

Splunk Community Badges!

  Hey everyone! Ready to earn some serious bragging rights in the community? Along with our existing badges ...

[Puzzles] Solve, Learn, Repeat: Matching cron expressions

This puzzle (first published here) is based on matching timestamps to cron expressions.All the timestamps ...