Getting Data In

How to check for files that are indexed multiple times

lakshman237
Path Finder

Greetings.,

I have universal forwarders reading files from a common directory, with whitelists. I have reviewed the monitor stanza and couldn't see the likelyhood of samefile being indexed twice.

Is there a way to find same files being indexed more than once from the indexer?

thanks
laks

Tags (2)
0 Karma

kristian_kolb
Ultra Champion

Depending on how unique your messages are, you could always try

...| stats count last(source) AS dup_file by _raw | search count > 1 | table dup_file 

if you get any results, you will likely have indexed the same event/file more than once.
The results will be rather simplistic, but...it might help.

_raw is the whole event, including its timestamp, which should be fairly unique in combination with the rest of the event text.

Please not that it may take some time, since splunk will have to go through all events. Some further qualification (sourcetype, host etc) may be necessary to speed things up.

/k

Get Updates on the Splunk Community!

Adoption of RUM and APM at Splunk

    Unleash the power of Splunk Observability   Watch Now In this can't miss Tech Talk! The Splunk Growth ...

Routing logs with Splunk OTel Collector for Kubernetes

The Splunk Distribution of the OpenTelemetry (OTel) Collector is a product that provides a way to ingest ...

Welcome to the Splunk Community!

(view in My Videos) We're so glad you're here! The Splunk Community is place to connect, learn, give back, and ...