Solved: How do you prevent Splunk from indexing duplicated...

djime · ‎11-08-2018

How do you prevent Splunk from indexing duplicate events forwarded from different forwarders? The monitored log files are recording the same events but in different servers. The requirement is needed for maintaining the availability of the monitored events, even when one of the servers is powered off.

Thank you.

gjanders · ‎11-11-2018

Effectively no, universal forwarders are not aware of other universal forwarders.
In fact Splunk enterprise instances are not aware of each other, each heavy forwarder would also be standalone.

Therefore you would have to build a script or find a way to only monitor the file when the instance should be running it...(or use another trick)

At the Splunk indexing tier it's also impossible to de-duplicate data on the way in, at least upto 7.2.x so far

-
Alerts for Splunk Admins, Version Control for Splunk, Decrypt2 VersionControl For SplunkCloud

View solution in original post

gjanders · ‎11-11-2018

Effectively no, universal forwarders are not aware of other universal forwarders.
In fact Splunk enterprise instances are not aware of each other, each heavy forwarder would also be standalone.

Therefore you would have to build a script or find a way to only monitor the file when the instance should be running it...(or use another trick)

At the Splunk indexing tier it's also impossible to de-duplicate data on the way in, at least upto 7.2.x so far

-
Alerts for Splunk Admins, Version Control for Splunk, Decrypt2 VersionControl For SplunkCloud

djime · ‎11-12-2018

Ok, thank you for the help

gjanders · ‎11-12-2018

Please click on accept answer so this question is marked as answered when you are ready (feel free to wait for more answers)...thanks!

-
Alerts for Splunk Admins, Version Control for Splunk, Decrypt2 VersionControl For SplunkCloud

rashi83 · ‎09-04-2019

@gjanders - Can we do some config change on forwarder end to stop sending duplicate data?

gjanders · ‎09-04-2019

@rashi83 it would depend on what is causing it! The UF does not de-duplicate data, so if multiple files have some level of duplicate content you may get duplicates in Splunk...

If you monitor unique files on the UF you should not be seeing duplicates in Splunk outside issues with performance and the useACK setting...

-
Alerts for Splunk Admins, Version Control for Splunk, Decrypt2 VersionControl For SplunkCloud

richgalloway · ‎11-08-2018

To prevent data loss, you probably want to index the duplicate events and remove the duplicates at search time.

---
If this reply helps you, Karma would be appreciated.

djime · ‎11-08-2018

Thank you ,but the goal is to not index the duplicated events. Any other idea?

How do you prevent Splunk from indexing duplicated events?

Splunk Observability for AI

Splunk Enterprise Security 8.x: The Essential Upgrade for Threat Detection, ...

Splunk Observability as Code: From Zero to Dashboard

Are you a member of the Splunk Community?

How do you prevent Splunk from indexing duplicated events?

Splunk Observability for AI

Splunk Enterprise Security 8.x: The Essential Upgrade for Threat Detection, ...

Splunk Observability as Code: From Zero to Dashboard