Splunk Search

How does dedup work in splunk ?

nibinabr
Communicator

How does dedup work in splunk ? My concern is about the performance.
If my search is over 500K -1M events out of which 2K events are duplicates, is using dedup going to be expensive ? Or should I find a way way to delete those 2K events and avoid using dedup ?

Can someone give me suggestions on this or direct me to a discussion where I can find the answer to this question.

Tags (2)
0 Karma
1 Solution

musskopf
Builder

It can be expensive, yes, as it needs to save the every unique entry in a temporary place to keep comparing with every following event. To see how expensive it is, just use the Job inspector, it'll show how long each command takes to run.

Also, remember that deleting the record, doesn't actually delete anything, just mark it so won't show up again... but still very handy in your situation as you won't need to re-run dedup every time.

Cheers

View solution in original post

musskopf
Builder

It can be expensive, yes, as it needs to save the every unique entry in a temporary place to keep comparing with every following event. To see how expensive it is, just use the Job inspector, it'll show how long each command takes to run.

Also, remember that deleting the record, doesn't actually delete anything, just mark it so won't show up again... but still very handy in your situation as you won't need to re-run dedup every time.

Cheers

Got questions? Get answers!

Join the Splunk Community Slack to learn, troubleshoot, and make connections with fellow Splunk practitioners in real time!

Meet up IRL or virtually!

Join Splunk User Groups to connect and learn in-person by region or remotely by topic or industry.

Get Updates on the Splunk Community!

Developer Spotlight with Mika Borner

From Hackathon Winner to Enterprise Leader    Mika Borner, CEO and Founder of Datapunctum AG, has been ...

Continue Your Federation Journey: Join Session 3 of the Bootcamp Series

To help practitioners build a stronger foundation, we launched the Data Management & Federation ...

Announcing Modern Navigation: A New Era of Splunk User Experience

We are excited to introduce the Modern Navigation feature in the Splunk Platform, available to both cloud and ...