Getting Data In

Indexed data twice! Suggestions to remove data from being searched?

ben_leung
Builder

Lets say we have forwarded events that are exactly the same and show in Splunk as duplicates. Running a | dedup _raw would resolve the duplicate events at search time. Would it make sense to run index=main | deduce _raw | delete so that we won't have to run a dedup every single time on that time range of events?

0 Karma

the_wolverine
Champion

I wouldn't advise scheduling a delete. For one, delete is expensive to run. Second, possibly dangerous in that you may wind up deleting something by accident. Third, fix the reason for duplicate events instead.

ben_leung
Builder

The reason I ask this is because | delete would remove the events returned from the prior search. I would assume it would "delete" the duplicate AND the original events. Does anyone know the behavior of this kind of scenario?

0 Karma
Career Survey
First 500 qualified respondents will receive a $20 gift card! Tell us about your professional Splunk journey.

Can’t make it to .conf25? Join us online!

Get Updates on the Splunk Community!

What Is Splunk? Here’s What You Can Do with Splunk

Hey Splunk Community, we know you know Splunk. You likely leverage its unparalleled ability to ingest, index, ...

Level Up Your .conf25: Splunk Arcade Comes to Boston

With .conf25 right around the corner in Boston, there’s a lot to look forward to — inspiring keynotes, ...

Manual Instrumentation with Splunk Observability Cloud: How to Instrument Frontend ...

Although it might seem daunting, as we’ve seen in this series, manual instrumentation can be straightforward ...