Splunk Search

How to exclude data that is already duplicated?

Hong_TP
Engager

Hi ,

I have somthing data need to deduplicate.

I got some data from two database and save in different indexes . I use the following SPL to merge the data as

 

 

 

index="data1"  sourcetype="data1" | append [search index="data2" sourcetype="data2"]
|rename data1DATA as 1data
|eval dataall=coalesce(1data,2data)
|table dataall sourcetype

 

 

 

and I got results like this

 

 

 

dataall      sourcetype
------       ----------
abc,1        data1
abc,1        data2
def,2        data1
abc,3        data2

 

 

 

Now, I need to compare the data and exclude duplicate data . The result is like the following

 

 

 

dataall      sourcetype
------       ----------
def,2        data1
dbc,3        data2

 

 

 

Any suggestions ?

Greetings and thanks!

Tags (3)
0 Karma
1 Solution

ITWhisperer
SplunkTrust
SplunkTrust
| eventstats count by dataall
| where count == 1

View solution in original post

0 Karma

ITWhisperer
SplunkTrust
SplunkTrust
| eventstats count by dataall
| where count == 1
0 Karma
Got questions? Get answers!

Join the Splunk Community Slack to learn, troubleshoot, and make connections with fellow Splunk practitioners in real time!

Meet up IRL or virtually!

Join Splunk User Groups to connect and learn in-person by region or remotely by topic or industry.

Get Updates on the Splunk Community!

Announcing Modern Navigation: A New Era of Splunk User Experience

We are excited to introduce the Modern Navigation feature in the Splunk Platform, available to both cloud and ...

Modernize your Splunk Apps – Introducing Python 3.13 in Splunk

We are excited to announce that the upcoming releases of Splunk Enterprise 10.2.x and Splunk Cloud Platform ...

Step into “Hunt the Insider: An Splunk ES Premier Mystery” to catch a cybercriminal ...

After a whole week of being on call, you fell asleep on your keyboard, and you hit a sequence of buttons that ...