Splunk Search

How to exclude data that is already duplicated?

Hong_TP
Engager

Hi ,

I have somthing data need to deduplicate.

I got some data from two database and save in different indexes . I use the following SPL to merge the data as

 

 

 

index="data1"  sourcetype="data1" | append [search index="data2" sourcetype="data2"]
|rename data1DATA as 1data
|eval dataall=coalesce(1data,2data)
|table dataall sourcetype

 

 

 

and I got results like this

 

 

 

dataall      sourcetype
------       ----------
abc,1        data1
abc,1        data2
def,2        data1
abc,3        data2

 

 

 

Now, I need to compare the data and exclude duplicate data . The result is like the following

 

 

 

dataall      sourcetype
------       ----------
def,2        data1
dbc,3        data2

 

 

 

Any suggestions ?

Greetings and thanks!

Labels (1)
Tags (3)
0 Karma
1 Solution

ITWhisperer
SplunkTrust
SplunkTrust
| eventstats count by dataall
| where count == 1

View solution in original post

0 Karma

ITWhisperer
SplunkTrust
SplunkTrust
| eventstats count by dataall
| where count == 1
0 Karma
Get Updates on the Splunk Community!

Cloud Platform | Customer Change Announcement: Email Notification Will Be Available ...

The Notification Team is migrating our email service provider since currently there’s no support ...

Mastering Synthetic Browser Testing: Pro Tips to Keep Your Web App Running Smoothly

To start, if you're new to synthetic monitoring, I recommend exploring this synthetic monitoring overview. In ...

Splunk Edge Processor | Popular Use Cases to Get Started with Edge Processor

Splunk Edge Processor offers more efficient, flexible data transformation – helping you reduce noise, control ...