<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: How to remove duplicate events in search results without using DEDUP in Getting Data In</title>
    <link>https://community.splunk.com/t5/Getting-Data-In/How-to-remove-duplicate-events-in-search-results-without-using/m-p/44368#M8287</link>
    <description>&lt;P&gt;I'm a VERY green splunker, but when I try this command I get this error...&lt;/P&gt;

&lt;P&gt;Error in 'delete' command: This command cannot be invoked after the non-streaming command 'streamstats'.&lt;/P&gt;

&lt;P&gt;Am I missing something? Thanks for your post!&lt;/P&gt;</description>
    <pubDate>Sat, 01 Dec 2012 00:33:55 GMT</pubDate>
    <dc:creator>BStodd</dc:creator>
    <dc:date>2012-12-01T00:33:55Z</dc:date>
    <item>
      <title>How to remove duplicate events in search results without using DEDUP</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/How-to-remove-duplicate-events-in-search-results-without-using/m-p/44363#M8282</link>
      <description>&lt;P&gt;I'm using *NIX app 4.6, and for auditd logs I have a duplication problem of events. I also checked the raw logs and they are unique. &lt;BR /&gt;
Is it possible to remove this problem at the source (i.e. with a script or cli) without use the dedup filter in the console at the analysis phase?&lt;/P&gt;</description>
      <pubDate>Sun, 25 Nov 2012 11:27:12 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/How-to-remove-duplicate-events-in-search-results-without-using/m-p/44363#M8282</guid>
      <dc:creator>horizonsecurity</dc:creator>
      <dc:date>2012-11-25T11:27:12Z</dc:date>
    </item>
    <item>
      <title>Re: How to remove duplicate events in search results without using DEDUP</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/How-to-remove-duplicate-events-in-search-results-without-using/m-p/44364#M8283</link>
      <description>&lt;P&gt;Is the problem that you have only a single copy of an event in the raw log but you have more than one copy in Splunk?  It's a bit unclear from your question.&lt;/P&gt;</description>
      <pubDate>Sun, 25 Nov 2012 14:30:28 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/How-to-remove-duplicate-events-in-search-results-without-using/m-p/44364#M8283</guid>
      <dc:creator>sbrant_splunk</dc:creator>
      <dc:date>2012-11-25T14:30:28Z</dc:date>
    </item>
    <item>
      <title>Re: How to remove duplicate events in search results without using DEDUP</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/How-to-remove-duplicate-events-in-search-results-without-using/m-p/44365#M8284</link>
      <description>&lt;P&gt;We had an issue with Splunk re-indexing the gzipped versions of the log files. Find a pair of duplicate events and see if the "source" field is the same.&lt;/P&gt;</description>
      <pubDate>Sun, 25 Nov 2012 15:25:15 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/How-to-remove-duplicate-events-in-search-results-without-using/m-p/44365#M8284</guid>
      <dc:creator>reed_kelly</dc:creator>
      <dc:date>2012-11-25T15:25:15Z</dc:date>
    </item>
    <item>
      <title>Re: How to remove duplicate events in search results without using DEDUP</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/How-to-remove-duplicate-events-in-search-results-without-using/m-p/44366#M8285</link>
      <description>&lt;BLOCKQUOTE&gt;
&lt;P&gt;Is the problem that you have only a single copy of an event in the raw log but you have more than one copy in Splunk?&lt;/P&gt;
&lt;/BLOCKQUOTE&gt;

&lt;P&gt;Yes, this is the issue; 1 raw event &amp;lt;-&amp;gt; 10 splunk console events (more or less)&lt;/P&gt;</description>
      <pubDate>Sun, 25 Nov 2012 17:22:37 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/How-to-remove-duplicate-events-in-search-results-without-using/m-p/44366#M8285</guid>
      <dc:creator>horizonsecurity</dc:creator>
      <dc:date>2012-11-25T17:22:37Z</dc:date>
    </item>
    <item>
      <title>Re: How to remove duplicate events in search results without using DEDUP</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/How-to-remove-duplicate-events-in-search-results-without-using/m-p/44367#M8286</link>
      <description>&lt;P&gt;Is it still happening ?  I wouldn't bother cleaning up until you've fixed it at source.&lt;/P&gt;

&lt;P&gt;Run this :&lt;/P&gt;

&lt;PRE&gt;&lt;CODE&gt;index=os | streamstats count by _raw _time source sourcetype host | table count _time host source sourcetype _raw
&lt;/CODE&gt;&lt;/PRE&gt;

&lt;P&gt;This will show you what is duplicated.&lt;/P&gt;

&lt;P&gt;Example :&lt;/P&gt;

&lt;P&gt;If you have 3 identical events like this:&lt;/P&gt;

&lt;P&gt;2012-11-25 13:01:00 This is a message&lt;/P&gt;

&lt;P&gt;You'll see this in the output :&lt;/P&gt;

&lt;PRE&gt;&lt;CODE&gt;Count  ... .. .. ..   _raw
3      ... .. .. ..   2012-11-25 13:01:00 This is a message
2      ... .. .. ..   2012-11-25 13:01:00 This is a message
1      ... .. .. ..   2012-11-25 13:01:00 This is a message
&lt;/CODE&gt;&lt;/PRE&gt;

&lt;P&gt;From here its pretty easy to delete the duplicates&lt;/P&gt;

&lt;P&gt;But 1st a word from safety pig :&lt;/P&gt;

&lt;PRE&gt;&lt;CODE&gt;    _._ _..._ .-',     _.._('))
   '-. '     '  /-._.-'    ',/
      )         \            '.
     / _    _    |             \
    |  a    a    /              |
    \   .-.                     ;  
     '-('' ).-'       ,'       ;
        '-;           |      .'
           \           \    /
           | 7  .__  _.-\   \
           | |  |  ''/  /'  /
          /,_|  |   /,_/   /
             /,_/      ''-'
&lt;/CODE&gt;&lt;/PRE&gt;

&lt;P&gt;Backup your index first, then delete from the copy to make sure that this works. There is no 'undo'. Only then run it on your live data.&lt;/P&gt;

&lt;P&gt;This will delete your duplicates provided that "_raw _time source sourcetype host" are the fields that should make an event unique:&lt;/P&gt;

&lt;PRE&gt;&lt;CODE&gt;index=os | streamstats count by _raw _time source sourcetype host | where count &amp;gt; 1 | delete
&lt;/CODE&gt;&lt;/PRE&gt;</description>
      <pubDate>Mon, 26 Nov 2012 13:14:17 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/How-to-remove-duplicate-events-in-search-results-without-using/m-p/44367#M8286</guid>
      <dc:creator>jonuwz</dc:creator>
      <dc:date>2012-11-26T13:14:17Z</dc:date>
    </item>
    <item>
      <title>Re: How to remove duplicate events in search results without using DEDUP</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/How-to-remove-duplicate-events-in-search-results-without-using/m-p/44368#M8287</link>
      <description>&lt;P&gt;I'm a VERY green splunker, but when I try this command I get this error...&lt;/P&gt;

&lt;P&gt;Error in 'delete' command: This command cannot be invoked after the non-streaming command 'streamstats'.&lt;/P&gt;

&lt;P&gt;Am I missing something? Thanks for your post!&lt;/P&gt;</description>
      <pubDate>Sat, 01 Dec 2012 00:33:55 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/How-to-remove-duplicate-events-in-search-results-without-using/m-p/44368#M8287</guid>
      <dc:creator>BStodd</dc:creator>
      <dc:date>2012-12-01T00:33:55Z</dc:date>
    </item>
    <item>
      <title>Re: How to remove duplicate events in search results without using DEDUP</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/How-to-remove-duplicate-events-in-search-results-without-using/m-p/44369#M8288</link>
      <description>&lt;P&gt;I'm also having same problem, anyway know how to solve it&lt;/P&gt;</description>
      <pubDate>Thu, 30 Jan 2014 00:06:35 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/How-to-remove-duplicate-events-in-search-results-without-using/m-p/44369#M8288</guid>
      <dc:creator>fabiocaldas</dc:creator>
      <dc:date>2014-01-30T00:06:35Z</dc:date>
    </item>
    <item>
      <title>Re: How to remove duplicate events in search results without using DEDUP</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/How-to-remove-duplicate-events-in-search-results-without-using/m-p/44370#M8289</link>
      <description>&lt;P&gt;Yes, I get the same error!&lt;/P&gt;</description>
      <pubDate>Fri, 31 Mar 2017 20:36:57 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/How-to-remove-duplicate-events-in-search-results-without-using/m-p/44370#M8289</guid>
      <dc:creator>jagadeeshm</dc:creator>
      <dc:date>2017-03-31T20:36:57Z</dc:date>
    </item>
    <item>
      <title>Re: How to remove duplicate events in search results without using DEDUP</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/How-to-remove-duplicate-events-in-search-results-without-using/m-p/44371#M8290</link>
      <description>&lt;P&gt;Error in 'delete' command: Missing or malformed messages.conf stanza for DISPATCHCOMM:PREVSTREAM_ERROR__simpleresultcombiner&lt;/P&gt;</description>
      <pubDate>Tue, 29 Sep 2020 23:28:46 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/How-to-remove-duplicate-events-in-search-results-without-using/m-p/44371#M8290</guid>
      <dc:creator>orion44</dc:creator>
      <dc:date>2020-09-29T23:28:46Z</dc:date>
    </item>
  </channel>
</rss>

