<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: How to remove duplicate event data from index? in Splunk Dev</title>
    <link>https://community.splunk.com/t5/Splunk-Dev/How-to-remove-duplicate-event-data-from-index/m-p/86878#M1244</link>
    <description>&lt;P&gt;Using dedup on just the _raw data could remove events that are not duplicates.  You would need to include the other indexed fields to ensure uniqueness.  For example, if you have an error message being logged multiple times and then used dedup just on the _raw data, you would only see one occurrence of that error in Splunk.  Including _time would help, but only for that server, so you would also need to include host.  Then you run into issues if there are two of the same errors on the same host at the same time, etc. ad infinitum.&lt;/P&gt;</description>
    <pubDate>Wed, 06 Jun 2012 23:17:27 GMT</pubDate>
    <dc:creator>richprescott</dc:creator>
    <dc:date>2012-06-06T23:17:27Z</dc:date>
    <item>
      <title>How to remove duplicate event data from index?</title>
      <link>https://community.splunk.com/t5/Splunk-Dev/How-to-remove-duplicate-event-data-from-index/m-p/86870#M1236</link>
      <description>&lt;P&gt;First of all, I need ask a question because I don't have enough karma points for upload an app. &lt;/P&gt;

&lt;P&gt;I had some little problems with input script in Splunk which created duplicate event data logs in indexes. This is a reason because I wrote &lt;A href="http://pastebin.com/DVKR9hLs"&gt;python script&lt;/A&gt; to remove that duplicate events.&lt;/P&gt;

&lt;P&gt;&lt;A href="http://pastebin.com/DVKR9hLs"&gt;Script&lt;/A&gt; works for me! &lt;STRONG&gt;Enjoy!&lt;/STRONG&gt; &lt;span class="lia-unicode-emoji" title=":winking_face:"&gt;😉&lt;/span&gt;&lt;/P&gt;</description>
      <pubDate>Thu, 21 Apr 2011 19:32:24 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Splunk-Dev/How-to-remove-duplicate-event-data-from-index/m-p/86870#M1236</guid>
      <dc:creator>zpavic</dc:creator>
      <dc:date>2011-04-21T19:32:24Z</dc:date>
    </item>
    <item>
      <title>Re: How to remove duplicate event data from index?</title>
      <link>https://community.splunk.com/t5/Splunk-Dev/How-to-remove-duplicate-event-data-from-index/m-p/86871#M1237</link>
      <description>&lt;P&gt;Wait - the system isn't supposed to prevent you from uploading an app. That's a bug &lt;span class="lia-unicode-emoji" title=":disappointed_face:"&gt;😞&lt;/span&gt;&lt;/P&gt;

&lt;P&gt;Let me get on that. &lt;/P&gt;</description>
      <pubDate>Thu, 21 Apr 2011 20:11:39 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Splunk-Dev/How-to-remove-duplicate-event-data-from-index/m-p/86871#M1237</guid>
      <dc:creator>John_Mark</dc:creator>
      <dc:date>2011-04-21T20:11:39Z</dc:date>
    </item>
    <item>
      <title>Re: How to remove duplicate event data from index?</title>
      <link>https://community.splunk.com/t5/Splunk-Dev/How-to-remove-duplicate-event-data-from-index/m-p/86872#M1238</link>
      <description>&lt;P&gt;I logout - login again and then I could upload an app but when I uploaded my app disappeared &lt;span class="lia-unicode-emoji" title=":disappointed_face:"&gt;😞&lt;/span&gt; &lt;/P&gt;

&lt;P&gt;I will try tomorrow!&lt;/P&gt;</description>
      <pubDate>Thu, 21 Apr 2011 21:26:26 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Splunk-Dev/How-to-remove-duplicate-event-data-from-index/m-p/86872#M1238</guid>
      <dc:creator>zpavic</dc:creator>
      <dc:date>2011-04-21T21:26:26Z</dc:date>
    </item>
    <item>
      <title>Re: How to remove duplicate event data from index?</title>
      <link>https://community.splunk.com/t5/Splunk-Dev/How-to-remove-duplicate-event-data-from-index/m-p/86873#M1239</link>
      <description>&lt;P&gt;Hi - I saw the app you uploaded and approved it. It will appear in a few minutes.&lt;/P&gt;

&lt;P&gt;Sounds like we need to improve the process for uploading apps &lt;span class="lia-unicode-emoji" title=":slightly_smiling_face:"&gt;🙂&lt;/span&gt;&lt;/P&gt;</description>
      <pubDate>Thu, 21 Apr 2011 21:29:15 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Splunk-Dev/How-to-remove-duplicate-event-data-from-index/m-p/86873#M1239</guid>
      <dc:creator>John_Mark</dc:creator>
      <dc:date>2011-04-21T21:29:15Z</dc:date>
    </item>
    <item>
      <title>Re: How to remove duplicate event data from index?</title>
      <link>https://community.splunk.com/t5/Splunk-Dev/How-to-remove-duplicate-event-data-from-index/m-p/86874#M1240</link>
      <description>&lt;P&gt;Thanks for approving! &lt;span class="lia-unicode-emoji" title=":slightly_smiling_face:"&gt;🙂&lt;/span&gt;&lt;/P&gt;</description>
      <pubDate>Fri, 22 Apr 2011 04:18:07 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Splunk-Dev/How-to-remove-duplicate-event-data-from-index/m-p/86874#M1240</guid>
      <dc:creator>zpavic</dc:creator>
      <dc:date>2011-04-22T04:18:07Z</dc:date>
    </item>
    <item>
      <title>Re: How to remove duplicate event data from index?</title>
      <link>https://community.splunk.com/t5/Splunk-Dev/How-to-remove-duplicate-event-data-from-index/m-p/86875#M1241</link>
      <description>&lt;P&gt;Worked well, thanks.&lt;BR /&gt;
It can take a long time to run if your data is large, some time restrictions may be useful.&lt;/P&gt;

&lt;P&gt;To generalize to any duplicate (not only in a short time range) I modified the search1 line 59&lt;/P&gt;

&lt;P&gt;search1 = 'search index=' + index + ' | transaction _raw  keepevicted=true | where eventcount&amp;gt;1'&lt;/P&gt;</description>
      <pubDate>Wed, 12 Oct 2011 23:58:06 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Splunk-Dev/How-to-remove-duplicate-event-data-from-index/m-p/86875#M1241</guid>
      <dc:creator>yannK</dc:creator>
      <dc:date>2011-10-12T23:58:06Z</dc:date>
    </item>
    <item>
      <title>Re: How to remove duplicate event data from index?</title>
      <link>https://community.splunk.com/t5/Splunk-Dev/How-to-remove-duplicate-event-data-from-index/m-p/86876#M1242</link>
      <description>&lt;P&gt;Unfortunately this script is only correct if you only have a maximum of one valid event per second in the targetted index.  In other words, any other events in that second are removed by the script, even if they are not duplicates.&lt;/P&gt;</description>
      <pubDate>Thu, 08 Dec 2011 00:41:22 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Splunk-Dev/How-to-remove-duplicate-event-data-from-index/m-p/86876#M1242</guid>
      <dc:creator>jrodman</dc:creator>
      <dc:date>2011-12-08T00:41:22Z</dc:date>
    </item>
    <item>
      <title>Re: How to remove duplicate event data from index?</title>
      <link>https://community.splunk.com/t5/Splunk-Dev/How-to-remove-duplicate-event-data-from-index/m-p/86877#M1243</link>
      <description>&lt;P&gt;You could probably use the dedup command at search time: &lt;CODE&gt;| dedup _raw&lt;/CODE&gt;&lt;/P&gt;

&lt;P&gt;This will remove all duplicate data from your index&lt;/P&gt;

&lt;P&gt;&lt;A href="http://docs.splunk.com/Documentation/Splunk/latest/SearchReference/dedup"&gt;Dedup Command&lt;/A&gt;&lt;/P&gt;</description>
      <pubDate>Thu, 05 Apr 2012 00:58:32 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Splunk-Dev/How-to-remove-duplicate-event-data-from-index/m-p/86877#M1243</guid>
      <dc:creator>Dark_Ichigo</dc:creator>
      <dc:date>2012-04-05T00:58:32Z</dc:date>
    </item>
    <item>
      <title>Re: How to remove duplicate event data from index?</title>
      <link>https://community.splunk.com/t5/Splunk-Dev/How-to-remove-duplicate-event-data-from-index/m-p/86878#M1244</link>
      <description>&lt;P&gt;Using dedup on just the _raw data could remove events that are not duplicates.  You would need to include the other indexed fields to ensure uniqueness.  For example, if you have an error message being logged multiple times and then used dedup just on the _raw data, you would only see one occurrence of that error in Splunk.  Including _time would help, but only for that server, so you would also need to include host.  Then you run into issues if there are two of the same errors on the same host at the same time, etc. ad infinitum.&lt;/P&gt;</description>
      <pubDate>Wed, 06 Jun 2012 23:17:27 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Splunk-Dev/How-to-remove-duplicate-event-data-from-index/m-p/86878#M1244</guid>
      <dc:creator>richprescott</dc:creator>
      <dc:date>2012-06-06T23:17:27Z</dc:date>
    </item>
    <item>
      <title>Re: How to remove duplicate event data from index?</title>
      <link>https://community.splunk.com/t5/Splunk-Dev/How-to-remove-duplicate-event-data-from-index/m-p/86879#M1245</link>
      <description>&lt;P&gt;Yep, already been through that, I did realize however that splunk was indexing some files more than once which was the main cause of the problem.&lt;/P&gt;

&lt;P&gt;I stopped using dedup after that.&lt;/P&gt;

&lt;P&gt;Thanks &lt;span class="lia-unicode-emoji" title=":slightly_smiling_face:"&gt;🙂&lt;/span&gt;&lt;/P&gt;</description>
      <pubDate>Thu, 07 Jun 2012 00:10:39 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Splunk-Dev/How-to-remove-duplicate-event-data-from-index/m-p/86879#M1245</guid>
      <dc:creator>Dark_Ichigo</dc:creator>
      <dc:date>2012-06-07T00:10:39Z</dc:date>
    </item>
    <item>
      <title>Re: How to remove duplicate event data from index?</title>
      <link>https://community.splunk.com/t5/Splunk-Dev/How-to-remove-duplicate-event-data-from-index/m-p/86880#M1246</link>
      <description>&lt;P&gt;I downvoted this post because dedup dont remove from index, remove from search...&lt;/P&gt;</description>
      <pubDate>Mon, 18 Sep 2017 22:28:37 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Splunk-Dev/How-to-remove-duplicate-event-data-from-index/m-p/86880#M1246</guid>
      <dc:creator>jmorais</dc:creator>
      <dc:date>2017-09-18T22:28:37Z</dc:date>
    </item>
  </channel>
</rss>

