<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: How to Delete Duplicate Events?? in Getting Data In</title>
    <link>https://community.splunk.com/t5/Getting-Data-In/How-to-delete-duplicate-events/m-p/591182#M103552</link>
    <description>&lt;P&gt;i tried, it is not working&lt;/P&gt;</description>
    <pubDate>Tue, 29 Mar 2022 06:12:38 GMT</pubDate>
    <dc:creator>jerjer95119</dc:creator>
    <dc:date>2022-03-29T06:12:38Z</dc:date>
    <item>
      <title>How to delete duplicate events?</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/How-to-delete-duplicate-events/m-p/591168#M103547</link>
      <description>&lt;P&gt;Each event has been ingested twice with the same uuid.&lt;/P&gt;
&lt;P&gt;i want to keep one event only for each uuid.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;How to delete one event only for each uuid?&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;for searching index="okta*" | dedup uuid, it will show events with the unique uuid only&lt;BR /&gt;it will show half of total events that i want.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;then i run index="okta*" | dedup uuid | delete , but this operation is not allowed&amp;nbsp;&lt;/P&gt;
&lt;P&gt;it will show "this command cannot be invoked after the command simpleresultcombiner"&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Anyone have suggestion?&lt;/P&gt;</description>
      <pubDate>Thu, 31 Mar 2022 19:39:57 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/How-to-delete-duplicate-events/m-p/591168#M103547</guid>
      <dc:creator>jerjer95119</dc:creator>
      <dc:date>2022-03-31T19:39:57Z</dc:date>
    </item>
    <item>
      <title>Re: How to Delete Duplicate Events??</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/How-to-delete-duplicate-events/m-p/591181#M103551</link>
      <description>&lt;P&gt;Please go through this link i hope&amp;nbsp; you may find the solution to it.&lt;BR /&gt;&lt;BR /&gt;&lt;A href="https://community.splunk.com/t5/Splunk-Search/How-to-delete-duplicate-events/m-p/70656/highlight/true#M17694" target="_blank"&gt;https://community.splunk.com/t5/Splunk-Search/How-to-delete-duplicate-events/m-p/70656/highlight/true#M17694&lt;/A&gt;&lt;/P&gt;</description>
      <pubDate>Tue, 29 Mar 2022 06:09:06 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/How-to-delete-duplicate-events/m-p/591181#M103551</guid>
      <dc:creator>vinod743374</dc:creator>
      <dc:date>2022-03-29T06:09:06Z</dc:date>
    </item>
    <item>
      <title>Re: How to Delete Duplicate Events??</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/How-to-delete-duplicate-events/m-p/591182#M103552</link>
      <description>&lt;P&gt;i tried, it is not working&lt;/P&gt;</description>
      <pubDate>Tue, 29 Mar 2022 06:12:38 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/How-to-delete-duplicate-events/m-p/591182#M103552</guid>
      <dc:creator>jerjer95119</dc:creator>
      <dc:date>2022-03-29T06:12:38Z</dc:date>
    </item>
    <item>
      <title>Re: How to Delete Duplicate Events??</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/How-to-delete-duplicate-events/m-p/591185#M103553</link>
      <description>&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="jerjer95119_0-1648534425946.png" style="width: 400px;"&gt;&lt;img src="https://community.splunk.com/t5/image/serverpage/image-id/18804iE4D1B65B4BAE9100/image-size/medium?v=v2&amp;amp;px=400" role="button" title="jerjer95119_0-1648534425946.png" alt="jerjer95119_0-1648534425946.png" /&gt;&lt;/span&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Tue, 29 Mar 2022 06:13:53 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/How-to-delete-duplicate-events/m-p/591185#M103553</guid>
      <dc:creator>jerjer95119</dc:creator>
      <dc:date>2022-03-29T06:13:53Z</dc:date>
    </item>
    <item>
      <title>Re: How to Delete Duplicate Events??</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/How-to-delete-duplicate-events/m-p/591187#M103554</link>
      <description>&lt;P&gt;Firstly, there is not much point in "deleting" data - you already "paid" for it with your license if you're using volume-based licensing so it might as well just stay there &lt;span class="lia-unicode-emoji" title=":winking_face:"&gt;😉&lt;/span&gt;&lt;/P&gt;&lt;P&gt;But seriously, I'd start with improving the data onbooarding process so you can identify the duplicates and prevent them from being ingested in the first place. There is no point in indexing data only to have it filtered out all the time.&lt;/P&gt;</description>
      <pubDate>Tue, 29 Mar 2022 06:19:30 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/How-to-delete-duplicate-events/m-p/591187#M103554</guid>
      <dc:creator>PickleRick</dc:creator>
      <dc:date>2022-03-29T06:19:30Z</dc:date>
    </item>
    <item>
      <title>Re: How to Delete Duplicate Events??</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/How-to-delete-duplicate-events/m-p/591210#M103557</link>
      <description>&lt;P&gt;Please, don't do it. If you can't copy-paste properly, don't do it! Especially that it's a potentially destructive command that can mean that you can lose your data.&lt;/P&gt;&lt;P&gt;It's not to make fun of you or anything it's just that if you don't understand what this search does and you're just copying it blindly, you can cause much harm!&lt;/P&gt;</description>
      <pubDate>Tue, 29 Mar 2022 09:08:13 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/How-to-delete-duplicate-events/m-p/591210#M103557</guid>
      <dc:creator>PickleRick</dc:creator>
      <dc:date>2022-03-29T09:08:13Z</dc:date>
    </item>
    <item>
      <title>Re: How to Delete Duplicate Events??</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/How-to-delete-duplicate-events/m-p/591648#M103607</link>
      <description>&lt;P&gt;No. This is not a delete command.&lt;/P&gt;&lt;P&gt;Please suggest a way to dedup the data.&lt;/P&gt;</description>
      <pubDate>Thu, 31 Mar 2022 02:20:21 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/How-to-delete-duplicate-events/m-p/591648#M103607</guid>
      <dc:creator>jerjer95119</dc:creator>
      <dc:date>2022-03-31T02:20:21Z</dc:date>
    </item>
    <item>
      <title>Re: How to Delete Duplicate Events??</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/How-to-delete-duplicate-events/m-p/591649#M103608</link>
      <description>&lt;P&gt;And the subsearch maxout is 10000. Please suggest a way that can dedup all data from index without limit.&lt;/P&gt;</description>
      <pubDate>Thu, 31 Mar 2022 02:23:00 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/How-to-delete-duplicate-events/m-p/591649#M103608</guid>
      <dc:creator>jerjer95119</dc:creator>
      <dc:date>2022-03-31T02:23:00Z</dc:date>
    </item>
    <item>
      <title>Re: How to Delete Duplicate Events??</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/How-to-delete-duplicate-events/m-p/591659#M103611</link>
      <description>&lt;P&gt;You can sort your events and then do streamstats with a relatively short window (like 3 or so) to count events by raw event value giving you effectively each event with a value of either 1 or 2 as count. Then you would only need to filter to see the events with the count=1&lt;/P&gt;</description>
      <pubDate>Thu, 31 Mar 2022 04:38:27 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/How-to-delete-duplicate-events/m-p/591659#M103611</guid>
      <dc:creator>PickleRick</dc:creator>
      <dc:date>2022-03-31T04:38:27Z</dc:date>
    </item>
    <item>
      <title>Re: How to Delete Duplicate Events??</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/How-to-delete-duplicate-events/m-p/591700#M103614</link>
      <description>&lt;P&gt;i mentioned all the data are with uuid count=2.&lt;BR /&gt;so i want to cut it as half, do you have solution? thanks&lt;/P&gt;</description>
      <pubDate>Thu, 31 Mar 2022 07:02:32 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/How-to-delete-duplicate-events/m-p/591700#M103614</guid>
      <dc:creator>jerjer95119</dc:creator>
      <dc:date>2022-03-31T07:02:32Z</dc:date>
    </item>
    <item>
      <title>Re: How to Delete Duplicate Events??</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/How-to-delete-duplicate-events/m-p/591723#M103615</link>
      <description>&lt;P&gt;As Per my Understanding I gave you this Solution,&lt;BR /&gt;You Can Try if it Solves your Issue.&lt;BR /&gt;&lt;BR /&gt;&lt;/P&gt;&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="Capture.PNG" style="width: 788px;"&gt;&lt;img src="https://community.splunk.com/t5/image/serverpage/image-id/18871iD13F0AC45B78F21E/image-dimensions/788x337?v=v2" width="788" height="337" role="button" title="Capture.PNG" alt="Capture.PNG" /&gt;&lt;/span&gt;&lt;/P&gt;&lt;P&gt; I hope Like this you have the Data, there are some Duplicates in the index in UID field.&lt;BR /&gt;I removed the duplicates with the below Query&lt;BR /&gt;&lt;BR /&gt;&lt;STRONG&gt;index="testing" sourcetype="csv" | table UID "Reason Code" Shift "From Date" "To Date" | streamstats count by UID | where count = 1 | fields - count&lt;BR /&gt;&lt;BR /&gt;&lt;BR /&gt;&lt;/STRONG&gt;&lt;/P&gt;&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="Capture2.PNG" style="width: 793px;"&gt;&lt;img src="https://community.splunk.com/t5/image/serverpage/image-id/18872i96A48F780A5A733B/image-dimensions/793x331?v=v2" width="793" height="331" role="button" title="Capture2.PNG" alt="Capture2.PNG" /&gt;&lt;/span&gt;You can see the duplicates is being removed.&lt;BR /&gt;I don't know whether there is any limit of 10000 or not , You can try ,Hope it works.&lt;BR /&gt;&lt;BR /&gt;Thankyou;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Thu, 31 Mar 2022 09:40:43 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/How-to-delete-duplicate-events/m-p/591723#M103615</guid>
      <dc:creator>vinod743374</dc:creator>
      <dc:date>2022-03-31T09:40:43Z</dc:date>
    </item>
    <item>
      <title>Re: How to Delete Duplicate Events??</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/How-to-delete-duplicate-events/m-p/591783#M103618</link>
      <description>&lt;P&gt;i will try this query thank you.&lt;/P&gt;&lt;P&gt;but can this query add "| delete" to delete data rather than just dedup when searching?&lt;/P&gt;&lt;P&gt;&lt;STRONG&gt;index="testing" sourcetype="csv" | table UID "Reason Code" Shift "From Date" "To Date" | streamstats count by UID | where count = 1 | fields - count | delete&lt;/STRONG&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;STRONG&gt;because if i just dedup in searching , i just did "&lt;SPAN&gt;index="okta*" | dedup uuid" and fine.&lt;/SPAN&gt;&lt;/STRONG&gt;&lt;/P&gt;&lt;P&gt;&lt;STRONG&gt;&lt;SPAN&gt;but i want to delete from index rather than just dedup in searching every time.&lt;/SPAN&gt;&lt;/STRONG&gt;&lt;/P&gt;</description>
      <pubDate>Thu, 31 Mar 2022 13:45:14 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/How-to-delete-duplicate-events/m-p/591783#M103618</guid>
      <dc:creator>jerjer95119</dc:creator>
      <dc:date>2022-03-31T13:45:14Z</dc:date>
    </item>
    <item>
      <title>Re: How to Delete Duplicate Events??</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/How-to-delete-duplicate-events/m-p/591864#M103626</link>
      <description>&lt;P&gt;No, you cannot. It's kinda complicated and it's because of how splunk works and where and how various commands are performed. Streamstats is _not_ a distributable streaming command so after that command you cannot do | delete.&lt;/P&gt;</description>
      <pubDate>Thu, 31 Mar 2022 19:08:36 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/How-to-delete-duplicate-events/m-p/591864#M103626</guid>
      <dc:creator>PickleRick</dc:creator>
      <dc:date>2022-03-31T19:08:36Z</dc:date>
    </item>
  </channel>
</rss>

