<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: How does dedup work in splunk ? in Splunk Search</title>
    <link>https://community.splunk.com/t5/Splunk-Search/How-does-dedup-work-in-splunk/m-p/161824#M45739</link>
    <description>&lt;P&gt;It can be expensive, yes, as it needs to save the every unique entry in a temporary place to keep comparing with every following event. To see how expensive it is, just use the Job inspector, it'll show how long each command takes to run.&lt;/P&gt;

&lt;P&gt;Also, remember that deleting the record, doesn't actually delete anything, just mark it so won't show up again... but still very handy in your situation as you won't need to re-run dedup every time.&lt;/P&gt;

&lt;P&gt;Cheers&lt;/P&gt;</description>
    <pubDate>Mon, 02 Mar 2015 23:06:19 GMT</pubDate>
    <dc:creator>musskopf</dc:creator>
    <dc:date>2015-03-02T23:06:19Z</dc:date>
    <item>
      <title>How does dedup work in splunk ?</title>
      <link>https://community.splunk.com/t5/Splunk-Search/How-does-dedup-work-in-splunk/m-p/161823#M45738</link>
      <description>&lt;P&gt;How does dedup work in splunk ? My concern is about the performance.&lt;BR /&gt;
If my search is over 500K -1M events out of which 2K events are duplicates, is using dedup going to be expensive ? Or should I find a way way to delete those 2K events and avoid using dedup ?&lt;/P&gt;

&lt;P&gt;Can someone give me suggestions on this or direct me to a discussion where I can find the answer to this question.&lt;/P&gt;</description>
      <pubDate>Mon, 02 Mar 2015 17:14:56 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Splunk-Search/How-does-dedup-work-in-splunk/m-p/161823#M45738</guid>
      <dc:creator>nibinabr</dc:creator>
      <dc:date>2015-03-02T17:14:56Z</dc:date>
    </item>
    <item>
      <title>Re: How does dedup work in splunk ?</title>
      <link>https://community.splunk.com/t5/Splunk-Search/How-does-dedup-work-in-splunk/m-p/161824#M45739</link>
      <description>&lt;P&gt;It can be expensive, yes, as it needs to save the every unique entry in a temporary place to keep comparing with every following event. To see how expensive it is, just use the Job inspector, it'll show how long each command takes to run.&lt;/P&gt;

&lt;P&gt;Also, remember that deleting the record, doesn't actually delete anything, just mark it so won't show up again... but still very handy in your situation as you won't need to re-run dedup every time.&lt;/P&gt;

&lt;P&gt;Cheers&lt;/P&gt;</description>
      <pubDate>Mon, 02 Mar 2015 23:06:19 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Splunk-Search/How-does-dedup-work-in-splunk/m-p/161824#M45739</guid>
      <dc:creator>musskopf</dc:creator>
      <dc:date>2015-03-02T23:06:19Z</dc:date>
    </item>
  </channel>
</rss>

