<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Seeing duplicate events in Search Results ? in Splunk Search</title>
    <link>https://community.splunk.com/t5/Splunk-Search/Seeing-duplicate-events-in-Search-Results/m-p/313621#M160956</link>
    <description>&lt;P&gt;The logs will update during the night hours. So once the logs are updated I need to check still duplicates are coming up or not.&lt;/P&gt;</description>
    <pubDate>Sun, 28 May 2017 21:54:23 GMT</pubDate>
    <dc:creator>arunsony</dc:creator>
    <dc:date>2017-05-28T21:54:23Z</dc:date>
    <item>
      <title>Seeing duplicate events in Search Results ?</title>
      <link>https://community.splunk.com/t5/Splunk-Search/Seeing-duplicate-events-in-Search-Results/m-p/313607#M160942</link>
      <description>&lt;P&gt;I have a source as ///application.log in my inputs.conf.On the servers the application.log will be rolled when it fill up with 10Mb by creating the file name as application.12-13-2014.log and new file with application.log will be creating after rolling . At some point because of this roll over we are missing some events in splunk. So in order to not to miss the events we changed the source as application* (used wild card) in inputs.conf and now we see all the logs getting indexed and showing events in search. But the problem is that we are getting duplicate logs with the same time stamp. The duplicate logs appears with the source as one with application.log and the other with application.12-12-2014.log. So can anyone help me on this issue. Thanks in advance !&lt;/P&gt;</description>
      <pubDate>Sun, 28 May 2017 03:02:49 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Splunk-Search/Seeing-duplicate-events-in-Search-Results/m-p/313607#M160942</guid>
      <dc:creator>arunsony</dc:creator>
      <dc:date>2017-05-28T03:02:49Z</dc:date>
    </item>
    <item>
      <title>Re: Seeing duplicate events in Search Results ?</title>
      <link>https://community.splunk.com/t5/Splunk-Search/Seeing-duplicate-events-in-Search-Results/m-p/313608#M160943</link>
      <description>&lt;P&gt;In the short term, you can add &lt;CODE&gt;| dedup _raw&lt;/CODE&gt; to your searches but this degrades performance significantly.  If you are using &lt;CODE&gt;crcSalt = &amp;lt;SOURCE&amp;gt;&lt;/CODE&gt;, make sure that you remove this.  It should be that simply changing the &lt;CODE&gt;whitelist&lt;/CODE&gt; the way that you did fixes the original problem without creating the new problem.  if you are not using that setting, then file a bug with the developers because they appear to be writing to the file after it is rotated/renamed, which they should not be doing.&lt;/P&gt;</description>
      <pubDate>Sun, 28 May 2017 14:05:36 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Splunk-Search/Seeing-duplicate-events-in-Search-Results/m-p/313608#M160943</guid>
      <dc:creator>woodcock</dc:creator>
      <dc:date>2017-05-28T14:05:36Z</dc:date>
    </item>
    <item>
      <title>Re: Seeing duplicate events in Search Results ?</title>
      <link>https://community.splunk.com/t5/Splunk-Search/Seeing-duplicate-events-in-Search-Results/m-p/313609#M160944</link>
      <description>&lt;P&gt;Is there any way of writing props or transforms  for the source to drop the duplicate events. I am not sure about the whitelist you are saying ? Can you give an example ? &lt;/P&gt;</description>
      <pubDate>Sun, 28 May 2017 15:47:55 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Splunk-Search/Seeing-duplicate-events-in-Search-Results/m-p/313609#M160944</guid>
      <dc:creator>arunsony</dc:creator>
      <dc:date>2017-05-28T15:47:55Z</dc:date>
    </item>
    <item>
      <title>Re: Seeing duplicate events in Search Results ?</title>
      <link>https://community.splunk.com/t5/Splunk-Search/Seeing-duplicate-events-in-Search-Results/m-p/313610#M160945</link>
      <description>&lt;P&gt;Can you post please the &lt;CODE&gt;inputs.conf&lt;/CODE&gt; file?&lt;/P&gt;

&lt;P&gt;The documentation supports (obviously) what @woodcock said - &lt;A href="https://docs.splunk.com/Documentation/Splunk/6.6.0/Data/Howlogfilerotationishandled"&gt;How Splunk Enterprise handles log file rotation&lt;/A&gt;&lt;/P&gt;

&lt;P&gt;It says -&lt;/P&gt;

&lt;P&gt;-- Do not use &lt;CODE&gt;crcSalt = &amp;lt;SOURCE&amp;gt;&lt;/CODE&gt; with rolling log files, or any other scenario in which logfiles get renamed or moved to another monitored location. Doing so prevents Splunk Enterprise from recognizing log files across the roll or rename, which results in the data being reindexed. &lt;/P&gt;

&lt;P&gt;I would look at either &lt;CODE&gt;initCrcLength&lt;/CODE&gt; and make it larger than the default 256 or/and &lt;CODE&gt;ignoreOlderThan&lt;/CODE&gt;. If you have &lt;CODE&gt;ignoreOlderThan = 2h&lt;/CODE&gt; for example, files which were not touched in the past two hours won't be read... it's a problematic situation, when the forwarder goes down for any reason...&lt;/P&gt;</description>
      <pubDate>Sun, 28 May 2017 18:16:38 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Splunk-Search/Seeing-duplicate-events-in-Search-Results/m-p/313610#M160945</guid>
      <dc:creator>ddrillic</dc:creator>
      <dc:date>2017-05-28T18:16:38Z</dc:date>
    </item>
    <item>
      <title>Re: Seeing duplicate events in Search Results ?</title>
      <link>https://community.splunk.com/t5/Splunk-Search/Seeing-duplicate-events-in-Search-Results/m-p/313611#M160946</link>
      <description>&lt;P&gt;inputs.conf : It looks as below&lt;BR /&gt;
[monitor: ///usr/apps444/test_application*]&lt;BR /&gt;
sourcetype = test_application&lt;BR /&gt;
index = Application&lt;/P&gt;

&lt;P&gt;All looks in the same format.. Just 4 hours back removed the crcSalt from the inputs.conf.&lt;/P&gt;</description>
      <pubDate>Tue, 29 Sep 2020 14:12:54 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Splunk-Search/Seeing-duplicate-events-in-Search-Results/m-p/313611#M160946</guid>
      <dc:creator>arunsony</dc:creator>
      <dc:date>2020-09-29T14:12:54Z</dc:date>
    </item>
    <item>
      <title>Re: Seeing duplicate events in Search Results ?</title>
      <link>https://community.splunk.com/t5/Splunk-Search/Seeing-duplicate-events-in-Search-Results/m-p/313612#M160947</link>
      <description>&lt;P&gt;If you removed the &lt;CODE&gt;crcSalt&lt;/CODE&gt;, you should not see duplicates any more...&lt;/P&gt;

&lt;P&gt;Without &lt;CODE&gt;crcSalt&lt;/CODE&gt; people experience sometimes the opposite problem, in which files are &lt;STRONG&gt;not&lt;/STRONG&gt; being indexed when the first 256 bytes of the files are identical. &lt;/P&gt;</description>
      <pubDate>Sun, 28 May 2017 19:50:20 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Splunk-Search/Seeing-duplicate-events-in-Search-Results/m-p/313612#M160947</guid>
      <dc:creator>ddrillic</dc:creator>
      <dc:date>2017-05-28T19:50:20Z</dc:date>
    </item>
    <item>
      <title>Re: Seeing duplicate events in Search Results ?</title>
      <link>https://community.splunk.com/t5/Splunk-Search/Seeing-duplicate-events-in-Search-Results/m-p/313613#M160948</link>
      <description>&lt;P&gt;When I search index= Application for last 1 hour I see around 1000 events and when I use the same search by adding dedup _raw I see the count falled to 750 events. Is the difference is the duplicate events .. ? I am not sure ? Is there any way to find the duplicates for an index ? How do delete the duplicate events which are already indexed in search ?&lt;/P&gt;</description>
      <pubDate>Sun, 28 May 2017 19:57:30 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Splunk-Search/Seeing-duplicate-events-in-Search-Results/m-p/313613#M160948</guid>
      <dc:creator>arunsony</dc:creator>
      <dc:date>2017-05-28T19:57:30Z</dc:date>
    </item>
    <item>
      <title>Re: Seeing duplicate events in Search Results ?</title>
      <link>https://community.splunk.com/t5/Splunk-Search/Seeing-duplicate-events-in-Search-Results/m-p/313614#M160949</link>
      <description>&lt;P&gt;If you are in a situation in which you can delete the index and start from scratch, it's the easiest. In these cases, I re-install the forwarder and start fresh. &lt;/P&gt;</description>
      <pubDate>Sun, 28 May 2017 20:01:10 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Splunk-Search/Seeing-duplicate-events-in-Search-Results/m-p/313614#M160949</guid>
      <dc:creator>ddrillic</dc:creator>
      <dc:date>2017-05-28T20:01:10Z</dc:date>
    </item>
    <item>
      <title>Re: Seeing duplicate events in Search Results ?</title>
      <link>https://community.splunk.com/t5/Splunk-Search/Seeing-duplicate-events-in-Search-Results/m-p/313615#M160950</link>
      <description>&lt;P&gt;No I cannot start from scratch. But one thing is the difference in number of events is the duplicate events or not ?&lt;/P&gt;</description>
      <pubDate>Sun, 28 May 2017 20:14:01 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Splunk-Search/Seeing-duplicate-events-in-Search-Results/m-p/313615#M160950</guid>
      <dc:creator>arunsony</dc:creator>
      <dc:date>2017-05-28T20:14:01Z</dc:date>
    </item>
    <item>
      <title>Re: Seeing duplicate events in Search Results ?</title>
      <link>https://community.splunk.com/t5/Splunk-Search/Seeing-duplicate-events-in-Search-Results/m-p/313616#M160951</link>
      <description>&lt;P&gt;Right, it definitely seems that you still have duplicates.  &lt;/P&gt;

&lt;P&gt;If you use -&lt;/P&gt;

&lt;PRE&gt;&lt;CODE&gt;[monitor: ///usr/apps444/test_application*]
sourcetype = test_application
index = Application
&lt;/CODE&gt;&lt;/PRE&gt;

&lt;P&gt;and you still see duplicates, I would first double-check that no duplicates exist in the files themselves. &lt;/P&gt;</description>
      <pubDate>Sun, 28 May 2017 20:17:38 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Splunk-Search/Seeing-duplicate-events-in-Search-Results/m-p/313616#M160951</guid>
      <dc:creator>ddrillic</dc:creator>
      <dc:date>2017-05-28T20:17:38Z</dc:date>
    </item>
    <item>
      <title>Re: Seeing duplicate events in Search Results ?</title>
      <link>https://community.splunk.com/t5/Splunk-Search/Seeing-duplicate-events-in-Search-Results/m-p/313617#M160952</link>
      <description>&lt;P&gt;I check in the file there are no duplicates. After removing crcSalt the file is not updated with logs. Need to wait to check still duplicates are coming up or not ?&lt;/P&gt;</description>
      <pubDate>Sun, 28 May 2017 20:25:02 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Splunk-Search/Seeing-duplicate-events-in-Search-Results/m-p/313617#M160952</guid>
      <dc:creator>arunsony</dc:creator>
      <dc:date>2017-05-28T20:25:02Z</dc:date>
    </item>
    <item>
      <title>Re: Seeing duplicate events in Search Results ?</title>
      <link>https://community.splunk.com/t5/Splunk-Search/Seeing-duplicate-events-in-Search-Results/m-p/313618#M160953</link>
      <description>&lt;P&gt;use &lt;CODE&gt;_index_earliest = -5m&lt;/CODE&gt; in your testing search to make sure that you are looking ONLY at recently indexed events.  Events from before the fix will stay duplicated/wrong.&lt;/P&gt;</description>
      <pubDate>Sun, 28 May 2017 21:23:39 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Splunk-Search/Seeing-duplicate-events-in-Search-Results/m-p/313618#M160953</guid>
      <dc:creator>woodcock</dc:creator>
      <dc:date>2017-05-28T21:23:39Z</dc:date>
    </item>
    <item>
      <title>Re: Seeing duplicate events in Search Results ?</title>
      <link>https://community.splunk.com/t5/Splunk-Search/Seeing-duplicate-events-in-Search-Results/m-p/313619#M160954</link>
      <description>&lt;P&gt;No but you can schedule a search to run hourly like this which removes duplicates from search results:&lt;/P&gt;

&lt;PRE&gt;&lt;CODE&gt;index=YourIndexHere sourcetype=YourSourcetypeHere earliest=-5h latest=now
| streamstats count AS deleteme BY _raw
| search deleteme&amp;gt;1
| delete
&lt;/CODE&gt;&lt;/PRE&gt;</description>
      <pubDate>Sun, 28 May 2017 21:27:48 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Splunk-Search/Seeing-duplicate-events-in-Search-Results/m-p/313619#M160954</guid>
      <dc:creator>woodcock</dc:creator>
      <dc:date>2017-05-28T21:27:48Z</dc:date>
    </item>
    <item>
      <title>Re: Seeing duplicate events in Search Results ?</title>
      <link>https://community.splunk.com/t5/Splunk-Search/Seeing-duplicate-events-in-Search-Results/m-p/313620#M160955</link>
      <description>&lt;P&gt;My index name is Application and Sourcetype is prod_application . Can you just write the search for 1 hour ? One more thing the duplicates will be deleted at the search level or indexer level ?&lt;/P&gt;</description>
      <pubDate>Sun, 28 May 2017 21:49:42 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Splunk-Search/Seeing-duplicate-events-in-Search-Results/m-p/313620#M160955</guid>
      <dc:creator>arunsony</dc:creator>
      <dc:date>2017-05-28T21:49:42Z</dc:date>
    </item>
    <item>
      <title>Re: Seeing duplicate events in Search Results ?</title>
      <link>https://community.splunk.com/t5/Splunk-Search/Seeing-duplicate-events-in-Search-Results/m-p/313621#M160956</link>
      <description>&lt;P&gt;The logs will update during the night hours. So once the logs are updated I need to check still duplicates are coming up or not.&lt;/P&gt;</description>
      <pubDate>Sun, 28 May 2017 21:54:23 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Splunk-Search/Seeing-duplicate-events-in-Search-Results/m-p/313621#M160956</guid>
      <dc:creator>arunsony</dc:creator>
      <dc:date>2017-05-28T21:54:23Z</dc:date>
    </item>
    <item>
      <title>Re: Seeing duplicate events in Search Results ?</title>
      <link>https://community.splunk.com/t5/Splunk-Search/Seeing-duplicate-events-in-Search-Results/m-p/313622#M160957</link>
      <description>&lt;P&gt;I still see the duplicate events in the search results. Can anyone suggest the solution for it ? One more thing how can we identify the duplicate events ?&lt;BR /&gt;
Just typing the index name and checking the events it shows around 10000 events and when I use dedup _raw it shows 7000 events So these are the duplicate events or not ? Is there any way to find the duplicate events ?&lt;/P&gt;</description>
      <pubDate>Mon, 29 May 2017 14:40:22 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Splunk-Search/Seeing-duplicate-events-in-Search-Results/m-p/313622#M160957</guid>
      <dc:creator>arunsony</dc:creator>
      <dc:date>2017-05-29T14:40:22Z</dc:date>
    </item>
    <item>
      <title>Re: Seeing duplicate events in Search Results ?</title>
      <link>https://community.splunk.com/t5/Splunk-Search/Seeing-duplicate-events-in-Search-Results/m-p/313623#M160958</link>
      <description>&lt;P&gt;The logs are updated but still I see duplicate events in search result  ? Anyother suggestions ?&lt;/P&gt;</description>
      <pubDate>Mon, 29 May 2017 16:49:44 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Splunk-Search/Seeing-duplicate-events-in-Search-Results/m-p/313623#M160958</guid>
      <dc:creator>arunsony</dc:creator>
      <dc:date>2017-05-29T16:49:44Z</dc:date>
    </item>
    <item>
      <title>Re: Seeing duplicate events in Search Results ?</title>
      <link>https://community.splunk.com/t5/Splunk-Search/Seeing-duplicate-events-in-Search-Results/m-p/313624#M160959</link>
      <description>&lt;P&gt;Use &lt;CODE&gt;btool&lt;/CODE&gt; and show us the settings for &lt;CODE&gt;inputs&lt;/CODE&gt; and &lt;CODE&gt;props&lt;/CODE&gt; for this source/type.&lt;/P&gt;</description>
      <pubDate>Mon, 29 May 2017 19:34:14 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Splunk-Search/Seeing-duplicate-events-in-Search-Results/m-p/313624#M160959</guid>
      <dc:creator>woodcock</dc:creator>
      <dc:date>2017-05-29T19:34:14Z</dc:date>
    </item>
    <item>
      <title>Re: Seeing duplicate events in Search Results ?</title>
      <link>https://community.splunk.com/t5/Splunk-Search/Seeing-duplicate-events-in-Search-Results/m-p/313625#M160960</link>
      <description>&lt;P&gt;my sourcetype is test_application.&lt;BR /&gt;
The splunk is installed in windows.&lt;BR /&gt;
Can you tell me what do I need to write at the bin directory ?&lt;BR /&gt;
for both inputs and props ?&lt;/P&gt;</description>
      <pubDate>Mon, 29 May 2017 20:37:57 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Splunk-Search/Seeing-duplicate-events-in-Search-Results/m-p/313625#M160960</guid>
      <dc:creator>arunsony</dc:creator>
      <dc:date>2017-05-29T20:37:57Z</dc:date>
    </item>
    <item>
      <title>Re: Seeing duplicate events in Search Results ?</title>
      <link>https://community.splunk.com/t5/Splunk-Search/Seeing-duplicate-events-in-Search-Results/m-p/313626#M160961</link>
      <description>&lt;P&gt;Do this:&lt;/P&gt;

&lt;PRE&gt;&lt;CODE&gt;./splunk cmd btool inputs list --debug
&lt;/CODE&gt;&lt;/PRE&gt;

&lt;P&gt;Then skip to the section for your input and make note of all the settings.  Then do this:&lt;/P&gt;

&lt;PRE&gt;&lt;CODE&gt;./splunk cmd btool props list --debug
&lt;/CODE&gt;&lt;/PRE&gt;

&lt;P&gt;Then skip to the &lt;CODE&gt;test_application&lt;/CODE&gt; section and make note of all the settings.&lt;/P&gt;

&lt;P&gt;Then post a &lt;CODE&gt;comment&lt;/CODE&gt; to your Question with the details.&lt;/P&gt;</description>
      <pubDate>Mon, 29 May 2017 21:02:01 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Splunk-Search/Seeing-duplicate-events-in-Search-Results/m-p/313626#M160961</guid>
      <dc:creator>woodcock</dc:creator>
      <dc:date>2017-05-29T21:02:01Z</dc:date>
    </item>
  </channel>
</rss>

