<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: duplicate index entries in Getting Data In</title>
    <link>https://community.splunk.com/t5/Getting-Data-In/duplicate-index-entries/m-p/42029#M7804</link>
    <description>&lt;P&gt;That's correct.&lt;/P&gt;</description>
    <pubDate>Fri, 03 Sep 2010 03:12:22 GMT</pubDate>
    <dc:creator>Branden</dc:creator>
    <dc:date>2010-09-03T03:12:22Z</dc:date>
    <item>
      <title>duplicate index entries</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/duplicate-index-entries/m-p/42021#M7796</link>
      <description>&lt;P&gt;I'm having what appears to be a logic problem, but it could be something else.&lt;/P&gt;

&lt;P&gt;I have an app that displays the output of an error log. Output looks something like this (may be familiar to those who have been helping me with other issues):&lt;/P&gt;

&lt;PRE&gt;&lt;CODE&gt;AA8AB241   0901122910 T O OPERATOR       OPERATOR NOTIFICATION
&lt;/CODE&gt;&lt;/PRE&gt;

&lt;P&gt;It parses the timestamp correctly. The error log script outputs any errors that have occurred in the past 60 seconds, and Splunk runs that every 60 seconds. &lt;/P&gt;

&lt;P&gt;Unfortunately, it appears to be indexing it 3-5 times. In other words, for each one of those entries, I get 3-5 identical entries in the index. &lt;/P&gt;

&lt;P&gt;I'm wondering if it has to do with my "every 60 seconds, produce results from the past 60 seconds" logic. Or could it be something else?&lt;/P&gt;

&lt;P&gt;Any feedback is appreciated.&lt;/P&gt;

&lt;P&gt;Thanks!&lt;/P&gt;

&lt;P&gt;Edit:&lt;/P&gt;

&lt;P&gt;Here is the inputs.conf file (note: this is on the forwarder):&lt;/P&gt;

&lt;PRE&gt;&lt;CODE&gt;[script://splunk/etc/apps/all/bin/errptsplunk.sh]
interval = 60  # Run every minute
sourcetype = errpt
source = script://./bin/errptsplunk.sh
&lt;/CODE&gt;&lt;/PRE&gt;

&lt;P&gt;And props.conf (on the indexer):&lt;/P&gt;

&lt;PRE&gt;&lt;CODE&gt;[errpt]
SHOULD_LINEMERGE = false
TIME_PREFIX = ^\S+\s+
TIME_FORMAT=%m%d%H%M%y
MAX_TIMESTAMP_LOOKAHEAD = 25
&lt;/CODE&gt;&lt;/PRE&gt;</description>
      <pubDate>Wed, 01 Sep 2010 23:37:22 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/duplicate-index-entries/m-p/42021#M7796</guid>
      <dc:creator>Branden</dc:creator>
      <dc:date>2010-09-01T23:37:22Z</dc:date>
    </item>
    <item>
      <title>Re: duplicate index entries</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/duplicate-index-entries/m-p/42022#M7797</link>
      <description>&lt;P&gt;Branden, is this error log very small?&lt;/P&gt;</description>
      <pubDate>Thu, 02 Sep 2010 00:31:07 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/duplicate-index-entries/m-p/42022#M7797</guid>
      <dc:creator>hulahoop</dc:creator>
      <dc:date>2010-09-02T00:31:07Z</dc:date>
    </item>
    <item>
      <title>Re: duplicate index entries</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/duplicate-index-entries/m-p/42023#M7798</link>
      <description>&lt;P&gt;They're one line long, just like above. &lt;BR /&gt;
It's possible they could be multiple lines if there were multiple errors within that one minute.&lt;/P&gt;</description>
      <pubDate>Thu, 02 Sep 2010 01:00:05 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/duplicate-index-entries/m-p/42023#M7798</guid>
      <dc:creator>Branden</dc:creator>
      <dc:date>2010-09-02T01:00:05Z</dc:date>
    </item>
    <item>
      <title>Re: duplicate index entries</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/duplicate-index-entries/m-p/42024#M7799</link>
      <description>&lt;P&gt;Can you have your script temporarily write entries to a file as well as stdout?  Then you could verify if the script was duplicating output between runs.&lt;/P&gt;</description>
      <pubDate>Thu, 02 Sep 2010 04:28:39 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/duplicate-index-entries/m-p/42024#M7799</guid>
      <dc:creator>Jeremiah</dc:creator>
      <dc:date>2010-09-02T04:28:39Z</dc:date>
    </item>
    <item>
      <title>Re: duplicate index entries</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/duplicate-index-entries/m-p/42025#M7800</link>
      <description>&lt;P&gt;There is a current problem under investigation where events in very small files are getting duplicated.  I'll post here when we have more details, but we've been able to reproduce this and will hopefully have a workaround soon if not a fix.  It's not you or your logic.  &lt;span class="lia-unicode-emoji" title=":slightly_smiling_face:"&gt;🙂&lt;/span&gt;&lt;/P&gt;</description>
      <pubDate>Thu, 02 Sep 2010 06:42:05 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/duplicate-index-entries/m-p/42025#M7800</guid>
      <dc:creator>hulahoop</dc:creator>
      <dc:date>2010-09-02T06:42:05Z</dc:date>
    </item>
    <item>
      <title>Re: duplicate index entries</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/duplicate-index-entries/m-p/42026#M7801</link>
      <description>&lt;P&gt;This isn't in a file, though, is it? You're just using a scripted input that calls the &lt;CODE&gt;errpt&lt;/CODE&gt; shell command, and you're certain that it only gives you back new items from the past 60 seconds? And that you don't have multiple instances of the script input running in different apps or something like that?&lt;/P&gt;</description>
      <pubDate>Thu, 02 Sep 2010 12:05:40 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/duplicate-index-entries/m-p/42026#M7801</guid>
      <dc:creator>gkanapathy</dc:creator>
      <dc:date>2010-09-02T12:05:40Z</dc:date>
    </item>
    <item>
      <title>Re: duplicate index entries</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/duplicate-index-entries/m-p/42027#M7802</link>
      <description>&lt;P&gt;Correct, it is not a file; it's capturing the output of the errpt command. It should only be giving back items from the past 60 seconds... even if there was a duplicate because of my logic, I could see it indexing twice, but not 3-5 times. I'm certain the script is only running in one place. &lt;BR /&gt;
Actually, this all started after fixing the previous issue I wrote about with the errpt command (splitting it up into individual lines). I'm not sure if that's related or not...&lt;/P&gt;</description>
      <pubDate>Fri, 03 Sep 2010 00:01:44 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/duplicate-index-entries/m-p/42027#M7802</guid>
      <dc:creator>Branden</dc:creator>
      <dc:date>2010-09-03T00:01:44Z</dc:date>
    </item>
    <item>
      <title>Re: duplicate index entries</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/duplicate-index-entries/m-p/42028#M7803</link>
      <description>&lt;P&gt;Interesting. So it was okay &lt;EM&gt;before&lt;/EM&gt; you did the config change to the sourcetype to index it as individual lines, i.e., there no repeats then?&lt;/P&gt;</description>
      <pubDate>Fri, 03 Sep 2010 01:07:30 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/duplicate-index-entries/m-p/42028#M7803</guid>
      <dc:creator>gkanapathy</dc:creator>
      <dc:date>2010-09-03T01:07:30Z</dc:date>
    </item>
    <item>
      <title>Re: duplicate index entries</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/duplicate-index-entries/m-p/42029#M7804</link>
      <description>&lt;P&gt;That's correct.&lt;/P&gt;</description>
      <pubDate>Fri, 03 Sep 2010 03:12:22 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/duplicate-index-entries/m-p/42029#M7804</guid>
      <dc:creator>Branden</dc:creator>
      <dc:date>2010-09-03T03:12:22Z</dc:date>
    </item>
    <item>
      <title>Re: duplicate index entries</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/duplicate-index-entries/m-p/42030#M7805</link>
      <description>&lt;P&gt;sounds a lot like a bug in Splunk scripted input, but maybe if you update the question with the complete inputs.conf stanza and props.conf config for the source/sourcetype, we can see if there's anything.&lt;/P&gt;</description>
      <pubDate>Fri, 03 Sep 2010 03:40:30 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/duplicate-index-entries/m-p/42030#M7805</guid>
      <dc:creator>gkanapathy</dc:creator>
      <dc:date>2010-09-03T03:40:30Z</dc:date>
    </item>
    <item>
      <title>Re: duplicate index entries</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/duplicate-index-entries/m-p/42031#M7806</link>
      <description>&lt;P&gt;Question edited with inputs.conf and props.conf errpt stanzas. Thanks!&lt;/P&gt;</description>
      <pubDate>Fri, 03 Sep 2010 20:08:23 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/duplicate-index-entries/m-p/42031#M7806</guid>
      <dc:creator>Branden</dc:creator>
      <dc:date>2010-09-03T20:08:23Z</dc:date>
    </item>
    <item>
      <title>Re: duplicate index entries</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/duplicate-index-entries/m-p/42032#M7807</link>
      <description>&lt;P&gt;I think I'm going to open a case with Splunk tech support on this one. Two of the errpt entries just got indexed over 10,000 times. Oops!&lt;/P&gt;</description>
      <pubDate>Fri, 03 Sep 2010 21:54:54 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/duplicate-index-entries/m-p/42032#M7807</guid>
      <dc:creator>Branden</dc:creator>
      <dc:date>2010-09-03T21:54:54Z</dc:date>
    </item>
    <item>
      <title>Re: duplicate index entries</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/duplicate-index-entries/m-p/42033#M7808</link>
      <description>&lt;P&gt;Yeah, that sounds like a bug.&lt;/P&gt;</description>
      <pubDate>Sat, 04 Sep 2010 00:17:34 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/duplicate-index-entries/m-p/42033#M7808</guid>
      <dc:creator>gkanapathy</dc:creator>
      <dc:date>2010-09-04T00:17:34Z</dc:date>
    </item>
    <item>
      <title>Re: duplicate index entries</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/duplicate-index-entries/m-p/42034#M7809</link>
      <description>&lt;P&gt;Splunk support got back to me. My indexer is running 4.1.4, but my forwarders are running 4.0.9. 4.0.X and 3.X versions do not handle files/input over less than 256 bytes very well. Each errpt entry is 64 bytes. I will upgrade the forwarders next week and see what happens.&lt;/P&gt;</description>
      <pubDate>Sat, 04 Sep 2010 02:30:52 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/duplicate-index-entries/m-p/42034#M7809</guid>
      <dc:creator>Branden</dc:creator>
      <dc:date>2010-09-04T02:30:52Z</dc:date>
    </item>
    <item>
      <title>Re: duplicate index entries</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/duplicate-index-entries/m-p/42035#M7810</link>
      <description>&lt;P&gt;Upgraded one of the forwarders just now. Problem still exists. &lt;span class="lia-unicode-emoji" title=":disappointed_face:"&gt;😞&lt;/span&gt;&lt;/P&gt;</description>
      <pubDate>Sat, 04 Sep 2010 02:57:29 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/duplicate-index-entries/m-p/42035#M7810</guid>
      <dc:creator>Branden</dc:creator>
      <dc:date>2010-09-04T02:57:29Z</dc:date>
    </item>
    <item>
      <title>Re: duplicate index entries</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/duplicate-index-entries/m-p/42036#M7811</link>
      <description>&lt;P&gt;Hmm, the 256 byte problem really only affects file inputs, it seems like something else entirely for script inputs.&lt;/P&gt;</description>
      <pubDate>Sat, 04 Sep 2010 23:19:40 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/duplicate-index-entries/m-p/42036#M7811</guid>
      <dc:creator>gkanapathy</dc:creator>
      <dc:date>2010-09-04T23:19:40Z</dc:date>
    </item>
    <item>
      <title>Re: duplicate index entries</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/duplicate-index-entries/m-p/42037#M7812</link>
      <description>&lt;P&gt;Interesting point. I mentioned that to the support engineer. I'll see what they come back with. Thank you again for all your help.&lt;/P&gt;</description>
      <pubDate>Tue, 07 Sep 2010 20:32:28 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/duplicate-index-entries/m-p/42037#M7812</guid>
      <dc:creator>Branden</dc:creator>
      <dc:date>2010-09-07T20:32:28Z</dc:date>
    </item>
  </channel>
</rss>

