<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Splunk data duplicated when reading offset decreases in Getting Data In</title>
    <link>https://community.splunk.com/t5/Getting-Data-In/Splunk-data-duplicated-when-reading-offset-decreases/m-p/47043#M8903</link>
    <description>&lt;P&gt;could you provide the details of how the input is configured?&lt;/P&gt;</description>
    <pubDate>Thu, 28 Feb 2013 16:11:15 GMT</pubDate>
    <dc:creator>Kate_Lawrence-G</dc:creator>
    <dc:date>2013-02-28T16:11:15Z</dc:date>
    <item>
      <title>Splunk data duplicated when reading offset decreases</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/Splunk-data-duplicated-when-reading-offset-decreases/m-p/47042#M8902</link>
      <description>&lt;P&gt;We have a forwarder monitoring a log file and are seeing duplicated data indexed from that file (by a number of indexers within the autoLB group)&lt;/P&gt;

&lt;P&gt;I'm seeing the following in the splunkd.log file on the forwarder:&lt;/P&gt;

&lt;PRE&gt;&lt;CODE&gt;splunkd.log:02-28-2013 13:42:34.044 +0000 INFO  WatchedFile - Will begin reading at offset=13142179 for file='&amp;lt;filename removed&amp;gt;'.
splunkd.log:02-28-2013 13:43:34.043 +0000 INFO  WatchedFile - Will begin reading at offset=13161047 for file='&amp;lt;filename removed&amp;gt;'.
splunkd.log:02-28-2013 13:43:44.092 +0000 INFO  WatchedFile - Will begin reading at offset=13138930 for file='&amp;lt;filename removed&amp;gt;'.
splunkd.log:02-28-2013 13:49:44.297 +0000 INFO  WatchedFile - Will begin reading at offset=13274923 for file='&amp;lt;filename removed&amp;gt;'.
splunkd.log:02-28-2013 13:50:34.333 +0000 INFO  WatchedFile - Will begin reading at offset=13329736 for file='&amp;lt;filename removed&amp;gt;'.
splunkd.log:02-28-2013 13:50:54.349 +0000 INFO  WatchedFile - Will begin reading at offset=13281747 for file='&amp;lt;filename removed&amp;gt;'.
splunkd.log:02-28-2013 13:51:04.367 +0000 INFO  WatchedFile - Will begin reading at offset=13281747 for file='&amp;lt;filename removed&amp;gt;'.
splunkd.log:02-28-2013 13:54:14.523 +0000 INFO  WatchedFile - Will begin reading at offset=13320589 for file='&amp;lt;filename removed&amp;gt;'.
&lt;/CODE&gt;&lt;/PRE&gt;

&lt;P&gt;As you can see the offset position for where to start reading the file decreases occasionally.&lt;/P&gt;

&lt;P&gt;Any suggestions as to what the issue may be? (I know our indexers are  a bit overloaded at present but I'm not seeing many failed ACKs in the log file)&lt;/P&gt;

&lt;P&gt;Edit:&lt;BR /&gt;
inputs.conf:&lt;/P&gt;

&lt;PRE&gt;&lt;CODE&gt;[monitor://&amp;lt;path to file removed&amp;gt;/access_vap*.log]
sourcetype = jboss-access-proxy
&lt;/CODE&gt;&lt;/PRE&gt;

&lt;P&gt;outputs.conf&lt;/P&gt;

&lt;PRE&gt;&lt;CODE&gt;[tcpout]
defaultGroup = default-autolb-group
disabled = false
maxQueueSize = 6MB

[tcpout:default-autolb-group]
autoLB = true
disabled = false
server = &amp;lt;servernames&amp;gt;
useACK = true
&lt;/CODE&gt;&lt;/PRE&gt;</description>
      <pubDate>Thu, 28 Feb 2013 15:17:30 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/Splunk-data-duplicated-when-reading-offset-decreases/m-p/47042#M8902</guid>
      <dc:creator>samhughe</dc:creator>
      <dc:date>2013-02-28T15:17:30Z</dc:date>
    </item>
    <item>
      <title>Re: Splunk data duplicated when reading offset decreases</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/Splunk-data-duplicated-when-reading-offset-decreases/m-p/47043#M8903</link>
      <description>&lt;P&gt;could you provide the details of how the input is configured?&lt;/P&gt;</description>
      <pubDate>Thu, 28 Feb 2013 16:11:15 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/Splunk-data-duplicated-when-reading-offset-decreases/m-p/47043#M8903</guid>
      <dc:creator>Kate_Lawrence-G</dc:creator>
      <dc:date>2013-02-28T16:11:15Z</dc:date>
    </item>
    <item>
      <title>Re: Splunk data duplicated when reading offset decreases</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/Splunk-data-duplicated-when-reading-offset-decreases/m-p/47044#M8904</link>
      <description>&lt;P&gt;Thanks, details added to the original post&lt;/P&gt;</description>
      <pubDate>Thu, 28 Feb 2013 16:36:06 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/Splunk-data-duplicated-when-reading-offset-decreases/m-p/47044#M8904</guid>
      <dc:creator>samhughe</dc:creator>
      <dc:date>2013-02-28T16:36:06Z</dc:date>
    </item>
    <item>
      <title>Re: Splunk data duplicated when reading offset decreases</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/Splunk-data-duplicated-when-reading-offset-decreases/m-p/47045#M8905</link>
      <description>&lt;P&gt;while not strictly necessary you may want to add &lt;BR /&gt;
followTail = 1 &lt;BR /&gt;
to your inputs.conf to ignore the older data which may be causing the issue.&lt;/P&gt;

&lt;P&gt;from the doc:&lt;/P&gt;

&lt;P&gt;followTail =1&lt;BR /&gt;
 Can be used to force splunk to skip past all current data for a given stanza. &lt;BR /&gt;
  * In more detail: this is intended to mean that if you start up splunk with a&lt;BR /&gt;
    stanza configured this way, all data in the file at the time it is first&lt;BR /&gt;
    encountered will not be read.  Only data arriving after that first&lt;BR /&gt;
    encounter time will be read.&lt;BR /&gt;
  * This can be used to "skip over" data from old log files, or old portions of&lt;BR /&gt;
    log files, to get started on current data right away&lt;/P&gt;</description>
      <pubDate>Fri, 01 Mar 2013 18:44:09 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/Splunk-data-duplicated-when-reading-offset-decreases/m-p/47045#M8905</guid>
      <dc:creator>Kate_Lawrence-G</dc:creator>
      <dc:date>2013-03-01T18:44:09Z</dc:date>
    </item>
  </channel>
</rss>

