<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: avoid duplicate indexing in splunk in Getting Data In</title>
    <link>https://community.splunk.com/t5/Getting-Data-In/avoid-duplicate-indexing-in-splunk/m-p/144459#M29501</link>
    <description>&lt;P&gt;Hello,&lt;/P&gt;

&lt;P&gt;how are you loading in the data? Via Splunk File Monitoring inputs.conf? &lt;/P&gt;

&lt;P&gt;Splunk Forwarders + Indexer in File Monitoring are remembering in files when did it stop, up to which line it was captured already etc. this is called "Fish Bucket".&lt;/P&gt;

&lt;P&gt;You can read more here: &lt;BR /&gt;
&lt;A href="http://blogs.splunk.com/2008/08/14/what-is-this-fishbucket-thing/"&gt;http://blogs.splunk.com/2008/08/14/what-is-this-fishbucket-thing/&lt;/A&gt;&lt;/P&gt;

&lt;P&gt;Br&lt;BR /&gt;
Matthias&lt;/P&gt;</description>
    <pubDate>Mon, 28 Apr 2014 10:53:16 GMT</pubDate>
    <dc:creator>Matthias_BY</dc:creator>
    <dc:date>2014-04-28T10:53:16Z</dc:date>
    <item>
      <title>avoid duplicate indexing in splunk</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/avoid-duplicate-indexing-in-splunk/m-p/144458#M29500</link>
      <description>&lt;P&gt;I have a scheduler which logs the data to my log file every hour, the log I use in splunk. Now the problem is every time scheduler runs it appends some row but in the splunk when I query I get (double the no. of rows + added rows as result). how can avoid this duplicate indexing. below ex. will clearly explain my problem.&lt;/P&gt;

&lt;P&gt;before scheduler run log have 3 rows. after scheduler run it add 1 more row to log and total no. of rows in log is 4 but in splunk when I query it gives me (3*2+1) 7 rows how to avoid this. please help&lt;/P&gt;</description>
      <pubDate>Mon, 28 Apr 2014 09:00:13 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/avoid-duplicate-indexing-in-splunk/m-p/144458#M29500</guid>
      <dc:creator>c_sahil</dc:creator>
      <dc:date>2014-04-28T09:00:13Z</dc:date>
    </item>
    <item>
      <title>Re: avoid duplicate indexing in splunk</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/avoid-duplicate-indexing-in-splunk/m-p/144459#M29501</link>
      <description>&lt;P&gt;Hello,&lt;/P&gt;

&lt;P&gt;how are you loading in the data? Via Splunk File Monitoring inputs.conf? &lt;/P&gt;

&lt;P&gt;Splunk Forwarders + Indexer in File Monitoring are remembering in files when did it stop, up to which line it was captured already etc. this is called "Fish Bucket".&lt;/P&gt;

&lt;P&gt;You can read more here: &lt;BR /&gt;
&lt;A href="http://blogs.splunk.com/2008/08/14/what-is-this-fishbucket-thing/"&gt;http://blogs.splunk.com/2008/08/14/what-is-this-fishbucket-thing/&lt;/A&gt;&lt;/P&gt;

&lt;P&gt;Br&lt;BR /&gt;
Matthias&lt;/P&gt;</description>
      <pubDate>Mon, 28 Apr 2014 10:53:16 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/avoid-duplicate-indexing-in-splunk/m-p/144459#M29501</guid>
      <dc:creator>Matthias_BY</dc:creator>
      <dc:date>2014-04-28T10:53:16Z</dc:date>
    </item>
    <item>
      <title>Re: avoid duplicate indexing in splunk</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/avoid-duplicate-indexing-in-splunk/m-p/144460#M29502</link>
      <description>&lt;P&gt;Thanks Matthias...&lt;BR /&gt;
Fixed the issue after reading the blog, I was adding the data in the log at start instead of adding it at the bottom.&lt;BR /&gt;
Thank a lot.&lt;/P&gt;</description>
      <pubDate>Mon, 28 Apr 2014 13:27:06 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/avoid-duplicate-indexing-in-splunk/m-p/144460#M29502</guid>
      <dc:creator>c_sahil</dc:creator>
      <dc:date>2014-04-28T13:27:06Z</dc:date>
    </item>
    <item>
      <title>Re: avoid duplicate indexing in splunk</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/avoid-duplicate-indexing-in-splunk/m-p/144461#M29503</link>
      <description>&lt;P&gt;Hi,&lt;/P&gt;

&lt;P&gt;great. so just to make clear. your application which did write the log Splunk monitors did write new lines at the bottom instead appending it, right?&lt;/P&gt;</description>
      <pubDate>Mon, 28 Apr 2014 14:16:12 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/avoid-duplicate-indexing-in-splunk/m-p/144461#M29503</guid>
      <dc:creator>Matthias_BY</dc:creator>
      <dc:date>2014-04-28T14:16:12Z</dc:date>
    </item>
  </channel>
</rss>

