<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Does batch input ignore files already read? in Getting Data In</title>
    <link>https://community.splunk.com/t5/Getting-Data-In/Does-batch-input-ignore-files-already-read/m-p/95691#M19948</link>
    <description>&lt;P&gt;I have a file in a folder that is being monitored by Splunk. Its contents have been indexed. If I move that file into a &lt;CODE&gt;batch&lt;/CODE&gt; input (read-and-delete), will the file be re-indexed, or will Splunk know it has already indexed this data already?&lt;/P&gt;

&lt;P&gt;(The specific scenario is I am changing a folder from &lt;CODE&gt;monitor://&lt;/CODE&gt; to &lt;CODE&gt;batch://&lt;/CODE&gt; and need to know if I need to remove all the files first to avoid data duplication in Splunk.)&lt;/P&gt;</description>
    <pubDate>Thu, 05 May 2011 17:49:22 GMT</pubDate>
    <dc:creator>Jason</dc:creator>
    <dc:date>2011-05-05T17:49:22Z</dc:date>
    <item>
      <title>Does batch input ignore files already read?</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/Does-batch-input-ignore-files-already-read/m-p/95691#M19948</link>
      <description>&lt;P&gt;I have a file in a folder that is being monitored by Splunk. Its contents have been indexed. If I move that file into a &lt;CODE&gt;batch&lt;/CODE&gt; input (read-and-delete), will the file be re-indexed, or will Splunk know it has already indexed this data already?&lt;/P&gt;

&lt;P&gt;(The specific scenario is I am changing a folder from &lt;CODE&gt;monitor://&lt;/CODE&gt; to &lt;CODE&gt;batch://&lt;/CODE&gt; and need to know if I need to remove all the files first to avoid data duplication in Splunk.)&lt;/P&gt;</description>
      <pubDate>Thu, 05 May 2011 17:49:22 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/Does-batch-input-ignore-files-already-read/m-p/95691#M19948</guid>
      <dc:creator>Jason</dc:creator>
      <dc:date>2011-05-05T17:49:22Z</dc:date>
    </item>
    <item>
      <title>Re: Does batch input ignore files already read?</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/Does-batch-input-ignore-files-already-read/m-p/95692#M19949</link>
      <description>&lt;P&gt;so long as the policy is set to move to the sinkhole, Splunk should eat the file again.&lt;BR /&gt;
The input could look something like this: &lt;/P&gt;

&lt;PRE&gt;&lt;CODE&gt;[batch:///some/path/some_file]
move_policy = sinkhole
&lt;/CODE&gt;&lt;/PRE&gt;

&lt;P&gt;without the move_policy = sinkhole setting, it won't load the files destructively and will keep track of them. &lt;/P&gt;

&lt;P&gt;Hope this helps!&lt;/P&gt;</description>
      <pubDate>Thu, 05 May 2011 18:05:33 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/Does-batch-input-ignore-files-already-read/m-p/95692#M19949</guid>
      <dc:creator>jbsplunk</dc:creator>
      <dc:date>2011-05-05T18:05:33Z</dc:date>
    </item>
    <item>
      <title>Re: Does batch input ignore files already read?</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/Does-batch-input-ignore-files-already-read/m-p/95693#M19950</link>
      <description>&lt;P&gt;Confirmed - Splunk does no checking and will re-index the file.&lt;/P&gt;</description>
      <pubDate>Thu, 05 May 2011 18:14:57 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/Does-batch-input-ignore-files-already-read/m-p/95693#M19950</guid>
      <dc:creator>Jason</dc:creator>
      <dc:date>2011-05-05T18:14:57Z</dc:date>
    </item>
  </channel>
</rss>

