<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic issue with Splunk batch input in Getting Data In</title>
    <link>https://community.splunk.com/t5/Getting-Data-In/issue-with-Splunk-batch-input/m-p/494841#M84448</link>
    <description>&lt;P&gt;Hello All,&lt;/P&gt;

&lt;P&gt;I am ingesting compressed(.gz) log files into Splunk by putting it in $SPLUNK_HOME/var/spool/splunk folder. (i.e. when I put the file in this location, Splunk's default batch input will automatically ingest it in Splunk).&lt;/P&gt;

&lt;P&gt;when I put a file in this location, Splunk will calculate and maintain it's CRC value to identify the same file in the future.&lt;/P&gt;

&lt;P&gt;BUT, &lt;/P&gt;

&lt;P&gt;when I put a file with the same name but newer content appended at the end of the file, it prints the logs in splunkd.log like:&lt;/P&gt;

&lt;PRE&gt;&lt;CODE&gt;03-10-2020 21:10:03.588 +0530 INFO  WatchedFile - **Will begin reading at offset=63969** for file='/opt/splunk8/splunk/var/spool/splunk/transaction-events-bfe8ae9a4041c5eaeea1663c583cbd54-72000-79200_0.gz'.
03-10-2020 21:10:13.589 +0530 INFO  TailReader - Archive file='/opt/splunk8/splunk/var/spool/splunk/transaction-events-bfe8ae9a4041c5eaeea1663c583cbd54-72000-79200_0.gz' has stopped changing, will read it now. 
03-10-2020 21:10:13.589 +0530 INFO  ArchiveProcessor - Handling file=/opt/splunk8/splunk/var/spool/splunk/transaction-events-bfe8ae9a4041c5eaeea1663c583cbd54-72000-79200_0.gz 
03-10-2020 21:10:13.590 +0530 INFO  ArchiveProcessor - reading path=/opt/splunk8/splunk/var/spool/splunk/transaction-events-bfe8ae9a4041c5eaeea1663c583cbd54-72000-79200_0.gz (seek=63969 len=77924)
&lt;/CODE&gt;&lt;/PRE&gt;

&lt;P&gt;So, According to the logs, Splunk should ingest only newer content of that file.&lt;BR /&gt;
But, when I search in Splunk, It is ingesting the whole file again instead of ingesting only newer content.&lt;/P&gt;

&lt;P&gt;Does anyone have any idea about this?&lt;/P&gt;</description>
    <pubDate>Thu, 12 Mar 2020 05:15:22 GMT</pubDate>
    <dc:creator>bharat097</dc:creator>
    <dc:date>2020-03-12T05:15:22Z</dc:date>
    <item>
      <title>issue with Splunk batch input</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/issue-with-Splunk-batch-input/m-p/494841#M84448</link>
      <description>&lt;P&gt;Hello All,&lt;/P&gt;

&lt;P&gt;I am ingesting compressed(.gz) log files into Splunk by putting it in $SPLUNK_HOME/var/spool/splunk folder. (i.e. when I put the file in this location, Splunk's default batch input will automatically ingest it in Splunk).&lt;/P&gt;

&lt;P&gt;when I put a file in this location, Splunk will calculate and maintain it's CRC value to identify the same file in the future.&lt;/P&gt;

&lt;P&gt;BUT, &lt;/P&gt;

&lt;P&gt;when I put a file with the same name but newer content appended at the end of the file, it prints the logs in splunkd.log like:&lt;/P&gt;

&lt;PRE&gt;&lt;CODE&gt;03-10-2020 21:10:03.588 +0530 INFO  WatchedFile - **Will begin reading at offset=63969** for file='/opt/splunk8/splunk/var/spool/splunk/transaction-events-bfe8ae9a4041c5eaeea1663c583cbd54-72000-79200_0.gz'.
03-10-2020 21:10:13.589 +0530 INFO  TailReader - Archive file='/opt/splunk8/splunk/var/spool/splunk/transaction-events-bfe8ae9a4041c5eaeea1663c583cbd54-72000-79200_0.gz' has stopped changing, will read it now. 
03-10-2020 21:10:13.589 +0530 INFO  ArchiveProcessor - Handling file=/opt/splunk8/splunk/var/spool/splunk/transaction-events-bfe8ae9a4041c5eaeea1663c583cbd54-72000-79200_0.gz 
03-10-2020 21:10:13.590 +0530 INFO  ArchiveProcessor - reading path=/opt/splunk8/splunk/var/spool/splunk/transaction-events-bfe8ae9a4041c5eaeea1663c583cbd54-72000-79200_0.gz (seek=63969 len=77924)
&lt;/CODE&gt;&lt;/PRE&gt;

&lt;P&gt;So, According to the logs, Splunk should ingest only newer content of that file.&lt;BR /&gt;
But, when I search in Splunk, It is ingesting the whole file again instead of ingesting only newer content.&lt;/P&gt;

&lt;P&gt;Does anyone have any idea about this?&lt;/P&gt;</description>
      <pubDate>Thu, 12 Mar 2020 05:15:22 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/issue-with-Splunk-batch-input/m-p/494841#M84448</guid>
      <dc:creator>bharat097</dc:creator>
      <dc:date>2020-03-12T05:15:22Z</dc:date>
    </item>
    <item>
      <title>Re: issue with Splunk batch input</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/issue-with-Splunk-batch-input/m-p/494842#M84449</link>
      <description>&lt;P&gt;Open a support case for sure.  This does not smell right at all.&lt;/P&gt;</description>
      <pubDate>Sat, 14 Mar 2020 20:39:25 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/issue-with-Splunk-batch-input/m-p/494842#M84449</guid>
      <dc:creator>woodcock</dc:creator>
      <dc:date>2020-03-14T20:39:25Z</dc:date>
    </item>
  </channel>
</rss>

