<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic How to manage indexing rolling log files without duplicating data in the Index in Getting Data In</title>
    <link>https://community.splunk.com/t5/Getting-Data-In/How-to-manage-indexing-rolling-log-files-without-duplicating/m-p/59475#M11723</link>
    <description>&lt;P&gt;We are testing in a high throughput environment capturing logs that grow to 251MB in ~ 4-6 minutes at which time the logs are rolled to a dated log file.&lt;/P&gt;

&lt;P&gt;e.g. test.log -&amp;gt; test.log.20110315042946&lt;/P&gt;

&lt;P&gt;The problems is that Splunk thinks we have already indexed one or more of the rolled log files, and results in us missing data from the performance run. I have read about using the crcSalt but to avoid using that on rotating log files.&lt;/P&gt;

&lt;BLOCKQUOTE&gt;
  &lt;P&gt;03-15-2011 09:38:04.028 ERROR
  TailingProcessor - Ignoring path due
  to: File will not be read, seekptr
  checksum did not match
  (file=/opt/perf/gett/log/test.log.20110315091120).
  Last time we saw this initcrc,
  filename was different.  You may wish
  to use a CRC salt on this source. 
  Consult the documentation or contact
  Splunk Support for more info.&lt;/P&gt;
&lt;/BLOCKQUOTE&gt;

&lt;P&gt;Can someone suggest how this problem can be managed?&lt;/P&gt;</description>
    <pubDate>Tue, 15 Mar 2011 23:51:16 GMT</pubDate>
    <dc:creator>ericrobinson</dc:creator>
    <dc:date>2011-03-15T23:51:16Z</dc:date>
    <item>
      <title>How to manage indexing rolling log files without duplicating data in the Index</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/How-to-manage-indexing-rolling-log-files-without-duplicating/m-p/59475#M11723</link>
      <description>&lt;P&gt;We are testing in a high throughput environment capturing logs that grow to 251MB in ~ 4-6 minutes at which time the logs are rolled to a dated log file.&lt;/P&gt;

&lt;P&gt;e.g. test.log -&amp;gt; test.log.20110315042946&lt;/P&gt;

&lt;P&gt;The problems is that Splunk thinks we have already indexed one or more of the rolled log files, and results in us missing data from the performance run. I have read about using the crcSalt but to avoid using that on rotating log files.&lt;/P&gt;

&lt;BLOCKQUOTE&gt;
  &lt;P&gt;03-15-2011 09:38:04.028 ERROR
  TailingProcessor - Ignoring path due
  to: File will not be read, seekptr
  checksum did not match
  (file=/opt/perf/gett/log/test.log.20110315091120).
  Last time we saw this initcrc,
  filename was different.  You may wish
  to use a CRC salt on this source. 
  Consult the documentation or contact
  Splunk Support for more info.&lt;/P&gt;
&lt;/BLOCKQUOTE&gt;

&lt;P&gt;Can someone suggest how this problem can be managed?&lt;/P&gt;</description>
      <pubDate>Tue, 15 Mar 2011 23:51:16 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/How-to-manage-indexing-rolling-log-files-without-duplicating/m-p/59475#M11723</guid>
      <dc:creator>ericrobinson</dc:creator>
      <dc:date>2011-03-15T23:51:16Z</dc:date>
    </item>
    <item>
      <title>Re: How to manage indexing rolling log files without duplicating data in the Index</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/How-to-manage-indexing-rolling-log-files-without-duplicating/m-p/59476#M11724</link>
      <description>&lt;P&gt;Could you name the log file with the associated date / time value at the beginning rather than changing it afterwards?&lt;/P&gt;</description>
      <pubDate>Wed, 16 Mar 2011 03:07:32 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/How-to-manage-indexing-rolling-log-files-without-duplicating/m-p/59476#M11724</guid>
      <dc:creator>netwrkr</dc:creator>
      <dc:date>2011-03-16T03:07:32Z</dc:date>
    </item>
    <item>
      <title>Re: How to manage indexing rolling log files without duplicating data in the Index</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/How-to-manage-indexing-rolling-log-files-without-duplicating/m-p/59477#M11725</link>
      <description>&lt;P&gt;Are the files simply renamed when they are rolled? What is the inputs.conf stanza that you are using to monitor the files? &lt;/P&gt;</description>
      <pubDate>Wed, 16 Mar 2011 04:33:05 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/How-to-manage-indexing-rolling-log-files-without-duplicating/m-p/59477#M11725</guid>
      <dc:creator>gkanapathy</dc:creator>
      <dc:date>2011-03-16T04:33:05Z</dc:date>
    </item>
    <item>
      <title>Re: How to manage indexing rolling log files without duplicating data in the Index</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/How-to-manage-indexing-rolling-log-files-without-duplicating/m-p/59478#M11726</link>
      <description>&lt;P&gt;Hi All.. Thanks for the help. We found that the rolling log file was also being renamed by another log archiving process. &lt;/P&gt;

&lt;P&gt;What was happenning was the log would be rolled to test.log.1&lt;/P&gt;

&lt;P&gt;Then, the archving process would rename it to test.log.20110316&lt;/P&gt;

&lt;P&gt;We think that Splunk was seeing the log in the .1 format and when the file name changed to .2011*, the CRC had issues.&lt;/P&gt;

&lt;P&gt;After changing our inputs.conf, we are not seeing the issue..&lt;/P&gt;

&lt;P&gt;We were monitoring test.log* and now only monitor test.log and test.log.2011*&lt;/P&gt;</description>
      <pubDate>Thu, 17 Mar 2011 00:12:07 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/How-to-manage-indexing-rolling-log-files-without-duplicating/m-p/59478#M11726</guid>
      <dc:creator>ericrobinson</dc:creator>
      <dc:date>2011-03-17T00:12:07Z</dc:date>
    </item>
  </channel>
</rss>

