<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Host name extraction via regex on indexing - Only indexing a single file? in Splunk Search</title>
    <link>https://community.splunk.com/t5/Splunk-Search/Host-name-extraction-via-regex-on-indexing-Only-indexing-a/m-p/48238#M11519</link>
    <description>&lt;P&gt;Right... a bit of digging around has turned up the goods &lt;span class="lia-unicode-emoji" title=":slightly_smiling_face:"&gt;🙂&lt;/span&gt;&lt;/P&gt;

&lt;P&gt;Checked in &lt;B&gt;/var/splunk/logs/splunk/splunkd.log&lt;/B&gt; which had multiple instances of the following:&lt;/P&gt;

&lt;PRE&gt;&lt;CODE&gt;02-23-2011 11:51:25.673 ERROR TailingProcessor - Ignoring path due to: File will not be read, seekptr checksum did not match (file=/var/splunk/input/mms_logs/mms_10.152.58.196_20110206_211500_28809).  Last time we saw this initcrc, filename was different.  You may wish to use a CRC salt on this source.  Consult the documentation or file a support case online at &lt;A href="http://www.splunk.com/page/submit_issue" target="test_blank"&gt;http://www.splunk.com/page/submit_issue&lt;/A&gt; for more info.
&lt;/CODE&gt;&lt;/PRE&gt;

&lt;P&gt;Now one thing I neglected to mention (and had no idea would be an issue) is that this data had previously been indexed, however I cleaned out the index with:&lt;/P&gt;

&lt;PRE&gt;&lt;CODE&gt;/opt/splunk/bin/splunk clean eventdata -index main
&lt;/CODE&gt;&lt;/PRE&gt;

&lt;P&gt;After searching for "&lt;EM&gt;ERROR TailingProcessor - Ignoring path due to: File will not be read, seekptr checksum did not match&lt;/EM&gt;", I was directed to this question on answers.splunk.com:&lt;/P&gt;

&lt;P&gt;&lt;A href="http://answers.splunk.com/questions/1568/windows-dhcp-log-files-too-small-to-match-seekptr-checksum" rel="nofollow"&gt;http://answers.splunk.com/questions/1568/windows-dhcp-log-files-too-small-to-match-seekptr-checksum&lt;/A&gt;&lt;/P&gt;

&lt;P&gt;Adding &lt;STRONG&gt;crcSalt = &amp;lt;SOURCE&amp;gt;&lt;/STRONG&gt; to the bottom of my inputs.conf and restarting Splunk solved the issue. Now to get my head around why this was needed...&lt;/P&gt;</description>
    <pubDate>Wed, 23 Feb 2011 10:10:06 GMT</pubDate>
    <dc:creator>rturk</dc:creator>
    <dc:date>2011-02-23T10:10:06Z</dc:date>
    <item>
      <title>Host name extraction via regex on indexing - Only indexing a single file?</title>
      <link>https://community.splunk.com/t5/Splunk-Search/Host-name-extraction-via-regex-on-indexing-Only-indexing-a/m-p/48236#M11517</link>
      <description>&lt;P&gt;Greetings fellow Splunkers,&lt;/P&gt;

&lt;P&gt;I'm having some issues with extracting the correct host name from log file names on indexing.&lt;/P&gt;

&lt;P&gt;I keep my log files in the following directory:&lt;/P&gt;

&lt;P&gt;&lt;B&gt;/var/splunk/input/mms_logs/&lt;/B&gt;&lt;/P&gt;

&lt;P&gt;The filename structure is:&lt;/P&gt;

&lt;P&gt;&lt;B&gt;mms_HOST-IP-ADDRESS_TIMESTAMP.log&lt;/B&gt;&lt;/P&gt;

&lt;P&gt;examples:&lt;/P&gt;

&lt;P&gt;&lt;B&gt;mms_10.152.58.100_20110101_004000_06137.log&lt;BR /&gt;
mms_10.152.58.194_20110121_120000_70656.log&lt;/B&gt;&lt;/P&gt;

&lt;P&gt;All log files have identical file permissions: &lt;B&gt;(rw-r--r--)&lt;/B&gt;.&lt;/P&gt;

&lt;P&gt;Now to extract the IP address portion of filename as a host, I used the following regex:&lt;/P&gt;

&lt;P&gt;&lt;B&gt;/var/splunk/input/mms_logs/mms_(\d+.\d+.\d+.\d+)_\d+&lt;/B&gt;&lt;/P&gt;

&lt;P&gt;Now while that works... it only seems to extract the hostname &amp;amp; event data from a single file. I have a single source, sourcetype, and host.&lt;/P&gt;

&lt;P&gt;This is despite 10,000+ files being in the directory, and the list of data inputs showing it has detected 12055 files in the directory :confused face:&lt;/P&gt;

&lt;P&gt;If someone can shed some light on this it would be greatly appreciated &lt;span class="lia-unicode-emoji" title=":slightly_smiling_face:"&gt;🙂&lt;/span&gt;&lt;/P&gt;</description>
      <pubDate>Tue, 22 Feb 2011 11:25:29 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Splunk-Search/Host-name-extraction-via-regex-on-indexing-Only-indexing-a/m-p/48236#M11517</guid>
      <dc:creator>rturk</dc:creator>
      <dc:date>2011-02-22T11:25:29Z</dc:date>
    </item>
    <item>
      <title>Re: Host name extraction via regex on indexing - Only indexing a single file?</title>
      <link>https://community.splunk.com/t5/Splunk-Search/Host-name-extraction-via-regex-on-indexing-Only-indexing-a/m-p/48237#M11518</link>
      <description>&lt;P&gt;I would like to see your inputs.conf, props.conf, transforms.conf, if possible &lt;span class="lia-unicode-emoji" title=":winking_face:"&gt;😉&lt;/span&gt;&lt;/P&gt;</description>
      <pubDate>Tue, 22 Feb 2011 23:54:10 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Splunk-Search/Host-name-extraction-via-regex-on-indexing-Only-indexing-a/m-p/48237#M11518</guid>
      <dc:creator>meno</dc:creator>
      <dc:date>2011-02-22T23:54:10Z</dc:date>
    </item>
    <item>
      <title>Re: Host name extraction via regex on indexing - Only indexing a single file?</title>
      <link>https://community.splunk.com/t5/Splunk-Search/Host-name-extraction-via-regex-on-indexing-Only-indexing-a/m-p/48238#M11519</link>
      <description>&lt;P&gt;Right... a bit of digging around has turned up the goods &lt;span class="lia-unicode-emoji" title=":slightly_smiling_face:"&gt;🙂&lt;/span&gt;&lt;/P&gt;

&lt;P&gt;Checked in &lt;B&gt;/var/splunk/logs/splunk/splunkd.log&lt;/B&gt; which had multiple instances of the following:&lt;/P&gt;

&lt;PRE&gt;&lt;CODE&gt;02-23-2011 11:51:25.673 ERROR TailingProcessor - Ignoring path due to: File will not be read, seekptr checksum did not match (file=/var/splunk/input/mms_logs/mms_10.152.58.196_20110206_211500_28809).  Last time we saw this initcrc, filename was different.  You may wish to use a CRC salt on this source.  Consult the documentation or file a support case online at &lt;A href="http://www.splunk.com/page/submit_issue" target="test_blank"&gt;http://www.splunk.com/page/submit_issue&lt;/A&gt; for more info.
&lt;/CODE&gt;&lt;/PRE&gt;

&lt;P&gt;Now one thing I neglected to mention (and had no idea would be an issue) is that this data had previously been indexed, however I cleaned out the index with:&lt;/P&gt;

&lt;PRE&gt;&lt;CODE&gt;/opt/splunk/bin/splunk clean eventdata -index main
&lt;/CODE&gt;&lt;/PRE&gt;

&lt;P&gt;After searching for "&lt;EM&gt;ERROR TailingProcessor - Ignoring path due to: File will not be read, seekptr checksum did not match&lt;/EM&gt;", I was directed to this question on answers.splunk.com:&lt;/P&gt;

&lt;P&gt;&lt;A href="http://answers.splunk.com/questions/1568/windows-dhcp-log-files-too-small-to-match-seekptr-checksum" rel="nofollow"&gt;http://answers.splunk.com/questions/1568/windows-dhcp-log-files-too-small-to-match-seekptr-checksum&lt;/A&gt;&lt;/P&gt;

&lt;P&gt;Adding &lt;STRONG&gt;crcSalt = &amp;lt;SOURCE&amp;gt;&lt;/STRONG&gt; to the bottom of my inputs.conf and restarting Splunk solved the issue. Now to get my head around why this was needed...&lt;/P&gt;</description>
      <pubDate>Wed, 23 Feb 2011 10:10:06 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Splunk-Search/Host-name-extraction-via-regex-on-indexing-Only-indexing-a/m-p/48238#M11519</guid>
      <dc:creator>rturk</dc:creator>
      <dc:date>2011-02-23T10:10:06Z</dc:date>
    </item>
    <item>
      <title>Re: Host name extraction via regex on indexing - Only indexing a single file?</title>
      <link>https://community.splunk.com/t5/Splunk-Search/Host-name-extraction-via-regex-on-indexing-Only-indexing-a/m-p/48239#M11520</link>
      <description>&lt;P&gt;Hey meno &lt;span class="lia-unicode-emoji" title=":slightly_smiling_face:"&gt;🙂&lt;/span&gt; Sorry for not getting back to you earlier, but I didn't notice the small text below my question. I managed to find the answer elsewhere and I have some information to get further clarification.&lt;/P&gt;

&lt;P&gt;Thanks &lt;span class="lia-unicode-emoji" title=":slightly_smiling_face:"&gt;🙂&lt;/span&gt;&lt;/P&gt;</description>
      <pubDate>Wed, 23 Feb 2011 10:11:38 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Splunk-Search/Host-name-extraction-via-regex-on-indexing-Only-indexing-a/m-p/48239#M11520</guid>
      <dc:creator>rturk</dc:creator>
      <dc:date>2011-02-23T10:11:38Z</dc:date>
    </item>
  </channel>
</rss>

