<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Splunk initial indexing not behaving as expected in Getting Data In</title>
    <link>https://community.splunk.com/t5/Getting-Data-In/Splunk-initial-indexing-not-behaving-as-expected/m-p/45480#M8526</link>
    <description>&lt;P&gt;A different configuration (&lt;CODE&gt;inputs.conf&lt;/CODE&gt;) is looking at the same files but with a more specific path/file declaration.  Try using &lt;CODE&gt;btool&lt;/CODE&gt; to list out all &lt;CODE&gt;inputs.conf&lt;/CODE&gt; settings.&lt;/P&gt;</description>
    <pubDate>Fri, 29 May 2015 05:55:45 GMT</pubDate>
    <dc:creator>woodcock</dc:creator>
    <dc:date>2015-05-29T05:55:45Z</dc:date>
    <item>
      <title>Splunk initial indexing not behaving as expected</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/Splunk-initial-indexing-not-behaving-as-expected/m-p/45479#M8525</link>
      <description>&lt;P&gt;I'm setting up a fresh new Splunk server and am re-indexing my data from scratch.&lt;/P&gt;

&lt;P&gt;Syslog data is being sent to my syslog-ng server/Splunk indexer via UDP 514. Rather than being sent directly to Splunk, I have the syslog data distributed to a file system/directory structure that I instruct Splunk to "monitor". (i.e. /logs/&lt;EM&gt;hostname&lt;/EM&gt;/&lt;EM&gt;year&lt;/EM&gt;/&lt;EM&gt;month&lt;/EM&gt;/&lt;EM&gt;year&lt;/EM&gt;/&lt;EM&gt;day&lt;/EM&gt;/&lt;EM&gt;logfile&lt;/EM&gt;)&lt;/P&gt;

&lt;P&gt;My expectation was that the host name would be set to the &lt;EM&gt;hostname&lt;/EM&gt; set in the path of the file directory structure, and that everything coming in from the syslog would be set to sourcetype "syslog". Accordingly, here is my inputs.conf:&lt;/P&gt;

&lt;PRE&gt;&lt;CODE&gt; [monitor:///logs]
 disabled=false
 sourcetype=syslog
 host_segment=2
 blacklist=\.(bz2|gz)$
&lt;/CODE&gt;&lt;/PRE&gt;

&lt;P&gt;And 95% of my events are indexed correctly. &lt;/P&gt;

&lt;P&gt;Unfortunately, a few of my events aren't setting the host name correctly; it's using the non-FQDN as indicated in the syslog event itself for some older events (legacy reasons) rather than the name specified in the /logs/&lt;EM&gt;hostname&lt;/EM&gt; segment. &lt;/P&gt;

&lt;P&gt;Also, most events are set to "syslog" as instructed in inputs.conf except for dhcp events which are being set to sourcetype "dhcpd". While technically accurate, it's not what I instructed Splunk to do in inputs.conf. I would have expected everything coming in from the /logs monitor to be set to sourcetype="syslog".&lt;/P&gt;

&lt;P&gt;Is there a reason Splunk is over-riding my settings?&lt;/P&gt;

&lt;P&gt;Thanks!&lt;/P&gt;</description>
      <pubDate>Wed, 20 Jul 2011 14:05:04 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/Splunk-initial-indexing-not-behaving-as-expected/m-p/45479#M8525</guid>
      <dc:creator>Branden</dc:creator>
      <dc:date>2011-07-20T14:05:04Z</dc:date>
    </item>
    <item>
      <title>Re: Splunk initial indexing not behaving as expected</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/Splunk-initial-indexing-not-behaving-as-expected/m-p/45480#M8526</link>
      <description>&lt;P&gt;A different configuration (&lt;CODE&gt;inputs.conf&lt;/CODE&gt;) is looking at the same files but with a more specific path/file declaration.  Try using &lt;CODE&gt;btool&lt;/CODE&gt; to list out all &lt;CODE&gt;inputs.conf&lt;/CODE&gt; settings.&lt;/P&gt;</description>
      <pubDate>Fri, 29 May 2015 05:55:45 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/Splunk-initial-indexing-not-behaving-as-expected/m-p/45480#M8526</guid>
      <dc:creator>woodcock</dc:creator>
      <dc:date>2015-05-29T05:55:45Z</dc:date>
    </item>
  </channel>
</rss>

