<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: splunk's internal logs for indexing process in Getting Data In</title>
    <link>https://community.splunk.com/t5/Getting-Data-In/splunk-s-internal-logs-for-indexing-process/m-p/18973#M2638</link>
    <description>&lt;P&gt;What is the other sourcetype.  is it maybe of the form &lt;CODE&gt;&amp;lt;filename&amp;gt;_too_small&lt;/CODE&gt;?   If so, then every now and then one of the files being auto-sourcetyped is just too short for the auto-sourcetyping to work correctly.   &lt;/P&gt;</description>
    <pubDate>Wed, 23 Nov 2011 19:53:47 GMT</pubDate>
    <dc:creator>sideview</dc:creator>
    <dc:date>2011-11-23T19:53:47Z</dc:date>
    <item>
      <title>splunk's internal logs for indexing process</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/splunk-s-internal-logs-for-indexing-process/m-p/18970#M2635</link>
      <description>&lt;P&gt;hi...&lt;BR /&gt;
I have huge amount of data to be indexed and all of them are being indexed with 1 log format (sourcetype) but suddenly, in few lines splunk suddenly switches to another sourcetype for that particular number of lines only and when I looked through the log data they had exactly same format that other had.&lt;BR /&gt;
so now, I need to access the internal logs of splunk to identify what happened during indexing data into splunk's database. i know that splunk stores its own log files in $splunk_home/var/log/splunk but what i cant find is the log file specifically related to indexing process.&lt;/P&gt;</description>
      <pubDate>Fri, 18 Nov 2011 02:59:32 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/splunk-s-internal-logs-for-indexing-process/m-p/18970#M2635</guid>
      <dc:creator>nina15</dc:creator>
      <dc:date>2011-11-18T02:59:32Z</dc:date>
    </item>
    <item>
      <title>Re: splunk's internal logs for indexing process</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/splunk-s-internal-logs-for-indexing-process/m-p/18971#M2636</link>
      <description>&lt;P&gt;Hi, &lt;/P&gt;

&lt;P&gt;most likely, Splunks own logs are also indexed in the &lt;CODE&gt;_internal&lt;/CODE&gt; index. You can search it just like the other indexes. However, there is a possibility that whatever happened to your log parsing/indexing has not been logged by Splunk. &lt;/P&gt;

&lt;P&gt;You may have to change the logging level in order to see this, e.g. from WARN to INFO or DEBUG. This is done in Manager -&amp;gt; System Settings -&amp;gt; System Logging. Unfortunately I don't know just which of the 400+ items should be changed. &lt;/P&gt;

&lt;P&gt;On a side note, did you specify a sourcetype in your inputs.conf (or via the GUI), or did Splunk auto-assign it? &lt;/P&gt;

&lt;P&gt;Also, a bit more information regarding the sourcetypes involved, along with some sample data would be good.&lt;/P&gt;

&lt;P&gt;UPDATE:&lt;/P&gt;

&lt;P&gt;As nick points out, if the new sourcetype is ...-too-small, then the file in question is too short for Splunks auto-sourcetyping to work properly. &lt;/P&gt;

&lt;P&gt;If the new sourcetype is a "numbered" version of the original sourcetype, e.g. iis-2 or iis-3, means that Splunk thinks that it's the same format, but slightly different. This can happen for CSV log files where the header row changes. By default, I believe that Splunk expects a header row for CSV files.&lt;/P&gt;

&lt;P&gt;I guess that this problem of yours only occurs on a per-file basis and not in the middle of a file, i.e. some of your files get indexed as the "wrong" sourcetype, but most do not.&lt;/P&gt;

&lt;P&gt;Please provide the first three rows of &lt;BR /&gt;
a) a correctly sourcetyped file and &lt;BR /&gt;
b) an incorrectly sourcetyped file.&lt;/P&gt;

&lt;P&gt;Don't forget to mask IP/usernames/hostnames as needed.&lt;/P&gt;

&lt;P&gt;Hope this helps,&lt;/P&gt;

&lt;P&gt;Kristian&lt;/P&gt;</description>
      <pubDate>Fri, 18 Nov 2011 08:56:07 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/splunk-s-internal-logs-for-indexing-process/m-p/18971#M2636</guid>
      <dc:creator>kristian_kolb</dc:creator>
      <dc:date>2011-11-18T08:56:07Z</dc:date>
    </item>
    <item>
      <title>Re: splunk's internal logs for indexing process</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/splunk-s-internal-logs-for-indexing-process/m-p/18972#M2637</link>
      <description>&lt;P&gt;thanks for your comprehensive answer. the sourcetype is auto-assigned actually...&lt;BR /&gt;
ok I have to try your says and see what happens. but for now, all i can say is that the log files are some comma separated events, and there are absolutely no difference in format between those in first sourcetype and those in second...&lt;/P&gt;</description>
      <pubDate>Mon, 21 Nov 2011 04:39:09 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/splunk-s-internal-logs-for-indexing-process/m-p/18972#M2637</guid>
      <dc:creator>nina15</dc:creator>
      <dc:date>2011-11-21T04:39:09Z</dc:date>
    </item>
    <item>
      <title>Re: splunk's internal logs for indexing process</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/splunk-s-internal-logs-for-indexing-process/m-p/18973#M2638</link>
      <description>&lt;P&gt;What is the other sourcetype.  is it maybe of the form &lt;CODE&gt;&amp;lt;filename&amp;gt;_too_small&lt;/CODE&gt;?   If so, then every now and then one of the files being auto-sourcetyped is just too short for the auto-sourcetyping to work correctly.   &lt;/P&gt;</description>
      <pubDate>Wed, 23 Nov 2011 19:53:47 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/splunk-s-internal-logs-for-indexing-process/m-p/18973#M2638</guid>
      <dc:creator>sideview</dc:creator>
      <dc:date>2011-11-23T19:53:47Z</dc:date>
    </item>
    <item>
      <title>Re: splunk's internal logs for indexing process</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/splunk-s-internal-logs-for-indexing-process/m-p/18974#M2639</link>
      <description>&lt;P&gt;great clarifications... following nick's answer then kristian's update, yes the sourcetype is actually same name followed by a number (not -too-small since is not small).&lt;BR /&gt;
but weired part is that that happens actually from middle of 2 files only (from tail of one file until head of another file is just the other sourcetype)&lt;BR /&gt;
and it is CSV but I recently had to change all "," to tab-separated delimiters, and im trying to re-index, will give u real examples if it happenes again in this indexing...&lt;/P&gt;</description>
      <pubDate>Tue, 29 Nov 2011 03:58:52 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/splunk-s-internal-logs-for-indexing-process/m-p/18974#M2639</guid>
      <dc:creator>nina15</dc:creator>
      <dc:date>2011-11-29T03:58:52Z</dc:date>
    </item>
  </channel>
</rss>

