<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Log file is not getting indexed in Getting Data In</title>
    <link>https://community.splunk.com/t5/Getting-Data-In/Log-file-is-not-getting-indexed/m-p/90495#M18773</link>
    <description>&lt;P&gt;You should probably read the following for guidance on how to skip indexing of some events.&lt;/P&gt;

&lt;P&gt;&lt;A href="http://docs.splunk.com/Documentation/Splunk/5.0.2/Deploy/Routeandfilterdatad#Keep_specific_events_and_discard_the_rest" target="_blank"&gt;http://docs.splunk.com/Documentation/Splunk/5.0.2/Deploy/Routeandfilterdatad#Keep_specific_events_and_discard_the_rest&lt;/A&gt;&lt;/P&gt;

&lt;P&gt;In props.conf: &lt;/P&gt;

&lt;PRE&gt;&lt;CODE&gt;[your_sourcetype]
TRANSFORMS-set= setnull,setparsing
&lt;/CODE&gt;&lt;/PRE&gt;

&lt;P&gt;In transforms.conf: &lt;/P&gt;

&lt;PRE&gt;&lt;CODE&gt;[setnull]
REGEX = .
DEST_KEY = queue
FORMAT = nullQueue

[setparsing]
REGEX = some_string
DEST_KEY = queue
FORMAT = indexQueue
&lt;/CODE&gt;&lt;/PRE&gt;

&lt;P&gt;You'll have to replace 'some_sting' with a something that distinguishes the lines you want to keep, e.g. in your example "&lt;CODE&gt;phase&lt;/CODE&gt;" or "&lt;CODE&gt;curr_date&lt;/CODE&gt;" occur in the events you want to keep.&lt;/P&gt;

&lt;P&gt;When these transforms are called from props.conf, the order is important; first ALL events are set to be thrown away (setnull), then followed by the second transform (setparsing) that re-set the destination for the matching events from the &lt;CODE&gt;nullQueue&lt;/CODE&gt; back to the &lt;CODE&gt;indexQueue&lt;/CODE&gt;.&lt;/P&gt;

&lt;P&gt;Hope this helps,&lt;/P&gt;

&lt;P&gt;Kristian&lt;/P&gt;</description>
    <pubDate>Mon, 28 Sep 2020 13:42:55 GMT</pubDate>
    <dc:creator>kristian_kolb</dc:creator>
    <dc:date>2020-09-28T13:42:55Z</dc:date>
    <item>
      <title>Log file is not getting indexed</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/Log-file-is-not-getting-indexed/m-p/90494#M18772</link>
      <description>&lt;P&gt;We have a custom application &lt;BR /&gt;
log file which looks something like below, this file is not getting indexed with the 1st 4 lines in it.&lt;BR /&gt;
These logs are generated for a number of similar programs the entries in "XXXXs..." will vary based on this&lt;/P&gt;

&lt;P&gt;=============================================================================================================================================================================================&lt;/P&gt;

&lt;P&gt;Application XXXXX XXXXX - YYYY BC and XX XXXXXX XXXXXX XXXXXX XX (XXX XX)&lt;BR /&gt;&lt;BR /&gt;
Application ---------------------------------------------------------------&lt;BR /&gt;&lt;BR /&gt;
Application XXXXX XXXXX - YYYY BC and XX XXXXXX XXXXXX XXXXXX XX (XXX XX)&lt;BR /&gt;&lt;BR /&gt;
Application ---------------------------------------------------------------&lt;BR /&gt;&lt;BR /&gt;
Application prg="XXXXX XXXXX- Receipt Creation Program" phase=Running status=Normal start_date="09-APR-2013 17:01:02" end_date="N/A" requestid=37541696 curr_time="09-APR-2013 17:15:13"&lt;BR /&gt;&lt;BR /&gt;
Application prg="XXXXX XXXXX- Receipt Creation Program" phase=Running status=Normal start_date="09-APR-2013 17:01:03" end_date="N/A" requestid=37541697 curr_time="09-APR-2013 17:15:13" &lt;/P&gt;

&lt;P&gt;==============================================================================================================================================================================================&lt;/P&gt;

&lt;P&gt;Same file (below) with the discarded lines works fine. &lt;/P&gt;

&lt;P&gt;==============================================================================================================================================================================================&lt;/P&gt;

&lt;P&gt;Application prg="XXXXX XXXXXX - Receipt Creation Program" phase=Running status=Normal start_date="09-APR-2013 17:01:02" end_date="N/A" requestid=37541696 curr_time="09-APR-2013 17:15:13"&lt;BR /&gt;&lt;BR /&gt;
Application prg="XXXXX XXXXXX- Receipt Creation Program" phase=Running status=Normal start_date="09-APR-2013 17:01:03" end_date="N/A" requestid=37541697 curr_time="09-APR-2013 17:15:13" &lt;/P&gt;

&lt;P&gt;===============================================================================================================================================================================================&lt;/P&gt;

&lt;P&gt;Is it possible to ignore or omit the 4 lines for the file to be indexed since it is not going to be possible to remove the entries from the applciation side to remove these lines.&lt;BR /&gt;
Thanks&lt;/P&gt;</description>
      <pubDate>Mon, 28 Sep 2020 13:42:52 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/Log-file-is-not-getting-indexed/m-p/90494#M18772</guid>
      <dc:creator>yogonline</dc:creator>
      <dc:date>2020-09-28T13:42:52Z</dc:date>
    </item>
    <item>
      <title>Re: Log file is not getting indexed</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/Log-file-is-not-getting-indexed/m-p/90495#M18773</link>
      <description>&lt;P&gt;You should probably read the following for guidance on how to skip indexing of some events.&lt;/P&gt;

&lt;P&gt;&lt;A href="http://docs.splunk.com/Documentation/Splunk/5.0.2/Deploy/Routeandfilterdatad#Keep_specific_events_and_discard_the_rest" target="_blank"&gt;http://docs.splunk.com/Documentation/Splunk/5.0.2/Deploy/Routeandfilterdatad#Keep_specific_events_and_discard_the_rest&lt;/A&gt;&lt;/P&gt;

&lt;P&gt;In props.conf: &lt;/P&gt;

&lt;PRE&gt;&lt;CODE&gt;[your_sourcetype]
TRANSFORMS-set= setnull,setparsing
&lt;/CODE&gt;&lt;/PRE&gt;

&lt;P&gt;In transforms.conf: &lt;/P&gt;

&lt;PRE&gt;&lt;CODE&gt;[setnull]
REGEX = .
DEST_KEY = queue
FORMAT = nullQueue

[setparsing]
REGEX = some_string
DEST_KEY = queue
FORMAT = indexQueue
&lt;/CODE&gt;&lt;/PRE&gt;

&lt;P&gt;You'll have to replace 'some_sting' with a something that distinguishes the lines you want to keep, e.g. in your example "&lt;CODE&gt;phase&lt;/CODE&gt;" or "&lt;CODE&gt;curr_date&lt;/CODE&gt;" occur in the events you want to keep.&lt;/P&gt;

&lt;P&gt;When these transforms are called from props.conf, the order is important; first ALL events are set to be thrown away (setnull), then followed by the second transform (setparsing) that re-set the destination for the matching events from the &lt;CODE&gt;nullQueue&lt;/CODE&gt; back to the &lt;CODE&gt;indexQueue&lt;/CODE&gt;.&lt;/P&gt;

&lt;P&gt;Hope this helps,&lt;/P&gt;

&lt;P&gt;Kristian&lt;/P&gt;</description>
      <pubDate>Mon, 28 Sep 2020 13:42:55 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/Log-file-is-not-getting-indexed/m-p/90495#M18773</guid>
      <dc:creator>kristian_kolb</dc:creator>
      <dc:date>2020-09-28T13:42:55Z</dc:date>
    </item>
    <item>
      <title>Re: Log file is not getting indexed</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/Log-file-is-not-getting-indexed/m-p/90496#M18774</link>
      <description>&lt;P&gt;If I read your question correct, your file is not indexed.&lt;BR /&gt;
Am I right in assuming, that the start of the file is identical with other files for the first 256 bytes?&lt;BR /&gt;&lt;BR /&gt;
You might see some lines in splunkd.log that looks like this: "File will not be read, is too small to match seekptr checksum" I use the following search:  &lt;/P&gt;

&lt;P&gt;index=_internal source=*splunkd.log "File will not be read, is too small to match seekptr checksum" component="TailingProcessor" | dedup host file | table _time host file | sort host  &lt;/P&gt;

&lt;P&gt;Try to look at initCrcLength in inputs.conf, this option came in 5.0.1&lt;/P&gt;</description>
      <pubDate>Thu, 11 Apr 2013 12:06:19 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/Log-file-is-not-getting-indexed/m-p/90496#M18774</guid>
      <dc:creator>las</dc:creator>
      <dc:date>2013-04-11T12:06:19Z</dc:date>
    </item>
    <item>
      <title>Re: Log file is not getting indexed</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/Log-file-is-not-getting-indexed/m-p/90497#M18775</link>
      <description>&lt;P&gt;Oh..reading your answer, I think you understood better what the problem may be.&lt;/P&gt;</description>
      <pubDate>Sat, 13 Apr 2013 01:39:00 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/Log-file-is-not-getting-indexed/m-p/90497#M18775</guid>
      <dc:creator>kristian_kolb</dc:creator>
      <dc:date>2013-04-13T01:39:00Z</dc:date>
    </item>
  </channel>
</rss>

