<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: not indexing 1 of 4 logs in Getting Data In</title>
    <link>https://community.splunk.com/t5/Getting-Data-In/not-indexing-1-of-4-logs/m-p/73149#M14956</link>
    <description>&lt;P&gt;What do you get if you do a search for source="&lt;EM&gt;/FEB14/&lt;/EM&gt;" with the time range of all time?&lt;/P&gt;

&lt;P&gt;EDIT:  THat's source="&lt;ASTRICKS&gt;/FEB14/&lt;ASTRICKS&gt;"  for some reason it converted it italics..o_O&lt;/ASTRICKS&gt;&lt;/ASTRICKS&gt;&lt;/P&gt;

&lt;P&gt;Brian&lt;/P&gt;</description>
    <pubDate>Fri, 17 Feb 2012 19:21:09 GMT</pubDate>
    <dc:creator>Brian_Osburn</dc:creator>
    <dc:date>2012-02-17T19:21:09Z</dc:date>
    <item>
      <title>not indexing 1 of 4 logs</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/not-indexing-1-of-4-logs/m-p/73145#M14952</link>
      <description>&lt;P&gt;I'm not really sure where else to look at troubleshooting this problem below:&lt;/P&gt;

&lt;P&gt;I have 4 data-input directories that are being watched by Splunk. Two hosts dirs, each with an apache and a tomcat dir. Once each day at midnite EST  logs are copied here into dated dirs. One directory and it's log is routinely ignored it seems from what I see doing a meta search on the index for the sources. &lt;/P&gt;

&lt;P&gt;I have tried deleting and re-adding the dir path in Splunk Manager (4.3). These dirs and files all have 644 and root:root and Splunk runs as root.&lt;/P&gt;

&lt;PRE&gt;&lt;CODE&gt;/opt/prod_log_spool/sjctriwa04/apache/FEB14  &amp;lt;&amp;lt; ignored 
/opt/prod_log_spool/sjctriwa04/tomcat/FEB14
/opt/prod_log_spool/sjctriwa03/apache/FEB14
/opt/prod_log_spool/sjctriwa03/tomcat/FEB14

/opt/prod_log_spool/sjctriwa04/apache/FEB15  &amp;lt;&amp;lt; ignored 
/opt/prod_log_spool/sjctriwa04/tomcat/FEB15
/opt/prod_log_spool/sjctriwa03/apache/FEB15
/opt/prod_log_spool/sjctriwa03/tomcat/FEB15
&lt;/CODE&gt;&lt;/PRE&gt;

&lt;P&gt;for example&lt;/P&gt;

&lt;PRE&gt;&lt;CODE&gt;# /opt/splunk/bin/splunk search "| metadata type=sources index="logprod*" earliest=-2d" | awk '/FEB14/ {print $4}'
/opt/prod_log_spool/sjctriwa03/tomcat/FEB14/catalina.out
/opt/prod_log_spool/sjctriwa04/tomcat/FEB14/catalina.out
/opt/prod_log_spool/sjctriwa03/apache/FEB14/extended_log
# 
# /opt/splunk/bin/splunk search "| metadata type=sources index="hznprod*" earliest=-2d" | awk '/FEB15/ {print $4}'
/opt/prod_log_spool/sjctriwa03/tomcat/FEB15/catalina.out
/opt/prod_log_spool/sjctriwa04/tomcat/FEB15/catalina.out
/opt/prod_log_spool/sjctriwa03/apache/FEB15/extended_log
# 
&lt;/CODE&gt;&lt;/PRE&gt;</description>
      <pubDate>Thu, 16 Feb 2012 20:28:53 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/not-indexing-1-of-4-logs/m-p/73145#M14952</guid>
      <dc:creator>pcorchary</dc:creator>
      <dc:date>2012-02-16T20:28:53Z</dc:date>
    </item>
    <item>
      <title>Re: not indexing 1 of 4 logs</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/not-indexing-1-of-4-logs/m-p/73146#M14953</link>
      <description>&lt;P&gt;Anyone have any ideas? I don't even know where to look next ...&lt;/P&gt;</description>
      <pubDate>Fri, 17 Feb 2012 18:03:17 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/not-indexing-1-of-4-logs/m-p/73146#M14953</guid>
      <dc:creator>pcorchary</dc:creator>
      <dc:date>2012-02-17T18:03:17Z</dc:date>
    </item>
    <item>
      <title>Re: not indexing 1 of 4 logs</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/not-indexing-1-of-4-logs/m-p/73147#M14954</link>
      <description>&lt;P&gt;Do you see anything in $SPLUNK_HOME/var/log/splunkd.log on that server?  &lt;/P&gt;

&lt;P&gt;If you go to https://&lt;SERVER_NAME&gt;:8089/services/admin/inputstatus/TailingProcessor:FileStatus you should see what the server status is around what files it has open.&lt;/SERVER_NAME&gt;&lt;/P&gt;

&lt;P&gt;Another good resource is &lt;A href="http://blogs.splunk.com/2011/01/02/did-i-miss-christmas-2/"&gt;http://blogs.splunk.com/2011/01/02/did-i-miss-christmas-2/&lt;/A&gt;&lt;/P&gt;</description>
      <pubDate>Fri, 17 Feb 2012 18:29:13 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/not-indexing-1-of-4-logs/m-p/73147#M14954</guid>
      <dc:creator>Brian_Osburn</dc:creator>
      <dc:date>2012-02-17T18:29:13Z</dc:date>
    </item>
    <item>
      <title>Re: not indexing 1 of 4 logs</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/not-indexing-1-of-4-logs/m-p/73148#M14955</link>
      <description>&lt;P&gt;Thanks Brian - That 'FileStatus' info is super helpful. I wasn't aware of that. And puzzeling it shows that the file was read.... I'm not sure what to make of that, because Splunk still shows it was not in the index, using the meta search. And in the splunk log (all), I see plenty of entries about FEB14/tomcat/catalina.log for both servers, but NONE for apache/extended.log ... but I can see from the meta output that one server's apache/extended.log WAS read ... ok. now i'm even more confused.&lt;/P&gt;

&lt;PRE&gt;&lt;CODE&gt;&amp;gt; /opt/prod_log_spool/sjctricwa04p/apache/FEB14 
&amp;gt; parent    /opt/prod_log_spool/sjctricwa04p/apache
&amp;gt; type  directory
&amp;gt; /opt/prod_log_spool/sjctricwa04p/apache/FEB14/extended_log    
&amp;gt; file position 111741482 file
&amp;gt; size  111741482
&amp;gt; parent    /opt/prod_log_spool/sjctricwa04p/apache
&amp;gt; percent   100.00 type finished reading
&amp;gt; /opt/prod_log_spool/sjctricwa04p/apache/FEB15 
&amp;gt; parent    /opt/prod_log_spool/sjctricwa04p/apache
&amp;gt; type  directory
&amp;gt; /opt/prod_log_spool/sjctricwa04p/apache/FEB15/extended_log    
&amp;gt; file position 117054516 file
&amp;gt; size  117054516
&amp;gt; parent    /opt/prod_log_spool/sjctricwa04p/apache
&amp;gt; percent   100.00 type finished reading

# /opt/splunk/bin/splunk search "| metadata type=sources index="hznprod*" earliest=-2d" | grep FEB14
1328623241 1329281946 1329321589 /opt/prod_log_spool/sjctricwa03p/tomcat/FEB14/catalina.out                1035125 sources
1329177600 1329281953 1329322086 /opt/prod_log_spool/sjctricwa04p/tomcat/FEB14/catalina.out                 786746 sources
1329195601 1329281999 1329321250 /opt/prod_log_spool/sjctricwa03p/apache/FEB14/extended_log                 318495 sources
# 
&lt;/CODE&gt;&lt;/PRE&gt;</description>
      <pubDate>Fri, 17 Feb 2012 18:58:26 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/not-indexing-1-of-4-logs/m-p/73148#M14955</guid>
      <dc:creator>pcorchary</dc:creator>
      <dc:date>2012-02-17T18:58:26Z</dc:date>
    </item>
    <item>
      <title>Re: not indexing 1 of 4 logs</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/not-indexing-1-of-4-logs/m-p/73149#M14956</link>
      <description>&lt;P&gt;What do you get if you do a search for source="&lt;EM&gt;/FEB14/&lt;/EM&gt;" with the time range of all time?&lt;/P&gt;

&lt;P&gt;EDIT:  THat's source="&lt;ASTRICKS&gt;/FEB14/&lt;ASTRICKS&gt;"  for some reason it converted it italics..o_O&lt;/ASTRICKS&gt;&lt;/ASTRICKS&gt;&lt;/P&gt;

&lt;P&gt;Brian&lt;/P&gt;</description>
      <pubDate>Fri, 17 Feb 2012 19:21:09 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/not-indexing-1-of-4-logs/m-p/73149#M14956</guid>
      <dc:creator>Brian_Osburn</dc:creator>
      <dc:date>2012-02-17T19:21:09Z</dc:date>
    </item>
    <item>
      <title>Re: not indexing 1 of 4 logs</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/not-indexing-1-of-4-logs/m-p/73150#M14957</link>
      <description>&lt;P&gt;nothing ... or am I missing something here in concept or syntax?&lt;BR /&gt;
like this?&lt;/P&gt;

&lt;H1&gt;/opt/splunk/bin/splunk search source="&lt;ASTERIX&gt;/FEB14/&lt;ASTERIX&gt;"&lt;/ASTERIX&gt;&lt;/ASTERIX&gt;&lt;/H1&gt;

&lt;P&gt;or like this&lt;/P&gt;

&lt;H1&gt;/opt/splunk/bin/splunk search "| metadata type=sources index="hznprod_apache" source="&lt;ASTERIX&gt;/FEB14/&lt;ASTERIX&gt;"&lt;/ASTERIX&gt;&lt;/ASTERIX&gt;&lt;/H1&gt;

&lt;P&gt;or something else ... ???&lt;/P&gt;</description>
      <pubDate>Fri, 17 Feb 2012 21:45:40 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/not-indexing-1-of-4-logs/m-p/73150#M14957</guid>
      <dc:creator>pcorchary</dc:creator>
      <dc:date>2012-02-17T21:45:40Z</dc:date>
    </item>
    <item>
      <title>Re: not indexing 1 of 4 logs</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/not-indexing-1-of-4-logs/m-p/73151#M14958</link>
      <description>&lt;P&gt;&lt;A href="http://docs.splunk.com/Documentation/Splunk/latest/searchreference/metadata"&gt;http://docs.splunk.com/Documentation/Splunk/latest/searchreference/metadata&lt;/A&gt; says that the metadata command doesn't support option 'source'.&lt;/P&gt;</description>
      <pubDate>Sun, 19 Feb 2012 18:15:32 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/not-indexing-1-of-4-logs/m-p/73151#M14958</guid>
      <dc:creator>pcorchary</dc:creator>
      <dc:date>2012-02-19T18:15:32Z</dc:date>
    </item>
    <item>
      <title>Re: not indexing 1 of 4 logs</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/not-indexing-1-of-4-logs/m-p/73152#M14959</link>
      <description>&lt;P&gt;nothing? I'm still wedged.... &lt;span class="lia-unicode-emoji" title=":disappointed_face:"&gt;😞&lt;/span&gt;&lt;/P&gt;</description>
      <pubDate>Wed, 29 Feb 2012 06:31:45 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/not-indexing-1-of-4-logs/m-p/73152#M14959</guid>
      <dc:creator>pcorchary</dc:creator>
      <dc:date>2012-02-29T06:31:45Z</dc:date>
    </item>
    <item>
      <title>Re: not indexing 1 of 4 logs</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/not-indexing-1-of-4-logs/m-p/73153#M14960</link>
      <description>&lt;P&gt;Lets make sure the data is in even in Splunk.&lt;/P&gt;

&lt;P&gt;Can you run the following search and let me know if it found anything:&lt;BR /&gt;
source="*apache/FEB14*" &lt;BR /&gt;
Make sure you set the search time for "All Time".  The most common issue I've seen around data not being in Splunk is that it got conjangled around the date / time stamp and actually set the event date as some future date.&lt;/P&gt;

&lt;P&gt;Brian&lt;/P&gt;</description>
      <pubDate>Wed, 29 Feb 2012 14:12:18 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/not-indexing-1-of-4-logs/m-p/73153#M14960</guid>
      <dc:creator>Brian_Osburn</dc:creator>
      <dc:date>2012-02-29T14:12:18Z</dc:date>
    </item>
    <item>
      <title>Re: not indexing 1 of 4 logs</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/not-indexing-1-of-4-logs/m-p/73154#M14961</link>
      <description>&lt;P&gt;nothing ... or am I missing something here in concept or syntax? like this?&lt;BR /&gt;
/opt/splunk/bin/splunk search source="&lt;ASTERIX&gt;/FEB14/&lt;ASTERIX&gt;"&lt;/ASTERIX&gt;&lt;/ASTERIX&gt;&lt;/P&gt;

&lt;P&gt;or like this&lt;BR /&gt;
/opt/splunk/bin/splunk search "| metadata type=sources index="hznprod_apache" source="&lt;ASTERIX&gt;/FEB14/&lt;ASTERIX&gt;"&lt;/ASTERIX&gt;&lt;/ASTERIX&gt;&lt;/P&gt;

&lt;P&gt;or something else ... ???&lt;/P&gt;

&lt;P&gt;&lt;A href="http://docs.splunk.com/Documentation/Splunk/latest/searchreference/metadata"&gt;http://docs.splunk.com/Documentation/Splunk/latest/searchreference/metadata&lt;/A&gt; says that the metadata command doesn't support option 'source'.&lt;/P&gt;

&lt;P&gt;please note here that I'm actually just (at this stage) looking for the file name/path to validate.&lt;/P&gt;

&lt;P&gt;Further I'm seeing something VERY odd ... i have a cron job that moves the 4 files each night at 00:20 EST. Two hours later I have cron job that does the meta search and emails me the results. I have noticed occassionally that Splunk will apparentely 'forget' about files that it has &lt;EM&gt;already&lt;/EM&gt; consumed!&lt;/P&gt;

&lt;P&gt;For instance.&lt;/P&gt;

&lt;P&gt;for logs of the 26th, i got this the first night:&lt;/P&gt;

&lt;PRE&gt;&lt;CODE&gt;pdated=Mon 27 Feb 2012 05:25:37 AM GMT, source=/opt/prod_log_spool/sjctricwa04p/apache/FEB26/extended_log
updated=Tue 28 Feb 2012 05:00:14 PM GMT, source=/opt/prod_log_spool/sjctricwa03p/apache/FEB26/extended_log
updated=Mon 27 Feb 2012 05:26:13 AM GMT, source=/opt/prod_log_spool/sjctricwa04p/tomcat/FEB26/catalina.out
updated=Tue 28 Feb 2012 05:01:22 PM GMT, source=/opt/prod_log_spool/sjctricwa03p/tomcat/FEB26/catalina.out
&lt;/CODE&gt;&lt;/PRE&gt;

&lt;P&gt;the next night I got this - one log was 'forgotten'!&lt;/P&gt;

&lt;PRE&gt;&lt;CODE&gt;updated=Mon 27 Feb 2012 05:25:37 AM GMT, source=/opt/prod_log_spool/sjctricwa04p/apache/FEB26/extended_log
updated=Mon 27 Feb 2012 05:26:13 AM GMT, source=/opt/prod_log_spool/sjctricwa04p/tomcat/FEB26/catalina.out
updated=Tue 28 Feb 2012 05:01:22 PM GMT, source=/opt/prod_log_spool/sjctricwa03p/tomcat/FEB26/catalina.out
&lt;/CODE&gt;&lt;/PRE&gt;

&lt;P&gt;this has happened several times now. One time I did a one-time index of the file, verified it, and again two days later it disappeared AGAIN! I'm VERY VERY concerned about this bevhaviour if i'm seeing it with just 4 logs from 2 hosts (like 3GB/day). What's going to break when I scale up, and how am I going to track and verify that Splunk is actually doing what I have instructed it to do!&lt;/P&gt;

&lt;P&gt;FYI the output here is from metadata piped thru awk as below&lt;/P&gt;

&lt;PRE&gt;&lt;CODE&gt;querySplunk ()
    {
        echo "Splunk prod logs for the last $end days summary ..."; 
        for date in "${d[@]}"; do
            echo "## $date";
            /opt/splunk/bin/splunk search "| metadata type=sources index=hznprod_apache" \
-auth "$splUname:$splPass" | awk -v dateStr="$date" '$0 ~ dateStr {printf "updated=%s, source=%s\n", strftime("%c",$3), $4}'; 
            /opt/splunk/bin/splunk search "| metadata type=sources index=hznprod_tc" | awk -v dateStr="$date" '$0 ~ dateStr {printf "updated=%s, source=%s\n", strftime("%c",$3), $4}' ;
        done
    }
&lt;/CODE&gt;&lt;/PRE&gt;</description>
      <pubDate>Wed, 29 Feb 2012 20:33:08 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/not-indexing-1-of-4-logs/m-p/73154#M14961</guid>
      <dc:creator>pcorchary</dc:creator>
      <dc:date>2012-02-29T20:33:08Z</dc:date>
    </item>
    <item>
      <title>Re: not indexing 1 of 4 logs</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/not-indexing-1-of-4-logs/m-p/73155#M14962</link>
      <description>&lt;P&gt;now today ...  for the 25th, three of the four logs are vanished! Even last night there were three of the four logs ... two more went missing overnight!!!! this is insane.&lt;/P&gt;

&lt;PRE&gt;&lt;CODE&gt;$ splunklogchk

Thu Mar  1 19:04:37 GMT 2012 - Splunk prod logs for the last 7 days summary ...
updated=Fri 24 Feb 2012 06:30:18 AM GMT, source=/opt/prod_log_spool/sjctricwa04p/apache/FEB23/extended_log
updated=Fri 24 Feb 2012 06:21:53 AM GMT, source=/opt/prod_log_spool/sjctricwa03p/apache/FEB23/extended_log
updated=Fri 24 Feb 2012 06:37:57 AM GMT, source=/opt/prod_log_spool/sjctricwa04p/tomcat/FEB23/catalina.out
updated=Fri 24 Feb 2012 06:25:59 AM GMT, source=/opt/prod_log_spool/sjctricwa03p/tomcat/FEB23/catalina.out

updated=Sat 25 Feb 2012 05:39:40 AM GMT, source=/opt/prod_log_spool/sjctricwa04p/apache/FEB24/extended_log
updated=Sat 25 Feb 2012 05:37:17 AM GMT, source=/opt/prod_log_spool/sjctricwa03p/apache/FEB24/extended_log
updated=Sat 25 Feb 2012 05:43:26 AM GMT, source=/opt/prod_log_spool/sjctricwa04p/tomcat/FEB24/catalina.out
updated=Sat 25 Feb 2012 05:38:00 AM GMT, source=/opt/prod_log_spool/sjctricwa03p/tomcat/FEB24/catalina.out

updated=Sun 26 Feb 2012 09:49:25 PM GMT, source=/opt/prod_log_spool/sjctricwa03p/tomcat/FEB25/catalina.out
***
*** three logs missing here
***
updated=Mon 27 Feb 2012 05:25:37 AM GMT, source=/opt/prod_log_spool/sjctricwa04p/apache/FEB26/extended_log
updated=Mon 27 Feb 2012 05:26:13 AM GMT, source=/opt/prod_log_spool/sjctricwa04p/tomcat/FEB26/catalina.out
updated=Tue 28 Feb 2012 05:01:22 PM GMT, source=/opt/prod_log_spool/sjctricwa03p/tomcat/FEB26/catalina.out
*** 1 log missing here

updated=Tue 28 Feb 2012 05:39:55 AM GMT, source=/opt/prod_log_spool/sjctricwa04p/apache/FEB27/extended_log
updated=Tue 28 Feb 2012 05:38:14 AM GMT, source=/opt/prod_log_spool/sjctricwa03p/apache/FEB27/extended_log
updated=Tue 28 Feb 2012 05:41:23 AM GMT, source=/opt/prod_log_spool/sjctricwa04p/tomcat/FEB27/catalina.out
updated=Tue 28 Feb 2012 05:39:10 AM GMT, source=/opt/prod_log_spool/sjctricwa03p/tomcat/FEB27/catalina.out

updated=Wed 29 Feb 2012 07:50:42 AM GMT, source=/opt/prod_log_spool/sjctricwa04p/apache/FEB28/extended_log
updated=Wed 29 Feb 2012 05:37:57 AM GMT, source=/opt/prod_log_spool/sjctricwa03p/apache/FEB28/extended_log
updated=Thu 01 Mar 2012 02:09:33 AM GMT, source=/opt/prod_log_spool/sjctricwa04p/tomcat/FEB28/catalina.out
updated=Wed 29 Feb 2012 05:38:48 AM GMT, source=/opt/prod_log_spool/sjctricwa03p/tomcat/FEB28/catalina.out

updated=Thu 01 Mar 2012 09:45:06 AM GMT, source=/opt/prod_log_spool/sjctricwa03p/apache/FEB29/extended_log
updated=Thu 01 Mar 2012 09:47:11 AM GMT, source=/opt/prod_log_spool/sjctricwa04p/apache/FEB29/extended_log
updated=Thu 01 Mar 2012 09:48:38 AM GMT, source=/opt/prod_log_spool/sjctricwa04p/tomcat/FEB29/catalina.out
updated=Thu 01 Mar 2012 09:46:39 AM GMT, source=/opt/prod_log_spool/sjctricwa03p/tomcat/FEB29/catalina.out
&lt;/CODE&gt;&lt;/PRE&gt;</description>
      <pubDate>Thu, 01 Mar 2012 19:09:50 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/not-indexing-1-of-4-logs/m-p/73155#M14962</guid>
      <dc:creator>pcorchary</dc:creator>
      <dc:date>2012-03-01T19:09:50Z</dc:date>
    </item>
  </channel>
</rss>

