<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: ignoreolderthan in inputs.conf vs number of files in &amp;quot;splunk list monitor&amp;quot; performance in Getting Data In</title>
    <link>https://community.splunk.com/t5/Getting-Data-In/ignoreolderthan-in-inputs-conf-vs-number-of-files-in-quot-splunk/m-p/242462#M47010</link>
    <description>&lt;P&gt;Thanks for answering, what's odd though is that it seems Splunk isn't ignoring the files. For example, out of the 20,000 files, say 1,000 of them are the last 7 days.&lt;/P&gt;

&lt;P&gt;If I set ignoreOlderThan = 7d and restart Splunk, the splunk list monitor output still shows all the 20,000 files, so it doesn't look like they're ignored at all.&lt;/P&gt;</description>
    <pubDate>Sun, 27 Sep 2015 01:26:26 GMT</pubDate>
    <dc:creator>jwquah</dc:creator>
    <dc:date>2015-09-27T01:26:26Z</dc:date>
    <item>
      <title>ignoreolderthan in inputs.conf vs number of files in "splunk list monitor" performance</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/ignoreolderthan-in-inputs-conf-vs-number-of-files-in-quot-splunk/m-p/242460#M47008</link>
      <description>&lt;P&gt;Hi All,&lt;/P&gt;

&lt;P&gt;I'm trying to see if we can improve the performance of a Splunk instance and trying to optimize it - e.g. putting sourcetype instead of letting it being automatic/etc. There is a data input that's monitoring a directory, and there are about 20,000 files within said directory. I've added ignoreolderthan = 7d to the inputs.conf&lt;/P&gt;

&lt;P&gt;The question is:&lt;BR /&gt;
 - Does adding &lt;STRONG&gt;ignoreolderthan&lt;/STRONG&gt; in inputs.conf make Splunk ignore those files? Does that mean that I should be seeing less files being monitored in /splunk list monitor as well as the Data Inputs in the Splunk webpage?&lt;BR /&gt;
 - Or is the only way to minimize the number of files being monitored is to move those files OUT of the monitored directory?&lt;/P&gt;

&lt;P&gt;Thank you.&lt;/P&gt;</description>
      <pubDate>Fri, 25 Sep 2015 05:42:37 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/ignoreolderthan-in-inputs-conf-vs-number-of-files-in-quot-splunk/m-p/242460#M47008</guid>
      <dc:creator>jwquah</dc:creator>
      <dc:date>2015-09-25T05:42:37Z</dc:date>
    </item>
    <item>
      <title>Re: ignoreolderthan in inputs.conf vs number of files in "splunk list monitor" performance</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/ignoreolderthan-in-inputs-conf-vs-number-of-files-in-quot-splunk/m-p/242461#M47009</link>
      <description>&lt;P&gt;Using &lt;CODE&gt;ignoreOlderThan&lt;/CODE&gt; will cause Splunk to totally ignore files forever: only the filename will be checked.  HOWEVER, with 20k files (most of which you are "ignoring"), you still have to deal with the OS-level lag of accessing a list of files from a directory (calls to &lt;CODE&gt;stat&lt;/CODE&gt;) that is too cluttered and the slowness of walking through that list when you know most of the files are permanently useless to you.&lt;/P&gt;

&lt;P&gt;To avoid all of these problems, check out my (and other) answer here:&lt;BR /&gt;
&lt;A href="http://answers.splunk.com/answers/309910/how-to-monitor-a-folder-for-newest-files-only-file.html#answer-309937"&gt;http://answers.splunk.com/answers/309910/how-to-monitor-a-folder-for-newest-files-only-file.html#answer-309937&lt;/A&gt;&lt;/P&gt;</description>
      <pubDate>Sat, 26 Sep 2015 16:50:54 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/ignoreolderthan-in-inputs-conf-vs-number-of-files-in-quot-splunk/m-p/242461#M47009</guid>
      <dc:creator>woodcock</dc:creator>
      <dc:date>2015-09-26T16:50:54Z</dc:date>
    </item>
    <item>
      <title>Re: ignoreolderthan in inputs.conf vs number of files in "splunk list monitor" performance</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/ignoreolderthan-in-inputs-conf-vs-number-of-files-in-quot-splunk/m-p/242462#M47010</link>
      <description>&lt;P&gt;Thanks for answering, what's odd though is that it seems Splunk isn't ignoring the files. For example, out of the 20,000 files, say 1,000 of them are the last 7 days.&lt;/P&gt;

&lt;P&gt;If I set ignoreOlderThan = 7d and restart Splunk, the splunk list monitor output still shows all the 20,000 files, so it doesn't look like they're ignored at all.&lt;/P&gt;</description>
      <pubDate>Sun, 27 Sep 2015 01:26:26 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/ignoreolderthan-in-inputs-conf-vs-number-of-files-in-quot-splunk/m-p/242462#M47010</guid>
      <dc:creator>jwquah</dc:creator>
      <dc:date>2015-09-27T01:26:26Z</dc:date>
    </item>
    <item>
      <title>Re: ignoreolderthan in inputs.conf vs number of files in "splunk list monitor" performance</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/ignoreolderthan-in-inputs-conf-vs-number-of-files-in-quot-splunk/m-p/242463#M47011</link>
      <description>&lt;P&gt;OK, so I did a test and set up a test instance with a data input monitoring a directory with 206 files. The files inside range from May to September (yesterday).&lt;/P&gt;

&lt;P&gt;In my inputs.conf, it's set to:&lt;/P&gt;

&lt;PRE&gt;&lt;CODE&gt;[monitor:///&amp;lt;dir&amp;gt;]
disabled = false
index = test_index
sourcetype = _json
ignoreOlderThan = 2d
&lt;/CODE&gt;&lt;/PRE&gt;

&lt;P&gt;There are only 13 files within the last two days, yet from the data inputs web view and even ./splunk list monitor, it shows the below. &lt;BR /&gt;
&lt;IMG src="https://cdn.pbrd.co/images/XTKMJzK.png" alt="alt text" /&gt;&lt;/P&gt;

&lt;P&gt;Is this expected? It seems to be that Splunk is still monitoring the whole directory. With few files, it probably doesn't matter, but it'll definitely slow down over time as more files heap up (assuming one doesn't rotate them out)...&lt;/P&gt;</description>
      <pubDate>Mon, 28 Sep 2015 01:08:25 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/ignoreolderthan-in-inputs-conf-vs-number-of-files-in-quot-splunk/m-p/242463#M47011</guid>
      <dc:creator>jwquah</dc:creator>
      <dc:date>2015-09-28T01:08:25Z</dc:date>
    </item>
    <item>
      <title>Re: ignoreolderthan in inputs.conf vs number of files in "splunk list monitor" performance</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/ignoreolderthan-in-inputs-conf-vs-number-of-files-in-quot-splunk/m-p/242464#M47012</link>
      <description>&lt;P&gt;I have not used &lt;CODE&gt;btool&lt;/CODE&gt; to verify the function of &lt;CODE&gt;ignoreOlderThan&lt;/CODE&gt; but your test surprises me.  I would open a case with support.&lt;/P&gt;</description>
      <pubDate>Sat, 31 Oct 2015 13:53:31 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/ignoreolderthan-in-inputs-conf-vs-number-of-files-in-quot-splunk/m-p/242464#M47012</guid>
      <dc:creator>woodcock</dc:creator>
      <dc:date>2015-10-31T13:53:31Z</dc:date>
    </item>
    <item>
      <title>Re: ignoreolderthan in inputs.conf vs number of files in "splunk list monitor" performance</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/ignoreolderthan-in-inputs-conf-vs-number-of-files-in-quot-splunk/m-p/242465#M47013</link>
      <description>&lt;P&gt;Sorry for not updating this. After further testing, we were able to confirm. Splunk will monitor files already indexed even if ignoreOlderThan is set, unless the conf is set before the index takes place. If the ignoreOlderThan is set after files are indexed, only new files will conform to the ignoreOlderThan config.&lt;/P&gt;</description>
      <pubDate>Mon, 02 Nov 2015 00:57:02 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/ignoreolderthan-in-inputs-conf-vs-number-of-files-in-quot-splunk/m-p/242465#M47013</guid>
      <dc:creator>jwquah</dc:creator>
      <dc:date>2015-11-02T00:57:02Z</dc:date>
    </item>
    <item>
      <title>Re: ignoreolderthan in inputs.conf vs number of files in "splunk list monitor" performance</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/ignoreolderthan-in-inputs-conf-vs-number-of-files-in-quot-splunk/m-p/242466#M47014</link>
      <description>&lt;P&gt;Which is pretty much what I was telling you (and why I pointed you to my other answer which is a good way around this whole mess).  You can flip back and forth between &lt;CODE&gt;ignoreOlderThan&lt;/CODE&gt; and not, by adding/removing the setting: no problem.  It is no surprise to find that Splunk is still monitoring them to some degree because it has to mark them as inactive and store that state somehow/somewhere.  The way to test if the &lt;CODE&gt;ignoreOlderThan&lt;/CODE&gt; setting is working is to wait the desired amount of days with no change at which point Splunk will mark it to ignore FOREVER. Then send new events to the file and confirm that those new events are not forwarded, which is the intention of the setting (but not what most people expect).&lt;/P&gt;</description>
      <pubDate>Sun, 08 Sep 2019 20:35:01 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/ignoreolderthan-in-inputs-conf-vs-number-of-files-in-quot-splunk/m-p/242466#M47014</guid>
      <dc:creator>woodcock</dc:creator>
      <dc:date>2019-09-08T20:35:01Z</dc:date>
    </item>
  </channel>
</rss>

