<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: monitoring files - how does splunk count the size? in Getting Data In</title>
    <link>https://community.splunk.com/t5/Getting-Data-In/monitoring-files-how-does-splunk-count-the-size/m-p/59390#M11697</link>
    <description>&lt;P&gt;hm.. it doesn't work&lt;BR /&gt;
I can still see in _internal index splunk is polling the data from archive. Current configuration is:&lt;/P&gt;

&lt;P&gt;followTail = 1&lt;BR /&gt;
recursive = false&lt;BR /&gt;
disabled = 0&lt;BR /&gt;
whitelist = *.log&lt;BR /&gt;
blacklist = *.zip #tried to exclude somehow zip files &lt;span class="lia-unicode-emoji" title=":slightly_smiling_face:"&gt;🙂&lt;/span&gt;&lt;/P&gt;</description>
    <pubDate>Fri, 27 Jan 2012 17:07:12 GMT</pubDate>
    <dc:creator>Vladimir</dc:creator>
    <dc:date>2012-01-27T17:07:12Z</dc:date>
    <item>
      <title>monitoring files - how does splunk count the size?</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/monitoring-files-how-does-splunk-count-the-size/m-p/59385#M11692</link>
      <description>&lt;P&gt;Hi,&lt;/P&gt;

&lt;P&gt;I've configured a directory for monitoring in inputs.conf ([monitor://path_to_dir]) and separated index for this folder several days ago. Everything is ok except one thing... the total size of files is ~500 Mb but splunk shows (in index activity-&amp;gt;index volume) that it indexing ~800 Mb per hour ... how is it possible? There is 10 Mb of new logs/day only. Does splunk resend the whole file if it has been changed (even if added 1 row)?&lt;BR /&gt;
The total amount of events is ~800-900 per 1 hour. My rsyslog index with ~12-15 000 events/h is increased ~100 Mb/h only.&lt;/P&gt;

&lt;P&gt;The same situation I have for one more monitored folder.&lt;BR /&gt;
splunk v4.3,&lt;/P&gt;</description>
      <pubDate>Mon, 28 Sep 2020 10:22:10 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/monitoring-files-how-does-splunk-count-the-size/m-p/59385#M11692</guid>
      <dc:creator>Vladimir</dc:creator>
      <dc:date>2020-09-28T10:22:10Z</dc:date>
    </item>
    <item>
      <title>Re: monitoring files - how does splunk count the size?</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/monitoring-files-how-does-splunk-count-the-size/m-p/59386#M11693</link>
      <description>&lt;P&gt;It shouldn't resend the whole file again.  It should only send the parity of the file.&lt;/P&gt;

&lt;P&gt;Keep in mind that when you monitor to path, are you doing a recursive search of the directory and all directories below (this is the default behavior)?  &lt;/P&gt;

&lt;P&gt;Additionally, are there files buried deep in that directory that might be causing your file size to blow up?&lt;/P&gt;

&lt;P&gt;Without having intimate knowledge of your environment, I'm having to hypothesize about what might be occurring here.&lt;/P&gt;</description>
      <pubDate>Fri, 27 Jan 2012 15:52:49 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/monitoring-files-how-does-splunk-count-the-size/m-p/59386#M11693</guid>
      <dc:creator>Lamar</dc:creator>
      <dc:date>2012-01-27T15:52:49Z</dc:date>
    </item>
    <item>
      <title>Re: monitoring files - how does splunk count the size?</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/monitoring-files-how-does-splunk-count-the-size/m-p/59387#M11694</link>
      <description>&lt;P&gt;there is no any subfolders but I figured out there are several archive files (*.zip with old files) and looks like (in metrics.log) splunk unzipped it and indexed... arrrhh&lt;/P&gt;</description>
      <pubDate>Fri, 27 Jan 2012 16:16:58 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/monitoring-files-how-does-splunk-count-the-size/m-p/59387#M11694</guid>
      <dc:creator>Vladimir</dc:creator>
      <dc:date>2012-01-27T16:16:58Z</dc:date>
    </item>
    <item>
      <title>Re: monitoring files - how does splunk count the size?</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/monitoring-files-how-does-splunk-count-the-size/m-p/59388#M11695</link>
      <description>&lt;P&gt;will recursive = false help in this case?&lt;/P&gt;</description>
      <pubDate>Fri, 27 Jan 2012 16:23:39 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/monitoring-files-how-does-splunk-count-the-size/m-p/59388#M11695</guid>
      <dc:creator>Vladimir</dc:creator>
      <dc:date>2012-01-27T16:23:39Z</dc:date>
    </item>
    <item>
      <title>Re: monitoring files - how does splunk count the size?</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/monitoring-files-how-does-splunk-count-the-size/m-p/59389#M11696</link>
      <description>&lt;P&gt;It would in fact.&lt;/P&gt;</description>
      <pubDate>Fri, 27 Jan 2012 16:26:56 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/monitoring-files-how-does-splunk-count-the-size/m-p/59389#M11696</guid>
      <dc:creator>Lamar</dc:creator>
      <dc:date>2012-01-27T16:26:56Z</dc:date>
    </item>
    <item>
      <title>Re: monitoring files - how does splunk count the size?</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/monitoring-files-how-does-splunk-count-the-size/m-p/59390#M11697</link>
      <description>&lt;P&gt;hm.. it doesn't work&lt;BR /&gt;
I can still see in _internal index splunk is polling the data from archive. Current configuration is:&lt;/P&gt;

&lt;P&gt;followTail = 1&lt;BR /&gt;
recursive = false&lt;BR /&gt;
disabled = 0&lt;BR /&gt;
whitelist = *.log&lt;BR /&gt;
blacklist = *.zip #tried to exclude somehow zip files &lt;span class="lia-unicode-emoji" title=":slightly_smiling_face:"&gt;🙂&lt;/span&gt;&lt;/P&gt;</description>
      <pubDate>Fri, 27 Jan 2012 17:07:12 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/monitoring-files-how-does-splunk-count-the-size/m-p/59390#M11697</guid>
      <dc:creator>Vladimir</dc:creator>
      <dc:date>2012-01-27T17:07:12Z</dc:date>
    </item>
    <item>
      <title>Re: monitoring files - how does splunk count the size?</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/monitoring-files-how-does-splunk-count-the-size/m-p/59391#M11698</link>
      <description>&lt;P&gt;Keep in mind that your whitelist/blacklist needs to be in regex form.  So, you would want:&lt;/P&gt;

&lt;P&gt;&lt;CODE&gt;&lt;BR /&gt;
whitelist = \.log$&lt;BR /&gt;
blacklist = \.zip$&lt;BR /&gt;
&lt;/CODE&gt;&lt;/P&gt;

&lt;P&gt;This should work a bit better for what you're trying to accomplish.&lt;/P&gt;</description>
      <pubDate>Mon, 30 Jan 2012 17:35:56 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/monitoring-files-how-does-splunk-count-the-size/m-p/59391#M11698</guid>
      <dc:creator>Lamar</dc:creator>
      <dc:date>2012-01-30T17:35:56Z</dc:date>
    </item>
  </channel>
</rss>

