<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: How is your volume usage calculated in Knowledge Management</title>
    <link>https://community.splunk.com/t5/Knowledge-Management/How-is-your-volume-usage-calculated/m-p/320723#M5601</link>
    <description>&lt;P&gt;No, it does not duplicate any data by itself. So I do not think Splunk is processing the whole file again.Even if it processes Splunk checks first and last 256 bytes of the file if they are same then it does not index again.&lt;BR /&gt;
Basically, when data gets indexed by an indexer, it counts towards your daily total.&lt;BR /&gt;
See here for more:&lt;/P&gt;

&lt;P&gt;&lt;A href="http://docs.splunk.com/Documentation/Splunk/latest/Admin/HowSplunklicensingworks"&gt;http://docs.splunk.com/Documentation/Splunk/latest/Admin/HowSplunklicensingworks&lt;/A&gt;&lt;/P&gt;

&lt;P&gt;One way to check this is event count and dig into Splunk data.&lt;/P&gt;

&lt;PRE&gt;&lt;CODE&gt;| tstats count where index=&amp;lt;your_index&amp;gt;
&lt;/CODE&gt;&lt;/PRE&gt;

&lt;P&gt;cross-check your file and see the line count.&lt;/P&gt;

&lt;P&gt;let me know if this helps you!&lt;/P&gt;</description>
    <pubDate>Fri, 19 Jan 2018 13:24:51 GMT</pubDate>
    <dc:creator>mayurr98</dc:creator>
    <dc:date>2018-01-19T13:24:51Z</dc:date>
    <item>
      <title>How is your volume usage calculated</title>
      <link>https://community.splunk.com/t5/Knowledge-Management/How-is-your-volume-usage-calculated/m-p/320722#M5600</link>
      <description>&lt;P&gt;Hi all&lt;/P&gt;

&lt;P&gt;I managed to generate a log file which I would need to use to display certain graphs.&lt;BR /&gt;
This logfile only increases a few times an hour and only with a few KB so I should be able to stay way below the 500MB trial license limit.&lt;BR /&gt;
However, after a few hours into using Splunk, I'm almost reaching the limit ?!&lt;BR /&gt;
I would expect that the Splunk Enterprise server processes the file once (or updates his DB when the logfile changes.. but is he processing the WHOLE again?) and puts the data in his DB, ready to be used for a search.&lt;/P&gt;

&lt;P&gt;Can someone help me to determine where this 400+ MB is coming from? As far as I can see, I only configured 1 file to be monitored and everything else is disabled.. all other scripts and files.&lt;/P&gt;</description>
      <pubDate>Fri, 19 Jan 2018 12:58:13 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Knowledge-Management/How-is-your-volume-usage-calculated/m-p/320722#M5600</guid>
      <dc:creator>bwouters</dc:creator>
      <dc:date>2018-01-19T12:58:13Z</dc:date>
    </item>
    <item>
      <title>Re: How is your volume usage calculated</title>
      <link>https://community.splunk.com/t5/Knowledge-Management/How-is-your-volume-usage-calculated/m-p/320723#M5601</link>
      <description>&lt;P&gt;No, it does not duplicate any data by itself. So I do not think Splunk is processing the whole file again.Even if it processes Splunk checks first and last 256 bytes of the file if they are same then it does not index again.&lt;BR /&gt;
Basically, when data gets indexed by an indexer, it counts towards your daily total.&lt;BR /&gt;
See here for more:&lt;/P&gt;

&lt;P&gt;&lt;A href="http://docs.splunk.com/Documentation/Splunk/latest/Admin/HowSplunklicensingworks"&gt;http://docs.splunk.com/Documentation/Splunk/latest/Admin/HowSplunklicensingworks&lt;/A&gt;&lt;/P&gt;

&lt;P&gt;One way to check this is event count and dig into Splunk data.&lt;/P&gt;

&lt;PRE&gt;&lt;CODE&gt;| tstats count where index=&amp;lt;your_index&amp;gt;
&lt;/CODE&gt;&lt;/PRE&gt;

&lt;P&gt;cross-check your file and see the line count.&lt;/P&gt;

&lt;P&gt;let me know if this helps you!&lt;/P&gt;</description>
      <pubDate>Fri, 19 Jan 2018 13:24:51 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Knowledge-Management/How-is-your-volume-usage-calculated/m-p/320723#M5601</guid>
      <dc:creator>mayurr98</dc:creator>
      <dc:date>2018-01-19T13:24:51Z</dc:date>
    </item>
    <item>
      <title>Re: How is your volume usage calculated</title>
      <link>https://community.splunk.com/t5/Knowledge-Management/How-is-your-volume-usage-calculated/m-p/320724#M5602</link>
      <description>&lt;P&gt;Hi, &lt;/P&gt;

&lt;P&gt;could you give the config you have made for the monitor stanza in inputs.conf?&lt;/P&gt;</description>
      <pubDate>Fri, 19 Jan 2018 13:27:29 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Knowledge-Management/How-is-your-volume-usage-calculated/m-p/320724#M5602</guid>
      <dc:creator>horsefez</dc:creator>
      <dc:date>2018-01-19T13:27:29Z</dc:date>
    </item>
    <item>
      <title>Re: How is your volume usage calculated</title>
      <link>https://community.splunk.com/t5/Knowledge-Management/How-is-your-volume-usage-calculated/m-p/320725#M5603</link>
      <description>&lt;PRE&gt;&lt;CODE&gt;/opt/splunk/etc/system/local$ cat inputs.conf 
[default]
host = SPLUNK

[monitor://$SPLUNK_HOME/etc/splunk.version]
disabled = 1

[monitor://$SPLUNK_HOME/var/log/splunk]
disabled = 1

[monitor://$SPLUNK_HOME/var/log/splunk/license_usage_summary.log]
disabled = 1

[batch://$SPLUNK_HOME/var/spool/splunk/...stash_new]
disabled = 1

[batch://$SPLUNK_HOME/var/spool/splunk]
disabled = 1
&lt;/CODE&gt;&lt;/PRE&gt;

&lt;P&gt;I used the web UI to do this.. I would expect that whatever I do using the web UI, is also put in the input.conf?&lt;/P&gt;</description>
      <pubDate>Fri, 19 Jan 2018 13:38:47 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Knowledge-Management/How-is-your-volume-usage-calculated/m-p/320725#M5603</guid>
      <dc:creator>bwouters</dc:creator>
      <dc:date>2018-01-19T13:38:47Z</dc:date>
    </item>
    <item>
      <title>Re: How is your volume usage calculated</title>
      <link>https://community.splunk.com/t5/Knowledge-Management/How-is-your-volume-usage-calculated/m-p/320726#M5604</link>
      <description>&lt;P&gt;Yes, you are right, but all those stanzas seem to still be disabled&lt;/P&gt;</description>
      <pubDate>Fri, 19 Jan 2018 13:50:43 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Knowledge-Management/How-is-your-volume-usage-calculated/m-p/320726#M5604</guid>
      <dc:creator>horsefez</dc:creator>
      <dc:date>2018-01-19T13:50:43Z</dc:date>
    </item>
    <item>
      <title>Re: How is your volume usage calculated</title>
      <link>https://community.splunk.com/t5/Knowledge-Management/How-is-your-volume-usage-calculated/m-p/320727#M5605</link>
      <description>&lt;P&gt;If I dig a bit deeper, I have like thousands of events per minute..&lt;BR /&gt;
When I check what these events are, they are exactly the same&lt;BR /&gt;
Line 1&lt;/P&gt;

&lt;PRE&gt;&lt;CODE&gt;2018-01-18 15:37:59,722 TRACE [HTTP worker thread 9] HttpInterface - [IP] [RequestId = ID] HTTP response send to IP
StatusCode = OK
Headers = 
  ResourceEvaluationCount: 1
  Cache-Control: max-age=7200
  Duration: 1 ms
  Content-Encoding: gzip
Body = &amp;lt;?xml version="1.0" encoding="utf-8"?&amp;gt;
&amp;lt;Session id="ID" xmlns="urn"&amp;gt;
  &amp;lt;State&amp;gt;Created&amp;lt;/State&amp;gt;
&amp;lt;/Session&amp;gt;
&lt;/CODE&gt;&lt;/PRE&gt;

&lt;P&gt;Line 2&lt;/P&gt;

&lt;PRE&gt;&lt;CODE&gt;2018-01-18 15:37:59,722 TRACE [HTTP worker thread 9] HttpInterface - [IP] [RequestId = ID] HTTP response send to IP
StatusCode = OK
Headers = 
  ResourceEvaluationCount: 1
  Cache-Control: max-age=7200
  Duration: 1 ms
  Content-Encoding: gzip
Body = &amp;lt;?xml version="1.0" encoding="utf-8"?&amp;gt;
&amp;lt;Session id="ID" xmlns="urn"&amp;gt;
  &amp;lt;State&amp;gt;Created&amp;lt;/State&amp;gt;
&amp;lt;/Session&amp;gt;
&lt;/CODE&gt;&lt;/PRE&gt;

&lt;P&gt;This data is put on 2 different lines and there are even more than 2 lines with exactly the same data..&lt;BR /&gt;
Probably I did something wrong here? Or how can I finetune this to have it only on one line?&lt;/P&gt;</description>
      <pubDate>Fri, 19 Jan 2018 13:51:41 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Knowledge-Management/How-is-your-volume-usage-calculated/m-p/320727#M5605</guid>
      <dc:creator>bwouters</dc:creator>
      <dc:date>2018-01-19T13:51:41Z</dc:date>
    </item>
    <item>
      <title>Re: How is your volume usage calculated</title>
      <link>https://community.splunk.com/t5/Knowledge-Management/How-is-your-volume-usage-calculated/m-p/320728#M5606</link>
      <description>&lt;P&gt;Yes, because I wanted to see if these were the cause of it&lt;/P&gt;</description>
      <pubDate>Fri, 19 Jan 2018 13:54:13 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Knowledge-Management/How-is-your-volume-usage-calculated/m-p/320728#M5606</guid>
      <dc:creator>bwouters</dc:creator>
      <dc:date>2018-01-19T13:54:13Z</dc:date>
    </item>
    <item>
      <title>Re: How is your volume usage calculated</title>
      <link>https://community.splunk.com/t5/Knowledge-Management/How-is-your-volume-usage-calculated/m-p/320729#M5607</link>
      <description>&lt;P&gt;have you checked the same in file? check the file with the same time do the file also have multiple events?&lt;/P&gt;</description>
      <pubDate>Fri, 19 Jan 2018 13:54:26 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Knowledge-Management/How-is-your-volume-usage-calculated/m-p/320729#M5607</guid>
      <dc:creator>mayurr98</dc:creator>
      <dc:date>2018-01-19T13:54:26Z</dc:date>
    </item>
    <item>
      <title>Re: How is your volume usage calculated</title>
      <link>https://community.splunk.com/t5/Knowledge-Management/How-is-your-volume-usage-calculated/m-p/320730#M5608</link>
      <description>&lt;P&gt;In the log file there is indeed twice the same logging but then I counted the amount of repetitions in Splunk -&amp;gt; 14 times&lt;/P&gt;</description>
      <pubDate>Fri, 19 Jan 2018 14:07:12 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Knowledge-Management/How-is-your-volume-usage-calculated/m-p/320730#M5608</guid>
      <dc:creator>bwouters</dc:creator>
      <dc:date>2018-01-19T14:07:12Z</dc:date>
    </item>
    <item>
      <title>Re: How is your volume usage calculated</title>
      <link>https://community.splunk.com/t5/Knowledge-Management/How-is-your-volume-usage-calculated/m-p/320731#M5609</link>
      <description>&lt;P&gt;Data Input is configured as:&lt;BR /&gt;
Set host -&amp;gt; Constant value&lt;BR /&gt;
Host Field valie -&amp;gt; DEMO&lt;BR /&gt;
Set the Source Type -&amp;gt; Manual&lt;BR /&gt;
Source Type -&amp;gt; SessionsLogs&lt;BR /&gt;
Index -&amp;gt; main&lt;BR /&gt;
Whitelist -&amp;gt; Splunk*&lt;/P&gt;

&lt;P&gt;When checking the indexes, I see that 'main' only has 139MB logged to it.&lt;BR /&gt;
If I add all the indexes together, I only have 152 MB&lt;/P&gt;

&lt;P&gt;SourceType configuration:&lt;BR /&gt;
Destination app: Search &amp;amp; Reporting&lt;BR /&gt;
Category: Metrics&lt;BR /&gt;
Indexed Extractions: none&lt;BR /&gt;
Event Breaks: Auto&lt;BR /&gt;
Timestamp: Auto&lt;BR /&gt;
CHARSET: UTF-8&lt;BR /&gt;
NO_BINARY_CHECK: true&lt;BR /&gt;
SHOULD_LINEMERGE: true&lt;BR /&gt;
category: Metrics&lt;BR /&gt;
disabled: false&lt;BR /&gt;
pulldown_type: true&lt;/P&gt;</description>
      <pubDate>Tue, 29 Sep 2020 17:45:28 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Knowledge-Management/How-is-your-volume-usage-calculated/m-p/320731#M5609</guid>
      <dc:creator>bwouters</dc:creator>
      <dc:date>2020-09-29T17:45:28Z</dc:date>
    </item>
  </channel>
</rss>

