<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Daily index volume by sourcetype in Getting Data In</title>
    <link>https://community.splunk.com/t5/Getting-Data-In/Daily-index-volume-by-sourcetype/m-p/62384#M12480</link>
    <description>&lt;P&gt;I downvoted this post because incomplete.  only top 10 sourcetypes recorded in metrics&lt;/P&gt;</description>
    <pubDate>Wed, 28 Nov 2018 20:58:28 GMT</pubDate>
    <dc:creator>landen99</dc:creator>
    <dc:date>2018-11-28T20:58:28Z</dc:date>
    <item>
      <title>Daily index volume by sourcetype</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/Daily-index-volume-by-sourcetype/m-p/62375#M12471</link>
      <description>&lt;P&gt;How does one go about calculating daily index volume by sourcetype?  &lt;/P&gt;

&lt;P&gt;I'm currently capturing all logged data and sending it to the main index, but moving away from that method, for performance/scalability reasons.  We're going through a data classification exercise that will dictate our future state indexes, and as part of that exercise, want to get a better idea of the breakdown of data sizes, and thought sizing by 'sourcetype' would be a good starting point.&lt;/P&gt;

&lt;P&gt;thanks for any information!&lt;/P&gt;

&lt;P&gt;Regards,&lt;BR /&gt;
-m&lt;/P&gt;</description>
      <pubDate>Mon, 22 Aug 2011 22:27:28 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/Daily-index-volume-by-sourcetype/m-p/62375#M12471</guid>
      <dc:creator>mtanadsk</dc:creator>
      <dc:date>2011-08-22T22:27:28Z</dc:date>
    </item>
    <item>
      <title>Re: Daily index volume by sourcetype</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/Daily-index-volume-by-sourcetype/m-p/62376#M12472</link>
      <description>&lt;P&gt;index=_internal source=&lt;EM&gt;metrics.log splunk_server="&lt;/EM&gt;" | eval MB=kb/1024 | search group="per_sourcetype_thruput" | timechart span=1d sum(MB) by series&lt;/P&gt;</description>
      <pubDate>Mon, 28 Sep 2020 09:49:12 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/Daily-index-volume-by-sourcetype/m-p/62376#M12472</guid>
      <dc:creator>cnk</dc:creator>
      <dc:date>2020-09-28T09:49:12Z</dc:date>
    </item>
    <item>
      <title>Re: Daily index volume by sourcetype</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/Daily-index-volume-by-sourcetype/m-p/62377#M12473</link>
      <description>&lt;P&gt;In the Search app, hit the "status" dropdown and choose index activity.  I was able to get index volume by sourcetype / server / at any time span.  It was great for a built-in tool.&lt;/P&gt;

&lt;P&gt;Running Splunk 4.3.2&lt;/P&gt;</description>
      <pubDate>Thu, 01 Nov 2012 19:38:20 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/Daily-index-volume-by-sourcetype/m-p/62377#M12473</guid>
      <dc:creator>wbfoxii</dc:creator>
      <dc:date>2012-11-01T19:38:20Z</dc:date>
    </item>
    <item>
      <title>Re: Daily index volume by sourcetype</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/Daily-index-volume-by-sourcetype/m-p/62378#M12474</link>
      <description>&lt;P&gt;What if i want to know for a specific sourcetype in a specific index? &lt;BR /&gt;
We have over 50+ indexes but for a couple of indexes, lets say index=a, index=b, index=c, i want to know how much data on a daily basis, my windows system/security/events logs are generating (min,max,avg) for a 30 day range.&lt;BR /&gt;
Can someone write up a quick search for this? Much appreciated.&lt;/P&gt;</description>
      <pubDate>Thu, 10 Jul 2014 20:56:00 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/Daily-index-volume-by-sourcetype/m-p/62378#M12474</guid>
      <dc:creator>gurinderbhatti</dc:creator>
      <dc:date>2014-07-10T20:56:00Z</dc:date>
    </item>
    <item>
      <title>Re: Daily index volume by sourcetype</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/Daily-index-volume-by-sourcetype/m-p/62379#M12475</link>
      <description>&lt;P&gt;This works with Splunk 6.2 for supporting GB by index...&lt;/P&gt;

&lt;PRE&gt;&lt;CODE&gt;index=_internal source=*metrics.log | eval GB=kb/(1024*1024) | search group="per_sourcetype_thruput" | timechart span=1d sum(GB) by series limit=20
&lt;/CODE&gt;&lt;/PRE&gt;</description>
      <pubDate>Tue, 27 Jan 2015 22:40:11 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/Daily-index-volume-by-sourcetype/m-p/62379#M12475</guid>
      <dc:creator>slierninja</dc:creator>
      <dc:date>2015-01-27T22:40:11Z</dc:date>
    </item>
    <item>
      <title>Re: Daily index volume by sourcetype</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/Daily-index-volume-by-sourcetype/m-p/62380#M12476</link>
      <description>&lt;P&gt;How can I get the index volume for a particular sourcetype, for example sourcetype=perfmon*..?&lt;/P&gt;</description>
      <pubDate>Wed, 08 Jun 2016 13:48:38 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/Daily-index-volume-by-sourcetype/m-p/62380#M12476</guid>
      <dc:creator>kiran_mh</dc:creator>
      <dc:date>2016-06-08T13:48:38Z</dc:date>
    </item>
    <item>
      <title>Re: Daily index volume by sourcetype</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/Daily-index-volume-by-sourcetype/m-p/62381#M12477</link>
      <description>&lt;P&gt;I am new to splunk&lt;BR /&gt;
I installed trial version which can index 500 MB/day. My question is that Domain controller can index how much events. Please just give me estimate.&lt;/P&gt;</description>
      <pubDate>Sun, 15 Apr 2018 18:32:44 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/Daily-index-volume-by-sourcetype/m-p/62381#M12477</guid>
      <dc:creator>rashid47010</dc:creator>
      <dc:date>2018-04-15T18:32:44Z</dc:date>
    </item>
    <item>
      <title>Re: Daily index volume by sourcetype</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/Daily-index-volume-by-sourcetype/m-p/62382#M12478</link>
      <description>&lt;P&gt;Try this query, it will give you size in GB for each day.&lt;/P&gt;

&lt;PRE&gt;&lt;CODE&gt;index=_internal source=*metrics.log
series=wineventlog:security
| eval formatted_time=strftime(_time, "%x")
| Rename series as sourcetypes
| chart sum(kb) over sourcetypes by formatted_time
| sort - [ makeresults | addinfo | eval time="\"".strftime(info_max_time-1, "%x")."\"" | return $time]
| foreach */* [eval "&amp;lt;&amp;lt;FIELD&amp;gt;&amp;gt;"=round('&amp;lt;&amp;lt;FIELD&amp;gt;&amp;gt;'/1024/1024,2)." GB"]
&lt;/CODE&gt;&lt;/PRE&gt;</description>
      <pubDate>Wed, 20 Jun 2018 13:57:24 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/Daily-index-volume-by-sourcetype/m-p/62382#M12478</guid>
      <dc:creator>ss026381</dc:creator>
      <dc:date>2018-06-20T13:57:24Z</dc:date>
    </item>
    <item>
      <title>Re: Daily index volume by sourcetype</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/Daily-index-volume-by-sourcetype/m-p/62383#M12479</link>
      <description>&lt;P&gt;Try below, select the time last 30 days in time picker.&lt;/P&gt;

&lt;PRE&gt;&lt;CODE&gt; index=_internal source=*metrics.log group="per_sourcetype_thruput"
 [| metadata type=hosts ( index=a OR index=b OR index=c)| table host | format] 
 (series=wineventlog:system OR series=wineventlog:security OR series=wineventlog:application) 
 | eval formatted_time=strftime(_indextime, "%x")
 | Rename series as sourcetypes
 | chart sum(kb) over sourcetypes by formatted_time
 | sort - [ makeresults | addinfo | eval time="\"".strftime(info_max_time-1, "%x")."\"" | return $time]
 | foreach */* [eval "&amp;lt;&amp;lt;FIELD&amp;gt;&amp;gt;"=round('&amp;lt;&amp;lt;FIELD&amp;gt;&amp;gt;'/1024/1024,2)." GB"]
&lt;/CODE&gt;&lt;/PRE&gt;</description>
      <pubDate>Wed, 20 Jun 2018 14:33:47 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/Daily-index-volume-by-sourcetype/m-p/62383#M12479</guid>
      <dc:creator>ss026381</dc:creator>
      <dc:date>2018-06-20T14:33:47Z</dc:date>
    </item>
    <item>
      <title>Re: Daily index volume by sourcetype</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/Daily-index-volume-by-sourcetype/m-p/62384#M12480</link>
      <description>&lt;P&gt;I downvoted this post because incomplete.  only top 10 sourcetypes recorded in metrics&lt;/P&gt;</description>
      <pubDate>Wed, 28 Nov 2018 20:58:28 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/Daily-index-volume-by-sourcetype/m-p/62384#M12480</guid>
      <dc:creator>landen99</dc:creator>
      <dc:date>2018-11-28T20:58:28Z</dc:date>
    </item>
    <item>
      <title>Re: Daily index volume by sourcetype</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/Daily-index-volume-by-sourcetype/m-p/62385#M12481</link>
      <description>&lt;P&gt;I downvoted this post because the results from this solution are incomplete. only the top 10 sourcetypes are recorded in metrics&lt;/P&gt;</description>
      <pubDate>Wed, 28 Nov 2018 20:59:18 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/Daily-index-volume-by-sourcetype/m-p/62385#M12481</guid>
      <dc:creator>landen99</dc:creator>
      <dc:date>2018-11-28T20:59:18Z</dc:date>
    </item>
    <item>
      <title>Re: Daily index volume by sourcetype</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/Daily-index-volume-by-sourcetype/m-p/62386#M12482</link>
      <description>&lt;P&gt;If i have 3 source type like  type, item and payable then what would be command. Can you please give command for my sourctetype.Thanks&lt;/P&gt;</description>
      <pubDate>Sun, 03 Feb 2019 13:58:19 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/Daily-index-volume-by-sourcetype/m-p/62386#M12482</guid>
      <dc:creator>rakesh44</dc:creator>
      <dc:date>2019-02-03T13:58:19Z</dc:date>
    </item>
    <item>
      <title>Re: Daily index volume by sourcetype</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/Daily-index-volume-by-sourcetype/m-p/552546#M91659</link>
      <description>&lt;P&gt;By default the metrics will only provide details on the top 10 items in a series - this applies to almost all the metrics that are collected.&lt;/P&gt;&lt;P&gt;&lt;A href="https://docs.splunk.com/Documentation/Splunk/8.1.3/Troubleshooting/Aboutmetricslog" target="_blank"&gt;https://docs.splunk.com/Documentation/Splunk/8.1.3/Troubleshooting/Aboutmetricslog&lt;/A&gt;&lt;/P&gt;&lt;P&gt;The way to address this, is to change the limit applied using the limits.conf file - the following stanza added to a indexer / HF will increase the number of items which are tracked&lt;/P&gt;&lt;LI-CODE lang="markup"&gt;[metrics]
maxseries = 50
interval = 60&lt;/LI-CODE&gt;&lt;P&gt;This will ensure the top 50 items are included - so you can adjust this to reflect the likely number of sourcetypes if that is the main metric to be tracked&lt;/P&gt;&lt;P&gt;The "interval" reference allows you to change the frequency that this data is collected to control the volume of data generated in the log. In the same, the frequency is 60seconds to allow minute by minute tracking - but that may be excessive for some purposes.&lt;/P&gt;</description>
      <pubDate>Fri, 21 May 2021 10:23:56 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/Daily-index-volume-by-sourcetype/m-p/552546#M91659</guid>
      <dc:creator>wyfwa4</dc:creator>
      <dc:date>2021-05-21T10:23:56Z</dc:date>
    </item>
  </channel>
</rss>

