<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Tracking indexing per source - without _internal index access in Getting Data In</title>
    <link>https://community.splunk.com/t5/Getting-Data-In/Tracking-indexing-per-source-without-internal-index-access/m-p/590731#M103470</link>
    <description>&lt;P&gt;That method is close enough, but will be slow since you have to read every event to get its size.&lt;/P&gt;&lt;P&gt;To improve performance ever so slightly, add up the length of _raw then convert to MB/GB/TB at the end.&lt;/P&gt;&lt;LI-CODE lang="markup"&gt;index=aws-index sourcetype=someSource
source="/some/source/file.log"
| eval raw_len=len(_raw)
| stats sum(raw_len) as B by source
| eval MB = B/1024/1024, eval GB = B/1024/1024/1024, eval TB = B/1024/1024/1024/1024&lt;/LI-CODE&gt;</description>
    <pubDate>Thu, 24 Mar 2022 16:50:36 GMT</pubDate>
    <dc:creator>richgalloway</dc:creator>
    <dc:date>2022-03-24T16:50:36Z</dc:date>
    <item>
      <title>Tracking indexing per source - without _internal index access</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/Tracking-indexing-per-source-without-internal-index-access/m-p/590728#M103468</link>
      <description>&lt;P&gt;Looking to measure heavy sources and track how much is getting indexed per day by source.&lt;BR /&gt;the main problem is our Splunk admin team cannot give us access to the _internal index, so i cannot run the standard&amp;nbsp; _internal metrics commands such as:&lt;/P&gt;&lt;PRE&gt; index=_internal sourcetype=splunkd source=*metrics.log* group=per_source_thruput&lt;/PRE&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Curious as to how accurate measuring actual log sizes with Splunk commands might be compared to _internal index stats. we dont need 100% accurate results just a ballpark estimate such as one source might be indexing 5-600Gbs per day or 1-1.5 Tb a day for example.&lt;BR /&gt;Thinking of trying something like&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;LI-CODE lang="markup"&gt;index=aws-index sourcetype=someSource
source="/some/source/file.log"
| eval raw_len=len(_raw)
| eval raw_len_kb = raw_len/1024
| eval raw_len_mb = raw_len/1024/1024
| eval raw_len_gb = raw_len/1024/1024/1024
| eval raw_len_tb = raw_len/1024/1024/1024/1024
| stats sum(raw_len_mb) as MB sum(raw_len_gb) as GB sum(raw_len_tb) as TB by source&lt;/LI-CODE&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Thu, 24 Mar 2022 16:43:47 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/Tracking-indexing-per-source-without-internal-index-access/m-p/590728#M103468</guid>
      <dc:creator>sonicZ</dc:creator>
      <dc:date>2022-03-24T16:43:47Z</dc:date>
    </item>
    <item>
      <title>Re: Tracking indexing per source - without _internal index access</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/Tracking-indexing-per-source-without-internal-index-access/m-p/590729#M103469</link>
      <description>&lt;P&gt;Why not ask your admin team to setup a summary index for license usage logs and give you access to that summary index. That way you can have access to that data without having access to whole _internal index. Something like this:&lt;/P&gt;&lt;P&gt;&lt;A href="https://community.splunk.com/t5/Getting-Data-In/How-to-create-a-summary-Index-that-will-give-license-usage-by/m-p/307561" target="_blank"&gt;https://community.splunk.com/t5/Getting-Data-In/How-to-create-a-summary-Index-that-will-give-license-usage-by/m-p/307561&lt;/A&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Thu, 24 Mar 2022 16:48:55 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/Tracking-indexing-per-source-without-internal-index-access/m-p/590729#M103469</guid>
      <dc:creator>somesoni2</dc:creator>
      <dc:date>2022-03-24T16:48:55Z</dc:date>
    </item>
    <item>
      <title>Re: Tracking indexing per source - without _internal index access</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/Tracking-indexing-per-source-without-internal-index-access/m-p/590731#M103470</link>
      <description>&lt;P&gt;That method is close enough, but will be slow since you have to read every event to get its size.&lt;/P&gt;&lt;P&gt;To improve performance ever so slightly, add up the length of _raw then convert to MB/GB/TB at the end.&lt;/P&gt;&lt;LI-CODE lang="markup"&gt;index=aws-index sourcetype=someSource
source="/some/source/file.log"
| eval raw_len=len(_raw)
| stats sum(raw_len) as B by source
| eval MB = B/1024/1024, eval GB = B/1024/1024/1024, eval TB = B/1024/1024/1024/1024&lt;/LI-CODE&gt;</description>
      <pubDate>Thu, 24 Mar 2022 16:50:36 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/Tracking-indexing-per-source-without-internal-index-access/m-p/590731#M103470</guid>
      <dc:creator>richgalloway</dc:creator>
      <dc:date>2022-03-24T16:50:36Z</dc:date>
    </item>
    <item>
      <title>Re: Tracking indexing per source - without _internal index access</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/Tracking-indexing-per-source-without-internal-index-access/m-p/590732#M103471</link>
      <description>&lt;P&gt;trying to get our Splunk admin team to do anything here is like pulling teeth &lt;span class="lia-unicode-emoji" title=":slightly_smiling_face:"&gt;🙂&lt;/span&gt; but summary indexing might work thanks for that. Will probably take them weeks to get to unfortunately&lt;/P&gt;</description>
      <pubDate>Thu, 24 Mar 2022 16:52:36 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/Tracking-indexing-per-source-without-internal-index-access/m-p/590732#M103471</guid>
      <dc:creator>sonicZ</dc:creator>
      <dc:date>2022-03-24T16:52:36Z</dc:date>
    </item>
    <item>
      <title>Re: Tracking indexing per source - without _internal index access</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/Tracking-indexing-per-source-without-internal-index-access/m-p/590735#M103473</link>
      <description>&lt;P&gt;Thanks, that will most likely help a bit!&lt;BR /&gt;planning to run this a few times per day so we can populate results in a .csv lookup table as well&lt;BR /&gt;&lt;BR /&gt;&lt;/P&gt;</description>
      <pubDate>Thu, 24 Mar 2022 17:04:10 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/Tracking-indexing-per-source-without-internal-index-access/m-p/590735#M103473</guid>
      <dc:creator>sonicZ</dc:creator>
      <dc:date>2022-03-24T17:04:10Z</dc:date>
    </item>
  </channel>
</rss>

