<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic How does Splunk estimate the total raw data size? in Deployment Architecture</title>
    <link>https://community.splunk.com/t5/Deployment-Architecture/How-does-Splunk-estimate-the-total-raw-data-size/m-p/248961#M9386</link>
    <description>&lt;P&gt;I had just setup Splunk with indexer clustering (RF-3, SF-2) with no data and initially loaded 1TB of syslog file using oneshot.  The "Index Detail: Deployment" page showed that the total index size is 1121GB whereas the total raw data size (uncompressed) as 1783GB and hence the Raw to Index Size Ratio at 1.59:1.&lt;/P&gt;

&lt;P&gt;My question is how is it possible for 1024GB (1TB) file to be treated as 1783GB?&lt;/P&gt;</description>
    <pubDate>Sun, 22 Jan 2017 09:21:55 GMT</pubDate>
    <dc:creator>srajarat2</dc:creator>
    <dc:date>2017-01-22T09:21:55Z</dc:date>
    <item>
      <title>How does Splunk estimate the total raw data size?</title>
      <link>https://community.splunk.com/t5/Deployment-Architecture/How-does-Splunk-estimate-the-total-raw-data-size/m-p/248961#M9386</link>
      <description>&lt;P&gt;I had just setup Splunk with indexer clustering (RF-3, SF-2) with no data and initially loaded 1TB of syslog file using oneshot.  The "Index Detail: Deployment" page showed that the total index size is 1121GB whereas the total raw data size (uncompressed) as 1783GB and hence the Raw to Index Size Ratio at 1.59:1.&lt;/P&gt;

&lt;P&gt;My question is how is it possible for 1024GB (1TB) file to be treated as 1783GB?&lt;/P&gt;</description>
      <pubDate>Sun, 22 Jan 2017 09:21:55 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Deployment-Architecture/How-does-Splunk-estimate-the-total-raw-data-size/m-p/248961#M9386</guid>
      <dc:creator>srajarat2</dc:creator>
      <dc:date>2017-01-22T09:21:55Z</dc:date>
    </item>
    <item>
      <title>Re: How does Splunk estimate the total raw data size?</title>
      <link>https://community.splunk.com/t5/Deployment-Architecture/How-does-Splunk-estimate-the-total-raw-data-size/m-p/248962#M9387</link>
      <description>&lt;P&gt;The index size doesn't only depends upon the uncompressed raw data size. The Splunk create a compressed raw data files, as well as, a set of index files to make it searchable. The index consists of both these type of files. The compression ratio of raw data files and size of index files depends upon various factor. For more information, see following documentation. (see 2nd link for example of how Splunk calculates space).&lt;/P&gt;

&lt;P&gt;&lt;A href="http://docs.splunk.com/Documentation/Splunk/6.5.1/Indexer/HowSplunkstoresindexes"&gt;http://docs.splunk.com/Documentation/Splunk/6.5.1/Indexer/HowSplunkstoresindexes&lt;/A&gt;&lt;BR /&gt;
&lt;A href="http://docs.splunk.com/Documentation/Splunk/6.5.1/Indexer/Systemrequirements#Storage_requirement_examples"&gt;http://docs.splunk.com/Documentation/Splunk/6.5.1/Indexer/Systemrequirements#Storage_requirement_examples&lt;/A&gt;&lt;/P&gt;</description>
      <pubDate>Sun, 22 Jan 2017 20:52:34 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Deployment-Architecture/How-does-Splunk-estimate-the-total-raw-data-size/m-p/248962#M9387</guid>
      <dc:creator>somesoni2</dc:creator>
      <dc:date>2017-01-22T20:52:34Z</dc:date>
    </item>
    <item>
      <title>Re: How does Splunk estimate the total raw data size?</title>
      <link>https://community.splunk.com/t5/Deployment-Architecture/How-does-Splunk-estimate-the-total-raw-data-size/m-p/248963#M9388</link>
      <description>&lt;P&gt;Sorry, if I was not clear.  I am not asking about the index size.  I do understand the sizing calculations on rawdata (RF) and tsidx (SF) in a clustered indexer mode.  My question is specifically on the page "Index Detail: Deployment" page which shows the following information under "Index Structure Overview" (in 6.5.1).&lt;/P&gt;

&lt;PRE&gt;&lt;CODE&gt;  8 (Indexers)        1121GB (Total Index Size)     1783GB (Total Raw Data size (uncompressed))                                          1.59:1 (Raw to Index Size Ratio)
&lt;/CODE&gt;&lt;/PRE&gt;

&lt;P&gt;My question is specifically on how Splunk measures the "Total Raw Data size (uncompressed)" as I just ingested a 1024GB syslog file and I was hoping to see that as the total raw data size and not 1783GB.&lt;/P&gt;</description>
      <pubDate>Mon, 23 Jan 2017 02:59:44 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Deployment-Architecture/How-does-Splunk-estimate-the-total-raw-data-size/m-p/248963#M9388</guid>
      <dc:creator>srajarat2</dc:creator>
      <dc:date>2017-01-23T02:59:44Z</dc:date>
    </item>
    <item>
      <title>Re: How does Splunk estimate the total raw data size?</title>
      <link>https://community.splunk.com/t5/Deployment-Architecture/How-does-Splunk-estimate-the-total-raw-data-size/m-p/248964#M9389</link>
      <description>&lt;P&gt;You can see the screenshot here.&lt;/P&gt;

&lt;P&gt;&lt;IMG src="http://www.somu.us/hidden_sisv/" alt="alt text" /&gt;&lt;/P&gt;</description>
      <pubDate>Mon, 23 Jan 2017 03:25:59 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Deployment-Architecture/How-does-Splunk-estimate-the-total-raw-data-size/m-p/248964#M9389</guid>
      <dc:creator>srajarat2</dc:creator>
      <dc:date>2017-01-23T03:25:59Z</dc:date>
    </item>
  </channel>
</rss>

