<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Estimating size of index in Deployment Architecture</title>
    <link>https://community.splunk.com/t5/Deployment-Architecture/Estimating-size-of-index/m-p/62589#M2127</link>
    <description>&lt;P&gt;martin_mueller,&lt;BR /&gt;
Thanks for your post!&lt;BR /&gt;
To observe a data sample's size is really a good idea, but It's hard to get it right now. So I would like to estimate the maximum size of the index file.&lt;/P&gt;</description>
    <pubDate>Fri, 15 Mar 2013 09:59:54 GMT</pubDate>
    <dc:creator>lzhang_soliton</dc:creator>
    <dc:date>2013-03-15T09:59:54Z</dc:date>
    <item>
      <title>Estimating size of index</title>
      <link>https://community.splunk.com/t5/Deployment-Architecture/Estimating-size-of-index/m-p/62587#M2125</link>
      <description>&lt;P&gt;Hi,&lt;/P&gt;

&lt;P&gt;I have been looking for how to calculate the range of index file in Splunk. I wrote the math according to the following description. Could anyone check it?&lt;/P&gt;

&lt;BLOCKQUOTE&gt;
&lt;P&gt;&lt;A href="http://docs.splunk.com/Documentation/Splunk/5.0.2/Installation/Estimateyourstoragerequirements"&gt;Estimate your storage requirements&lt;/A&gt;&lt;/P&gt;

&lt;P&gt;The compressed rawdata file is approximately 10% the size of the incoming, pre-indexed raw data.&lt;/P&gt;

&lt;P&gt;The associated index files range in size from approximately 10% to 110% of the rawdata file.&lt;/P&gt;
&lt;/BLOCKQUOTE&gt;

&lt;PRE&gt;&lt;CODE&gt;Raw data size: 9TB
"rawdata file size": 9TB x 10%
Minimum index size: (9TB x 10%) + ((9TB x 10%) x 10%)
Maximum index size: (9TB x 10%) + ((9TB x 10%) x 110%)
&lt;/CODE&gt;&lt;/PRE&gt;

&lt;P&gt;Thank you all in advance.&lt;/P&gt;</description>
      <pubDate>Fri, 15 Mar 2013 09:19:34 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Deployment-Architecture/Estimating-size-of-index/m-p/62587#M2125</guid>
      <dc:creator>lzhang_soliton</dc:creator>
      <dc:date>2013-03-15T09:19:34Z</dc:date>
    </item>
    <item>
      <title>Re: Estimating size of index</title>
      <link>https://community.splunk.com/t5/Deployment-Architecture/Estimating-size-of-index/m-p/62588#M2126</link>
      <description>&lt;P&gt;You've got one "x 10%" too many. The index size is based on the raw data, not the compressed raw data.&lt;/P&gt;

&lt;P&gt;To get a more accurate reading on your data you could take a 10GB sample and store it in a temporary index, take that size on disk as a baseline.&lt;/P&gt;

&lt;P&gt;As a real-life example, JSON data from Twitter is compressible to about 15% and yields indexes about 60% of the raw data size - in total you'd need about 75% of the raw size on disk.&lt;/P&gt;</description>
      <pubDate>Fri, 15 Mar 2013 09:27:25 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Deployment-Architecture/Estimating-size-of-index/m-p/62588#M2126</guid>
      <dc:creator>martin_mueller</dc:creator>
      <dc:date>2013-03-15T09:27:25Z</dc:date>
    </item>
    <item>
      <title>Re: Estimating size of index</title>
      <link>https://community.splunk.com/t5/Deployment-Architecture/Estimating-size-of-index/m-p/62589#M2127</link>
      <description>&lt;P&gt;martin_mueller,&lt;BR /&gt;
Thanks for your post!&lt;BR /&gt;
To observe a data sample's size is really a good idea, but It's hard to get it right now. So I would like to estimate the maximum size of the index file.&lt;/P&gt;</description>
      <pubDate>Fri, 15 Mar 2013 09:59:54 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Deployment-Architecture/Estimating-size-of-index/m-p/62589#M2127</guid>
      <dc:creator>lzhang_soliton</dc:creator>
      <dc:date>2013-03-15T09:59:54Z</dc:date>
    </item>
    <item>
      <title>Re: Estimating size of index</title>
      <link>https://community.splunk.com/t5/Deployment-Architecture/Estimating-size-of-index/m-p/62590#M2128</link>
      <description>&lt;P&gt;For ongoing knowledge of your indexers, I &lt;STRONG&gt;highly&lt;/STRONG&gt; recommend Sanford Owing's Fire Brigade &lt;A href="http://apps.splunk.com/app/1632/"&gt;Fire Brigade&lt;/A&gt; App on Splunkbase. It will give you a ton of information on the size of indexes and individual buckets for existing indexes, which lets you plan for future growth.&lt;/P&gt;

&lt;P&gt;(Edited to link to Fire Brigade Version 2 for Splunk 5 &amp;amp; 6)&lt;/P&gt;</description>
      <pubDate>Tue, 13 May 2014 16:08:58 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Deployment-Architecture/Estimating-size-of-index/m-p/62590#M2128</guid>
      <dc:creator>ckurtz</dc:creator>
      <dc:date>2014-05-13T16:08:58Z</dc:date>
    </item>
  </channel>
</rss>

