<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Compression rate for indexes / hot / warm / cold / frozen ? in Getting Data In</title>
    <link>https://community.splunk.com/t5/Getting-Data-In/Compression-rate-for-indexes-hot-warm-cold-frozen/m-p/89904#M97322</link>
    <description>&lt;P&gt;No.&lt;BR /&gt;
But you can run a test by segregating each source/sourcetype to a different index, index a significant sample, then compare with the previous search.&lt;/P&gt;</description>
    <pubDate>Tue, 27 Aug 2013 17:43:56 GMT</pubDate>
    <dc:creator>yannK</dc:creator>
    <dc:date>2013-08-27T17:43:56Z</dc:date>
    <item>
      <title>Compression rate for indexes / hot / warm / cold / frozen ?</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/Compression-rate-for-indexes-hot-warm-cold-frozen/m-p/89899#M97317</link>
      <description>&lt;P&gt;I have a few easy question about splunk data compression rate.&lt;/P&gt;

&lt;OL&gt;
&lt;LI&gt;&lt;P&gt;What is the typical compression rate for english ASCII based data?&lt;/P&gt;&lt;/LI&gt;
&lt;LI&gt;&lt;P&gt;Is the compression rate different from hot / warn / cold / frozen?&lt;/P&gt;&lt;/LI&gt;
&lt;LI&gt;&lt;P&gt;Does hot buckets also get compressed?&lt;/P&gt;&lt;/LI&gt;
&lt;/OL&gt;

&lt;P&gt;Easy, huh?  Thanks for your answer!&lt;/P&gt;</description>
      <pubDate>Tue, 03 Jul 2012 17:28:03 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/Compression-rate-for-indexes-hot-warm-cold-frozen/m-p/89899#M97317</guid>
      <dc:creator>clyde772</dc:creator>
      <dc:date>2012-07-03T17:28:03Z</dc:date>
    </item>
    <item>
      <title>Re: Compression rate for indexes / hot / warm / cold / frozen ?</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/Compression-rate-for-indexes-hot-warm-cold-frozen/m-p/89900#M97318</link>
      <description>&lt;P&gt;easy :&lt;/P&gt;

&lt;P&gt;1 - roughly between 1 and infinite minus one.&lt;BR /&gt;
Seriously, it depends of your data, here is a method is to calculate it.&lt;BR /&gt;
I usually see about 40%~50% compression.&lt;/P&gt;

&lt;P&gt;in this example we look at index=_internal, please replace by your index.&lt;BR /&gt;
&lt;PRE&gt;&lt;BR /&gt;
| dbinspect index=_internal&lt;BR /&gt;
| fields state,id,rawSize,sizeOnDiskMB &lt;BR /&gt;
| stats sum(rawSize) AS rawTotal, sum(sizeOnDiskMB) AS diskTotalinMB&lt;BR /&gt;
| eval rawTotalinMB=(rawTotal / 1024 / 1024) | fields - rawTotal&lt;BR /&gt;
| eval compression=tostring(round(diskTotalinMB / rawTotalinMB * 100, 2)) + "%"&lt;BR /&gt;
| table rawTotalinMB, diskTotalinMB, compression&lt;BR /&gt;
&lt;/PRE&gt;&lt;/P&gt;

&lt;P&gt;2 - the compression rate is identical for hot / warm / cold / frozen&lt;BR /&gt;
However when a bucket is frozen, some metadata files are removed or compressed (it saves some MB), they can be recreated when thawed. &lt;/P&gt;

&lt;P&gt;3 - the hot buckets are been written already compressed.&lt;/P&gt;</description>
      <pubDate>Mon, 28 Sep 2020 12:01:13 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/Compression-rate-for-indexes-hot-warm-cold-frozen/m-p/89900#M97318</guid>
      <dc:creator>yannK</dc:creator>
      <dc:date>2020-09-28T12:01:13Z</dc:date>
    </item>
    <item>
      <title>Re: Compression rate for indexes / hot / warm / cold / frozen ?</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/Compression-rate-for-indexes-hot-warm-cold-frozen/m-p/89901#M97319</link>
      <description>&lt;P&gt;Featuring "Sanford" for the search.&lt;/P&gt;</description>
      <pubDate>Tue, 03 Jul 2012 18:03:27 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/Compression-rate-for-indexes-hot-warm-cold-frozen/m-p/89901#M97319</guid>
      <dc:creator>yannK</dc:creator>
      <dc:date>2012-07-03T18:03:27Z</dc:date>
    </item>
    <item>
      <title>Re: Compression rate for indexes / hot / warm / cold / frozen ?</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/Compression-rate-for-indexes-hot-warm-cold-frozen/m-p/89902#M97320</link>
      <description>&lt;P&gt;extra question :&lt;/P&gt;

&lt;UL&gt;
&lt;LI&gt;is the license volume counted on the stored compressed data or on the uncompressed data ?&lt;/LI&gt;
&lt;LI&gt;Answer -&amp;gt; on the uncompressed data.&lt;/LI&gt;
&lt;/UL&gt;</description>
      <pubDate>Tue, 22 Jan 2013 20:28:56 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/Compression-rate-for-indexes-hot-warm-cold-frozen/m-p/89902#M97320</guid>
      <dc:creator>yannK</dc:creator>
      <dc:date>2013-01-22T20:28:56Z</dc:date>
    </item>
    <item>
      <title>Re: Compression rate for indexes / hot / warm / cold / frozen ?</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/Compression-rate-for-indexes-hot-warm-cold-frozen/m-p/89903#M97321</link>
      <description>&lt;P&gt;@yannk, is there a straightforward way to calculate compression ratios for different sources or sourcetypes within an index?&lt;/P&gt;</description>
      <pubDate>Tue, 27 Aug 2013 17:36:59 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/Compression-rate-for-indexes-hot-warm-cold-frozen/m-p/89903#M97321</guid>
      <dc:creator>cphair</dc:creator>
      <dc:date>2013-08-27T17:36:59Z</dc:date>
    </item>
    <item>
      <title>Re: Compression rate for indexes / hot / warm / cold / frozen ?</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/Compression-rate-for-indexes-hot-warm-cold-frozen/m-p/89904#M97322</link>
      <description>&lt;P&gt;No.&lt;BR /&gt;
But you can run a test by segregating each source/sourcetype to a different index, index a significant sample, then compare with the previous search.&lt;/P&gt;</description>
      <pubDate>Tue, 27 Aug 2013 17:43:56 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/Compression-rate-for-indexes-hot-warm-cold-frozen/m-p/89904#M97322</guid>
      <dc:creator>yannK</dc:creator>
      <dc:date>2013-08-27T17:43:56Z</dc:date>
    </item>
    <item>
      <title>Re: Compression rate for indexes / hot / warm / cold / frozen ?</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/Compression-rate-for-indexes-hot-warm-cold-frozen/m-p/89905#M97323</link>
      <description>&lt;P&gt;Always a good idea to add new data to a test index and check for &lt;/P&gt;

&lt;P&gt;compression&lt;BR /&gt;
line-breaking&lt;BR /&gt;
time-stamping&lt;/P&gt;

&lt;P&gt;before creating the input in a production environment.&lt;/P&gt;</description>
      <pubDate>Tue, 27 Aug 2013 17:53:11 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/Compression-rate-for-indexes-hot-warm-cold-frozen/m-p/89905#M97323</guid>
      <dc:creator>lguinn2</dc:creator>
      <dc:date>2013-08-27T17:53:11Z</dc:date>
    </item>
    <item>
      <title>Re: Compression rate for indexes / hot / warm / cold / frozen ?</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/Compression-rate-for-indexes-hot-warm-cold-frozen/m-p/89906#M97324</link>
      <description>&lt;P&gt;With the above logic for most of the indexers I see 200+% compression an eg rawTotal=42726 and diskTotalinMB = 102921.&lt;/P&gt;

&lt;P&gt;As per documentation compression should be around 50% meaning diskTotalinMB should be halfth the rawTotal. But in my case it is more than 2.5 times. Any pointers why it consumes more disk space?&lt;/P&gt;</description>
      <pubDate>Fri, 06 Sep 2019 12:12:48 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/Compression-rate-for-indexes-hot-warm-cold-frozen/m-p/89906#M97324</guid>
      <dc:creator>sakthiganesht</dc:creator>
      <dc:date>2019-09-06T12:12:48Z</dc:date>
    </item>
    <item>
      <title>Re: Compression rate for indexes / hot / warm / cold / frozen ?</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/Compression-rate-for-indexes-hot-warm-cold-frozen/m-p/89907#M97325</link>
      <description>&lt;P&gt;It all depends on your data.  If you are using indexed extractions on json data you will get virtually no reduction is total disk size since the tsidx files will be huge compared to typical syslog data (on which the documentation is built).&lt;/P&gt;</description>
      <pubDate>Fri, 06 Sep 2019 19:28:20 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/Compression-rate-for-indexes-hot-warm-cold-frozen/m-p/89907#M97325</guid>
      <dc:creator>hunderliggur</dc:creator>
      <dc:date>2019-09-06T19:28:20Z</dc:date>
    </item>
    <item>
      <title>Re: Compression rate for indexes / hot / warm / cold / frozen ?</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/Compression-rate-for-indexes-hot-warm-cold-frozen/m-p/89908#M97326</link>
      <description>&lt;P&gt;Hello guys,&lt;BR /&gt;
it looks like frozen data is around -50% compared to hot/cold, is this correct?&lt;BR /&gt;
Thanks.&lt;/P&gt;</description>
      <pubDate>Tue, 10 Dec 2019 16:28:43 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/Compression-rate-for-indexes-hot-warm-cold-frozen/m-p/89908#M97326</guid>
      <dc:creator>splunkreal</dc:creator>
      <dc:date>2019-12-10T16:28:43Z</dc:date>
    </item>
    <item>
      <title>Re: Compression rate for indexes / hot / warm / cold / frozen ?</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/Compression-rate-for-indexes-hot-warm-cold-frozen/m-p/708259#M117016</link>
      <description>&lt;P&gt;See my reply here if it can help&lt;/P&gt;&lt;P&gt;&lt;A href="https://community.splunk.com/t5/Deployment-Architecture/Splunk-Storage-Sizing-Guidelines-and-calculations/m-p/708258/highlight/true#M29013" target="_blank"&gt;https://community.splunk.com/t5/Deployment-Architecture/Splunk-Storage-Sizing-Guidelines-and-calculations/m-p/708258/highlight/true#M29013&lt;/A&gt;&lt;/P&gt;</description>
      <pubDate>Wed, 08 Jan 2025 15:14:58 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/Compression-rate-for-indexes-hot-warm-cold-frozen/m-p/708259#M117016</guid>
      <dc:creator>edoardo_vicendo</dc:creator>
      <dc:date>2025-01-08T15:14:58Z</dc:date>
    </item>
  </channel>
</rss>

