<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Splunk compression rate for archiving data in Splunk Search</title>
    <link>https://community.splunk.com/t5/Splunk-Search/Splunk-compression-rate-for-archiving-data/m-p/37317#M178633</link>
    <description>&lt;P&gt;Check &lt;CODE&gt;/opt/splunk/var/lib/splunk/defaultdb/db/&lt;/CODE&gt;&lt;/P&gt;

&lt;P&gt;That's where the 'main' index (defaultdb) is stored. In this folder you will find the hot and warm buckets as subdirs, e.g. &lt;CODE&gt;db_1234123412_12341234325_33&lt;/CODE&gt;&lt;/P&gt;

&lt;P&gt;Inside a bucket there will be some metadata files and .tsidx-files (indexes for searching the raw data). Finally there will be a directory called 'rawdata' that contains the zipped raw data.&lt;/P&gt;</description>
    <pubDate>Fri, 16 Aug 2013 14:30:16 GMT</pubDate>
    <dc:creator>kristian_kolb</dc:creator>
    <dc:date>2013-08-16T14:30:16Z</dc:date>
    <item>
      <title>Splunk compression rate for archiving data</title>
      <link>https://community.splunk.com/t5/Splunk-Search/Splunk-compression-rate-for-archiving-data/m-p/37314#M178630</link>
      <description>&lt;P&gt;i have to  set up a Archiving policy and storage requirements in SPlunk. Estimated logs per day would be 100 GB. So if i go by documentation SPlunk will index 50 GB(with a compression rate of 50%). Then As the data will get old it same move 50 Gb of data from Hot-&amp;gt;Warm-&amp;gt;Cold. At this point i will setup a archival policy to S3(AWS). I wanted to know whether splunk will archive whole 50GB or 100 Gb data in S3 and What amount of data   will be indexed back. Is it going to be 50Gb&amp;gt;&lt;/P&gt;

&lt;P&gt;Please help&lt;/P&gt;</description>
      <pubDate>Fri, 16 Aug 2013 09:39:08 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Splunk-Search/Splunk-compression-rate-for-archiving-data/m-p/37314#M178630</guid>
      <dc:creator>lohit</dc:creator>
      <dc:date>2013-08-16T09:39:08Z</dc:date>
    </item>
    <item>
      <title>Re: Splunk compression rate for archiving data</title>
      <link>https://community.splunk.com/t5/Splunk-Search/Splunk-compression-rate-for-archiving-data/m-p/37315#M178631</link>
      <description>&lt;P&gt;Normally, on average, Splunk will compress raw data to about half the size, or thereabouts. So your original 100GB will now be 35GB of index-files and 15GB of compressed data, according to a rough estimate.&lt;/P&gt;

&lt;P&gt;When data is &lt;CODE&gt;frozen&lt;/CODE&gt; - which is what I assume you mean by "archival policy", only the compressed data is saved, and the index-files are deleted. So only about 15% of the original size of the raw data is archived. 15GB&lt;/P&gt;

&lt;P&gt;When/if you need to restore archived (frozen) data, you will need to rebuild the index-files before you can search it again. Back to 15+35 GB.&lt;/P&gt;

&lt;P&gt;/K&lt;/P&gt;</description>
      <pubDate>Fri, 16 Aug 2013 11:53:58 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Splunk-Search/Splunk-compression-rate-for-archiving-data/m-p/37315#M178631</guid>
      <dc:creator>kristian_kolb</dc:creator>
      <dc:date>2013-08-16T11:53:58Z</dc:date>
    </item>
    <item>
      <title>Re: Splunk compression rate for archiving data</title>
      <link>https://community.splunk.com/t5/Splunk-Search/Splunk-compression-rate-for-archiving-data/m-p/37316#M178632</link>
      <description>&lt;P&gt;How can i check the compressed data size?&lt;/P&gt;</description>
      <pubDate>Fri, 16 Aug 2013 12:01:47 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Splunk-Search/Splunk-compression-rate-for-archiving-data/m-p/37316#M178632</guid>
      <dc:creator>lohit</dc:creator>
      <dc:date>2013-08-16T12:01:47Z</dc:date>
    </item>
    <item>
      <title>Re: Splunk compression rate for archiving data</title>
      <link>https://community.splunk.com/t5/Splunk-Search/Splunk-compression-rate-for-archiving-data/m-p/37317#M178633</link>
      <description>&lt;P&gt;Check &lt;CODE&gt;/opt/splunk/var/lib/splunk/defaultdb/db/&lt;/CODE&gt;&lt;/P&gt;

&lt;P&gt;That's where the 'main' index (defaultdb) is stored. In this folder you will find the hot and warm buckets as subdirs, e.g. &lt;CODE&gt;db_1234123412_12341234325_33&lt;/CODE&gt;&lt;/P&gt;

&lt;P&gt;Inside a bucket there will be some metadata files and .tsidx-files (indexes for searching the raw data). Finally there will be a directory called 'rawdata' that contains the zipped raw data.&lt;/P&gt;</description>
      <pubDate>Fri, 16 Aug 2013 14:30:16 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Splunk-Search/Splunk-compression-rate-for-archiving-data/m-p/37317#M178633</guid>
      <dc:creator>kristian_kolb</dc:creator>
      <dc:date>2013-08-16T14:30:16Z</dc:date>
    </item>
    <item>
      <title>Re: Splunk compression rate for archiving data</title>
      <link>https://community.splunk.com/t5/Splunk-Search/Splunk-compression-rate-for-archiving-data/m-p/37318#M178634</link>
      <description>&lt;P&gt;So the "50%" would be the size of the bucket as a whole, compared to the uncompressed .gz found in its rawdata directory. &lt;/P&gt;

&lt;P&gt;This can vary from bucket to bucket, and will depend on the compressability of the log data coming in. Over a diverse set of log sources, the figure "50%" is commonly mentioned as an average compression rate.&lt;/P&gt;</description>
      <pubDate>Fri, 16 Aug 2013 14:34:57 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Splunk-Search/Splunk-compression-rate-for-archiving-data/m-p/37318#M178634</guid>
      <dc:creator>kristian_kolb</dc:creator>
      <dc:date>2013-08-16T14:34:57Z</dc:date>
    </item>
    <item>
      <title>Re: Splunk compression rate for archiving data</title>
      <link>https://community.splunk.com/t5/Splunk-Search/Splunk-compression-rate-for-archiving-data/m-p/37319#M178635</link>
      <description>&lt;P&gt;Has anything changed in this topic?&lt;/P&gt;

&lt;P&gt;Are these calculations actual (I mean &lt;STRONG&gt;about&lt;/STRONG&gt; 15% for data and &lt;STRONG&gt;about&lt;/STRONG&gt; 35% for metadata)?&lt;/P&gt;</description>
      <pubDate>Thu, 09 Jun 2016 10:46:52 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Splunk-Search/Splunk-compression-rate-for-archiving-data/m-p/37319#M178635</guid>
      <dc:creator>lukasz92</dc:creator>
      <dc:date>2016-06-09T10:46:52Z</dc:date>
    </item>
  </channel>
</rss>

