Deployment Architecture

Storage size thoughts and calculations

horsefez
Motivator

Hi fellow Splunkers!

I'm currently trying to figure out how much data storage my deployment of splunk would need if I would index up to 10 GB of data per day.

What Splunk thinks about it:

Typically, the compressed rawdata file is 10% the size of the incoming, pre-indexed raw data. The associated index files range in size from approximately 10% to 110% of the rawdata file. The number of unique terms in the data affect this value.

http://docs.splunk.com/Documentation/Splunk/6.3.3/Capacity/Estimateyourstoragerequirements

What I think about it:

I have a log volume of 10 GB per day.
This would be an estimated rawdata size of... 10 GB.
This would be an estimated compressed(10%) rawdata size of 1 GB.
The maximal possible compression(10%) of indexdata could be 1 GB.
The minimal possible compression(110%) of indexdata would be 11 GB.

As a result I would have to make room for storage between at least 2 GB and max 12 GB for every day I want to store data.
Am I right?

What you think about it:
...

0 Karma
1 Solution

jmallorquin
Builder

Hi,

You can use this tool.

https://splunk-sizing.appspot.com/

Hope i help you

View solution in original post

woodcock
Esteemed Legend

The general rule of thumb calculation is:

raw_daily_bandwidth * days-to-retain-data * index-replication-factor / 2 (includes reduction due to compression and bloating due to indexing overhead ASSUMING NOT USING `indexed_extractions`)
0 Karma

gwiley_splunk
Splunk Employee
Splunk Employee

One key question here is how long do you want to keep the data for?

Other questions worthy of consideration are:

  • are you planning to use data models and/or summary indexes? Note that some apps on splunkbase may use data models or summary indexes.
  • are you running a cluster and planning to use index replication?

The splunk-sizing web app will help you get most of the way there and allows you to specify storage contingency.

Cheers, Greg.

0 Karma

horsefez
Motivator

Thanks to you, too! 🙂
What would change if I'm planning to use data models?
Do I need to save even more data?

0 Karma

jmallorquin
Builder

Only if you accelerate the data models.

0 Karma

jmallorquin
Builder

Hi,

You can use this tool.

https://splunk-sizing.appspot.com/

Hope i help you

horsefez
Motivator

Wow, this is an amazing tool. Thank you! 🙂

0 Karma
Career Survey
First 500 qualified respondents will receive a $20 gift card! Tell us about your professional Splunk journey.

Can’t make it to .conf25? Join us online!

Get Updates on the Splunk Community!

Community Content Calendar, September edition

Welcome to another insightful post from our Community Content Calendar! We're thrilled to continue bringing ...

Splunkbase Unveils New App Listing Management Public Preview

Splunkbase Unveils New App Listing Management Public PreviewWe're thrilled to announce the public preview of ...

Leveraging Automated Threat Analysis Across the Splunk Ecosystem

Are you leveraging automation to its fullest potential in your threat detection strategy?Our upcoming Security ...