Getting Data In

indexing volume vs data size received

Genti
Splunk Employee
Splunk Employee

What is the relationship between size of logs received by Splunk indexing servers versus indexing volume? On the load balancer we see similar amount of data sent to each Splunk server. But their indexing volumes are drastically different.

Tags (2)
0 Karma
1 Solution

piebob
Splunk Employee
Splunk Employee

typically, the compressed, persisted data that Splunk extracts from your data inputs amounts to approximately 10% of the raw data that comes into Splunk. the indexes that are created to access this data can be anywhere from 10% to 110% of the size of the data that comes in. this value is affected strongly by how many unique terms occur in your data.

(from http://www.splunk.com/base/Documentation/latest/Installation/HowHowmuchspaceyouwillneed )

View solution in original post

0 Karma

piebob
Splunk Employee
Splunk Employee

typically, the compressed, persisted data that Splunk extracts from your data inputs amounts to approximately 10% of the raw data that comes into Splunk. the indexes that are created to access this data can be anywhere from 10% to 110% of the size of the data that comes in. this value is affected strongly by how many unique terms occur in your data.

(from http://www.splunk.com/base/Documentation/latest/Installation/HowHowmuchspaceyouwillneed )

0 Karma
Got questions? Get answers!

Join the Splunk Community Slack to learn, troubleshoot, and make connections with fellow Splunk practitioners in real time!

Meet up IRL or virtually!

Join Splunk User Groups to connect and learn in-person by region or remotely by topic or industry.

Get Updates on the Splunk Community!

Monitoring AI Agents with Splunk Observability Cloud

Let’s say I’m running a travel planning AI app in production. A user asks for three concise hotel options in ...

[Puzzles] Solve, Learn, Repeat: Tiling

This puzzle (first published here) is based on finding groups of tessellated tiles (inspired by floor tiles I ...

SOK it to Me: Top 3 Benefits of Using Splunk Operator on Kubernetes that’ll Make ...

    Thursday, July 9, 2026  |  11:00AM–12:00PM PDT Duration: 1 hour (includes Q&A) Managing can feel like a ...