Solved: Why does Splunk have multiple indexes?

maverick · ‎04-22-2010

With Splunk's normalizing timestamp-based event indexing capabilities combined with it's powerful search language and processing commands, one would think that all you need is one big main index.

So why is there more than one index and what are the reasons for creating additional indexes?

gkanapathy · ‎04-22-2010

Mulitple indexes are indicated usually for two reasons:

Physical data separation
- This may be related to access control of data, but it is not necessary to use separate indexes to control access to data, although with current (v4.1) Splunk management capabilities, access control is easiest to configure with separate indexes.
Differential retention periods for different data sets
- This includes summary indexing of different time densities, test indexes, as well as cases of some data having longer retention requirements than other (often extremely high-volume) data has shorter requirements.

Performance is not a typical consideration, and the effect of multiple indexes vs a single one for a given set of data varies greatly depending on the exact nature of the data and the exact queries or mix of queries to be performed against it.

View solution in original post

muebel · ‎04-22-2010

In addition to gkanapathy's answer, additional indexes seems to be part and parcel of how summary indexing works.

http://www.splunk.com/base/Documentation/4.1.1/Knowledge/Usesummaryindexing

View solution in original post

jrodman · ‎04-22-2010

There are performance goals as well, sparse data (login errors) will be more performant when searched apart from bulk data (firewall rule traversals). There's administrative overhead in creating multiple indexes (you have to configure them) but when you will have a large amount of data of quite different volumes in high performance environments this can be worthwhile. This is the main reason that summary indexing goes to a new index (it could use the same one).

There are more obscure cases as well for performance, such as different segmentation per index, but ideally this is not necessary.

muebel · ‎04-22-2010

In addition to gkanapathy's answer, additional indexes seems to be part and parcel of how summary indexing works.

http://www.splunk.com/base/Documentation/4.1.1/Knowledge/Usesummaryindexing

Genti · ‎10-06-2010

ha! bad Maverick, bad!

gkanapathy · ‎04-24-2010

maverick is on vendetta against me, jrodman, and other Splunk employees on this site.

muebel · ‎04-22-2010

I have no idea why this was considered the best answer hah

gkanapathy · ‎04-22-2010

Mulitple indexes are indicated usually for two reasons:

Physical data separation
- This may be related to access control of data, but it is not necessary to use separate indexes to control access to data, although with current (v4.1) Splunk management capabilities, access control is easiest to configure with separate indexes.
Differential retention periods for different data sets
- This includes summary indexing of different time densities, test indexes, as well as cases of some data having longer retention requirements than other (often extremely high-volume) data has shorter requirements.

Performance is not a typical consideration, and the effect of multiple indexes vs a single one for a given set of data varies greatly depending on the exact nature of the data and the exact queries or mix of queries to be performed against it.

Why does Splunk have multiple indexes?

Join the Splunk Community Slack to learn, troubleshoot, and make connections with fellow Splunk practitioners in real time!

Join Splunk User Groups to connect and learn in-person by region or remotely by topic or industry.

Laser Bananas and Edge Hubs: Exploring Operational Technology (OT) Data Through a ...

Event Series: Mastering AI Tokenomics and Splunk Agent Observability

span_metrics: The OpenTelemetry-Idiomatic Way to See Inside Your Services

Join the Conversation