Getting Data In

What are best practices on setting the replication factor for X number of indexers in an indexer cluster?

dhavamanis
Builder

Need your help,

We are trying to increase the number of indexer nodes in the indexer cluster for max availability approach. Can you please share the best Splunk replication factor vs number of indexer nodes? Because we want to use minimal storage and all time data availability for search even if one or two nodes went down in the indexer pool.

0 Karma
1 Solution

somesoni2
Revered Legend

IMK, the Replication factor is set uses only criteria which depends on "How many node failure you can tolerate without data loss".
So,

Replication Factor = No of allowed Indexer failure +1 

If you've 4 indexers and want all data to available even with 2 nodes failed then you need replication factor of 3 and so on.

See this for more information

http://docs.splunk.com/Documentation/Splunk/6.4.1/Indexer/Thereplicationfactor#Replication_factor_an...

View solution in original post

somesoni2
Revered Legend

IMK, the Replication factor is set uses only criteria which depends on "How many node failure you can tolerate without data loss".
So,

Replication Factor = No of allowed Indexer failure +1 

If you've 4 indexers and want all data to available even with 2 nodes failed then you need replication factor of 3 and so on.

See this for more information

http://docs.splunk.com/Documentation/Splunk/6.4.1/Indexer/Thereplicationfactor#Replication_factor_an...

Get Updates on the Splunk Community!

Automatic Discovery Part 1: What is Automatic Discovery in Splunk Observability Cloud ...

If you’ve ever deployed a new database cluster, spun up a caching layer, or added a load balancer, you know it ...

Real-Time Fraud Detection: How Splunk Dashboards Protect Financial Institutions

Financial fraud isn't slowing down. If anything, it's getting more sophisticated. Account takeovers, credit ...

Splunk + ThousandEyes: Correlate frontend, app, and network data to troubleshoot ...

 Are you tired of troubleshooting delays caused by siloed frontend, application, and network data? We've got a ...