Getting Data In

Newbie Question: Getting Data into a Distributed Cluster

zadunn
Engager

Hey all!

I am trying to understand splunk a little better. I am trying to setup a search head and two indexers. I have all that configured (well everything is added into the search head). Now I am wondering, aside from the splunk forwarder handling automatic load balancing between the two index nodes, what is the best practice on getting data into the indexes? Put more clearly, say i want to collect rsyslogd data on port 514. Do I need to configure each indexer, and then make sure that I am alternating which 'nix boxes i am assigning to which indexer? Or do I need to configure the search head as a forwarder, use that as a single point of entry for everything (how well would that scale?) and then let the splunk forwarder LB it between the two indexers? Do i need to create the indexes manually on each index node?

Lots of questions, like I mentioned I am new to all this.

Thanks!

Zach

Tags (2)

gkanapathy
Splunk Employee
Splunk Employee

You should forward to a Splunk forwarder (preferably not your search head), which will then distribute the data among the nodes of your indexing cluster. For UDP syslog packets in particular, you can use a hardware load balancer or some other way to scatter the packets, but you can't do this with TCP streams. You don't really need a separate dedicated search head with only two indexers, and if it's similar hardware, I'd say that you will do better using it as a third indexer and then picking one of those and using it as your search head at the same time.

Get Updates on the Splunk Community!

Index This | What is broken 80% of the time by February?

December 2025 Edition   Hayyy Splunk Education Enthusiasts and the Eternally Curious!    We’re back with this ...

Unlock Faster Time-to-Value on Edge and Ingest Processor with New SPL2 Pipeline ...

Hello Splunk Community,   We're thrilled to share an exciting update that will help you manage your data more ...

Splunk MCP & Agentic AI: Machine Data Without Limits

Discover how the Splunk Model Context Protocol (MCP) Server can revolutionize the way your organization uses ...