Hi Everyone,
I hope all is well.
As Splunk consultant, how can you estimate the required hardware specs such as CPU and RAM:
Here, I will leave my Splunk infrastructure HLD, and I wish to know how roughly we can ask the right question and use the answers to request the proper CPU and RAM:
Assumption:
Data Volume: 130 GB/day
Number of sites : Three sites
Site1: Splunk infrastructure - Site2: Data collection - Site3: Data Collection
RF/SF : 2/1
Hi All,
Thank you for your valuable replies, it's really appreciated!
The main challenge here, we dont have any public sizing guidlines to follow when we do the splunk instance sizing task, so I truly depend on your feedback.
Hi @0xAli ,
as also @PickleRick said: the best approach is to use a certified Architect from a trusted Splunk Partner.
Ciao.
Giuseppe
This is the kind of question you don't want to rely on answer from some randos from internet on. This is something you want to work with your local Splunk Partner with so that you have some papers to back your sizing if anyone from management wants for justification of the cost or for where the performance problems come from. They will also ask some difficult questions about your data, your workload and so on which we in the community might not have time to think about and you might not be eager to share answers to with the whole world. So while we can discuss the general principles of sizing environments freely here I'd suggest engaging professional help for a specific project.
Hi at all,
I agree with all the considerations from @0xAli, there's only one additional information:
the sizing of using an indexer every 80-100 GB/day to index: this sizing arrives from the Enterprise Security Admin Course, but it isn't in any Splunk official documentation, infact working with the Splunk PS I received by them a different sizing: they hinted to use 150 GB/day per Indexer and, having very performant physical servers, also 200-250 GB/day per Indexer (this is the sizing from a Splunk architect of a project that I'm implementing ).
What is the correct answer?
I usually try to request a sizing to Splunk SE to be sure, if I must do the sizing by myself I use 150 GB/day per Indexer.
Ciao.
Giuseppe
Hey 0xAli — quick version: ES is what drives your sizing, not the 130 GB/day. Correlation searches and accelerated data models hit CPU and disk IOPS far harder than plain indexing, so size for the search load.
Rough starting point:
One design note: pointing network gear straight at an HF over UDP syslog drops events under load. Put SC4S (or syslog-ng/rsyslog writing to disk with a UF reading it) in front instead.
For real procurement numbers, don't eyeball it — run your inputs through a sizing calculator, match the topology against the Validated Architectures, and for an ES build it's worth a quick capacity-planning session with Splunk PS or a partner before you buy.
Docs worth referencing: