Getting Data In

Large WMI environment questions

Jason
Motivator

This question is helpful, but I have a client who needs more detail on a WMI-polling environment. Ideally a conference call with an architect. Specifically:

  • What is your largest known WMI-polling architecture?
  • What server in a search head - multiple Indexers - multiple LFs and collectors should do the WMI collection?
  • Is ~120 servers a best practice for number of polled per 16GB Splunk server?
  • What is an appropriate polling frequency? They want as close to real-time alerts as possible.
  • What is the effect on the polled machine?
  • Is load balancing possible, in case LF1 is down, that LF2 can handle the polling?

Let me know if I should break this out into multiple questions on here. Thanks!

0 Karma
1 Solution

gkanapathy
Splunk Employee
Splunk Employee
  • I don't know
  • A forwarder, or multiple forwarders
  • No. Number of servers is extremely approximate. The limitation is the number of WMI connections that a server can handle, and each query for a server requires a separate connection. So if you're issuing 5 queries, you can only collect from 1/5 as many servers. Furthermore, if you're collecting event logs rather than performance stats via WMI, it will be even more costly since every query must bring back the entire event log on each poll. It is best practice not to collect event logs via WMI, but instead to forward them, either with Splunk forwarders, syslog agents, or Windows event log forwarding.
  • As often as they can tolerate, keeping in mind this also is directly proportional to indexed volume. "As close to real-time as possible" of course comes at a cost that most people are in fact unwilling to bear.
  • Not that much, actually.
  • Yes, but not through pure native Splunk functionality.

View solution in original post

gkanapathy
Splunk Employee
Splunk Employee
  • I don't know
  • A forwarder, or multiple forwarders
  • No. Number of servers is extremely approximate. The limitation is the number of WMI connections that a server can handle, and each query for a server requires a separate connection. So if you're issuing 5 queries, you can only collect from 1/5 as many servers. Furthermore, if you're collecting event logs rather than performance stats via WMI, it will be even more costly since every query must bring back the entire event log on each poll. It is best practice not to collect event logs via WMI, but instead to forward them, either with Splunk forwarders, syslog agents, or Windows event log forwarding.
  • As often as they can tolerate, keeping in mind this also is directly proportional to indexed volume. "As close to real-time as possible" of course comes at a cost that most people are in fact unwilling to bear.
  • Not that much, actually.
  • Yes, but not through pure native Splunk functionality.
Got questions? Get answers!

Join the Splunk Community Slack to learn, troubleshoot, and make connections with fellow Splunk practitioners in real time!

Meet up IRL or virtually!

Join Splunk User Groups to connect and learn in-person by region or remotely by topic or industry.

Get Updates on the Splunk Community!

Announcing Modern Navigation: A New Era of Splunk User Experience

We are excited to introduce the Modern Navigation feature in the Splunk Platform, available to both cloud and ...

Modernize your Splunk Apps – Introducing Python 3.13 in Splunk

We are excited to announce that the upcoming releases of Splunk Enterprise 10.2.x and Splunk Cloud Platform ...

Step into “Hunt the Insider: An Splunk ES Premier Mystery” to catch a cybercriminal ...

After a whole week of being on call, you fell asleep on your keyboard, and you hit a sequence of buttons that ...