I am making a list of Splunk critical services to be notified about after hours & weekends to revive Splunk in case it is goes down & stay down until Monday !! We had an incident that a Splunk instance went down Friday night & we found out on Monday !! I am including Splunkd down, CPU on an instance going up/running at 95%. What other critical items would you add to this list please? I have a large environment, have Splunk Ent. ES & clustered environment.
Hi @SamHTexas,
You can use your Monitoring Console instance alerts.
You just need to enable DMC Alert - Search Peer Not Responding alert and set an action (like Send email).