I use bluepill to track the health of service process (like a java web service for example).
Bluepill outputs lines like:
host service1 Going from starting => up
host service1 Going from up => down
host service1 Going from down => starting
host service1 Going from up => unmonitored
The set of states is small and finite. Some are 'good' states like: 'up' some are 'bad': unmonitored, down.
What mechanisms are there to correlate and track this data?
For example:
1. I would like to display a graph that tracks services by host that transition from up => unmonitored and back often.
2. I would like to alert when a given service transitions too quickly.
3. I would like to see if all services transition to a bad state at the same time.
... View more