I have events like the below that are saying when a particular pool member was out of rotation for a particular period of time. What would be an ideal search would be to match all events that have the "was down for" and then the length of time and simply average that, and take the 95th percentile of that duration. Probably more difficult than it seems and I'm not sure how to approach it.
Hi @jyates76,
you have to extract the down duratio and then run a simple search:
index=your_index "was down for"
| rex "was\s+down\s+for\s+(?<hours>\d+)hr:(?<minutes>\d+)min:(?<seconds>\d+)sec"
| eval duration=hours*3600+minutes*60+seconds
| timechart perc90(duration) BY host
You can test the regex at https://regex101.com/r/75pRcf/1
then you can use other functions or aggregations.
Ciao.
Giuseppe