Alerting

Get alerted if same error happening on multiple hosts

jrlesch
New Member

I have the same suite of tests running on multiple hosts. I want to set up an alert if a specific test fails on 2 or more hosts.

sourcetype=teamcity LogFailure name="'tests/cluster/pay/*'"

is my search to get all the tests the failed. The "name" is the name of the test.

How do I tell if the test is throwing the same LogFailure on multiple hosts.

Thanks

0 Karma
1 Solution

somesoni2
Revered Legend

Try this

sourcetype=teamcity LogFailure name="'tests/cluster/pay/*'" | stats dc(host) as count | where count>=2 

Set an alert if above search returns any result.

Update

Try this

sourcetype=teamcity LogFailure name="'tests/cluster/pay/*'" | stats dc(host) as count  by name| where count>=2 

This should get distinct count of host for each values of name i.e. test names and will list of tests which have failed in more than 1 hosts. The alert condition should be same.

View solution in original post

somesoni2
Revered Legend

Try this

sourcetype=teamcity LogFailure name="'tests/cluster/pay/*'" | stats dc(host) as count | where count>=2 

Set an alert if above search returns any result.

Update

Try this

sourcetype=teamcity LogFailure name="'tests/cluster/pay/*'" | stats dc(host) as count  by name| where count>=2 

This should get distinct count of host for each values of name i.e. test names and will list of tests which have failed in more than 1 hosts. The alert condition should be same.

jrlesch
New Member

Thanks that helped!

0 Karma

jrlesch
New Member

I guess the problem with that is that it is checking for any test under tests/cluster/pay/ failing on more than 2. I was looking to search for specific tests failing on multiple hosts. But there are 50 tests under pay. Does that make sense?

0 Karma

somesoni2
Revered Legend

See if the updated answer, which counts no of hosts for each specific tests, suits your requirement.

0 Karma
Get Updates on the Splunk Community!

Accelerating Observability as Code with the Splunk AI Assistant

We’ve seen in previous posts what Observability as Code (OaC) is and how it’s now essential for managing ...

Integrating Splunk Search API and Quarto to Create Reproducible Investigation ...

 Splunk is More Than Just the Web Console For Digital Forensics and Incident Response (DFIR) practitioners, ...

Congratulations to the 2025-2026 SplunkTrust!

Hello, Splunk Community! We are beyond thrilled to announce our newest group of SplunkTrust members!  The ...