Alerting

Can you help me figure out this gap in the alert scheduling?

damucka
Builder

Hello,

I have an alert scheduled each minute.

Yesterday, I had a gap in scheduling between 16:15 and 16:51 and I am not able to find the reason for that:

alt text

There were no skipped "Anomaly Detection" alerts in this period:

alt text

Also, looking at the scheduler log, I am not getting any smarter; the two corresponding entries from 16:15 and the next from 16:51 are:

1/31/19
4:51:40.841 PM  
01-31-2019 16:51:40.841 +0100 INFO  SavedSplunker - savedsearch_id="nobody;mlbso;Anomaly Detection", search_type="scheduled", user="d046266", app="mlbso", savedsearch_name="Anomaly Detection", priority=default, status=success, digest_mode=1, scheduled_time=1548949620, window_time=0, dispatch_time=1548949886, run_time=13.882, result_count=0, alert_actions="", sid="scheduler__d046266__mlbso__RMD54eeec7fba2d5a846_at_1548949620_52", suppressed=0, thread_id="AlertNotifierWorker-0"
host =  mo-91aebdc20.mo.sap.corp source =   /opt/splunk/var/log/splunk/scheduler.log sourcetype =   scheduler
1/31/19
4:15:27.308 PM  
01-31-2019 16:15:27.308 +0100 INFO  SavedSplunker - savedsearch_id="nobody;mlbso;Anomaly Detection", search_type="scheduled", user="d046266", app="mlbso", savedsearch_name="Anomaly Detection", priority=default, status=success, digest_mode=1, scheduled_time=1548947580, window_time=0, dispatch_time=1548947720, run_time=6.333, result_count=0, alert_actions="", sid="scheduler__d046266__mlbso__RMD54eeec7fba2d5a846_at_1548947580_24969", suppressed=0, thread_id="AlertNotifierWorker-0"

Could you please help me analyze this issue?

Where would I look?

Kind regards,
Kamil

1 Solution

damucka
Builder

Hi,

The issue with the gap is clarified ... it was trivial, I asked the Splunk admin and he mentioned there would be some problems with the servers and they needed 10 min restart.

But anyway, perhaps you can help me with the second one I have, it is visible on the screenshot as well and this is a permanent lag of 2 mins between schedule time and dispatch time of this alert. This is actually not only for this alert but for many, not all though. Not sure where this can come from. There are no skipped alerts, at least not the "Anomaly Detection" ones, so I guess this is not the resource issue.
How would I configure the immediate dispatch of my alerts?
Especially for the "Anomaly Detection" one it cannot wait 2 minutes to alert.

Splunk Version:7.0.0
Splunk Build
c8a78efdd40f

Kind Regards,
Kamil

View solution in original post

0 Karma

damucka
Builder

Hi,

The issue with the gap is clarified ... it was trivial, I asked the Splunk admin and he mentioned there would be some problems with the servers and they needed 10 min restart.

But anyway, perhaps you can help me with the second one I have, it is visible on the screenshot as well and this is a permanent lag of 2 mins between schedule time and dispatch time of this alert. This is actually not only for this alert but for many, not all though. Not sure where this can come from. There are no skipped alerts, at least not the "Anomaly Detection" ones, so I guess this is not the resource issue.
How would I configure the immediate dispatch of my alerts?
Especially for the "Anomaly Detection" one it cannot wait 2 minutes to alert.

Splunk Version:7.0.0
Splunk Build
c8a78efdd40f

Kind Regards,
Kamil

0 Karma

woodcock
Esteemed Legend

You should click Accept to close your question.

0 Karma

woodcock
Esteemed Legend

What version of Splunk search head?

0 Karma

vishaltaneja070
Motivator

@damucka,

Check if any issue with splunkd service at that time. Check for internal logs at that time.

0 Karma
Get Updates on the Splunk Community!

Introducing Splunk Enterprise 9.2

WATCH HERE! Watch this Tech Talk to learn about the latest features and enhancements shipped in the new Splunk ...

Adoption of RUM and APM at Splunk

    Unleash the power of Splunk Observability   Watch Now In this can't miss Tech Talk! The Splunk Growth ...

Routing logs with Splunk OTel Collector for Kubernetes

The Splunk Distribution of the OpenTelemetry (OTel) Collector is a product that provides a way to ingest ...