Knowledge Management

Using mean(x) in summary search run every 5 minutes - when running a report on summary index using mean(x) per day or month, will that give the same result?

JYTTEJ
Communicator

Hi - I need to calculate the mean(response time) for at complete month based on summary index.

The summary index search has to run every 5 minutes - selecting last 5 minutes of data.

The search will look like this: Search.... |sistats mean(response time) by xxx yyyy www ttt

The monthly report search will look like this: index=summary REPORT=MEANRESP|timechart span=1month mean(response time)

QUESTION: Will the search run on the summary index give the same result as if I made the search run on the actual transaction log?

Tags (2)
0 Karma
1 Solution

somesoni2
Revered Legend

There will be some differences (magnitude depend upon the data) as there will be little rounding in mean calculation of summary indexes and subsequent mean calculation during data fetch from. See this runanywhere search example and compare the results.

SET 1
simulating summary index
| gentimes start=-1 | eval temp=mvrange(1,200,2) | table temp | mvexpand temp | eval data=temp+1 | bucket span=5s data | stats mean(temp) as temp by data | stats mean(temp) as temp

Direct calculation
| gentimes start=-1 | eval temp=mvrange(1,200,2) | table temp | mvexpand temp | eval data=temp+1 | stats mean(data) as temp

SET 2
simulating summary index
| gentimes start=-1 | eval temp=mvrange(2,200,2) | table temp | mvexpand temp | eval data=temp+1 | bucket span=5s data | stats mean(temp) as temp by data | stats mean(temp) as temp

Direct calculation
| gentimes start=-1 | eval temp=mvrange(2,200,2) | table temp | mvexpand temp | eval data=temp+1 | stats mean(data) as temp

View solution in original post

somesoni2
Revered Legend

There will be some differences (magnitude depend upon the data) as there will be little rounding in mean calculation of summary indexes and subsequent mean calculation during data fetch from. See this runanywhere search example and compare the results.

SET 1
simulating summary index
| gentimes start=-1 | eval temp=mvrange(1,200,2) | table temp | mvexpand temp | eval data=temp+1 | bucket span=5s data | stats mean(temp) as temp by data | stats mean(temp) as temp

Direct calculation
| gentimes start=-1 | eval temp=mvrange(1,200,2) | table temp | mvexpand temp | eval data=temp+1 | stats mean(data) as temp

SET 2
simulating summary index
| gentimes start=-1 | eval temp=mvrange(2,200,2) | table temp | mvexpand temp | eval data=temp+1 | bucket span=5s data | stats mean(temp) as temp by data | stats mean(temp) as temp

Direct calculation
| gentimes start=-1 | eval temp=mvrange(2,200,2) | table temp | mvexpand temp | eval data=temp+1 | stats mean(data) as temp
Get Updates on the Splunk Community!

Simplifying the Analyst Experience with Finding-based Detections

    Splunk invites you to an engaging Tech Talk focused on streamlining security operations with ...

[Puzzles] Solve, Learn, Repeat: Word Search

This challenge was first posted on Slack #puzzles channelThis puzzle is based on a letter grid containing ...

[Puzzles] Solve, Learn, Repeat: Advent of Code - Day 4

Advent of CodeIn order to participate in these challenges, you will need to register with the Advent of Code ...