Splunk Search

Help with summary indexing

yuwtennis
Communicator

Hi!

I would like to have some help with summary indexing.

My situations is like following:

I have events that comes into splunk everday like following
day 1:
_time StartTemperature EndTemperature SequenceNum ID
2014/1/9 00:00 20 21 1 A
2014/1/10 00:00 23 24 2 A
2014/1/11 00:00 25 27 3 B

However, at some point there are some delayed incoming event.
day 2:
_time StartTemperature EndTemperature SequenceNum ID
2014/1/9 00:00 20 21 1 A
2014/1/10 00:00 23 24 2 A
2014/1/8 00:00 27 21 5 A **** late event
2014/1/11 00:00 25 27 3 B

Everday I have to create total average temperature of difference of EndTemperature and NextRecord's Start Temperature. The problem is I can easily do re ordering with sort and streamstats but
there are not enough memory and disks to handle this since there are millions of events.

Since there are millions events, I am considering to organize with
daily summary index like following:

_time SumOfDifferenc(EndTemperature and NextRecord's Start Temperature) ID
xxxxx A

However, if there are late incoming events, I have to recreate the records that
has been already created. I am not sure if this is possible or not..
This might take lots of scripting....

Has anyone has tackled with these kind of situation?
And I would appreciate if one can share their solution.

Thanks,
Yu

Tags (2)
0 Karma

lukejadamec
Super Champion

Report acceleration might make more sense for you as it can handle late arriving events. See this doc:

http://docs.splunk.com/Documentation/Splunk/5.0.5/Knowledge/Aboutsummaryindexing

0 Karma

yuwtennis
Communicator

In addition , my version of splunk is ver 5.0.5

0 Karma
Got questions? Get answers!

Join the Splunk Community Slack to learn, troubleshoot, and make connections with fellow Splunk practitioners in real time!

Meet up IRL or virtually!

Join Splunk User Groups to connect and learn in-person by region or remotely by topic or industry.

Get Updates on the Splunk Community!

Think Like an Architect: Introducing the Splunk Certified Cybersecurity Defense ...

In cybersecurity, defenders respond to threats. Architects design the systems that stop them.    As ...

Best Practices: Splunk auto adjust pipeline queue

When you enable autoAdjustQueue in Splunk, maxSize should be understood as the queue size Splunk starts with ...

Announcing Modern Navigation: A New Era of Splunk User Experience

We are excited to introduce the Modern Navigation feature in the Splunk Platform, available to both cloud and ...