Solved: Performance issue with query

amoldesai · ‎03-11-2016

Hi,

I have a query written to find average exceptions per device on monthly basis for my use case. The query returns results as expected but the query performance is very poor. Below is the query and details on it. Request your help in optimising the query:

index="79390-np" sourcetype=np-cache-v2 source="*bp_detail*" [search index="79390-np" sourcetype=np-cache-v2 source="*bp_detail*" | join bpRuleId [search index="79390-np" sourcetype=np-cache-v2 (source="*bp_summary*" and bpPrimaryTechnology="*" ) ]| join deviceId [ search index="79390-np" sourcetype=np-cache-v2 source="*group_member*" groupId="*"] | fields deviceId]| eval month_num=strftime(_time,"%m") | eval Month=strftime(_time,"%b %Y")|stats dc(deviceId) as uniquedevices,dc(source) as sourcecount,count by Month|sort month_num| eval avgdeviceperupload=count/sourcecount | eval avguniquedevices = round(avgdeviceperupload/uniquedevices)| rename avguniquedevices as "Average Exceptions Per Device" | table Month,"Average Exceptions Per Device"
,
Details:
sources:bp_detail,bp_summary,group_member.
bp_detail fields: bpRuleId,deviceId
bp_summary fields: bpRuleId,bpPrimaryTechnology
group_members fields : groupId,deviceId

Thanks

somesoni2 · ‎03-11-2016

Give this a try

index="79390-np" sourcetype=np-cache-v2 source="*bp_detail*" 
[search index="79390-np" sourcetype=np-cache-v2 source="*bp_detail*" OR (source="*bp_summary*" and bpPrimaryTechnology="*" )
| stats values(deviceId) as deviceId by bpRuleId  | where isnotnull(devideId)
| append [search index="79390-np" sourcetype=np-cache-v2 source="*group_member*" groupId="*" | stats count by devideId groupId]
| stats values(groupId) as groupId by deviceId | where isnotnull(groupId) | tabel devideId]
| timechart span=1mon dc(deviceId) as uniquedevices,dc(source) as sourcecount,count 
| eval avgdeviceperupload=count/sourcecount | eval avguniquedevices = round(avgdeviceperupload/uniquedevices)| rename avguniquedevices as "Average Exceptions Per Device"
| eval Month=strftime(_time,"%b %Y") | table Month,"Average Exceptions Per Device"

View solution in original post

somesoni2 · ‎03-11-2016

Give this a try

index="79390-np" sourcetype=np-cache-v2 source="*bp_detail*" 
[search index="79390-np" sourcetype=np-cache-v2 source="*bp_detail*" OR (source="*bp_summary*" and bpPrimaryTechnology="*" )
| stats values(deviceId) as deviceId by bpRuleId  | where isnotnull(devideId)
| append [search index="79390-np" sourcetype=np-cache-v2 source="*group_member*" groupId="*" | stats count by devideId groupId]
| stats values(groupId) as groupId by deviceId | where isnotnull(groupId) | tabel devideId]
| timechart span=1mon dc(deviceId) as uniquedevices,dc(source) as sourcecount,count 
| eval avgdeviceperupload=count/sourcecount | eval avguniquedevices = round(avgdeviceperupload/uniquedevices)| rename avguniquedevices as "Average Exceptions Per Device"
| eval Month=strftime(_time,"%b %Y") | table Month,"Average Exceptions Per Device"

amoldesai · ‎03-12-2016

Thanks a lot. It helps in optimising the query.

Richfez · ‎03-11-2016

OK, so two things stand out. First, using a wildcard at the front of some search criteria (like source="*bp_detail*") isn't making it speedy, but there might not be any choice in that. The second, though, is join. Join is wonderful. But also amazingly inefficient and slow. Whenever possible (which is usually but not always), stats or transaction is preferred.

So, let's strip it down a bit. It looks like the "get my data into one nice event" stuff is joining together bp_detail to bp_summary on bpRuleId, then bp_detail to group_members on deviceId. Does that sound right?

I usually end up with transactions for my data, so that's what I'll use here. A similar technique can be used for stats, too, it just ends up with a bit of a different look.

index="79390-np" sourcetype=np-cache-v2 source="*bp_detail*" OR source="*group_member*" OR source="*bp_summary*"
| transaction bpRuleId deviceId
| ... evals and pretty output things here

First line just grabs all the events that you eventually want to join together.
The second line builds a transaction that says wherever a bpRuleId exists in two events and matches, join it together. Ditto where deviceId exists in two events and matches, join THAT in too. For more on this, read the docs for Transaction very carefully, especially right at the Transaction Usage section. There's also a great answer by gkanapathy that may be of use - if you find that helpful, be sure to upvote it! You'll likely want some maxspan=X and maxpause=Y and/or other things like that in there to make it even speedier, but you can play with those and if you have problems, feel free to ask again.

Once you have all the events linking up to each other properly, well, the rest is mostly what you already have.

JohnEGones · ‎07-21-2023

These sorts of replies are very useful (and I do understand that they take experience, attention and effort to craft), even, I suspect, for some people that are more competent than me at just coding queries.

In my (limited) experience, there are a lot of sometimes subtle and nuanced considerations, to say nothing of just things you may not know, that can go into even simple queries, let alone ones that perform a series of transforms, calculations and joins, and which need to be relatively fast and efficient because they will tend to be run regularly.

I'm happy I am taking this stuff rather more seriously, I'm learning a lot.

amoldesai · ‎03-12-2016

Thanks for your quick and detailed reply. It was useful to know about how transaction could be used here.
But as I require groupId and technology field(mandatory input to my form) to be explicitly set in the query,I will go with the stats based technique. Thanks.

Performance issue with query

Join the Splunk Community Slack to learn, troubleshoot, and make connections with fellow Splunk practitioners in real time!

Join Splunk User Groups to connect and learn in-person by region or remotely by topic or industry.

Quantify Your Splunk Investment Impact: Introducing Savings Metrics to Value Insights

Event Series: Telemetry Pipeline Management

Kick the Tires Before You Commit: A Hands-On Tour of the Splunk Observability Cloud ...

Join the Conversation