Splunk Search

Why is join slow?

chirsf
Explorer

I've seen a lot about not using join subsearches, how it's slow, etc etc. Which proves to be true in practice.

What I would like to find out is why it is slow. Any insight here would be helpful.

Tags (1)
0 Karma
1 Solution

nickhills
Ultra Champion

Take a look at these two articles, specifically the posts by @daljeanis :

https://answers.splunk.com/answers/561130/how-to-join-two-tables-where-the-key-is-named-diff.html
https://answers.splunk.com/answers/660008/which-is-the-best-approach-to-join-two-database-ta.html

The problem is that join is an SQL concept, and Splunk is not a relational database. The command exists (and works), but its very often not the best approach

If my comment helps, please give it a thumbs up!

View solution in original post

nickhills
Ultra Champion

Take a look at these two articles, specifically the posts by @daljeanis :

https://answers.splunk.com/answers/561130/how-to-join-two-tables-where-the-key-is-named-diff.html
https://answers.splunk.com/answers/660008/which-is-the-best-approach-to-join-two-database-ta.html

The problem is that join is an SQL concept, and Splunk is not a relational database. The command exists (and works), but its very often not the best approach

If my comment helps, please give it a thumbs up!

somesoni2
Revered Legend

I believe it's slow because of the algorithm and virtual memory the join command uses (it basically has to build a Cartesian product of two datasets and then work from there). With amount of processing and memory consumption often causes the join subsearches to timeout as well. If you've not read it alreadym, here is an excellent Splunk documentation on when to use join and when to use it's alternatives.

https://docs.splunk.com/Documentation/Splunk/7.2.4/Search/Abouteventcorrelation

0 Karma
Career Survey
First 500 qualified respondents will receive a $20 gift card! Tell us about your professional Splunk journey.
Get Updates on the Splunk Community!

Tech Talk Recap | Mastering Threat Hunting

Mastering Threat HuntingDive into the world of threat hunting, exploring the key differences between ...

Observability for AI Applications: Troubleshooting Latency

If you’re working with proprietary company data, you’re probably going to have a locally hosted LLM or many ...

Splunk AI Assistant for SPL vs. ChatGPT: Which One is Better?

In the age of AI, every tool promises to make our lives easier. From summarizing content to writing code, ...