Splunk Search

Formatting Callout/Lookup Data to include non-exact matches. (Lookup/log/callout issue).

danfinan
Explorer

Hi all,

Apologies for the vague title, I have a lookup problem that I need help with, so any help is greatly appreciated.

I have a .CSV file containing a single field named 'high_risk_keywords' that I wish to use to help filter search results. My web filter logs are plugged into Splunk with SSL decrpytion enabled so I am able to pull out keywords from Google, Bing, Yahoo and YouTube using callouts

I'm currently using the following search to do this:

index=webfilter url="*bing*" [| inputlookup high_risk_words.csv | fields high_risk_words | rename high_risk_words as bing_callout | eval bing_callout="*"+bing_callout+"*"] | stats count by bing_callout, src, user | sort by -count

If the user searched the word 'Ferrari', I get results like:

ferrari
how+to+build+a+ferrari
ferarri%20
etc...

Instead of having 4 x search results with 1 x count, I would like to have 4 x hits for the word 'ferrari'. Is there a way to achieve this where the entry for 'ferrari' not only includes an exact match, that it also includes variations as shown above?

I've tried editing:

eval bing_callout="*"+bing_callout+"*"]

to 

eval bing_callout=bing_callout]

but this only includes exact matches for the word 'ferrari' and terms like 'how+to+build+a+ferarri' get missed.

I hope this all makes sense, any help would be really really appreciated!

Thank you 🙂

0 Karma
Get Updates on the Splunk Community!

Observe and Secure All Apps with Splunk

  Join Us for Our Next Tech Talk: Observe and Secure All Apps with SplunkAs organizations continue to innovate ...

Splunk Decoded: Business Transactions vs Business IQ

It’s the morning of Black Friday, and your e-commerce site is handling 10x normal traffic. Orders are flowing, ...

Fastest way to demo Observability

I’ve been having a lot of fun learning about Kubernetes and Observability. I set myself an interesting ...