Splunk Search

group certain URLs

gerard11
Engager

I have a search that returns events with many different URLs

 

 

index=test URL=*

 

 

I want to obtain a count of events per URL

However some of the URLs are slightly different so I want to group them together

Example of my URL values

/login/
/login/
/api/customer/5542-a44/data
/api/customer/5c77-59w/data
/api/customer/7a88-134/data
/weather/forecast/
/api/savedseach/7775
/api/savedseach/4788
/new/user

What I would like to end up with

URLCOUNT
/login/2
/api/customer//data3

/weather/forecast/

1

/api/savedseach/

2

/new/user

1

 

Im using | stats count by URL
However as mentioned above my issue is with the URLs that have ids or guids in them

Labels (1)
0 Karma
1 Solution

dmarling
Builder

You can normalize the url with regular expression, but you will need to account for all of your use cases.  Here's an example regex based on the examples you provided:

| rex mode=sed field=URL "s/\/(customer|savedsearch)\/[^\e\/]+/\/\1\//g"

 

Here's a run anywhere example that shows how it works:

| makeresults
| eval URL="/login/
/login/
/api/customer/5542-a44/data
/api/customer/5c77-59w/data
/api/customer/7a88-134/data
/weather/forecast/
/api/savedsearch/7775
/api/savedsearch/4788
/new/use"
| makemv URL tokenizer="(?<URL>[^\n]+)"
| mvexpand URL
| rex mode=sed field=URL "s/\/(customer|savedsearch)\/[^\e\/]+/\/\1\//g"
| stats count by URL
If this comment/answer was helpful, please up vote it. Thank you.

View solution in original post

0 Karma

dmarling
Builder

You can normalize the url with regular expression, but you will need to account for all of your use cases.  Here's an example regex based on the examples you provided:

| rex mode=sed field=URL "s/\/(customer|savedsearch)\/[^\e\/]+/\/\1\//g"

 

Here's a run anywhere example that shows how it works:

| makeresults
| eval URL="/login/
/login/
/api/customer/5542-a44/data
/api/customer/5c77-59w/data
/api/customer/7a88-134/data
/weather/forecast/
/api/savedsearch/7775
/api/savedsearch/4788
/new/use"
| makemv URL tokenizer="(?<URL>[^\n]+)"
| mvexpand URL
| rex mode=sed field=URL "s/\/(customer|savedsearch)\/[^\e\/]+/\/\1\//g"
| stats count by URL
If this comment/answer was helpful, please up vote it. Thank you.
0 Karma

gerard11
Engager

This is exactly what I was looking for, thank you.

0 Karma
Career Survey
First 500 qualified respondents will receive a $20 gift card! Tell us about your professional Splunk journey.
Get Updates on the Splunk Community!

Tech Talk Recap | Mastering Threat Hunting

Mastering Threat HuntingDive into the world of threat hunting, exploring the key differences between ...

Observability for AI Applications: Troubleshooting Latency

If you’re working with proprietary company data, you’re probably going to have a locally hosted LLM or many ...

Splunk AI Assistant for SPL vs. ChatGPT: Which One is Better?

In the age of AI, every tool promises to make our lives easier. From summarizing content to writing code, ...