Splunk Search

group certain URLs

gerard11
Engager

I have a search that returns events with many different URLs

 

 

index=test URL=*

 

 

I want to obtain a count of events per URL

However some of the URLs are slightly different so I want to group them together

Example of my URL values

/login/
/login/
/api/customer/5542-a44/data
/api/customer/5c77-59w/data
/api/customer/7a88-134/data
/weather/forecast/
/api/savedseach/7775
/api/savedseach/4788
/new/user

What I would like to end up with

URLCOUNT
/login/2
/api/customer//data3

/weather/forecast/

1

/api/savedseach/

2

/new/user

1

 

Im using | stats count by URL
However as mentioned above my issue is with the URLs that have ids or guids in them

Labels (1)
0 Karma
1 Solution

dmarling
Builder

You can normalize the url with regular expression, but you will need to account for all of your use cases.  Here's an example regex based on the examples you provided:

| rex mode=sed field=URL "s/\/(customer|savedsearch)\/[^\e\/]+/\/\1\//g"

 

Here's a run anywhere example that shows how it works:

| makeresults
| eval URL="/login/
/login/
/api/customer/5542-a44/data
/api/customer/5c77-59w/data
/api/customer/7a88-134/data
/weather/forecast/
/api/savedsearch/7775
/api/savedsearch/4788
/new/use"
| makemv URL tokenizer="(?<URL>[^\n]+)"
| mvexpand URL
| rex mode=sed field=URL "s/\/(customer|savedsearch)\/[^\e\/]+/\/\1\//g"
| stats count by URL
If this comment/answer was helpful, please up vote it. Thank you.

View solution in original post

0 Karma

dmarling
Builder

You can normalize the url with regular expression, but you will need to account for all of your use cases.  Here's an example regex based on the examples you provided:

| rex mode=sed field=URL "s/\/(customer|savedsearch)\/[^\e\/]+/\/\1\//g"

 

Here's a run anywhere example that shows how it works:

| makeresults
| eval URL="/login/
/login/
/api/customer/5542-a44/data
/api/customer/5c77-59w/data
/api/customer/7a88-134/data
/weather/forecast/
/api/savedsearch/7775
/api/savedsearch/4788
/new/use"
| makemv URL tokenizer="(?<URL>[^\n]+)"
| mvexpand URL
| rex mode=sed field=URL "s/\/(customer|savedsearch)\/[^\e\/]+/\/\1\//g"
| stats count by URL
If this comment/answer was helpful, please up vote it. Thank you.
0 Karma

gerard11
Engager

This is exactly what I was looking for, thank you.

0 Karma
Get Updates on the Splunk Community!

The Splunk Success Framework: Your Guide to Successful Splunk Implementations

Splunk Lantern is a customer success center that provides advice from Splunk experts on valuable data ...

Splunk Training for All: Meet Aspiring Cybersecurity Analyst, Marc Alicea

Splunk Education believes in the value of training and certification in today’s rapidly-changing data-driven ...

Investigate Security and Threat Detection with VirusTotal and Splunk Integration

As security threats and their complexities surge, security analysts deal with increased challenges and ...