All Apps and Add-ons

How are categorical outliers detected?

AayushSmarten
Observer

I am looking for a technical understanding for detecting a "Univariate Categorical Outlier".

I have used the ML Toolkit on Splunk and basically, I am trying to detect the "rare" categories which are really having low frequencies for the given variable of the dataset. 

I have also followed the thread here but I couldn't find the information I am looking for. Tough I could see the links like this which discuss different methods like histogram, IQR, and ZScore for anomaly detection but couldn't find any technical overview.

If anyone could help me with finding the "rare" category automatically, it will be a huge help. Because setting a static threshold like 0.05 doesn't work for all datasets. There has to be some way around like the histogram method.

Please give me the sources on how splunk finds the rare categories. It is fine if you can provide me with the univariate variable only instead of the multivariate.

Thanks

Labels (4)
0 Karma
Career Survey
First 500 qualified respondents will receive a $20 gift card! Tell us about your professional Splunk journey.
Get Updates on the Splunk Community!

Observe and Secure All Apps with Splunk

 Join Us for Our Next Tech Talk: Observe and Secure All Apps with SplunkAs organizations continue to innovate ...

What's New in Splunk Observability - August 2025

What's New We are excited to announce the latest enhancements to Splunk Observability Cloud as well as what is ...

Introduction to Splunk AI

How are you using AI in Splunk? Whether you see AI as a threat or opportunity, AI is here to stay. Lucky for ...