Splunk Enterprise Security

Is it possible to optimize hyperparameters in MLTK?

rosho
Communicator

Hi

I am using MLTK for anomaly detection. So I am benchmarking algorithms. I was wondering if it is possible to optimize the hyperparameters using techniques like: random search, grid search, bayesian optimization, etc.

I am not sure if it could be possible to apply it with real-time data. Has anyone done it before?

Thank you

0 Karma
1 Solution

astein_splunk
Splunk Employee
Splunk Employee

Optimize the hyperparameters in an automated way is not something supported out of the box, but if your Splunk instance is a separate node and you won't impact production servers you can write your own optimization via the ML APIs by creating a clone of the algorithm and inserting the technique of your choice. Check out https://docs.splunk.com/Documentation/MLApp/4.2.0/API/Overview or the Github for more examples.

I would not try to do so with an |apply step, which you seem to be hinting at, but partial_fit might be an option.

View solution in original post

astein_splunk
Splunk Employee
Splunk Employee

Optimize the hyperparameters in an automated way is not something supported out of the box, but if your Splunk instance is a separate node and you won't impact production servers you can write your own optimization via the ML APIs by creating a clone of the algorithm and inserting the technique of your choice. Check out https://docs.splunk.com/Documentation/MLApp/4.2.0/API/Overview or the Github for more examples.

I would not try to do so with an |apply step, which you seem to be hinting at, but partial_fit might be an option.

Career Survey
First 500 qualified respondents will receive a $20 gift card! Tell us about your professional Splunk journey.
Get Updates on the Splunk Community!

Introduction to Splunk AI

How are you using AI in Splunk? Whether you see AI as a threat or opportunity, AI is here to stay. Lucky for ...

Splunk + ThousandEyes: Correlate frontend, app, and network data to troubleshoot ...

Are you tired of troubleshooting delays caused by siloed frontend, application, and network data? We've got a ...

Maximizing the Value of Splunk ES 8.x

Splunk Enterprise Security (ES) continues to be a leader in the Gartner Magic Quadrant, reflecting its pivotal ...