All Apps and Add-ons

Customize TFIDF Stop Words

ericchaucl
Path Finder

I'm now trying to use the TFIDF algorithm and use the parameter "stop_words=english".

Where are the stop_words defined and how can I customize it? Thanks!

0 Karma
1 Solution

ericchaucl
Path Finder

Thanks for the advise. Finally I can find it!

It's in the path:
$SPLUNK HOME\etc\apps\Splunk_SA_Scientific_Python_windows_x86_64\bin\windows_x86_64\Lib\site-packages\sklearn\feature_extraction\stop_words.py

View solution in original post

0 Karma

ericchaucl
Path Finder

Thanks for the advise. Finally I can find it!

It's in the path:
$SPLUNK HOME\etc\apps\Splunk_SA_Scientific_Python_windows_x86_64\bin\windows_x86_64\Lib\site-packages\sklearn\feature_extraction\stop_words.py

0 Karma

parkz
Explorer

Is there a similar file path for Linux Splunk app?

0 Karma

cmerriman
Super Champion

If you go through the splunk docs for the algoriths, they bring you to the scikit-learn that houses the documentation. if you go through that enough, you can land on this page. https://github.com/scikit-learn/scikit-learn/blob/ef5cb84a805efbe4bb06516670a9b8c690992bd7/sklearn/f...

0 Karma
Get Updates on the Splunk Community!

Stay Connected: Your Guide to January Tech Talks, Office Hours, and Webinars!

What are Community Office Hours? Community Office Hours is an interactive 60-minute Zoom series where ...

[Puzzles] Solve, Learn, Repeat: Reprocessing XML into Fixed-Length Events

This challenge was first posted on Slack #puzzles channelFor a previous puzzle, I needed a set of fixed-length ...

Data Management Digest – December 2025

Welcome to the December edition of Data Management Digest! As we continue our journey of data innovation, the ...