<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic How to resolve Splunk machine learning knowledge bundle size issues? in All Apps and Add-ons</title>
    <link>https://community.splunk.com/t5/All-Apps-and-Add-ons/How-to-resolve-Splunk-machine-learning-knowledge-bundle-size/m-p/610415#M77461</link>
    <description>&lt;P&gt;We run a number of machine learning models and routinely run into limitations of the "knowledge bundle" getting too big with errors like this&lt;/P&gt;
&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="bundle errors" style="width: 548px;"&gt;&lt;img src="https://community.splunk.com/t5/image/serverpage/image-id/21135i4851018FD67517AA/image-size/large?v=v2&amp;amp;px=999" role="button" title="snowybundles-2022-08-23_10-10-53.png" alt="bundle errors" /&gt;&lt;span class="lia-inline-image-caption" onclick="event.preventDefault();"&gt;bundle errors&lt;/span&gt;&lt;/span&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;We increased the limits.conf to alleviate it but error came back after a few more models were made.&lt;/P&gt;
&lt;P&gt;I've noticed that these likely need to be included in the knowledge bundle since they are not explicitly blacklisted from the distsearch.conf&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;LI-CODE lang="markup"&gt;[replicationSettings:refineConf]
replicate.algos = true
replicate.mlspl = true
replicate.scorings = true

[replicationBlacklist]
non_model_lookups = apps[/\\]Splunk_ML_Toolkit[/\\]lookups[/\\](?!__mlspl_)*.csv
non_model_lookups_docs = apps[/\\]Splunk_ML_Toolkit[/\\]lookups[/\\]docs[/\\]...&lt;/LI-CODE&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Now looking at the users directory there are a lot of double ups.&amp;nbsp;&lt;/P&gt;
&lt;P&gt;/opt/splunk/etc/users/theusername/Splunk_ML_Toolkit/lookups&lt;/P&gt;
&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="users ML lookup directory" style="width: 999px;"&gt;&lt;img src="https://community.splunk.com/t5/image/serverpage/image-id/21136i9F7B08551CD9C8EE/image-size/large?v=v2&amp;amp;px=999" role="button" title="snowybundles-2022-08-23_09-13-39.png" alt="users ML lookup directory" /&gt;&lt;span class="lia-inline-image-caption" onclick="event.preventDefault();"&gt;users ML lookup directory&lt;/span&gt;&lt;/span&gt;&lt;/P&gt;
&lt;P&gt;Is there a way to get rid of these _draft_ ones in the Machine Learning GUI?&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
    <pubDate>Wed, 25 Jan 2023 15:11:35 GMT</pubDate>
    <dc:creator>cameronjust</dc:creator>
    <dc:date>2023-01-25T15:11:35Z</dc:date>
    <item>
      <title>How to resolve Splunk machine learning knowledge bundle size issues?</title>
      <link>https://community.splunk.com/t5/All-Apps-and-Add-ons/How-to-resolve-Splunk-machine-learning-knowledge-bundle-size/m-p/610415#M77461</link>
      <description>&lt;P&gt;We run a number of machine learning models and routinely run into limitations of the "knowledge bundle" getting too big with errors like this&lt;/P&gt;
&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="bundle errors" style="width: 548px;"&gt;&lt;img src="https://community.splunk.com/t5/image/serverpage/image-id/21135i4851018FD67517AA/image-size/large?v=v2&amp;amp;px=999" role="button" title="snowybundles-2022-08-23_10-10-53.png" alt="bundle errors" /&gt;&lt;span class="lia-inline-image-caption" onclick="event.preventDefault();"&gt;bundle errors&lt;/span&gt;&lt;/span&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;We increased the limits.conf to alleviate it but error came back after a few more models were made.&lt;/P&gt;
&lt;P&gt;I've noticed that these likely need to be included in the knowledge bundle since they are not explicitly blacklisted from the distsearch.conf&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;LI-CODE lang="markup"&gt;[replicationSettings:refineConf]
replicate.algos = true
replicate.mlspl = true
replicate.scorings = true

[replicationBlacklist]
non_model_lookups = apps[/\\]Splunk_ML_Toolkit[/\\]lookups[/\\](?!__mlspl_)*.csv
non_model_lookups_docs = apps[/\\]Splunk_ML_Toolkit[/\\]lookups[/\\]docs[/\\]...&lt;/LI-CODE&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Now looking at the users directory there are a lot of double ups.&amp;nbsp;&lt;/P&gt;
&lt;P&gt;/opt/splunk/etc/users/theusername/Splunk_ML_Toolkit/lookups&lt;/P&gt;
&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="users ML lookup directory" style="width: 999px;"&gt;&lt;img src="https://community.splunk.com/t5/image/serverpage/image-id/21136i9F7B08551CD9C8EE/image-size/large?v=v2&amp;amp;px=999" role="button" title="snowybundles-2022-08-23_09-13-39.png" alt="users ML lookup directory" /&gt;&lt;span class="lia-inline-image-caption" onclick="event.preventDefault();"&gt;users ML lookup directory&lt;/span&gt;&lt;/span&gt;&lt;/P&gt;
&lt;P&gt;Is there a way to get rid of these _draft_ ones in the Machine Learning GUI?&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Wed, 25 Jan 2023 15:11:35 GMT</pubDate>
      <guid>https://community.splunk.com/t5/All-Apps-and-Add-ons/How-to-resolve-Splunk-machine-learning-knowledge-bundle-size/m-p/610415#M77461</guid>
      <dc:creator>cameronjust</dc:creator>
      <dc:date>2023-01-25T15:11:35Z</dc:date>
    </item>
    <item>
      <title>Re: How to Resolve Splunk Machine Learning Knowledge Bundle Size Issues?</title>
      <link>https://community.splunk.com/t5/All-Apps-and-Add-ons/How-to-resolve-Splunk-machine-learning-knowledge-bundle-size/m-p/610885#M77484</link>
      <description>&lt;P&gt;Answering my own question here. I did some testing and the _draft_ ones are updated when you are playing around in the experiments section with each model types interfaces.&lt;BR /&gt;&lt;BR /&gt;They update the non _draft_ ones when you save. Which is pretty obvious.&lt;BR /&gt;&lt;BR /&gt;As for whether the models are required in the knowledge bundle it all depends on the search you are running to build your data set. If the job inspector phase0 search includes the fit command then you will likely run into issues if you don't distribute these in the bundle.&lt;/P&gt;&lt;P&gt;I added this to /opt/splunk/etc/apps/Splunk_ML_Toolkit/local/distsearch.conf to see if I could "break" it.&lt;/P&gt;&lt;LI-CODE lang="markup"&gt;[replicationBlacklist]
# non_draft_models = apps/Splunk_ML_Toolkit/lookups/(?!__mlspl__exp_draft)*.mlmodel
user_models = users/*/Splunk_ML_Toolkit/lookups/*.mlmodel
# all_models = apps/Splunk_ML_Toolkit/lookups/*.mlmodel&lt;/LI-CODE&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Essentially stops all users data models being included in the knowledge bundle.&lt;/P&gt;&lt;P&gt;I tried many different combinations of searches using fit and couldn't get a search that tried to run the fit command on the indexers. Like this pointless model&lt;/P&gt;&lt;LI-CODE lang="markup"&gt;index=_internal (status=2* OR status=3* OR status=4*)
| fit DensityFunction "status" by "method" dist=auto threshold=0.0001 show_density=true show_options="feature_variables,split_by,params" into "_exp_draft_212803036b7b408a9d467eedf8c6a3a6"&lt;/LI-CODE&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;The fit always ran on the search head so I'd say 9 times out of 10 you don't need these in the bundle.&lt;/P&gt;</description>
      <pubDate>Thu, 25 Aug 2022 23:27:07 GMT</pubDate>
      <guid>https://community.splunk.com/t5/All-Apps-and-Add-ons/How-to-resolve-Splunk-machine-learning-knowledge-bundle-size/m-p/610885#M77484</guid>
      <dc:creator>cameronjust</dc:creator>
      <dc:date>2022-08-25T23:27:07Z</dc:date>
    </item>
    <item>
      <title>Re: How to Resolve Splunk Machine Learning Knowledge Bundle Size Issues?</title>
      <link>https://community.splunk.com/t5/All-Apps-and-Add-ons/How-to-resolve-Splunk-machine-learning-knowledge-bundle-size/m-p/628226#M78376</link>
      <description>&lt;P&gt;Replying to my own post of the other requirement if you have large knowledge bundles. You need this on the indexers server.conf as well if you continue to have bundles bigger than 2GB.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;LI-CODE lang="markup"&gt;[httpServer]

# Required on indexers due to increasing knowledge bundle sizes to 5GB. Large Knowledge Bundles caused by machine learning models. Default is 2147483648 (2GB)
max_content_length = 5368709120&lt;/LI-CODE&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Wed, 25 Jan 2023 06:19:43 GMT</pubDate>
      <guid>https://community.splunk.com/t5/All-Apps-and-Add-ons/How-to-resolve-Splunk-machine-learning-knowledge-bundle-size/m-p/628226#M78376</guid>
      <dc:creator>cameronjust</dc:creator>
      <dc:date>2023-01-25T06:19:43Z</dc:date>
    </item>
  </channel>
</rss>

