Hi ryanprayacn,
There are no hard limits, but there are configureable resource limits defined in the following conf file:
Splunk_ML_Toolkit/default/mlspl.conf
Because MLTK fits models on the search head, we give customers the ability to control these limits via the file above, with reasonable defaults.
As an aside, you should consider whether you really want to fit on that many events. Linear regression, for example, is unlikely to yield significantly different results on 7M events compared with, say, 500k random events. Depends on your data and application, of course. 🙂
Does this answer your question?
Cheers,
- Adam
... View more