All Apps and Add-ons

MLTK training run failing with 'Usecols do not match columns, columns expected but not found' error


I'm utilizing Principal Component Analysis (PCA) on a RandomForestRegressor model to process some of the text fields in my data, which results in a certain number of PCA fields (around 30, I would say). The model look good upon the initial `fit` from within the experiment window, so I saved the model and scheduled a training run to occur every morning.

However, the scheduled training fails with a 'Usecols do not match columns, columns expected but not found' failure. It normally reports a handful of PC_* fields on the higher end of the range (like PC_27 - PC_31) not being found. The error appears to be related directly to the pandas python library but I don't have the capability to troubleshoot the code itself and hoping to resolve the issue via MLTK. Can anyone assist?

Labels (3)
Tags (3)
0 Karma
Get Updates on the Splunk Community!

New Splunk Observability innovations: Deeper visibility and smarter alerting to ...

You asked, we delivered. Splunk Observability Cloud has several new innovations giving you deeper visibility ...

Synthetic Monitoring: Not your Grandma’s Polyester! Tech Talk: DevOps Edition

Register today and join TekStream on Tuesday, February 28 at 11am PT/2pm ET for a demonstration of Splunk ...

Instrumenting Java Websocket Messaging

Instrumenting Java Websocket MessagingThis article is a code-based discussion of passing OpenTelemetry trace ...