All Apps and Add-ons

Split data into a training set and a testing set

emma1
Engager

Hi,

I am trying to train a LSTM time series classifier in the search window in Splunk, step 4 in Model Development Guide in deep learning toolkit.

It says that the data can be split into a training set and a testing set using the | sample command. To my understanding, the datasets is then constructed by taken events randomly.  I do not want datasets that is constructed by taking events randomly. I want the data to be sorted by time and then split with a ratio, how can I do this?

Splunk version: 7.2.4

Deep Learning Toolkit for Splunk Version: 2.3.0

Labels (1)
0 Karma
1 Solution

richgalloway
SplunkTrust
SplunkTrust

You could take every other (or some number) event using the modulo operator.

| where (_time % 2 = 0)
---
If this reply helps you, Karma would be appreciated.

View solution in original post

0 Karma

richgalloway
SplunkTrust
SplunkTrust

You could take every other (or some number) event using the modulo operator.

| where (_time % 2 = 0)
---
If this reply helps you, Karma would be appreciated.
0 Karma
Get Updates on the Splunk Community!

Extending Observability Content to Splunk Cloud

Watch Now!   In this Extending Observability Content to Splunk Cloud Tech Talk, you'll see how to leverage ...

More Control Over Your Monitoring Costs with Archived Metrics!

What if there was a way you could keep all the metrics data you need while saving on storage costs?This is now ...

New in Observability Cloud - Explicit Bucket Histograms

Splunk introduces native support for histograms as a metric data type within Observability Cloud with Explicit ...