All Apps and Add-ons

Split data into a training set and a testing set

emma1
Engager

Hi,

I am trying to train a LSTM time series classifier in the search window in Splunk, step 4 in Model Development Guide in deep learning toolkit.

It says that the data can be split into a training set and a testing set using the | sample command. To my understanding, the datasets is then constructed by taken events randomly.  I do not want datasets that is constructed by taking events randomly. I want the data to be sorted by time and then split with a ratio, how can I do this?

Splunk version: 7.2.4

Deep Learning Toolkit for Splunk Version: 2.3.0

Labels (1)
0 Karma
1 Solution

richgalloway
SplunkTrust
SplunkTrust

You could take every other (or some number) event using the modulo operator.

| where (_time % 2 = 0)
---
If this reply helps you, an upvote would be appreciated.

View solution in original post

0 Karma

richgalloway
SplunkTrust
SplunkTrust

You could take every other (or some number) event using the modulo operator.

| where (_time % 2 = 0)
---
If this reply helps you, an upvote would be appreciated.

View solution in original post

0 Karma
.conf21 Now Fully Virtual!
Register for FREE Today!

We've made .conf21 totally virtual and totally FREE! Our completely online experience will run from 10/19 through 10/20 with some additional events, too!