All Apps and Add-ons

Split data into a training set and a testing set

emma1
Engager

Hi,

I am trying to train a LSTM time series classifier in the search window in Splunk, step 4 in Model Development Guide in deep learning toolkit.

It says that the data can be split into a training set and a testing set using the | sample command. To my understanding, the datasets is then constructed by taken events randomly.  I do not want datasets that is constructed by taking events randomly. I want the data to be sorted by time and then split with a ratio, how can I do this?

Splunk version: 7.2.4

Deep Learning Toolkit for Splunk Version: 2.3.0

Labels (1)
0 Karma
1 Solution

richgalloway
SplunkTrust
SplunkTrust

You could take every other (or some number) event using the modulo operator.

| where (_time % 2 = 0)
---
If this reply helps you, Karma would be appreciated.

View solution in original post

0 Karma

richgalloway
SplunkTrust
SplunkTrust

You could take every other (or some number) event using the modulo operator.

| where (_time % 2 = 0)
---
If this reply helps you, Karma would be appreciated.
0 Karma
Get Updates on the Splunk Community!

Observe and Secure All Apps with Splunk

  Join Us for Our Next Tech Talk: Observe and Secure All Apps with SplunkAs organizations continue to innovate ...

Splunk Decoded: Business Transactions vs Business IQ

It’s the morning of Black Friday, and your e-commerce site is handling 10x normal traffic. Orders are flowing, ...

Fastest way to demo Observability

I’ve been having a lot of fun learning about Kubernetes and Observability. I set myself an interesting ...