All Apps and Add-ons

Split data into a training set and a testing set

emma1
Engager

Hi,

I am trying to train a LSTM time series classifier in the search window in Splunk, step 4 in Model Development Guide in deep learning toolkit.

It says that the data can be split into a training set and a testing set using the | sample command. To my understanding, the datasets is then constructed by taken events randomly.  I do not want datasets that is constructed by taking events randomly. I want the data to be sorted by time and then split with a ratio, how can I do this?

Splunk version: 7.2.4

Deep Learning Toolkit for Splunk Version: 2.3.0

Labels (1)
0 Karma
1 Solution

richgalloway
SplunkTrust
SplunkTrust

You could take every other (or some number) event using the modulo operator.

| where (_time % 2 = 0)
---
If this reply helps you, Karma would be appreciated.

View solution in original post

0 Karma

richgalloway
SplunkTrust
SplunkTrust

You could take every other (or some number) event using the modulo operator.

| where (_time % 2 = 0)
---
If this reply helps you, Karma would be appreciated.
0 Karma
Get Updates on the Splunk Community!

Introducing the Splunk Community Dashboard Challenge!

Welcome to Splunk Community Dashboard Challenge! This is your chance to showcase your skills in creating ...

Built-in Service Level Objectives Management to Bridge the Gap Between Service & ...

Wednesday, May 29, 2024  |  11AM PST / 2PM ESTRegister now and join us to learn more about how you can ...

Get Your Exclusive Splunk Certified Cybersecurity Defense Engineer Certification at ...

We’re excited to announce a new Splunk certification exam being released at .conf24! If you’re headed to Vegas ...