Splunk Search

Regex tokenizer when the multivalue field has values within an event that are not always connected in a structured way?

boris
Path Finder

I want to make my DATASET field a multivalue field.

The regex extracting the field using Splunkweb's Field Extraction Manager page is:

(umi\.|%3D|,|%3B|\=|/catalog/w+/)(?P{DATASET}[a-z0-9_\-%]+)\.(geometry|\w*geom|\w+\.\w+)

The dataset values in an event are not delimited in an structured way.

An example event with 4 DATASET values:

"select=VALUE1,umi.VALUE2&from=VALUE3%BVALUE4.gemetry"

QUESTIONS:

  • How should define my regex tokenizer for the DATASET field?
  • Should I define tokenizer in fields.conf or in Splunkweb's Transform Manager page?
Tags (2)
0 Karma

yannK
Splunk Employee
Splunk Employee

There is no manager for that.

you can test is with that in a search

<my search> | makemv tokenizer="([^,].*)" DATASET

then deploy a fields.conf to make it automatic
see http://docs.splunk.com/Documentation/Splunk/6.0.2/Knowledge/ConfigureSplunktoparsemulti-valuefields

0 Karma
Get Updates on the Splunk Community!

Splunk Forwarders and Forced Time Based Load Balancing

Splunk customers use universal forwarders to collect and send data to Splunk. A universal forwarder can send ...

NEW! Log Views in Splunk Observability Dashboards Gives Context From a Single Page

Today, Splunk Observability releases log views, a new feature for users to add their logs data from Splunk Log ...

Last Chance to Submit Your Paper For BSides Splunk - Deadline is August 12th!

Hello everyone! Don't wait to submit - The deadline is August 12th! We have truly missed the community so ...