Splunk Search

How can we include sourcetype in the beginning of subdirectories and still have optimal searches in HUNK?

EricLloyd79
Builder

We currently are using MapR and HUNK to index files of the structure:

/user/mapr/(sourcetype)/(year)/(month)/(day)/(hour)/xyz.log

Lets say our sourcetype is foo.

With HUNK, you must specify regex to extract the date and time for the searches to run optimally and not search all files and directories before ending the search.

So for the regex I have:

/user/mapr/foo/(\d+)/(\d+)/(\d+)/(\d+)/.*

This works, but the problem is we will have to create a different virtual index for every different sourcetype.

I have tried /user/mapr//(\d+)/(\d+)/(\d+)/(\d+)/.
But that doesn't work, the search goes through all the files and subdirectories.

So I need something to replace foo with and * doesnt work. Putting (/w+) there doesn't work either as it extracts the "foo" and tries to use it as part of the string for the time lookup.

I have attached the indexes.conf file (minus the provider) below:

[mapr1]
vix.input.1.accept = 
vix.input.1.et.format = yyyyMMddHH
vix.input.1.et.regex = /user/mapr/*/(\d+)/(\d+)/(\d+)/(\d+)/.*
vix.input.1.lt.format = yyyyMMddHH
vix.input.1.lt.offset = 3600
vix.input.1.lt.regex = /user/mapr/*/(\d+)/(\d+)/(\d+)/(\d+)/.*
vix.input.1.path = /user/mapr/${sourcetype}/...
vix.provider = maproly
0 Karma
1 Solution

EricLloyd79
Builder

The solution ended up being:

/user/mapr/.?/(\d+)/(\d+)/(\d+)/(\d+)/.

View solution in original post

0 Karma

EricLloyd79
Builder

The solution ended up being:

/user/mapr/.?/(\d+)/(\d+)/(\d+)/(\d+)/.

0 Karma

somesoni2
Revered Legend

Give this a try

/user/mapr/[^\/]+/(\d+)/(\d+)/(\d+)/(\d+)/.

OR

/user/mapr/\w+/(\d+)/(\d+)/(\d+)/(\d+)/.
0 Karma
Career Survey
First 500 qualified respondents will receive a $20 gift card! Tell us about your professional Splunk journey.
Get Updates on the Splunk Community!

Data Persistence in the OpenTelemetry Collector

This blog post is part of an ongoing series on OpenTelemetry. What happens if the OpenTelemetry collector ...

Introducing Splunk 10.0: Smarter, Faster, and More Powerful Than Ever

Now On Demand Whether you're managing complex deployments or looking to future-proof your data ...

Community Content Calendar, September edition

Welcome to another insightful post from our Community Content Calendar! We're thrilled to continue bringing ...