Archive

Creating Hunk 6.1 Virtual Index

techdiverdown
Path Finder

Using CDH5 (MR2) and Hunk 6.1 on Centos 6.4...

I have my netflow ascii data in the HDFS file system in 15 minute increments with each day being a higher level directory and each file having 15 minutes of netflow data. Something like this:

/user/netflow/2015-05-25/asciiflow2014-05-25-02-45-01.csv
/user/netflow/2015-05-25/asciiflow2014-05-25-03-00-01.csv
..
..
/user/netflow/2015-05-26/asciiflow2014-05-26-02-45-01.csv
..

Given this I am wondering about the virtual index configuration I have, listed below, is correct?
I seem to search the same amount of time no mater what the time period is....

Time Capturing Regex is "/user/netflow/(\d+)-(\d+)-(\d+)/"

Time Format is "yyyyMMdd"
Time Adjustment is 15 Minutes??
Time Range is 1 day ??

Tags (4)
0 Karma

Ledion_Bitincka
Splunk Employee
Splunk Employee

You can either extract the time range from the parent dir:

Time Capturing Regex: "/user/netflow/(d+)-(d+)-(d+)/"
Time Format: "yyyyMMdd" 
Time Adjustment: 0 
Time Range: 1 day 

or your can extract the more granular timestamp at the file level:

Time Capturing Regex: "asciiflow(\d+)-(\d+)-(\d+)-(\d+)-(\d+)-\d+.csv$"
Time Format: "yyyyMMddHHmm" 
Time Adjustment: 0  
Time Range: 15 minutes  

Tune In & Win!

Don't miss out on your
chance to take home free
prizes by helping our players
save the Splunk Cloudom!

Dungeons & Data
Monsters: Splunk O11y
Day Editions Games
stream live:
5/4 at 6:30pm PST
5/5 at 7:00pm PST
on