Virtual Index with Hunk

boulderdevdude · ‎05-27-2014

Platform: CDH5.0.0 + Splunk 6.1 and Hunk.
We have configured virtual indexes. We are successfully issuing queries against Splunk and getting data back from MR jobs.

When MR jobs run, should they produce splunk index files, so next time Splunk makes a query against this data it doesn't have to run an MR job again?

I see result files with names like 1401212205.1082/0/part-m-00043 (1 per MR task, I think). Each file contains what looks like a hash and a path to a different file. Example
81a76d523faab948b16fcd2eb6dd56a0 /user/splunk/scratch/dispatch/1401212205.1082/0/splunk-m-0000043

But the splunk-m-0000043 file (or any like them) do not exist at that directory location. Are they supposed to exist?

Ledion_Bitincka · ‎05-27-2014

Hunk does not create any persistent artifacts or index after running a search, ie it does "index" the data after first search. However, for searches that you intend to run frequently in Hunk 6.1 you can enable report acceleration to cache the intermediate results so they are reused next time

Here's another related question - not sure if it's the same question from two co-workers 🙂

Virtual Index with Hunk

Join the Splunk Community Slack to learn, troubleshoot, and make connections with fellow Splunk practitioners in real time!

Join Splunk User Groups to connect and learn in-person by region or remotely by topic or industry.

Why Splunk Customers Should Attend Cisco Live 2026 Las Vegas

What Is the Name of the USB Key Inserted by Bob Smith? (BOTS Hint, Not the Answer)

Automating Threat Operations and Threat Hunting with Recorded Future

Join the Conversation

Virtual Index with Hunk

Join the Splunk Community Slack to learn, troubleshoot, and make connections with fellow Splunk practitioners in real time!

Join Splunk User Groups to connect and learn in-person by region or remotely by topic or industry.

Why Splunk Customers Should Attend Cisco Live 2026 Las Vegas

What Is the Name of the USB Key Inserted by Bob Smith? (BOTS Hint, Not the Answer)

Automating Threat Operations and Threat Hunting with Recorded Future