Getting Data In

For Hunk, how do you configure indexes.conf to use MR v2?

jimjh
Path Finder

My provider configuration inside indexes.conf looks like

[provider:analytics-emr]
vix.env.HADOOP_HOME = /opt/hadoop-2.2.0
vix.env.JAVA_HOME = /usr/lib/jvm/java-7-oracle-1.7.0.45/jre
vix.family = hadoop
vix.fs.default.name = xxx
vix.mapreduce.framework.name = yarn
vix.yarn.resourcemanager.address = xxx:8032
vix.yarn.resourcemanager.scheduler.address= xxx:8030
vix.splunk.home.hdfs = /hunk-dir
vix.splunk.setup.package = /opt/splunk_packages/hunk-6.1.1.tgz

According to the Web UI, this provider is using MR v1. How do I configure it to use MR v2?

Tags (2)
1 Solution

jimjh
Path Finder

I wanted to do this without going through the UI, mainly so that I can launch a new Hunk node programmatically.

After some messing around, I found that if I set the following

vix.command.arg.3 = $SPLUNK_HOME/bin/jars/SplunkMR-s6.0-hy2.0.jar

Hunk knows that I want to use YARN. There is probably another value for MR v2.

View solution in original post

jimjh
Path Finder

I wanted to do this without going through the UI, mainly so that I can launch a new Hunk node programmatically.

After some messing around, I found that if I set the following

vix.command.arg.3 = $SPLUNK_HOME/bin/jars/SplunkMR-s6.0-hy2.0.jar

Hunk knows that I want to use YARN. There is probably another value for MR v2.

nhaddadkaveh_sp
Splunk Employee
Splunk Employee

You can select the Haoop version from the UI under "Provider"
Also you need to add these in your indexes.conf:
vix.mapred.job.tracker = jobtracker.hadoop.splunk.com:8021
vix.fs.default.name = hdfs://hdfs.hadoop.splunk.com:8020
vix.splunk.home.datanode = /

For more detail you can check here:
http://docs.splunk.com/Documentation/Hunk/6.1.1/Hunk/Setupavirtualindex

0 Karma
Get Updates on the Splunk Community!

Monitoring Postgres with OpenTelemetry

Behind every business-critical application, you’ll find databases. These behind-the-scenes stores power ...

Mastering Synthetic Browser Testing: Pro Tips to Keep Your Web App Running Smoothly

To start, if you're new to synthetic monitoring, I recommend exploring this synthetic monitoring overview. In ...

Splunk Edge Processor | Popular Use Cases to Get Started with Edge Processor

Splunk Edge Processor offers more efficient, flexible data transformation – helping you reduce noise, control ...