I am getting this error in Hunk:
03-11-2014 08:45:08.354 ERROR SearchOperator:stdin - Cannot consume data with unset stream_type
03-11-2014 08:45:08.355 ERROR ResultProvider - Error in 'SearchOperator:stdin': Cannot consume data with unset stream_type
And no results are returning.
This is using the Hunk Tutorial settings.
Can you please check what hadoop version you selected while creating the Provider on UI.
In the drop down Hadoop version is this selected Hadoop 2.x,(MRv1) ?
Can you please check what hadoop version you selected while creating the Provider on UI.
In the drop down Hadoop version is this selected Hadoop 2.x,(MRv1) ?
BINGO! Changing it to 2.x worked!
1.x is selected currently
file is too large to post. Here is the error section:
03-11-2014 10:43:04.242 ERROR SearchOperator:stdin - Cannot consume data with unset stream_type
03-11-2014 10:43:04.242 ERROR ResultProvider - Error in 'SearchOperator:stdin': Cannot consume data with unset stream_type
0
Please paste your search.log in here
You need to change vix.fs.default.name = hdfs://localhost/8020 to vix.fs.default.name = hdfs://localhost:8020
And make sure you have your data in hdfs under /data
Changed that. Still getting the same error.
[ponyindex]
vix.input.1.accept = .gz$
vix.input.1.path = /data/...
vix.provider = PonyProvider
[cloudera@localhost local]$ cat indexes.conf
[provider:PonyProvider]
vix.env.HADOOP_HOME = /usr/lib/hadoop
vix.env.JAVA_HOME = /usr/java/jdk1.6.0_32
vix.family = hadoop
vix.fs.default.name = hdfs://localhost:8020
vix.mapred.job.tracker = localhost:8021
vix.splunk.home.hdfs = /user/root/splunkmr
[ponyindex]
vix.input.1.accept = .gz$
vix.input.1.path = /data/...
vix.provider = PonyProvider
you have a slash before 8020 but you need colon (:8020)
Can you clarify that? Those are the same 🙂
ps: there is a backslash before the .gz that somehow this webpage doesn't show.
[provider:PonyProvider]
vix.env.HADOOP_HOME = /usr/lib/hadoop
vix.env.JAVA_HOME = /usr/java/jdk1.6.0_32
vix.family = hadoop
vix.fs.default.name = hdfs://localhost/8020
vix.mapred.job.tracker = localhost:8021
vix.splunk.home.hdfs = /user/root/splunkmr
[ponyindex]
vix.input.1.accept = .gz$
vix.input.1.path = /data/...
vix.provider = PonyProvider
I need to see your provider and virtual indexes. I am guessing this one:
/home/cloudera/splunk/etc/apps/search/local/indexes.conf (if you haven't moved it under your App)
I am using the Cloudera Quickstart VM.
There are several indexes.conf files... which one do you want?
/home/cloudera/splunk/etc/apps/sample_app/default/indexes.conf
/home/cloudera/splunk/etc/apps/SplunkLightForwarder/default/indexes.conf
/home/cloudera/splunk/etc/apps/search/local/indexes.conf
/home/cloudera/splunk/etc/system/default/indexes.conf
/home/cloudera/splunk/etc/master-apps/_cluster/default/indexes.conf
Can you copy your indexes.conf in here. What version of Hadoop are you using?