Getting Data In

settings to configure hunk for Hive ORC table

pawantiwari3987
Explorer

Hi Team,
Please help us in configuring hadoop provider/virtual index so that we can map it on hive orc table. we are able to search on simple/text table however when we give path of orc table it show output in some unreadable format.

Regards
Pawan Tiwari

Tags (1)
0 Karma

hyan_splunk
Splunk Employee
Splunk Employee

For any hive formats other than text, you need to specify fileformat property. So in your case:

vix.input.1.splitter.hive.fileformat = orc

0 Karma

rdagan_splunk
Splunk Employee
Splunk Employee

If you have access to the Metastore URI, it should look similar to this example

In the Provider add these two flags:
vix.splunk.search.splitter = HiveSplitGenerator
vix.hive.metastore.uris = thrift://sandbox:9083 (you can find the value in hive-site.xml)

In the Virtual Index you will need to point to the actual ORC file, DB Name, Table Name:
[employee_orc]
vix.input.1.path = /apps/hive/warehouse/employees_orc
vix.provider = HiveHDPProvider
vix.input.1.splitter.hive.dbname = default
vix.input.1.splitter.hive.tablename = employees_rc

0 Karma
Career Survey
First 500 qualified respondents will receive a $20 gift card! Tell us about your professional Splunk journey.
Get Updates on the Splunk Community!

Tech Talk Recap | Mastering Threat Hunting

Mastering Threat HuntingDive into the world of threat hunting, exploring the key differences between ...

Observability for AI Applications: Troubleshooting Latency

If you’re working with proprietary company data, you’re probably going to have a locally hosted LLM or many ...

Splunk AI Assistant for SPL vs. ChatGPT: Which One is Better?

In the age of AI, every tool promises to make our lives easier. From summarizing content to writing code, ...