Getting Data In

How to configure Splunk to store indexed data into HDFS?

srini_daruna
New Member

HI,

I am writing a Java program to index the data into a Splunk index. How do I configure the index to store the indexed data in HDFS?

Tags (4)
0 Karma

lguinn2
Legend

AFAIK, you can't do write directly to HDFS using Splunk or the Splunk REST API. If you are using Hunk (Splunk for Hadoop), then you must write to the HDFS using one of the libraries for HDFS, not Splunk. Then the data will be searchable using Hunk, provided the virtual indexes are properly configured.

If you write data to Splunk, it will be stored in a Splunk index and that requires a Splunk license. You can move data from a Splunk index into Hadoop HDFS. When rolling buckets from cold to frozen in a Splunk index, you can send the frozen data to Hadoop - if you also have a Hunk license. This is an archiving technique.. The idea is to keep your most recent data in Splunk, where the access speed is very fast - and then roll your old data into Hadoop, where it is still available but more cheaply and slower.

0 Karma
Career Survey
First 500 qualified respondents will receive a $20 gift card! Tell us about your professional Splunk journey.
Get Updates on the Splunk Community!

Tech Talk Recap | Mastering Threat Hunting

Mastering Threat HuntingDive into the world of threat hunting, exploring the key differences between ...

Observability for AI Applications: Troubleshooting Latency

If you’re working with proprietary company data, you’re probably going to have a locally hosted LLM or many ...

Splunk AI Assistant for SPL vs. ChatGPT: Which One is Better?

In the age of AI, every tool promises to make our lives easier. From summarizing content to writing code, ...