Solved: Hadoop Data Roll and archiving replicated buckets

delappml_2 · ‎10-24-2017

I have an indexer cluster with a replication factor of 3. If I were to implement Hadoop Data Roll, would only one copy of each event be archived to Hadoop at freeze time, or would all three bucket copies be archived? I'm trying to find out if I can save in terms of raw archive storage costs by implementing HDR versus archiving frozen buckets to a set of NFS mounts.

rdagan_splunk · ‎10-24-2017

HDR will only copy 1 journal.gz (raw data from the bucket). Therefore, 3X Splunk bucket replication will not impact the storage on the Hadoop side.

View solution in original post

rdagan_splunk · ‎10-24-2017

HDR will only copy 1 journal.gz (raw data from the bucket). Therefore, 3X Splunk bucket replication will not impact the storage on the Hadoop side.

mattymo · ‎10-24-2017

HDR is by far the best archiving solution, unless you really want to write your own dedup logic (spoiler: you don't lol).

- MattyMo

Hadoop Data Roll and archiving replicated buckets

Splunk Search APIを使えば調査過程が残せます

Integrating Splunk Search API and Quarto to Create Reproducible Investigation ...

Congratulations to the 2025-2026 SplunkTrust!

Join the Conversation

Hadoop Data Roll and archiving replicated buckets

Splunk Search APIを使えば調査過程が残せます

Integrating Splunk Search API and Quarto to Create Reproducible Investigation ...

Congratulations to the 2025-2026 SplunkTrust!