All Apps and Add-ons

Shuttl setup/configuration with S3

marksnelling
Communicator

I looking at using Shuttl combined with Amazon S3, I'm unfamiliar with Hadoop but have successfully installed and tested it.
Firstly, although the Shuttl documentation says that it works with Hadoop and S3 storage there doesn't seem to be instructions on how to set this up.
Secondly, I'm unclear about if I'm using S3 as the storage for my frozen buckets where does Hadoop run, does it run on-site with my Splunk installation or as an Amazon EC2 instance?

Tags (3)
1 Solution

Petter_Eriksson
Splunk Employee
Splunk Employee

The configuration has changed a little since you asked the question.
We've setup a page with examples and detailed documentation here:
https://github.com/splunk/splunk-shuttl/tree/master/examples

View solution in original post

Petter_Eriksson
Splunk Employee
Splunk Employee

The configuration has changed a little since you asked the question.
We've setup a page with examples and detailed documentation here:
https://github.com/splunk/splunk-shuttl/tree/master/examples

bchen
Splunk Employee
Splunk Employee

A belated response that I'll answer for others on Splunkbase.

To summarize:

If you use S3 as your storage, setting up Hadoop to use the data is up to you. You can use Hadoop in EC2, or EMR. (keep in mind you need to archive in CSV format; only Splunk can understand the Splunk Bucket format)

In order to setup Shuttl for Amazon S3, you provide a URI indicating the backend. Inside, $SPLUNK_HOME/etc/apps/shuttl/conf you'll see archiver.xml. In that file, you'll edit the archiverRootURI to have a value of the form: s3://:@/

In addition, keep in mind that though "s3n" will work as well for Shuttl, you will need to keep in mind the limitation of file size. In general, it's safest to use "s3".

See the Quickstart guide for more information: https://github.com/splunk/splunk-shuttl/wiki/Quickstart-Guide

0 Karma
Got questions? Get answers!

Join the Splunk Community Slack to learn, troubleshoot, and make connections with fellow Splunk practitioners in real time!

Meet up IRL or virtually!

Join Splunk User Groups to connect and learn in-person by region or remotely by topic or industry.

Get Updates on the Splunk Community!

Observability Simplified: Combining User Experience, Application Performance & ...

Tech Talk Observability Simplified: Combining User Experience, Application Performance & Network ...

Event Series May & June: From Network Visibility to Service Intelligence

Unifying the Network: Moving from Alert Noise to Service Intelligence with Splunk ITSI In today’s hybrid ...

Global Splunk User Group Events: May + June 2026

Your Splunk Community Awaits: Discover Upcoming User Group Events Worldwide    Staying ahead in the fast-paced ...