Solved: Can a Splunk deployment configured in AWS cloud (I...

andrewtrobec · ‎12-16-2021

Hello,

I am a bit confused as to how Splunk manages its indexes through AWS cloud services, and I am not sure whether both EBS and S3 services are interchangeable for thsi type of deployment. For example, is S3 only for archiving frozen buckets, or can it be used for hot/warm/cold buckets as well?

Is there some documentation about best practices here? Compare and contrast?

Thanks!

Andrew

aasabatini · ‎12-16-2021

Hi @andrewtrobec

Splunk on aws manage the buckets always in the same way if you configured the storage like a file system.

If you want use S3 bucket with splunk you need to use the smartstore function.

docs.splunk.com/Documentation/Splunk/8.2.3/Indexer/AboutSmartStore

anyway you have to consider the sinzing of your log and how many day you need the data on line.

I hope I’ve been clear

“The answer is out there, Neo, and it’s looking for you, and it will find you if you want it to.”

View solution in original post

aasabatini · ‎12-16-2021

Hi @andrewtrobec

Splunk on aws manage the buckets always in the same way if you configured the storage like a file system.

If you want use S3 bucket with splunk you need to use the smartstore function.

docs.splunk.com/Documentation/Splunk/8.2.3/Indexer/AboutSmartStore

anyway you have to consider the sinzing of your log and how many day you need the data on line.

I hope I’ve been clear

“The answer is out there, Neo, and it’s looking for you, and it will find you if you want it to.”

isoutamo · ‎12-16-2021

Hi

Usually basic splunk core installation is using "local" storage for all indexes. That local storage can be an EC2 instance storage or EBS. Chose between instance and EBS based on your way to use nodes. Instance storage is lost when instance is terminated, EBS didn't go away, and you could use it again with a new EC2 instance.

In "normal" splunk instance you can move frozen buckets to S3 for archive. S3 can be a traditional S3 or a Glacier or you can even define automated migration from S3 to Glacier based on e.g. time.

When You are using Splunk's Smartstore, then there is fundamental difference how instance is used storage. In SmartStore only hot data is in instance storage (only). All other events are stored to S3 (automatically) remote volumes. Partially that data is stored also on local instances cache partition (same where hot buckets are). With SmartStore it's crucial to use instances which have enough NVMe based local storage for performance point of view.

In shortly said EBS and S3 are not interchangeable. There are separate usage profile for both.

r. Ismo

Can a Splunk deployment configured in AWS cloud (IaaS) read and write to both S3 buckets and EBS?

indexer

other

Earn a $35 Gift Card for Answering our Splunk Admins & App Developer Survey

Continuing Innovation & New Integrations Unlock Full Stack Observability For Your ...

Monitoring Amazon Elastic Kubernetes Service (EKS)