Getting Data In

Reading AWS s3 gz files without extension in Splunk?

prash
Loves-to-Learn Everything

I am having difficulties to get Splunk to ingest gzipped logs files from an S3 bucket, the files itself do not have extensions and Splunk is reading them as binaries.

I tried archive_cmd to auto, gunzip -c, gzip -d in props.conf with no luck

[source::/xxx/*]

unarchive_cmd = gunzip -c

NO_BINARY_CHECK = true

gunzip -c works in shell, gzip -d doesn't without gz suffix

*using AWS addon

due to the nature of the environment, the files can't be renamed. Anyone experienced this before?

Labels (1)
0 Karma

prash
Loves-to-Learn Everything

I did try that, made no difference. As per docs, the unarchive_cmd is only invoked when invalid_cause is specified.

 

0 Karma

richgalloway
SplunkTrust
SplunkTrust

I noticed the contradiction in the docs.  I suggest contacting Splunk Support.

---
If this reply helps you, Karma would be appreciated.
0 Karma

richgalloway
SplunkTrust
SplunkTrust

Have you tried adding invalid_cause=archive to the stanza?  The docs have conflicting information about it, but it's worth a try.

---
If this reply helps you, Karma would be appreciated.
0 Karma
Get Updates on the Splunk Community!

Data Management Digest – December 2025

Welcome to the December edition of Data Management Digest! As we continue our journey of data innovation, the ...

Index This | What is broken 80% of the time by February?

December 2025 Edition   Hayyy Splunk Education Enthusiasts and the Eternally Curious!    We’re back with this ...

Unlock Faster Time-to-Value on Edge and Ingest Processor with New SPL2 Pipeline ...

Hello Splunk Community,   We're thrilled to share an exciting update that will help you manage your data more ...