We are new to splunk and we are trying to find about all the vast capabilities that splunk offers.
So here is the scenario.
We have a repository that contains100s of zip files that are accessible through a network share.
Within these zip files contains a csv file that contains the data we need to ingest.
So the question is, is it possible for splunk to ingest just the csv within these individual zip files without having to unzip the entire archive first.
Splunk will ingest ZIP files
https://docs.splunk.com/Documentation/Splunk/latest/Data/Monitorfilesanddirectories
Thanks. Is it possible though to only ingest the CSV file within the zip files?
Firstly - there is no way to whitelist/blacklist/select subset of files from an archive. If you tell Splunk to ingest an archive, it will happily pull contents of all files contained within.
Secondly - ingesting archives can be tricky - any change to archive will cause complete reingestion of archive contents so you might end up with several copies of the same events (and huge overconsumption of license).
If the zip contains other files, then I don't know how you would exclude those files