Splunk Search

Index bz2 files take too much time

indeed_2000
Motivator

Hi

I have a directory that contain 60 bz2 files. Totally 27 GB

After 24 hours still index processing not completed!

How can I check index status of this directory? (How much remain? How much pass?)

How can I tune splunk to index compress files more quickly?

FYI: there is no issue about license limitation.

FYI: I have enough disk space.

 

any idea?

Thanks

Labels (2)
Tags (2)
0 Karma
1 Solution

venkatasri
SplunkTrust
SplunkTrust

@indeed_2000 

The thruput is limited to 256 Kbps in limit.conf on UF,  it depends on network speed you have to calculate how much data can be ingested.

Issue following command under $SPLUNK_HOME/bin and when prompt provide the username/password. This will provide the list of the files being read by UF and their current stage/progress.

./splunk list inputstatus

 

----

An upvote would be appreciated and Accept solution if it helps!

View solution in original post

codebuilder
Influencer

To ingest these files Splunk first has to decompress them and that is a single threaded process (and consumes a lot of memory).

You'll see better performance by ingesting them before they are compressed, or decompress them prior to ingestion.

----
An upvote would be appreciated and Accept Solution if it helps!
0 Karma

venkatasri
SplunkTrust
SplunkTrust

@indeed_2000 

The thruput is limited to 256 Kbps in limit.conf on UF,  it depends on network speed you have to calculate how much data can be ingested.

Issue following command under $SPLUNK_HOME/bin and when prompt provide the username/password. This will provide the list of the files being read by UF and their current stage/progress.

./splunk list inputstatus

 

----

An upvote would be appreciated and Accept solution if it helps!

Career Survey
First 500 qualified respondents will receive a $20 gift card! Tell us about your professional Splunk journey.
Get Updates on the Splunk Community!

Tech Talk Recap | Mastering Threat Hunting

Mastering Threat HuntingDive into the world of threat hunting, exploring the key differences between ...

Observability for AI Applications: Troubleshooting Latency

If you’re working with proprietary company data, you’re probably going to have a locally hosted LLM or many ...

Splunk AI Assistant for SPL vs. ChatGPT: Which One is Better?

In the age of AI, every tool promises to make our lives easier. From summarizing content to writing code, ...