Getting Data In

Reindex gz files not working : Why is already indexed as a non-archive?

splunkreal
Motivator

Hello,
we have issue reindexing archives as gz files even using crcSalt = <SOURCE> or crcSalt = REINDEXMPLEASE
We CAN'T go on each UF and clean fishbucket.

 

UF (V7.1.4) linux splunkd.log :
07-19-2022 18:19:09.129 +0200 INFO ArchiveProcessor - Handling file=/var/log/MAJ-OS.log-20220601.gz
07-19-2022 18:19:09.130 +0200 INFO ArchiveProcessor - reading path=/var/log/MAJ-OS.log-20220601.gz (seek=0 len=1356)
07-19-2022 18:19:09.281 +0200 INFO ArchiveProcessor - Archive with path="/var/log/MAJ-OS.log-20220601.gz" was already indexed as a non-archive, skipping.
07-19-2022 18:19:09.281 +0200 INFO ArchiveProcessor - Finished processing file '/var/log/MAJ-OS.log-20220601.gz', removing from stats

It also says "new tailer already processed path..."


inputs.conf app from deployment-apps (V8.2.2) :
[monitor:///var/log/MAJ-OS.log*]
blacklist = archives
disabled = false
index = inf-servers
sourcetype = MAJ-OS
crcSalt = <SOURCE>

 

Thanks for your help.

 

 

* If this helps, please upvote or accept solution if it solved *
Labels (3)
Tags (3)
0 Karma
1 Solution

splunkreal
Motivator

Solved with support help:

 

add:

crcSalt = <SOURCE>

initCrcLength = 1000

ignoreOlderThan = 90d

 

---

 

Another personal workaround that could do the job : example :

 

[script://./bin/MAJ-OS_zcat.sh]

source = MAJ-OS_zcat

interval = 2592000

disabled = true

index = inf-servers

sourcetype = MAJ-OS

 

MAJ-OS_zcat.sh :

#!/bin/sh

content=`zcat /var/log/MAJ-OS.log-20220701.gz | grep -i status`

echo $content "(catchup 07/20/2022)"

 

🙂

* If this helps, please upvote or accept solution if it solved *

View solution in original post

0 Karma

splunkreal
Motivator

Solved with support help:

 

add:

crcSalt = <SOURCE>

initCrcLength = 1000

ignoreOlderThan = 90d

 

---

 

Another personal workaround that could do the job : example :

 

[script://./bin/MAJ-OS_zcat.sh]

source = MAJ-OS_zcat

interval = 2592000

disabled = true

index = inf-servers

sourcetype = MAJ-OS

 

MAJ-OS_zcat.sh :

#!/bin/sh

content=`zcat /var/log/MAJ-OS.log-20220701.gz | grep -i status`

echo $content "(catchup 07/20/2022)"

 

🙂

* If this helps, please upvote or accept solution if it solved *
0 Karma
Career Survey
First 500 qualified respondents will receive a $20 gift card! Tell us about your professional Splunk journey.

Can’t make it to .conf25? Join us online!

Get Updates on the Splunk Community!

Community Content Calendar, September edition

Welcome to another insightful post from our Community Content Calendar! We're thrilled to continue bringing ...

Splunkbase Unveils New App Listing Management Public Preview

Splunkbase Unveils New App Listing Management Public PreviewWe're thrilled to announce the public preview of ...

Leveraging Automated Threat Analysis Across the Splunk Ecosystem

Are you leveraging automation to its fullest potential in your threat detection strategy?Our upcoming Security ...