Getting Data In

Reindex gz files not working : Why is already indexed as a non-archive?

splunkreal
Motivator

Hello,
we have issue reindexing archives as gz files even using crcSalt = <SOURCE> or crcSalt = REINDEXMPLEASE
We CAN'T go on each UF and clean fishbucket.

 

UF (V7.1.4) linux splunkd.log :
07-19-2022 18:19:09.129 +0200 INFO ArchiveProcessor - Handling file=/var/log/MAJ-OS.log-20220601.gz
07-19-2022 18:19:09.130 +0200 INFO ArchiveProcessor - reading path=/var/log/MAJ-OS.log-20220601.gz (seek=0 len=1356)
07-19-2022 18:19:09.281 +0200 INFO ArchiveProcessor - Archive with path="/var/log/MAJ-OS.log-20220601.gz" was already indexed as a non-archive, skipping.
07-19-2022 18:19:09.281 +0200 INFO ArchiveProcessor - Finished processing file '/var/log/MAJ-OS.log-20220601.gz', removing from stats

It also says "new tailer already processed path..."


inputs.conf app from deployment-apps (V8.2.2) :
[monitor:///var/log/MAJ-OS.log*]
blacklist = archives
disabled = false
index = inf-servers
sourcetype = MAJ-OS
crcSalt = <SOURCE>

 

Thanks for your help.

 

 

* If this helps, please upvote or accept solution if it solved *
Labels (3)
Tags (3)
0 Karma
1 Solution

splunkreal
Motivator

Solved with support help:

 

add:

crcSalt = <SOURCE>

initCrcLength = 1000

ignoreOlderThan = 90d

 

---

 

Another personal workaround that could do the job : example :

 

[script://./bin/MAJ-OS_zcat.sh]

source = MAJ-OS_zcat

interval = 2592000

disabled = true

index = inf-servers

sourcetype = MAJ-OS

 

MAJ-OS_zcat.sh :

#!/bin/sh

content=`zcat /var/log/MAJ-OS.log-20220701.gz | grep -i status`

echo $content "(catchup 07/20/2022)"

 

🙂

* If this helps, please upvote or accept solution if it solved *

View solution in original post

0 Karma

splunkreal
Motivator

Solved with support help:

 

add:

crcSalt = <SOURCE>

initCrcLength = 1000

ignoreOlderThan = 90d

 

---

 

Another personal workaround that could do the job : example :

 

[script://./bin/MAJ-OS_zcat.sh]

source = MAJ-OS_zcat

interval = 2592000

disabled = true

index = inf-servers

sourcetype = MAJ-OS

 

MAJ-OS_zcat.sh :

#!/bin/sh

content=`zcat /var/log/MAJ-OS.log-20220701.gz | grep -i status`

echo $content "(catchup 07/20/2022)"

 

🙂

* If this helps, please upvote or accept solution if it solved *
0 Karma
Get Updates on the Splunk Community!

Index This | What’s a riddle wrapped in an enigma?

September 2025 Edition Hayyy Splunk Education Enthusiasts and the Eternally Curious!  We’re back with this ...

BORE at .conf25

Boss Of Regular Expression (BORE) was an interactive session run again this year at .conf25 by the brilliant ...

OpenTelemetry for Legacy Apps? Yes, You Can!

This article is a follow-up to my previous article posted on the OpenTelemetry Blog, "Your Critical Legacy App ...