Getting Data In

Reindex gz files not working : Why is already indexed as a non-archive?

splunkreal
Motivator

Hello,
we have issue reindexing archives as gz files even using crcSalt = <SOURCE> or crcSalt = REINDEXMPLEASE
We CAN'T go on each UF and clean fishbucket.

 

UF (V7.1.4) linux splunkd.log :
07-19-2022 18:19:09.129 +0200 INFO ArchiveProcessor - Handling file=/var/log/MAJ-OS.log-20220601.gz
07-19-2022 18:19:09.130 +0200 INFO ArchiveProcessor - reading path=/var/log/MAJ-OS.log-20220601.gz (seek=0 len=1356)
07-19-2022 18:19:09.281 +0200 INFO ArchiveProcessor - Archive with path="/var/log/MAJ-OS.log-20220601.gz" was already indexed as a non-archive, skipping.
07-19-2022 18:19:09.281 +0200 INFO ArchiveProcessor - Finished processing file '/var/log/MAJ-OS.log-20220601.gz', removing from stats

It also says "new tailer already processed path..."


inputs.conf app from deployment-apps (V8.2.2) :
[monitor:///var/log/MAJ-OS.log*]
blacklist = archives
disabled = false
index = inf-servers
sourcetype = MAJ-OS
crcSalt = <SOURCE>

 

Thanks for your help.

 

 

* If this helps, please upvote or accept solution if it solved *
Labels (3)
Tags (3)
0 Karma
1 Solution

splunkreal
Motivator

Solved with support help:

 

add:

crcSalt = <SOURCE>

initCrcLength = 1000

ignoreOlderThan = 90d

 

---

 

Another personal workaround that could do the job : example :

 

[script://./bin/MAJ-OS_zcat.sh]

source = MAJ-OS_zcat

interval = 2592000

disabled = true

index = inf-servers

sourcetype = MAJ-OS

 

MAJ-OS_zcat.sh :

#!/bin/sh

content=`zcat /var/log/MAJ-OS.log-20220701.gz | grep -i status`

echo $content "(catchup 07/20/2022)"

 

🙂

* If this helps, please upvote or accept solution if it solved *

View solution in original post

0 Karma

splunkreal
Motivator

Solved with support help:

 

add:

crcSalt = <SOURCE>

initCrcLength = 1000

ignoreOlderThan = 90d

 

---

 

Another personal workaround that could do the job : example :

 

[script://./bin/MAJ-OS_zcat.sh]

source = MAJ-OS_zcat

interval = 2592000

disabled = true

index = inf-servers

sourcetype = MAJ-OS

 

MAJ-OS_zcat.sh :

#!/bin/sh

content=`zcat /var/log/MAJ-OS.log-20220701.gz | grep -i status`

echo $content "(catchup 07/20/2022)"

 

🙂

* If this helps, please upvote or accept solution if it solved *
0 Karma
Get Updates on the Splunk Community!

CX Day is Coming!

Customer Experience (CX) Day is on October 7th!! We're so excited to bring back another day full of wonderful ...

Strengthen Your Future: A Look Back at Splunk 10 Innovations and .conf25 Highlights!

The Big One: Splunk 10 is Here!  The moment many of you have been waiting for has arrived! We are thrilled to ...

Now Offering the AI Assistant Usage Dashboard in Cloud Monitoring Console

Today, we’re excited to announce the release of a brand new AI assistant usage dashboard in Cloud Monitoring ...