Problem with indexing the same filename

krylov — Wed, 06 Oct 2021 13:10:39 GMT

Good afternoon!

I have a XPRT_002_SYSAT-41777_202110020712.csv file. After some time, exactly the same XPRT_002_SYSAT-41777_202110020712.csv file appears in my directory, with exactly the same content, but with a different modification time. In this case, the system indexes all events from this file twice and I have duplicates. I know that they can be filtered by means of dedup _raw, but it is not my way because it very strongly worsens search performance. Are there any other ways to configure indexing based on file changes rather than name and size, and if they match, do not index again?

Tried:

crcSalt = <SOURCE>

CHECK_METHOD = modtime

topic Problem with indexing the same filename in Splunk Enterprise

Problem with indexing the same filename