Getting Data In

How to index duplicate files which has different name

snehalk
Communicator

Hello All,

Is there any application or method in Splunk, where we can index the files(which has same contain) in Splunk but with different file name?

Or Can we identify the duplicate files in Splunk without indexing?

Note: one option is there where we can delete the fishbucket and reindex, but i dont want to go with this option.

Can any one help me on this.

Thanks in Advance.

1 Solution

gcusello
SplunkTrust
SplunkTrust

insert in your inputs.conf stanza the row

crcSalt = <SOURCE>

as described in http://docs.splunk.com/Documentation/Splunk/latest/admin/inputsconf

Bye.
Giuseppe

View solution in original post

gcusello
SplunkTrust
SplunkTrust

insert in your inputs.conf stanza the row

crcSalt = <SOURCE>

as described in http://docs.splunk.com/Documentation/Splunk/latest/admin/inputsconf

Bye.
Giuseppe

snehalk
Communicator

Hello Cusello,

Thank you for quick response, am using below config as suggested by you, and its indexing duplicate files in splunk.

inputs.conf
[monitor://C:\sampleduplicate\duplicatefiles\*]
index=main
sourcetype=vendorduplicate
crcSalt = <SOURCE>

Thank you once again !!

0 Karma

Samir__
Engager

Follow up question on the same topic. I also have files with different name but most of them have exact same contents.

After adding crcSalt = < SOURCE> in inputs.conf, does splunk automatically index previously excluded "duplicates"?

0 Karma
Get Updates on the Splunk Community!

The All New Performance Insights for Splunk

Splunk gives you amazing tools to analyze system data and make business-critical decisions, react to issues, ...

Good Sourcetype Naming

When it comes to getting data in, one of the earliest decisions made is what to use as a sourcetype. Often, ...

See your relevant APM services, dashboards, and alerts in one place with the updated ...

As a Splunk Observability user, you have a lot of data you have to manage, prioritize, and troubleshoot on a ...