<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Problem with indexing the same filename in Splunk Enterprise</title>
    <link>https://community.splunk.com/t5/Splunk-Enterprise/Problem-with-indexing-the-same-filename/m-p/569880#M10252</link>
    <description>&lt;P&gt;Good afternoon!&lt;/P&gt;&lt;P&gt;I have a &lt;SPAN&gt;XPRT_002_SYSAT-41777_202110020712.csv&lt;/SPAN&gt; file. After some time, exactly the same &lt;SPAN&gt;XPRT_002_SYSAT-41777_202110020712.csv&lt;/SPAN&gt; file appears in my directory, with exactly the same content, but with a different modification time. In this case, the system indexes all events from this file twice and I have duplicates. I know that they can be filtered by means of dedup _raw, but it is not my way because it very strongly worsens search performance. Are there any other ways to configure indexing based on file changes rather than name and size, and if they match, do not index again?&lt;/P&gt;&lt;P&gt;Tried:&lt;/P&gt;&lt;P&gt;crcSalt = &amp;lt;SOURCE&amp;gt;&lt;/P&gt;&lt;P&gt;CHECK_METHOD = modtime&lt;/P&gt;</description>
    <pubDate>Wed, 06 Oct 2021 13:10:39 GMT</pubDate>
    <dc:creator>krylov</dc:creator>
    <dc:date>2021-10-06T13:10:39Z</dc:date>
    <item>
      <title>Problem with indexing the same filename</title>
      <link>https://community.splunk.com/t5/Splunk-Enterprise/Problem-with-indexing-the-same-filename/m-p/569880#M10252</link>
      <description>&lt;P&gt;Good afternoon!&lt;/P&gt;&lt;P&gt;I have a &lt;SPAN&gt;XPRT_002_SYSAT-41777_202110020712.csv&lt;/SPAN&gt; file. After some time, exactly the same &lt;SPAN&gt;XPRT_002_SYSAT-41777_202110020712.csv&lt;/SPAN&gt; file appears in my directory, with exactly the same content, but with a different modification time. In this case, the system indexes all events from this file twice and I have duplicates. I know that they can be filtered by means of dedup _raw, but it is not my way because it very strongly worsens search performance. Are there any other ways to configure indexing based on file changes rather than name and size, and if they match, do not index again?&lt;/P&gt;&lt;P&gt;Tried:&lt;/P&gt;&lt;P&gt;crcSalt = &amp;lt;SOURCE&amp;gt;&lt;/P&gt;&lt;P&gt;CHECK_METHOD = modtime&lt;/P&gt;</description>
      <pubDate>Wed, 06 Oct 2021 13:10:39 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Splunk-Enterprise/Problem-with-indexing-the-same-filename/m-p/569880#M10252</guid>
      <dc:creator>krylov</dc:creator>
      <dc:date>2021-10-06T13:10:39Z</dc:date>
    </item>
  </channel>
</rss>

