<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: How to avoid same log with same content ingestion in splunk from different servers in Getting Data In</title>
    <link>https://community.splunk.com/t5/Getting-Data-In/How-to-avoid-same-log-with-same-content-ingestion-in-splunk-from/m-p/578533#M102152</link>
    <description>&lt;P&gt;I would see only one way to do this. And it's ugly. ...&lt;/P&gt;&lt;P&gt;Mount the 10 file systems on NFS from the 10 servers on one single UF.&amp;nbsp;&lt;/P&gt;&lt;P&gt;That way, the fishbucket will consider treat the file as unique/identical on the 10 different paths, and it will be indexed only once.&lt;/P&gt;</description>
    <pubDate>Wed, 15 Dec 2021 20:00:33 GMT</pubDate>
    <dc:creator>ldongradi_SPL</dc:creator>
    <dc:date>2021-12-15T20:00:33Z</dc:date>
    <item>
      <title>How to avoid same log with same content ingestion in splunk from different servers</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/How-to-avoid-same-log-with-same-content-ingestion-in-splunk-from/m-p/578490#M102140</link>
      <description>&lt;P&gt;Hello,&lt;/P&gt;&lt;P&gt;I have 10 servers for same purpose. If one server is down others will be active so that no loss of business continuity.&amp;nbsp;&lt;/P&gt;&lt;P&gt;We have ABC.log generates across all the servers with same content. We need to add all the 10 servers in serverclass.conf and we did&amp;nbsp; the same. But we are getting ABC.log to splunk multiple times I.e., 5 to 6 times or one event repeating 5 to 6 times.&amp;nbsp;&lt;/P&gt;&lt;P&gt;I appreciate any help to avoid mutiple ingestion of same log from different servers or avoid duplicate events.&amp;nbsp;&lt;/P&gt;&lt;P&gt;Added crcSalt in inputs.conf, but not working.&amp;nbsp;&lt;/P&gt;&lt;P&gt;Thanks&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Wed, 15 Dec 2021 14:53:37 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/How-to-avoid-same-log-with-same-content-ingestion-in-splunk-from/m-p/578490#M102140</guid>
      <dc:creator>narmadak</dc:creator>
      <dc:date>2021-12-15T14:53:37Z</dc:date>
    </item>
    <item>
      <title>Re: How to avoid same log with same content ingestion in splunk from different servers</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/How-to-avoid-same-log-with-same-content-ingestion-in-splunk-from/m-p/578501#M102144</link>
      <description>&lt;P&gt;The crcSalt setting helps only with a single monitored file.&amp;nbsp; Splunk has no way of knowing whether the data from several different servers is duplicated or not.&amp;nbsp; For all it knows, the same event hit all of the servers at about the same time.&lt;/P&gt;&lt;P&gt;The workaround is to remove duplicates at search time.&lt;/P&gt;</description>
      <pubDate>Wed, 15 Dec 2021 15:58:40 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/How-to-avoid-same-log-with-same-content-ingestion-in-splunk-from/m-p/578501#M102144</guid>
      <dc:creator>richgalloway</dc:creator>
      <dc:date>2021-12-15T15:58:40Z</dc:date>
    </item>
    <item>
      <title>Re: How to avoid same log with same content ingestion in splunk from different servers</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/How-to-avoid-same-log-with-same-content-ingestion-in-splunk-from/m-p/578533#M102152</link>
      <description>&lt;P&gt;I would see only one way to do this. And it's ugly. ...&lt;/P&gt;&lt;P&gt;Mount the 10 file systems on NFS from the 10 servers on one single UF.&amp;nbsp;&lt;/P&gt;&lt;P&gt;That way, the fishbucket will consider treat the file as unique/identical on the 10 different paths, and it will be indexed only once.&lt;/P&gt;</description>
      <pubDate>Wed, 15 Dec 2021 20:00:33 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/How-to-avoid-same-log-with-same-content-ingestion-in-splunk-from/m-p/578533#M102152</guid>
      <dc:creator>ldongradi_SPL</dc:creator>
      <dc:date>2021-12-15T20:00:33Z</dc:date>
    </item>
    <item>
      <title>Re: How to avoid same log with same content ingestion in splunk from different servers</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/How-to-avoid-same-log-with-same-content-ingestion-in-splunk-from/m-p/578538#M102154</link>
      <description>&lt;P&gt;Unfortunately, this does not solve possible license consumption issues.&lt;/P&gt;&lt;P&gt;It's a very unusual case and there is no ready-made solution for this (at least none that I know of). Splunk on its own does not implement deduplication on inputs. Also, you have to remember that "the same" event could be ingested from different sources/forwarders and get sent to other peers in a cluster. Splunk would have no way of knowing that the event is duplicated.&lt;/P&gt;&lt;P&gt;A slightly ugly solution would be to create your own modular input reading events from all those sources and performjng deduplication. But that would mean creating a SPOF in your infrastructure since you'd have to have a single collection point.&lt;/P&gt;&lt;P&gt;Another possibility (albeit ugly as hell and even more license-consuming that the original one) would be to ingest the events initially into a temporary index and then periodically deduplicate them in search time collecting them into a destination index. Ugh.&lt;/P&gt;</description>
      <pubDate>Wed, 15 Dec 2021 20:17:06 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/How-to-avoid-same-log-with-same-content-ingestion-in-splunk-from/m-p/578538#M102154</guid>
      <dc:creator>PickleRick</dc:creator>
      <dc:date>2021-12-15T20:17:06Z</dc:date>
    </item>
  </channel>
</rss>

