<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: What causes duplicate data in an indexer cluster? in Deployment Architecture</title>
    <link>https://community.splunk.com/t5/Deployment-Architecture/What-causes-duplicate-data-in-an-indexer-cluster/m-p/386369#M14059</link>
    <description>&lt;P&gt;any difference between the original and it duplicate/s.&lt;BR /&gt;
i.e. for each event,  how does it differ from its duplicate? Is there only 1 copy or more of each of the duplicates?&lt;/P&gt;</description>
    <pubDate>Sun, 18 Nov 2018 21:34:18 GMT</pubDate>
    <dc:creator>laurie_gellatly</dc:creator>
    <dc:date>2018-11-18T21:34:18Z</dc:date>
    <item>
      <title>What causes duplicate data in an indexer cluster?</title>
      <link>https://community.splunk.com/t5/Deployment-Architecture/What-causes-duplicate-data-in-an-indexer-cluster/m-p/386367#M14057</link>
      <description>&lt;P&gt;We're seeing an issue on our indexer cluster where ~25% of events are duplicated. The raw logs do not contain duplicates, nor are there duplicate or overlapping monitor stanzas. When looking at &lt;STRONG&gt;bucket ID&lt;/STRONG&gt;, &lt;STRONG&gt;Index Time&lt;/STRONG&gt;, and &lt;STRONG&gt;Splunk Server&lt;/STRONG&gt;, all are identical across the duplicates.&lt;/P&gt;

&lt;P&gt;Our indexers are clustered, and we're running Enterprise version 6.6.3 on Windows Server 2012 R2.&lt;/P&gt;

&lt;P&gt;Here's our aggregated outputs.conf from a Universal Forwarder:&lt;/P&gt;

&lt;PRE&gt;&lt;CODE&gt;\splunk btool outputs list

[syslog]
maxEventSize = 1024
priority = &amp;lt;13&amp;gt;
type = udp
[tcpout]
ackTimeoutOnShutdown = 30
autoLBFrequency = 30
autoLBVolume = 0
blockOnCloning = true
blockWarnThreshold = 100
cipherSuite = ECDHE-ECDSA-AES256-GCM-SHA384:ECDHE-RSA-AES256-GCM-SHA384:ECDHE-ECDSA-AES128-GCM-SHA256:ECDHE-RSA-AES128-GCM-SHA256:ECDHE-ECDSA-AES256-SHA384:ECDHE-RSA-AES256-SHA384:ECDHE-ECDSA-AES128-SHA256:ECDHE-RSA-AES128-SHA256:AES256-GCM-SHA384:AES128-GCM-SHA256:AES128-SHA256:ECDH-ECDSA-AES256-GCM-SHA384:ECDH-ECDSA-AES128-GCM-SHA256:ECDH-ECDSA-AES256-SHA384:ECDH-ECDSA-AES128-SHA256
compressed = false
connectionTimeout = 20
defaultGroup = gd_indexers
disabled = false
dropClonedEventsOnQueueFull = 5
dropEventsOnQueueFull = -1
ecdhCurves = prime256v1, secp384r1, secp521r1
forceTimebasedAutoLB = false
forwardedindex.0.whitelist = .*
forwardedindex.1.blacklist = _.*
forwardedindex.2.whitelist = (_audit|_introspection|_internal|_telemetry)
forwardedindex.filter.disable = false
heartbeatFrequency = 30
indexAndForward = false
maxConnectionsPerIndexer = 2
maxFailuresPerInterval = 2
maxQueueSize = auto
readTimeout = 300
secsInFailureInterval = 1
sendCookedData = true
sslQuietShutdown = false
sslVersions = tls1.2
tcpSendBufSz = 0
useACK = true
writeTimeout = 300
[tcpout:gd_indexers]
server = &amp;lt;List of Internal IPs&amp;gt;
&lt;/CODE&gt;&lt;/PRE&gt;

&lt;P&gt;If anyone can suggest an avenue for troubleshooting, it would be greatly appreciated. Please also let me know if I can provide more relevant information.&lt;/P&gt;</description>
      <pubDate>Thu, 15 Nov 2018 19:45:36 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Deployment-Architecture/What-causes-duplicate-data-in-an-indexer-cluster/m-p/386367#M14057</guid>
      <dc:creator>schofiet</dc:creator>
      <dc:date>2018-11-15T19:45:36Z</dc:date>
    </item>
    <item>
      <title>Re: What causes duplicate data in an indexer cluster?</title>
      <link>https://community.splunk.com/t5/Deployment-Architecture/What-causes-duplicate-data-in-an-indexer-cluster/m-p/386368#M14058</link>
      <description>&lt;P&gt;You said you looked at indextime. Did that include looking at the indextime for both copies of the same event?&lt;BR /&gt;
Pick a few events that are duplicated and look at any differences between the events. &lt;BR /&gt;
indextime, host, splunkserver... is there anything you can see as different?&lt;/P&gt;

&lt;P&gt;...Laurie:{)&lt;/P&gt;</description>
      <pubDate>Sun, 18 Nov 2018 21:31:46 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Deployment-Architecture/What-causes-duplicate-data-in-an-indexer-cluster/m-p/386368#M14058</guid>
      <dc:creator>laurie_gellatly</dc:creator>
      <dc:date>2018-11-18T21:31:46Z</dc:date>
    </item>
    <item>
      <title>Re: What causes duplicate data in an indexer cluster?</title>
      <link>https://community.splunk.com/t5/Deployment-Architecture/What-causes-duplicate-data-in-an-indexer-cluster/m-p/386369#M14059</link>
      <description>&lt;P&gt;any difference between the original and it duplicate/s.&lt;BR /&gt;
i.e. for each event,  how does it differ from its duplicate? Is there only 1 copy or more of each of the duplicates?&lt;/P&gt;</description>
      <pubDate>Sun, 18 Nov 2018 21:34:18 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Deployment-Architecture/What-causes-duplicate-data-in-an-indexer-cluster/m-p/386369#M14059</guid>
      <dc:creator>laurie_gellatly</dc:creator>
      <dc:date>2018-11-18T21:34:18Z</dc:date>
    </item>
    <item>
      <title>Re: What causes duplicate data in an indexer cluster?</title>
      <link>https://community.splunk.com/t5/Deployment-Architecture/What-causes-duplicate-data-in-an-indexer-cluster/m-p/386370#M14060</link>
      <description>&lt;P&gt;I used the following search:&lt;/P&gt;

&lt;PRE&gt;&lt;CODE&gt;index=&amp;lt;my_index&amp;gt; sourcetype=&amp;lt;my_sourcetype&amp;gt;
| eval bucket=_bkt
| eval indextime=_indextime
| table _time, indextime, bucket splunk_server _raw
| convert ctime(indextime)
| stats count list(*) as * by _raw
| where count&amp;gt;1
| fields * _raw
&lt;/CODE&gt;&lt;/PRE&gt;

&lt;P&gt;Under the indextime field, I saw one value repeated for each of the duplicate events, same with bucket and splunk_server.&lt;/P&gt;

&lt;P&gt;There appears to be no difference between duplicates, aside from occasionally there are 3 to 5 copies in an indexer, but most of the time just two copies. It's not always the same indexer either, it seems relatively evenly distributed.&lt;/P&gt;</description>
      <pubDate>Mon, 19 Nov 2018 17:18:13 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Deployment-Architecture/What-causes-duplicate-data-in-an-indexer-cluster/m-p/386370#M14060</guid>
      <dc:creator>schofiet</dc:creator>
      <dc:date>2018-11-19T17:18:13Z</dc:date>
    </item>
    <item>
      <title>Re: What causes duplicate data in an indexer cluster?</title>
      <link>https://community.splunk.com/t5/Deployment-Architecture/What-causes-duplicate-data-in-an-indexer-cluster/m-p/386371#M14061</link>
      <description>&lt;P&gt;Found this: &lt;A href="https://answers.splunk.com/answers/365914/why-are-we-seeing-duplicate-events-found-in-an-ind.html"&gt;https://answers.splunk.com/answers/365914/why-are-we-seeing-duplicate-events-found-in-an-ind.html&lt;/A&gt;&lt;BR /&gt;
An additional incentive to update???&lt;/P&gt;</description>
      <pubDate>Mon, 19 Nov 2018 20:50:21 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Deployment-Architecture/What-causes-duplicate-data-in-an-indexer-cluster/m-p/386371#M14061</guid>
      <dc:creator>laurie_gellatly</dc:creator>
      <dc:date>2018-11-19T20:50:21Z</dc:date>
    </item>
    <item>
      <title>Re: What causes duplicate data in an indexer cluster?</title>
      <link>https://community.splunk.com/t5/Deployment-Architecture/What-causes-duplicate-data-in-an-indexer-cluster/m-p/386372#M14062</link>
      <description>&lt;P&gt;We're working on upgrading to 7.2.x as soon as we can get it scheduled. The linked question looks like it's talking about 6.4 as the solution; we're on 6.6. Appreciate you taking the time to post a suggestion though!&lt;/P&gt;</description>
      <pubDate>Mon, 19 Nov 2018 21:22:29 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Deployment-Architecture/What-causes-duplicate-data-in-an-indexer-cluster/m-p/386372#M14062</guid>
      <dc:creator>schofiet</dc:creator>
      <dc:date>2018-11-19T21:22:29Z</dc:date>
    </item>
  </channel>
</rss>

