<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Sporadic &amp;quot;Timed out waiting for peer&amp;quot; messsages when querying search peers / indexer cluster in Deployment Architecture</title>
    <link>https://community.splunk.com/t5/Deployment-Architecture/Sporadic-quot-Timed-out-waiting-for-peer-quot-messsages-when/m-p/557558#M18940</link>
    <description>&lt;P&gt;Recently we've been noticing a lot of searches have been getting connection timeouts when trying to query our indexer cluster.&lt;/P&gt;&lt;P&gt;We keep getting the message:&lt;/P&gt;&lt;LI-CODE lang="markup"&gt;2 errors occurred while the search was executing. Therefore, search results might be incomplete. Hide errors.
Error connecting: Connect Timeout Timeout error.
Timed out waiting for peer searchpeer01. Search results might be incomplete! If this occurs frequently, receiveTimeout in distsearch.conf might need to be increased.&lt;/LI-CODE&gt;&lt;P&gt;&lt;BR /&gt;Delving into the search.log, we see that we are getting 502 Bad Gateway from the indexer cluster:&lt;/P&gt;&lt;PRE&gt;06-28-2021 12:45:14.663 ERROR SearchResultTransaction - Got status 502 from https://10.0.0.43:8089/services/streams/search?sh_sid=scheduler__username_aW52X2NpdF9zbm93X3NlYXJjaA__RMD565f4e7f87d23277d_at_1624880700_38630
06-28-2021 12:45:14.663 ERROR SearchResultParser - HTTP error status message from https://10.0.0.43:8089/services/streams/search?sh_sid=scheduler__username_aW52X2NpdF9zbm93X3NlYXJjaA__RMD565f4e7f87d23277d_at_1624880700_38630: Error connecting: Connect Timeout
06-28-2021 12:45:14.663 WARN  SearchResultCollator - Failure received on retry collector. _unresolvedRetries=1
06-28-2021 12:45:14.663 WARN  SearchResultParserExecutor - Error connecting: Connect Timeout Timeout error. for collector=searchpeer01
06-28-2021 12:45:14.663 ERROR DispatchThread - sid:scheduler__username_aW52X2NpdF9zbm93X3NlYXJjaA__RMD565f4e7f87d23277d_at_1624880700_38630 Timed out waiting for peer searchpeer01.  Search results might be incomplete! If this occurs frequently, receiveTimeout in distsearch.conf might need to be increased.&lt;/PRE&gt;&lt;P&gt;Considering&amp;nbsp; the receiveTimeout is 600 seconds, I don't think that will change anything. I'm not sure where these 502 errors are coming from or what to do about them?&lt;/P&gt;&lt;P&gt;Does anyone have any insight into what may be happening? Running version 8.1.3 on the search head and 7.3.3 on the indexer cluster (though planning to upgrade to 8.1.4 as soon as we are able to).&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Thanks!&lt;/P&gt;</description>
    <pubDate>Tue, 29 Jun 2021 08:17:19 GMT</pubDate>
    <dc:creator>althomas</dc:creator>
    <dc:date>2021-06-29T08:17:19Z</dc:date>
    <item>
      <title>Sporadic "Timed out waiting for peer" messsages when querying search peers / indexer cluster</title>
      <link>https://community.splunk.com/t5/Deployment-Architecture/Sporadic-quot-Timed-out-waiting-for-peer-quot-messsages-when/m-p/557558#M18940</link>
      <description>&lt;P&gt;Recently we've been noticing a lot of searches have been getting connection timeouts when trying to query our indexer cluster.&lt;/P&gt;&lt;P&gt;We keep getting the message:&lt;/P&gt;&lt;LI-CODE lang="markup"&gt;2 errors occurred while the search was executing. Therefore, search results might be incomplete. Hide errors.
Error connecting: Connect Timeout Timeout error.
Timed out waiting for peer searchpeer01. Search results might be incomplete! If this occurs frequently, receiveTimeout in distsearch.conf might need to be increased.&lt;/LI-CODE&gt;&lt;P&gt;&lt;BR /&gt;Delving into the search.log, we see that we are getting 502 Bad Gateway from the indexer cluster:&lt;/P&gt;&lt;PRE&gt;06-28-2021 12:45:14.663 ERROR SearchResultTransaction - Got status 502 from https://10.0.0.43:8089/services/streams/search?sh_sid=scheduler__username_aW52X2NpdF9zbm93X3NlYXJjaA__RMD565f4e7f87d23277d_at_1624880700_38630
06-28-2021 12:45:14.663 ERROR SearchResultParser - HTTP error status message from https://10.0.0.43:8089/services/streams/search?sh_sid=scheduler__username_aW52X2NpdF9zbm93X3NlYXJjaA__RMD565f4e7f87d23277d_at_1624880700_38630: Error connecting: Connect Timeout
06-28-2021 12:45:14.663 WARN  SearchResultCollator - Failure received on retry collector. _unresolvedRetries=1
06-28-2021 12:45:14.663 WARN  SearchResultParserExecutor - Error connecting: Connect Timeout Timeout error. for collector=searchpeer01
06-28-2021 12:45:14.663 ERROR DispatchThread - sid:scheduler__username_aW52X2NpdF9zbm93X3NlYXJjaA__RMD565f4e7f87d23277d_at_1624880700_38630 Timed out waiting for peer searchpeer01.  Search results might be incomplete! If this occurs frequently, receiveTimeout in distsearch.conf might need to be increased.&lt;/PRE&gt;&lt;P&gt;Considering&amp;nbsp; the receiveTimeout is 600 seconds, I don't think that will change anything. I'm not sure where these 502 errors are coming from or what to do about them?&lt;/P&gt;&lt;P&gt;Does anyone have any insight into what may be happening? Running version 8.1.3 on the search head and 7.3.3 on the indexer cluster (though planning to upgrade to 8.1.4 as soon as we are able to).&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Thanks!&lt;/P&gt;</description>
      <pubDate>Tue, 29 Jun 2021 08:17:19 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Deployment-Architecture/Sporadic-quot-Timed-out-waiting-for-peer-quot-messsages-when/m-p/557558#M18940</guid>
      <dc:creator>althomas</dc:creator>
      <dc:date>2021-06-29T08:17:19Z</dc:date>
    </item>
    <item>
      <title>Re: Sporadic "Timed out waiting for peer" messsages when querying search peers / indexer cluster</title>
      <link>https://community.splunk.com/t5/Deployment-Architecture/Sporadic-quot-Timed-out-waiting-for-peer-quot-messsages-when/m-p/557653#M18943</link>
      <description>&lt;P&gt;Have you checked network latency between your SHC nodes and the indexers? A simple ping is a good place to start...&lt;/P&gt;</description>
      <pubDate>Tue, 29 Jun 2021 18:02:01 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Deployment-Architecture/Sporadic-quot-Timed-out-waiting-for-peer-quot-messsages-when/m-p/557653#M18943</guid>
      <dc:creator>codebuilder</dc:creator>
      <dc:date>2021-06-29T18:02:01Z</dc:date>
    </item>
    <item>
      <title>Re: Sporadic "Timed out waiting for peer" messsages when querying search peers / indexer cluster</title>
      <link>https://community.splunk.com/t5/Deployment-Architecture/Sporadic-quot-Timed-out-waiting-for-peer-quot-messsages-when/m-p/557766#M18951</link>
      <description>&lt;P&gt;It's on the same network -- ping is 0-1 ms.&lt;/P&gt;</description>
      <pubDate>Wed, 30 Jun 2021 07:48:51 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Deployment-Architecture/Sporadic-quot-Timed-out-waiting-for-peer-quot-messsages-when/m-p/557766#M18951</guid>
      <dc:creator>althomas</dc:creator>
      <dc:date>2021-06-30T07:48:51Z</dc:date>
    </item>
    <item>
      <title>Re: Sporadic "Timed out waiting for peer" messsages when querying search peers / indexer cluster</title>
      <link>https://community.splunk.com/t5/Deployment-Architecture/Sporadic-quot-Timed-out-waiting-for-peer-quot-messsages-when/m-p/566605#M24858</link>
      <description>&lt;P&gt;Hi&lt;/P&gt;&lt;P&gt;I see the exact same problems on a 8.0.4 indexercluster and search head cluster. We have sporadic errors and timeouts.&amp;nbsp;&lt;/P&gt;&lt;P&gt;Servers a 80 cores dualsocket, 386 GB ram, all SSD, and fiber network. Ping around 1 ms between all servers. We also have no ingestion errors, or other network related errors, it is ONLY regarding searches.&lt;/P&gt;&lt;P&gt;Also I see many of these types of errors (though only logged as warning?)in the splunkd.log:&lt;/P&gt;&lt;P&gt;&lt;SPAN class="t"&gt;09-10-2021&lt;/SPAN&gt; &lt;SPAN class="t"&gt;12:39:03.296&lt;/SPAN&gt;&lt;SPAN&gt; +&lt;/SPAN&gt;&lt;SPAN class="t"&gt;0200&lt;/SPAN&gt; &lt;SPAN class="t"&gt;WARN&lt;/SPAN&gt; &lt;SPAN class="t a"&gt;&lt;SPAN class="t"&gt;HttpListener&lt;/SPAN&gt;&lt;/SPAN&gt; &lt;SPAN class="t"&gt;-&lt;/SPAN&gt; &lt;SPAN class="t"&gt;Socket&lt;/SPAN&gt; &lt;SPAN class="t"&gt;error&lt;/SPAN&gt; &lt;SPAN class="t"&gt;from&lt;/SPAN&gt;&amp;nbsp;"IPaddress"&lt;SPAN class="t"&gt;:47270&lt;/SPAN&gt; &lt;SPAN class="t"&gt;while&lt;/SPAN&gt; &lt;SPAN class="t"&gt;accessing&lt;/SPAN&gt; &lt;SPAN class="t"&gt;/services/streams/search:&lt;/SPAN&gt; &lt;SPAN class="t a"&gt;&lt;SPAN class="t"&gt;Broken&lt;/SPAN&gt;&lt;/SPAN&gt; &lt;SPAN class="t a"&gt;&lt;SPAN class="t"&gt;pipe&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN class="t a"&gt;&lt;SPAN class="t"&gt;on all indexers. When we see many of these, we see several searches, that in search.log, logs the exact same errors as posted above. Ie searches failing to retrive correct result.&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN class="t a"&gt;&lt;SPAN class="t"&gt;Have any of you had any luck in mitigating this ? Or should next step be a support case.&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/P&gt;</description>
      <pubDate>Fri, 10 Sep 2021 10:53:09 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Deployment-Architecture/Sporadic-quot-Timed-out-waiting-for-peer-quot-messsages-when/m-p/566605#M24858</guid>
      <dc:creator>agneticdk</dc:creator>
      <dc:date>2021-09-10T10:53:09Z</dc:date>
    </item>
    <item>
      <title>Re: Sporadic "Timed out waiting for peer" messsages when querying search peers / indexer cluster</title>
      <link>https://community.splunk.com/t5/Deployment-Architecture/Sporadic-quot-Timed-out-waiting-for-peer-quot-messsages-when/m-p/566606#M24859</link>
      <description>&lt;P&gt;We had, for various reasons, different versions of enterprise servers due to a merging of sites and a stilted roll-forward schedule. Because of these issues, we pushed to move everything onto the same version and this resolved most of the issues.&amp;nbsp;&amp;nbsp;&lt;/P&gt;&lt;P&gt;We still have other issues because we have multiple sites, some with lots of latency, but this isn't one of them.&amp;nbsp;&amp;nbsp;&lt;/P&gt;&lt;P&gt;I would probably recommend a support case or an upgrade to the latest 8.1.X&lt;/P&gt;&lt;P&gt;FYI 8.0.X is EOL from next month.&lt;/P&gt;</description>
      <pubDate>Fri, 10 Sep 2021 11:00:19 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Deployment-Architecture/Sporadic-quot-Timed-out-waiting-for-peer-quot-messsages-when/m-p/566606#M24859</guid>
      <dc:creator>althomas</dc:creator>
      <dc:date>2021-09-10T11:00:19Z</dc:date>
    </item>
    <item>
      <title>Re: Sporadic "Timed out waiting for peer" messsages when querying search peers / indexer cluster</title>
      <link>https://community.splunk.com/t5/Deployment-Architecture/Sporadic-quot-Timed-out-waiting-for-peer-quot-messsages-when/m-p/566607#M24860</link>
      <description>&lt;P&gt;OK, thank you.&lt;/P&gt;&lt;P&gt;Yes, an upgrade is definetly also in the works. Might do that before raising ticket.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;André&lt;/P&gt;</description>
      <pubDate>Fri, 10 Sep 2021 11:00:15 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Deployment-Architecture/Sporadic-quot-Timed-out-waiting-for-peer-quot-messsages-when/m-p/566607#M24860</guid>
      <dc:creator>agneticdk</dc:creator>
      <dc:date>2021-09-10T11:00:15Z</dc:date>
    </item>
    <item>
      <title>Re: Sporadic "Timed out waiting for peer" messsages when querying search peers / indexer cluster</title>
      <link>https://community.splunk.com/t5/Deployment-Architecture/Sporadic-quot-Timed-out-waiting-for-peer-quot-messsages-when/m-p/566609#M24861</link>
      <description>&lt;P&gt;We have seen this "broken pipe" error on our environments as well. Not to a great extend, but we still see it, and we have to rerun the affected searches. Not sure what the cause of this is.&lt;/P&gt;</description>
      <pubDate>Fri, 10 Sep 2021 11:06:25 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Deployment-Architecture/Sporadic-quot-Timed-out-waiting-for-peer-quot-messsages-when/m-p/566609#M24861</guid>
      <dc:creator>ktatrifork</dc:creator>
      <dc:date>2021-09-10T11:06:25Z</dc:date>
    </item>
    <item>
      <title>Re: Sporadic "Timed out waiting for peer" messsages when querying search peers / indexer cluster</title>
      <link>https://community.splunk.com/t5/Deployment-Architecture/Sporadic-quot-Timed-out-waiting-for-peer-quot-messsages-when/m-p/566610#M24862</link>
      <description>&lt;P&gt;We're seeing the same issue on 8.2.1, also not seeing any hw/network issues also server is&amp;nbsp;heavily spec'ed&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Fri, 10 Sep 2021 11:06:32 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Deployment-Architecture/Sporadic-quot-Timed-out-waiting-for-peer-quot-messsages-when/m-p/566610#M24862</guid>
      <dc:creator>Terpz</dc:creator>
      <dc:date>2021-09-10T11:06:32Z</dc:date>
    </item>
    <item>
      <title>Re: Sporadic "Timed out waiting for peer" messsages when querying search peers / indexer cluster</title>
      <link>https://community.splunk.com/t5/Deployment-Architecture/Sporadic-quot-Timed-out-waiting-for-peer-quot-messsages-when/m-p/577585#M25083</link>
      <description>&lt;P&gt;Adding my 2 cents here - we have the exact same error messages. Also a multisite Cluster and a Search Head Cluster - all Hardware based.&lt;/P&gt;&lt;P&gt;Since we updated to 8.2.2 this issues startet to occur. We have timeouts on our Search Head Cluster Members&lt;/P&gt;&lt;P&gt;"Timed out waiting for peer [XXX] . Search results might be incomplete! If this occurs frequently, receiveTimeout in distsearch.conf might need to be increased."&lt;/P&gt;&lt;P&gt;And we also have the broken pipe events for our indexers. Splunk Support so far couldnt help. Their last resort was to look at the network and os level.&lt;/P&gt;&lt;P&gt;Before we updated we had no issues, now they started...&lt;/P&gt;</description>
      <pubDate>Tue, 07 Dec 2021 10:16:45 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Deployment-Architecture/Sporadic-quot-Timed-out-waiting-for-peer-quot-messsages-when/m-p/577585#M25083</guid>
      <dc:creator>DanielAmlung</dc:creator>
      <dc:date>2021-12-07T10:16:45Z</dc:date>
    </item>
    <item>
      <title>Re: Sporadic "Timed out waiting for peer" messsages when querying search peers / indexer cluster</title>
      <link>https://community.splunk.com/t5/Deployment-Architecture/Sporadic-quot-Timed-out-waiting-for-peer-quot-messsages-when/m-p/577611#M25084</link>
      <description>&lt;P&gt;Just an update on my end on this.&amp;nbsp;&lt;/P&gt;&lt;P&gt;An upgrade fixed the problem. I think it was related to a setting around sslCompression internally in Splunk that looks to have been the issue.&lt;/P&gt;&lt;P&gt;The new version 8.2.2 has this setting set to false, it was true in the old version we ran (8.1.3).&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;In server.conf on both search heads (search head cluster) and indexeres (indexer cluster):&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;[sslConfig] &lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;useClientSSLCompression = false&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;I saw that this fixed the same problems on another customer on 8.1.4 (I think).&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;useClientSSLCompression is default true in older versions, it is false on the new.&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;If you run older versions of splunk and search head cluster (I have not seen it on single search head and indexer cluster) - you could try the above to see if that works.&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;Regards&lt;/SPAN&gt;&lt;/P&gt;&lt;P&gt;&lt;SPAN&gt;André&lt;/SPAN&gt;&lt;/P&gt;</description>
      <pubDate>Tue, 07 Dec 2021 12:47:44 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Deployment-Architecture/Sporadic-quot-Timed-out-waiting-for-peer-quot-messsages-when/m-p/577611#M25084</guid>
      <dc:creator>agneticdk</dc:creator>
      <dc:date>2021-12-07T12:47:44Z</dc:date>
    </item>
  </channel>
</rss>

