<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Purge queue on forwarder / indexer down in Getting Data In</title>
    <link>https://community.splunk.com/t5/Getting-Data-In/Purge-queue-on-forwarder-indexer-down/m-p/104613#M22020</link>
    <description>&lt;P&gt;I had one of my indexers go down a couple weeks back.  Since then each of my forwarders is trying to send events to that indexer but failing with errors like&lt;/P&gt;

&lt;PRE&gt;&lt;CODE&gt;WARN  TcpOutputFd - Connect to 10.1.4.183:9998 failed. Connection refused
&lt;/CODE&gt;&lt;/PRE&gt;

&lt;P&gt;So I modified my outputs.conf to remove that target indexer and restarted the forwarder (heavy).  However, I'm still seeing that error.  Also I'm seeing queueing errors on the forwarder:&lt;/P&gt;

&lt;PRE&gt;&lt;CODE&gt;INFO  TailingProcessor - Could not send data to output queue (parsingQueue), retrying...
&lt;/CODE&gt;&lt;/PRE&gt;

&lt;P&gt;I'm thinking that the queue has retained the old indexer and is continuing to attempt event delivery.   As I noted, cycling splunkd on the forwarder doesn't make a difference.  I also think this is causing delays in sending events to my other indexers (5-15 minutes will go by before any events show up).&lt;/P&gt;

&lt;P&gt;How can I alleviate this problem (aside from standing up an indexer on the failed IP noted above)?&lt;/P&gt;</description>
    <pubDate>Fri, 20 Jul 2012 18:44:20 GMT</pubDate>
    <dc:creator>nocostk</dc:creator>
    <dc:date>2012-07-20T18:44:20Z</dc:date>
    <item>
      <title>Purge queue on forwarder / indexer down</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/Purge-queue-on-forwarder-indexer-down/m-p/104613#M22020</link>
      <description>&lt;P&gt;I had one of my indexers go down a couple weeks back.  Since then each of my forwarders is trying to send events to that indexer but failing with errors like&lt;/P&gt;

&lt;PRE&gt;&lt;CODE&gt;WARN  TcpOutputFd - Connect to 10.1.4.183:9998 failed. Connection refused
&lt;/CODE&gt;&lt;/PRE&gt;

&lt;P&gt;So I modified my outputs.conf to remove that target indexer and restarted the forwarder (heavy).  However, I'm still seeing that error.  Also I'm seeing queueing errors on the forwarder:&lt;/P&gt;

&lt;PRE&gt;&lt;CODE&gt;INFO  TailingProcessor - Could not send data to output queue (parsingQueue), retrying...
&lt;/CODE&gt;&lt;/PRE&gt;

&lt;P&gt;I'm thinking that the queue has retained the old indexer and is continuing to attempt event delivery.   As I noted, cycling splunkd on the forwarder doesn't make a difference.  I also think this is causing delays in sending events to my other indexers (5-15 minutes will go by before any events show up).&lt;/P&gt;

&lt;P&gt;How can I alleviate this problem (aside from standing up an indexer on the failed IP noted above)?&lt;/P&gt;</description>
      <pubDate>Fri, 20 Jul 2012 18:44:20 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/Purge-queue-on-forwarder-indexer-down/m-p/104613#M22020</guid>
      <dc:creator>nocostk</dc:creator>
      <dc:date>2012-07-20T18:44:20Z</dc:date>
    </item>
    <item>
      <title>Re: Purge queue on forwarder / indexer down</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/Purge-queue-on-forwarder-indexer-down/m-p/104614#M22021</link>
      <description>&lt;P&gt;Did you find a solution for this? I think I'm seeing the same problem.&lt;/P&gt;</description>
      <pubDate>Tue, 21 Oct 2014 14:11:23 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/Purge-queue-on-forwarder-indexer-down/m-p/104614#M22021</guid>
      <dc:creator>sloshburch</dc:creator>
      <dc:date>2014-10-21T14:11:23Z</dc:date>
    </item>
    <item>
      <title>Re: Purge queue on forwarder / indexer down</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/Purge-queue-on-forwarder-indexer-down/m-p/104615#M22022</link>
      <description>&lt;P&gt;Any solution or comment on this? We are in the same situation&lt;/P&gt;</description>
      <pubDate>Fri, 21 Nov 2014 21:42:35 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/Purge-queue-on-forwarder-indexer-down/m-p/104615#M22022</guid>
      <dc:creator>tkiss</dc:creator>
      <dc:date>2014-11-21T21:42:35Z</dc:date>
    </item>
    <item>
      <title>Re: Purge queue on forwarder / indexer down</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/Purge-queue-on-forwarder-indexer-down/m-p/104616#M22023</link>
      <description>&lt;P&gt;Voting the question up is one way of saying you think this is important.&lt;/P&gt;</description>
      <pubDate>Fri, 21 Nov 2014 21:47:28 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/Purge-queue-on-forwarder-indexer-down/m-p/104616#M22023</guid>
      <dc:creator>aljohnson_splun</dc:creator>
      <dc:date>2014-11-21T21:47:28Z</dc:date>
    </item>
    <item>
      <title>Re: Purge queue on forwarder / indexer down</title>
      <link>https://community.splunk.com/t5/Getting-Data-In/Purge-queue-on-forwarder-indexer-down/m-p/104617#M22024</link>
      <description>&lt;P&gt;We found that a couple of things were causing such issues. These are not necessarily the same issue you're seeing.&lt;/P&gt;

&lt;P&gt;I did some math and realized that we had some blocking because our Universal Forwarder was hitting its default &lt;CODE&gt;limits.conf&lt;/CODE&gt;&lt;/P&gt;

&lt;PRE&gt;&lt;CODE&gt;[thruput]
maxKBps = 256
&lt;/CODE&gt;&lt;/PRE&gt;

&lt;P&gt;So we changed that to 0, which makes it unlimited. Keep in mind this impacts CPU on the host system where the forwarder lives.&lt;/P&gt;

&lt;P&gt;This allowed the forwarder to catch up to itself. I was then able to analyze the &lt;CODE&gt;metrics.log&lt;/CODE&gt; on the forwarder to see what thruput was required based on actual (the other option is to do some math and figure out how much thruput you need).&lt;BR /&gt;
The other thing was that we had to disable &lt;CODE&gt;useACK&lt;/CODE&gt; in my forwarder's &lt;CODE&gt;outputs.conf&lt;/CODE&gt; so its &lt;/P&gt;

&lt;PRE&gt;&lt;CODE&gt;[tcpout:mygroup]
useACK = false
&lt;/CODE&gt;&lt;/PRE&gt;

&lt;P&gt;This was because the ACKs caused even more thruput and pauses.&lt;/P&gt;

&lt;P&gt;So in conclusion, check out the &lt;CODE&gt;metrics.log&lt;/CODE&gt; and take a hard look at where the pipeline is backing up.&lt;/P&gt;

&lt;P&gt;Hopefully that helps you as well?!&lt;BR /&gt;
,We found that a couple of things were causing such issues. These are not necessarily the same issue you're seeing.&lt;/P&gt;

&lt;P&gt;I did some math and realized that we had some blocking because our Universal Forwarder was hitting its default &lt;CODE&gt;limits.conf&lt;/CODE&gt;&lt;/P&gt;

&lt;PRE&gt;&lt;CODE&gt;[thruput]
maxKBps = 256
&lt;/CODE&gt;&lt;/PRE&gt;

&lt;P&gt;So we changed that to 0, which makes it unlimited. Keep in mind this impacts CPU on the host system where the forwarder lives.&lt;/P&gt;

&lt;P&gt;This allowed the forwarder to catch up to itself. I was then able to analyze the &lt;CODE&gt;metrics.log&lt;/CODE&gt; on the forwarder to see what thruput was required based on actual (the other option is to do some math and figure out how much thruput you need).&lt;BR /&gt;
The other thing was that we had to disable &lt;CODE&gt;useACK&lt;/CODE&gt; in my forwarder's &lt;CODE&gt;outputs.conf&lt;/CODE&gt; so its &lt;/P&gt;

&lt;PRE&gt;&lt;CODE&gt;[tcpout:mygroup]
useACK = false
&lt;/CODE&gt;&lt;/PRE&gt;

&lt;P&gt;This was because the ACKs caused even more thruput and pauses.&lt;/P&gt;

&lt;P&gt;So in conclusion, check out the &lt;CODE&gt;metrics.log&lt;/CODE&gt; and take a hard look at where the pipeline is backing up.&lt;/P&gt;

&lt;P&gt;Hopefully that helps you as well?!&lt;/P&gt;</description>
      <pubDate>Fri, 21 Nov 2014 22:05:25 GMT</pubDate>
      <guid>https://community.splunk.com/t5/Getting-Data-In/Purge-queue-on-forwarder-indexer-down/m-p/104617#M22024</guid>
      <dc:creator>sloshburch</dc:creator>
      <dc:date>2014-11-21T22:05:25Z</dc:date>
    </item>
  </channel>
</rss>

