Monitoring Splunk

Indexer Queues are full

ronny_wang
Explorer

Hi,

We've noticed an issue with our upgrade after upgrading Splunk from 7.3.2 to version 8.0.5. 
We're on a cluster environment, with 3 indexers and 3 SHs. We're forcing python 3.7 on all of the Splunk servers.

Since the upgrade, all 3 indexer Indexing queues have been full, as you can see in the screenshot below.

ronny_wang_0-1618963122864.png
There have been no changes to the amount of data we're ingesting since the upgrade, however a few of the apps did need to also be upgraded to be python 3.7 compatible. 

Here is what we've tried:

  • Restarting - Alleviates the queues for a little bit, but inevitably gets blocked
  • Increasing the queue sizes - We've increased the queue sizes from the default to 80mb, and this increased the time until the queues were blocked. Noticeably, one indexer would block first, then the others would get blocked after some more minutes
  • Validated all the permissions and ownership

There's been two things of note that could be related to this issue:

ronny_wang_1-1618963819168.png

This graph shows that the indexer pipe is directly correlating to the FwdDataReceiverThread. Unfortunately, doesn't seem to be much info concerning this thread out there. 
We've noticed that we've been getting the following errors concerning this thread.

  • ERROR Watchdog - No response received from IMonitoredThread=0x7fb47f7feb50 within 8000 ms. Looks like thread name='FwdDataReceiverThread' tid=6894 is busy !? Starting to trace with 8000 ms interval.

There have also been a number of crashlogs since the upgrade on the Indexers. These crashlogs include items like the following:

ronny_wang_2-1618964405316.png

It seems to be related to a particular search, so not sure if this is related to the issue.

 

Does anyone have any ideas about these items?

Labels (2)
0 Karma
Get Updates on the Splunk Community!

Exporting Splunk Apps

Join us on Monday, October 21 at 11 am PT | 2 pm ET!With the app export functionality, app developers and ...

Cisco Use Cases, ITSI Best Practices, and More New Articles from Splunk Lantern

Splunk Lantern is a Splunk customer success center that provides advice from Splunk experts on valuable data ...

Build Your First SPL2 App!

Watch the recording now!.Do you want to SPL™, too? SPL2, Splunk's next-generation data search and preparation ...