We have a distributed architecture
Search head cluster with 6 hosts across 3 data centres
Index cluster with 6 index peers and 1 index master
Forwarders on all servers in environment - web tier, app tier, load balancer tier
Few months back , web tier stopped sending - log stopped coming to splunk ; but other tiers are are working
When checked the activity on web-tier , there was a patching happened and splunkd was restarted -after that forwarding stopped in web-tier
But splunkd process came up fine - still running in those
And observed below WARN messages started coming exactly same time
[ See the highlighted in red starting from 10 seconds it grows ]
WARN TcpOutputProc - Tcpout Processor: The TCP output processor has paused the data flow. Forwarding to output group index_peers has been blocked for 10 seconds. This will probably stall the data flow towards indexing and other network outputs. Review the receiving system's health in the Splunk Monitoring Console. It is probably not accepting data.
----------
------
+0000 WARN TcpOutputProc - Tcpout Processor: The TCP output processor has paused the data flow. Forwarding to output group index-peers has been blocked for 9725460 seconds. This will probably stall the data flow towards indexing and other network outputs. Review the receiving system's health in the Splunk Monitoring Console. It is probably not accepting data. +0000 WARN TcpOutputProc - Tcpout Processor: The TCP output processor has paused the data flow. Forwarding to output group index-peers has been blocked for 9725470 seconds. This will probably stall the data flow towards indexing and other network outputs. Review the receiving system's health in the Splunk Monitoring Console. It is probably not accepting data.
=============================================================================
Why we picked this WARN message may be cause - as same happened in other tier recently
load lancer tier stopped stopped forwarding recently. Above WARN started showing same time onwards - starting with "blocked for 10 seconds "
splunk forwarder is running fine in all these
App tier still working -sending data , so indexers are fine
not disk space or memory issue in any of these
No config changes done any where ( inputs or outputs conf or any file that matter) -its same , just that stopped working suddenly
What could have caused this sudden stopping of forwarding ?
Splunk Enterprise
Version:7.2.1Build:be11b2c46e23
... View more