Getting Data In

How to troubleshoot why my intermediate forwarder is not working, causing 600 universal forwarders to not send data to indexers?

hartfoml
Motivator

I have a ticket in with support but this may be faster.

My intermediate forwarder is not working right. When I restart it, everything works for a few minutes then stops working. I have checked everything that I know to help.

Please help with suggestions. 600 systems are down!!!

1 Solution

hartfoml
Motivator

Splunk Support was relatively quick to respond. Rajpal Bal got on the line and at my request she quickly setup a webex. we looked at SOS and could see that the tcpout on the Intermediate forwarder (IF) was full and the tcpin for the indexers was very low. on Thursday there was a mix up in DNS but this did not affect the IF until Splunk was restarted yesterday. Rajpal suggested and helped me to add the connection_host entry below to the inputs.conf to force Splunk to use IP and not look-up DNS names. we did this on both the IF and the indexers. it did not immediately resolve the issues but over a few hours the IF started its normal behavior and we can fix DNS on Monday.

Thanks Rajpal, for fast, appropriate and extra effort in staying beyond work hours to solve this tricky problem

in the inputs.conf that has the "splunktcp" stanza the "connection_host = ip" for app ports like below

[splunktcp:://]
connection_host = ip

View solution in original post

hartfoml
Motivator

Splunk Support was relatively quick to respond. Rajpal Bal got on the line and at my request she quickly setup a webex. we looked at SOS and could see that the tcpout on the Intermediate forwarder (IF) was full and the tcpin for the indexers was very low. on Thursday there was a mix up in DNS but this did not affect the IF until Splunk was restarted yesterday. Rajpal suggested and helped me to add the connection_host entry below to the inputs.conf to force Splunk to use IP and not look-up DNS names. we did this on both the IF and the indexers. it did not immediately resolve the issues but over a few hours the IF started its normal behavior and we can fix DNS on Monday.

Thanks Rajpal, for fast, appropriate and extra effort in staying beyond work hours to solve this tricky problem

in the inputs.conf that has the "splunktcp" stanza the "connection_host = ip" for app ports like below

[splunktcp:://]
connection_host = ip

MuS
Legend

follow @martin_mueller advice and check the servers ulimit settings; usually if something works for a few minutes and then stops on *nix systems, indicates ulimit being too low.

0 Karma

martin_mueller
SplunkTrust
SplunkTrust

Usually there would be some indication of what's wrong in the IF's internal logs, especially splunkd.log.

0 Karma
Career Survey
First 500 qualified respondents will receive a $20 gift card! Tell us about your professional Splunk journey.

Can’t make it to .conf25? Join us online!

Get Updates on the Splunk Community!

Take Action Automatically on Splunk Alerts with Red Hat Ansible Automation Platform

 Are you ready to revolutionize your IT operations? As digital transformation accelerates, the demand for ...

Calling All Security Pros: Ready to Race Through Boston?

Hey Splunkers, .conf25 is heading to Boston and we’re kicking things off with something bold, competitive, and ...

Beyond Detection: How Splunk and Cisco Integrated Security Platforms Transform ...

Financial services organizations face an impossible equation: maintain 99.9% uptime for mission-critical ...