We have a stand-alone splunk instance in a closed area. We had to roll back the server to a snapshot and now the clients only phone home when we restart the splunk server. I've looked at the splunk log, phonehome log, checked the outputs.conf. I've run telnet server:8089 and 9997 from the clients and the ports are open listening. Any help would be appreciated. We are on version 9.3.1
What do you mean by "clients phoning home only when you restart the DS"? How did you determine this? The clients phone home on schedule - it's asynchronous versus whatever the DS is doing.
How did you determine this? - This is what the Forwarder Management Web UI shows us, client phone home time stamp coincides with the restart.
OK. So this is not (or at least might not be) about the phonehomes as such but on the info shown in the DS console.
I'd go for
1) Verifying on selected forwarders that the phonehomes are shown in the splunkd.log
2) Checking the logs on the DS itself to see if it can see the phonehomes.
3) Checking if you have the selective routing properly configured on the DS. https://help.splunk.com/en/splunk-enterprise/administer/manage-distributed-deployments/9.2/configure... (it's not about upgraded instances only; we had this issue lately on a new installation of 9.3.something).
Still no success after attempting all the steps below. Checked splunkd log on a few fowarders as well as the Deployment server and neither indicated connection errors. One question I have is in regards to indexes. From the webui i see the _dsphonehome, _dsappevent, _dsclient, but I don't see those indexes in the indexes.conf file on the deployment server. Another note is I found this and wondering if this could help? Our Splunk instance is at version 9.3.1.