Getting Data In

INFO entries in the metrics.log indicate some problem with installing a deployment app on a forwarder. Any idea what it is?

wrangler2x
Motivator

In /opt/splunk/var/log/splunk/metrics.log I am seeing this type of log entry for one forwarder:

06-24-2014 13:59:32.428 -0700 INFO  DeploymentMetrics - ip=nnn.nnn.nnn.nnn, dns=systemname.dom.uci.edu, hostname=dc2-finaid, mgmt=8089, build=110225, name=deploymentClient, id=connection_nnn.nnn.nnn.nnn_8089_systemname.dom.uci.edu_dc2-finaid_deploymentClient, utsname=windows-x64 scName=XYZ_OUTPUT_9998, appName=XYZ_OUTPUT_9998, fqname=C:\Program Files\Splunk\etc\apps\XYZ_OUTPUT_9998, event=install, status=failed, reason=Failed to install app : C:\Program Files\Splunk\etc\apps\XYZ_OUTPUT_9998. Cannot update application info: /nobody/XYZ_OUTPUT_9998/app/install/state = enabled: Metadata could not be written: /nobody/XYZ_OUTPUT_9998/app/install/state: {  }, removable: yes

There are two deployment apps going to the system "systemname" and only one is generating this INFO item. XYZ_OUTPUT_9998 is a global app that goes to all our forwarders through the deployment server, and this is the only one generating this INFO item in our logs. The forwarder is a heavy forwarder on a Windows domain controller. I don't have admin access to it. Looking for ideas to share with the admins who installed the forwarder.

0 Karma

grijhwani
Motivator

The biggest clue is "Metadata could not be written". To me that suggests that there is a permission problem, the Splunk user is over quota on the system disk, or the disk partition is full. Most probably the failing app deployment could not create its directory tree to write into, whereas the working one already has.

You say there are two app deployments, only one of which is failing. My question would be is the first already present as part of the standard system build, or was it already deployed earlier? Either of these might explain why one app is deployable (because it already has been, and hence there is no write-permission/disk capacity issue), but a new one isn't.

0 Karma

lguinn2
Legend

My first guess would be "check permissions." Specifically, does the Splunk user (that is running the services on the forwarder) have the ability to write to the folders that contain the app?

Second thought: what happens if you manually install this app on a test server? An app must have a certain directory structure and the metadata files must also exist. If the structure or file contents are wrong, the install could fail. If this is the problem, you should see the error even on a manual install.

Finally, a possible fix: on a test server, create a new app with the same name, using the Splunk GUI. Copy in the configuration files from the original app, except for the metadata files. Restart Splunk on the test server, then examine the app's configuration using the Splunk GUI. Modify as needed. Replace the old app with the new one on the deployment server.

0 Karma
Got questions? Get answers!

Join the Splunk Community Slack to learn, troubleshoot, and make connections with fellow Splunk practitioners in real time!

Meet up IRL or virtually!

Join Splunk User Groups to connect and learn in-person by region or remotely by topic or industry.

Get Updates on the Splunk Community!

Network to App: Observability Unlocked [May & June Series]

In today’s digital landscape, your environment is no longer confined to the data center. It spans complex ...

SPL2 Deep Dives, AppDynamics Integrations, SAML Made Simple and Much More on Splunk ...

Splunk Lantern is Splunk’s customer success center that provides practical guidance from Splunk experts on key ...

[Puzzles] Solve, Learn, Repeat: Matching cron expressions

This puzzle (first published here) is based on matching timestamps to cron expressions.All the timestamps ...