Solved: Linear memory growth with Splunk 9.4.0 and above

hrawat · ‎02-25-2025

Linear memory growth on any splunk instance configured to receive data on splunktcpin, tcpin and udpin ports.

Following config in server.conf will fix the memory growth.

[prometheus]

disabled = true

hrawat · ‎02-25-2025

Although it was not documented but 9.3.x/9.2.x/9.1.x etc/system/default/server.conf you will find

[prometheus]
disabled = true

It was added in server.conf to prevent unwanted memory growth caused by prometheus. Unintentionally stanza got removed from 9.4.0. So you restore it back.

View solution in original post

jotne · ‎02-28-2025

Can confirm that it fixed memory leak that we see on our upgraded HF server.

This is not fixed in the newly released 9.4.1.

But what does prometheus do in splunk? Is it some new function that was added to the 9.x server and was set to disabled? Do not find any info in server.conf docs.

hrawat · ‎02-28-2025

It's https://prometheus.io/ support that was added almost 4 years ago(8.2.0). But it was disabled due to memory explosion since 8.2.1

jotne · ‎02-28-2025

4 years?????

And nothing has been done to fix it. This part should then be removed from the code then.

Here you see memory one of our HF whas upgraded from 9.3.2 to 9.4. When all memory are used up, it runs for some hour more and then dies. We reported this issue just some days after 9.4.0 was released, and did get the fix just now.

gjanders · ‎02-25-2025

I noticed that https://docs.splunk.com/Documentation/Splunk/9.4.0/Admin/Serverconf does not mention prometheus.

Is this an undocumented feature that is getting disabled to prevent a memory leak issue?

-
Alerts for Splunk Admins, Version Control for Splunk, Decrypt2 VersionControl For SplunkCloud

hrawat · ‎02-25-2025

Although it was not documented but 9.3.x/9.2.x/9.1.x etc/system/default/server.conf you will find

[prometheus]
disabled = true

It was added in server.conf to prevent unwanted memory growth caused by prometheus. Unintentionally stanza got removed from 9.4.0. So you restore it back.

AndyM · ‎05-07-2025

Is this issue likely to be fixed in an upcoming version release?

hrawat · ‎05-07-2025

It's not fixed in upcoming releases.
However the fix (whenever part of a release) will also be same as the workaround.

[prometheus]
disabled = true

jotne · ‎05-07-2025

I do not see any fix for this in the just released 9.4.2 that was released month after it was discovered in 9.4.0

There is a setting in the next beta for Splunk so maybe it will come in 9.4.3

Also strange that this setting is not mention in the latest documentation:
https://docs.splunk.com/Documentation/Splunk/latest/Admin/Serverconf

[prometheus]
disabled = true

Linear memory growth with Splunk 9.4.0 and above

metrics

using Splunk Enterprise

Tech Talk Recap | Mastering Threat Hunting

Observability for AI Applications: Troubleshooting Latency

Splunk AI Assistant for SPL vs. ChatGPT: Which One is Better?

Are you a member of the Splunk Community?