Splunk User Behavior Analytics

UBA caspida-realtimeruleexec Service Crash with OutOfMemoryError on 7-node Deployment

aldo
New Member

Hi Splunk Community,

I'm currently running a 7-node UBA deployment and encountering persistent errors with the `caspida-realtimeruleexec` service.

Screenshot 2025-08-29 112423.png

The service fails to respond in the UBA Health Monitor UI, and upon checking the logs at:

/var/vcap/sys/log/caspida/ruleengine/realtimeruleexecutor.log :
Screenshot 2025-08-29 113509.png

I see the following error:

Screenshot 2025-08-29 113324.png

java.lang.OutOfMemoryError: Java heap space

When inspecting the JVM options in the `/opt/caspida/bin/CaspidaCommonEnv.sh` file, I found that the heap size was set as:
-Xmx4M

This seems too low for a rule executor component. My node has around 64 GB of RAM and ~34 GB free memory available (`free -m` confirms this). I am planning to increase the heap size to `-Xmx8192m`.

### My Questions:

1. Is this the correct cause of the crash?
2. Is it safe and recommended to increase the heap size to 8 GB or more for `caspida-realtimeruleexec` in a 7-node cluster?
3. Are there any best practices or official docs on tuning JVM heap for this component?
4. Should I also consider increasing other services' heap sizes?

Thanks in advance for your help!

Labels (2)
0 Karma

livehybrid
SplunkTrust
SplunkTrust

Hi @aldo 

This is a slightly tricky one because the documentation is so light on the subject! However my take on this is that, yes, you should be okay to increase the heap size across the deployment for this service *however* what we dont know is how the other services will behave once this service is back up and running. e.g. just because there is 30gb RAM available now, when everything starts working properly and other services start working together the RAM Usage could increase (if that makes sense).

I'd be tempted to increase is gradually until you see the issue resolve and then closely monitor the usage moving forwards. I dont know if you've seen https://help.splunk.com/en/security-offerings/splunk-user-behavior-analytics/administer/5.4.1/monito... but there is some good info here on monitoring your UBA deployment so that might help once things have stabilised.

🌟 Did this answer help you? If so, please consider:

  • Adding karma to show it was useful
  • Marking it as the solution if it resolved your issue
  • Commenting if you need any clarification

Your feedback encourages the volunteers in this community to continue contributing

0 Karma
Career Survey
First 500 qualified respondents will receive a $20 gift card! Tell us about your professional Splunk journey.
Get Updates on the Splunk Community!

Introduction to Splunk AI

How are you using AI in Splunk? Whether you see AI as a threat or opportunity, AI is here to stay. Lucky for ...

Splunk + ThousandEyes: Correlate frontend, app, and network data to troubleshoot ...

Are you tired of troubleshooting delays caused by siloed frontend, application, and network data? We've got a ...

Maximizing the Value of Splunk ES 8.x

Splunk Enterprise Security (ES) continues to be a leader in the Gartner Magic Quadrant, reflecting its pivotal ...