We are using OpenShift version 4.13.24 and it is actually on the ROSA AWS managed solution. I've been looking at some metrics for the splunk-otel-collector-agent pods that we have running, and in particular we review kubernetes metrics with Dynatrace. The alerts I am seeing are "High CPU Throttling" which basically translates into the CPU Throttling metric being nearly at the same level, or at the same level, as the CPU Usage metric.
The pods are configured for Splunk Platform
For these pods, I reviewed the YAML for the running instance and we include the following configuration:
- resources: limits: cpu: 200m memory: 500Mi requests: cpu: 200m memory: 500Mi |
As a workaround I was thinking to increase the cpu value under requests (and limits), however I haven't tried this yet. Has anyone else observed high CPU throttling issues? Thank you.