Thanos querier pods gets OOMKilled when loading the API performance dashboard with large time ranges in RHOCP 4
Issue
- When using the "API Performance" dashboard under "Observe --> Dashboard" and trying to query the metrics for more than 1 week, the
thanos-querierpod getsOOMKilledor the dashboard returnsError Loading AlertGateway Time-out. -
The
thanos-querierpod shows the container as terminated withreason: OOMKilled:[...] lastState: terminated: containerID: cri-o://XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX exitCode: 137 [...] reason: OOMKilled [...] -
The dashboard shows a warning similar to the below images:


Environment
- Red Hat OpenShift Container Platform (RHOCP)
- 4
- Thanos
Subscriber exclusive content
A Red Hat subscription provides unlimited access to our knowledgebase, tools, and much more.