Thanos Querier container frequently restarts due to failed LivesnessProbe
Issue
- Thanos Querier LivenessProbe is failing:
Type Reason Age From Message
---- ------ ---- ---- -------
Warning BackOff 66m (x3403 over 2d20h) kubelet Back-off restarting failed container
Warning Unhealthy 5m50s (x4243 over 2d21h) kubelet Liveness probe failed: command timed out
Warning Unhealthy 107s (x4198 over 2d21h) kubelet Readiness probe failed: command timed out
- Thanos logs don't show any issues and appear healthy
- After an upgrade, we see failures in thanos pods causing frequent restarts
Environment
- Red Hat OpenShift Container Platform (OCP)
- 4.7
Subscriber exclusive content
A Red Hat subscription provides unlimited access to our knowledgebase, tools, and much more.