Thanos Querier container frequently restarts due to failed LivesnessProbe

Solution In Progress - Updated -

Issue

  • Thanos Querier LivenessProbe is failing:
  Type     Reason     Age                       From     Message
  ----     ------     ----                      ----     -------
  Warning  BackOff    66m (x3403 over 2d20h)    kubelet  Back-off restarting failed container
  Warning  Unhealthy  5m50s (x4243 over 2d21h)  kubelet  Liveness probe failed: command timed out
  Warning  Unhealthy  107s (x4198 over 2d21h)   kubelet  Readiness probe failed: command timed out
  • Thanos logs don't show any issues and appear healthy
  • After an upgrade, we see failures in thanos pods causing frequent restarts

Environment

  • Red Hat OpenShift Container Platform (OCP)
    • 4.7

Subscriber exclusive content

A Red Hat subscription provides unlimited access to our knowledgebase, tools, and much more.

Current Customers and Partners

Log in for full access

Log In

New to Red Hat?

Learn more about Red Hat subscriptions

Using a Red Hat product through a public cloud?

How to access this content