Monitoring operator degraded as alertmanager and thanosruler can't reconcile in RHOCP 4

Solution Verified - Updated -

Issue

  • Monitoring operator reports degraded:

    $ oc get co monitoring
    NAME         VERSION   AVAILABLE   PROGRESSING   DEGRADED   SINCE
    monitoring   4.14.12   False       True          True       3d2h
    
  • Monitoring operator reports below status messages:

    $ oc get co monitoring -o json | jq .status.conditions
    {
    "lastTransitionTime": "2024-03-01T08:04:07Z",
    "message": "waiting for Alertmanager object changes failed: waiting for Alertmanager openshift-monitoring/main: context deadline exceeded, waiting for ThanosRuler object changes failed: waiting for Thanos Ruler openshift-user-workload-monitoring/user-workload: context deadline exceeded",
    "reason": "MultipleTasksFailed",
    "status": "False",
    "type": "Available"
    },
    {
    "lastTransitionTime": "2024-03-01T08:04:07Z",
    "message": "waiting for Alertmanager object changes failed: waiting for Alertmanager openshift-monitoring/main: context deadline exceeded, waiting for ThanosRuler object changes failed: waiting for Thanos Ruler openshift-user-workload-monitoring/user-workload: context deadline exceeded",
    "reason": "MultipleTasksFailed",
    "status": "True",
    "type": "Degraded"
    }
    

Environment

  • Red Hat OpenShift Container Platform (RHOCP)
    • 4
  • Portworx by Pure Storage

Subscriber exclusive content

A Red Hat subscription provides unlimited access to our knowledgebase, tools, and much more.

Current Customers and Partners

Log in for full access

Log In

New to Red Hat?

Learn more about Red Hat subscriptions

Using a Red Hat product through a public cloud?

How to access this content