CephFS MDSs are in CLBO due to rook-ceph-mgr-a being Unable to Perform Delete Ops - OpenShift Data Foundation (ODF)

Solution Verified - Updated 2024-05-17T14:40:46+00:00 -

Issue

Both MDSs enter an Error/CrashLoopBackOff state due to a failed rook-ceph-mgr-a delete operation. Most likely due to a database workload.

NAME                                                             READY  STATUS   RESTARTS  AGE
rook-ceph-mds-ocs-storagecluster-cephfilesystem-a-9b68f854zbw9z  1/2    Running  5         5m30s
rook-ceph-mds-ocs-storagecluster-cephfilesystem-b-c7c87476b7gq6  1/2    Running  5         5m30s

    name: mds
    ready: false
    restartCount: 4
    started: false
    state:
      waiting:
        message: back-off 1m20s restarting failed container=mds pod=rook-ceph-mds-ocs-storagecluster-cephfilesystem-a-9b68f854zbw9z_openshift-storage(10c20d44-014b-4b62-b9e2-83df4a6d2177)
        reason: CrashLoopBackOff

Environment

Red Hat OpenShift Data Foundation (RHODF) v4.x
Red Hat OpenShift Container Storage (RHOCS) v4.x

Subscriber exclusive content

A Red Hat subscription provides unlimited access to our knowledgebase, tools, and much more.

Select Your Language

CephFS MDSs are in CLBO due to rook-ceph-mgr-a being Unable to Perform Delete Ops - OpenShift Data Foundation (ODF)

Issue

Environment

Subscriber exclusive content

Current Customers and Partners

New to Red Hat?

Using a Red Hat product through a public cloud?

Quick Links

Help

Site Info

Related Sites

About

Red Hat legal and privacy links

Red Hat legal and privacy links

Issue

Environment

Subscriber exclusive content

Current Customers and Partners

New to Red Hat?

Using a Red Hat product through a public cloud?

Quick Links

Help

Site Info

Related Sites

Systems Status

About

Red Hat legal and privacy links

Red Hat legal and privacy links