Ceph OSD pods in CLBO state due to PG Dup log issue in ODF Environment

Solution Verified - Updated -

Issue

  • One or more of:
    • Ceph OSD Pods in restart loop with high memory utilization
    • OSD pods getting OOMKilled and in CLBO state
    • OSD logs have tcmalloc: large alloc entries
    • Long running peering events during OSD state change
    • OSD flapping during peering

Environment

  • Red Hat Openshift Data foundation 4.9 and above
  • Red Hat Ceph Storage 5.0 (all versions)
  • Red Hat Ceph Storage 5.1 (before 5.1z3)

Subscriber exclusive content

A Red Hat subscription provides unlimited access to our knowledgebase, tools, and much more.

Current Customers and Partners

Log in for full access

Log In

New to Red Hat?

Learn more about Red Hat subscriptions

Using a Red Hat product through a public cloud?

How to access this content