Ceph OSD pods in CLBO state due to PG Dup log issue in ODF Environment
Issue
- One or more of:
- Ceph OSD Pods in restart loop with high memory utilization
- OSD pods getting OOMKilled and in CLBO state
- OSD logs have tcmalloc: large alloc entries
- Long running peering events during OSD state change
- OSD flapping during peering
Environment
- Red Hat Openshift Data foundation 4.9 and above
- Red Hat Ceph Storage 5.0 (all versions)
- Red Hat Ceph Storage 5.1 (before 5.1z3)
Subscriber exclusive content
A Red Hat subscription provides unlimited access to our knowledgebase, tools, and much more.