Ceph MON processes crash or lose quorum during data rebalance in RHCS 4

Solution Verified - Updated -

Issue

During data rebalancing ( recovery or backfill ), Ceph MON processes create larger and larger memory allocation requests, potentially crashing with 'segfault' if the Kernel cannot honor the memory allocation request.

Environment

Red Hat Ceph Storage 4.x during data rebalancing.
Problem is more prominent with high OSD counts.

Subscriber exclusive content

A Red Hat subscription provides unlimited access to our knowledgebase, tools, and much more.

Current Customers and Partners

Log in for full access

Log In

New to Red Hat?

Learn more about Red Hat subscriptions

Using a Red Hat product through a public cloud?

How to access this content