Cluster services become blocked in a recovering state when multiple nodes start at once and the initial start attempt fails in RHEL 5
Issue
- When nodes are booting and starting
rgmanager
, if the service fails to start and needs to be recovered on another node,clusvcadm
commands may hang. - Service gets stuck in 'recovering' state when being started
Environment
- Red Hat Enterprise Linux (RHEL) 5 or 6 with the High Availability Add On
- RHEL 5:
rgmanager
prior to release2.0.52-29.el5_8.5
or2.0.52-37.el5
- RHEL 6:
rgmanager
prior to release3.0.12.1-17.el6
- One or more services in
/etc/cluster/cluster.conf
Subscriber exclusive content
A Red Hat subscription provides unlimited access to our knowledgebase, tools, and much more.