clurgmgrd (rgmanager) segfaults and causes nodes to reboot when using restricted failover domains in RHEL 5 Update 3 and earlier
Issue
- On one node rgmanager was stopped. The other four nodes get a segfault and the watchdog reboots them
Jan 25 22:52:18 node1 kernel: clurgmgrd[3108]: segfault at 0000000000000000 rip 000000000040a944 rsp 00007fff8b467910 error 4
- When using
rgmanager
with a restricted failover domain where some of the nodes of the domain are offline during a failover event,rgmanager
can crash
Environment
- Red Hat Enterprise Linux (RHEL) 5 Update 3 or earlier
rgmanager
prior to release2.0.52-1.el5
- One or more
failoverdomain
s withrestricted="1"
in/etc/cluster/cluster.conf
- A service failover/relocation event when one or more nodes in the
restricted failoverdomain
is not a member
Subscriber exclusive content
A Red Hat subscription provides unlimited access to our knowledgebase, tools, and much more.