clurgmgrd (rgmanager) segfaults and causes nodes to reboot when using restricted failover domains in RHEL 5 Update 3 and earlier
Issue
- On one node rgmanager was stopped. The other four nodes get a segfault and the watchdog reboots them
Jan 25 22:52:18 node1 kernel: clurgmgrd[3108]: segfault at 0000000000000000 rip 000000000040a944 rsp 00007fff8b467910 error 4
- When using
rgmanagerwith a restricted failover domain where some of the nodes of the domain are offline during a failover event,rgmanagercan crash
Environment
- Red Hat Enterprise Linux (RHEL) 5 Update 3 or earlier
rgmanagerprior to release2.0.52-1.el5- One or more
failoverdomains withrestricted="1"in/etc/cluster/cluster.conf - A service failover/relocation event when one or more nodes in the
restricted failoverdomainis not a member
Subscriber exclusive content
A Red Hat subscription provides unlimited access to our knowledgebase, tools, and much more.