clurgmgrd (rgmanager) segfaults and causes nodes to reboot when using restricted failover domains in RHEL 5 Update 3 and earlier
Issue
- On one node rgmanager was stopped. The other four nodes get a segfault and the watchdog reboots them
Jan 25 22:52:18 node1 kernel: clurgmgrd[3108]: segfault at 0000000000000000 rip 000000000040a944 rsp 00007fff8b467910 error 4
- When using
rgmanagerwith a restricted failover domain where some of the nodes of the domain are offline during a failover event,rgmanagercan crash
Environment
- Red Hat Enterprise Linux (RHEL) 5 Update 3 or earlier
rgmanagerprior to release2.0.52-1.el5- One or more
failoverdomains withrestricted="1"in/etc/cluster/cluster.conf - A service failover/relocation event when one or more nodes in the
restricted failoverdomainis not a member
Subscriber exclusive content
A Red Hat subscription provides unlimited access to our knowledgebase of over 48,000 articles and solutions.
Welcome! Check out the Getting Started with Red Hat page for quick tours and guides for common tasks.
