Why does rgmanager fail service migration with error '#75: Failed changing service status' during cman membership transition?

Solution Verified - Updated -

Issue

  • When stopping cluster services (rgmanager, cman) on one node which is running an rgmanager service, it is possible to hit a timing issue which causes that service to fail starting on the other node with the message:
        #75: Failed changing service status
  • The surviving node is unable to take over the services because it can not receive the acknowledgement from cman/ais on the failed node leading to timeouts in view formation and the service terminates with an error.

    Dec 15 14:49:48 lsphc1e02 clurgmgrd[8702]: <notice> Member 1 shutting down
    Dec 15 14:49:54 lsphc1e02 clurgmgrd[8702]: <notice> Starting stopped service service:lsphc1e-srv
    ...
    Dec 15 14:50:32 lsphc1e02 clurgmgrd[8702]: <err> #75: Failed changing service status
    Dec 15 14:50:32 lsphc1e02 clurgmgrd[8702]: <notice> Stopping service service:lsphc1e-srv
    

Environment

Subscriber exclusive content

A Red Hat subscription provides unlimited access to our knowledgebase, tools, and much more.

Current Customers and Partners

Log in for full access

Log In

New to Red Hat?

Learn more about Red Hat subscriptions

Using a Red Hat product through a public cloud?

How to access this content