Temporary loss of quorum when a node starts to rejoin in RHEL8

Solution Verified - Updated -

Issue

  • Two node cluster, plus quorum-only node. Testing the behavior when active node is gracefully rebooted, all seems well initially. Resources are migrated, come up and function as expected but, when the rebooted node starts to come back up, the other node seems to lose quorum temporarily, even though it still has communication with the quorum node. This causes the resources to stop until quorum is reestablished.

  • It happens with knet + ffsplit, only with 2 node cluster and only when a node with lowest node id is restarted.

  • It happens with knet + ffsplit, only with 2 node cluster and no matter which node you reboot/panic.

Environment

  • Red Hat Enterprise Linux 8
  • corosync-qdevice

Subscriber exclusive content

A Red Hat subscription provides unlimited access to our knowledgebase, tools, and much more.

Current Customers and Partners

Log in for full access

Log In

New to Red Hat?

Learn more about Red Hat subscriptions

Using a Red Hat product through a public cloud?

How to access this content