Split-Brain occurs on two-node pacmeaker cluster while stonith is disabled and token loss occurs
Issue
-
In a "two-node" cluster, we encounter an issue where both nodes show the other offline, and both servers reflect active resources:
- Node 01:
[root@rhel8-node1 ~]# pcs status Cluster name: rhel8-cluster Cluster Summary: -----------------------------------------8<----------------------------------------- Node List: * Online: [ rhel8-node1 ] * OFFLINE: [ rhel8-node2 ] <--- Node1 sees node2 as offline Full List of Resources: -----------------------------------------8<----------------------------------------- * postgresql (ocf::heartbeat:pgsql): Started rhel8-node1 <--- all resources active on each * xvmfence (stonith:fence_xvm): Started rhel8-node1 individual "cluster instance" * temp1 (ocf::heartbeat:Dummy): Started rhel8-node1 * temp2 (ocf::heartbeat:Dummy): Started rhel8-node1 * temp3 (ocf::heartbeat:Dummy): Started rhel8-node1- Node 02:
[root@rhel8-node2 ~]# pcs status Cluster name: rhel8-cluster Cluster Summary: -----------------------------------------8<----------------------------------------- Node List: * Online: [ rhel8-node2 ] * OFFLINE: [ rhel8-node1 ] <--- Node2 sees node1 as offline Full List of Resources: -----------------------------------------8<----------------------------------------- * postgresql (ocf::heartbeat:pgsql): Started rhel8-node2 * xvmfence (stonith:fence_xvm): Stopped <--- because network down * temp1 (ocf::heartbeat:Dummy): Started rhel8-node2 * temp2 (ocf::heartbeat:Dummy): Started rhel8-node2 * temp3 (ocf::heartbeat:Dummy): Started rhel8-node2
Environment
- Red Hat Enterprise Linux Server 7 and higher (with the High Availability Add On and Resilient Storage Add Ons)
- Pacemaker Cluster
- Two Node Cluster
Subscriber exclusive content
A Red Hat subscription provides unlimited access to our knowledgebase, tools, and much more.