After corosync segfaults, qdiskd evicts other cluster nodes on Red Hat Enterprise Linux 6

Solution Unverified - Updated -

Issue

  • The following issue was occurred on our 3 node cluster suddenly:
  1. Added a new service, and started this service from Luci.
  2. After starting the service, node1 and node3 were rebooted suddenly.
  3. We found that corosync suffered a SIGABRT and coredumped on node2

Environment

  • Red Hat Enterprise Linux Server 6 (with the High Availability and Resilient Storage Add Ons)
  • Red Hat High Availability Cluster with 2 or more nodes
  • Quorum disk configured
  • Issue is triggered by a failure of a cluster process (observed when corosync segfaulted)

Subscriber exclusive content

A Red Hat subscription provides unlimited access to our knowledgebase, tools, and much more.

Current Customers and Partners

Log in for full access

Log In

New to Red Hat?

Learn more about Red Hat subscriptions

Using a Red Hat product through a public cloud?

How to access this content