clogd repeats error "Sync checkpoint section create entry" and lvm commands block in a cluster when a node activates a mirrored volume in RHEL 5

Solution Unverified - Updated -

Issue

  • After a network outage, only one node of a two node cluster can be up. If the second node gets up, will be member of the cluster and starts clmvd, the whole cluster hangs and no service is up anymore. Cluster solution is running only on one node at the moment while it is stopped on the other
  • When a node joins the cluster and starts clvmd, it hangs. lvm commands block and the other node repeats "Sync checkpoint section create retry" messages over and over:
Nov 14 11:38:28 node1 kernel: dlm: connecting to 1
Nov 14 11:38:28 node1 kernel: dlm: got connection from 1
Nov 14 11:38:34 node1 clogd[6632]: Sync checkpoint section create retry
Nov 14 11:38:34 node1 clogd[6632]: Sync checkpoint section create retry

Environment

  • Red Hat Enterprise Linux (RHEL) RHEL 5 with the High Availability Add On
  • cmirror-1.1.39-8.el5
  • openais-0.80.6-28.el5_6.1

Subscriber exclusive content

A Red Hat subscription provides unlimited access to our knowledgebase of over 48,000 articles and solutions.

Current Customers and Partners

Log in for full access

Log In
Close

Welcome! Check out the Getting Started with Red Hat page for quick tours and guides for common tasks.