clogd repeats error "Sync checkpoint section create entry" and lvm commands block in a cluster when a node activates a mirrored volume in RHEL 5
Issue
- After a network outage, only one node of a two node cluster can be up. If the second node gets up, will be member of the cluster and starts
clmvd, the whole cluster hangs and no service is up anymore. Cluster solution is running only on one node at the moment while it is stopped on the other - When a node joins the cluster and starts
clvmd, it hangs.lvmcommands block and the other node repeats "Sync checkpoint section create retry" messages over and over:
Nov 14 11:38:28 node1 kernel: dlm: connecting to 1
Nov 14 11:38:28 node1 kernel: dlm: got connection from 1
Nov 14 11:38:34 node1 clogd[6632]: Sync checkpoint section create retry
Nov 14 11:38:34 node1 clogd[6632]: Sync checkpoint section create retry
Environment
- Red Hat Enterprise Linux (RHEL) RHEL 5 with the High Availability Add On
cmirror-1.1.39-8.el5openais-0.80.6-28.el5_6.1
Subscriber exclusive content
A Red Hat subscription provides unlimited access to our knowledgebase of over 48,000 articles and solutions.
Welcome! Check out the Getting Started with Red Hat page for quick tours and guides for common tasks.
