Cluster service containing lvm resource fails to recover with error "Some else owns this volume group" after a node is fenced in a RHEL 6 High Availability cluster
Issue
- When a node reboots on its own (power blip, kernel panic, etc) and rejoins the cluster before its token is lost or it is fenced, recovery of services containing lvm resources fail:
Jul 05 10:44:47 rgmanager [lvm] Someone else owns this volume group
Jul 05 10:44:47 rgmanager start on lvm "myVG" returned 1 (generic error)
- Service containing
lvm
resource fails when being recovered after a node has been fenced.
Environment
- Red Hat Enterprise Linux (RHEL) 6 with the High Availability Add On
rgmanager
- HA-LVM using the tagging variant
<totem token="xxxxx"/>
set to a value higher than the amount of time it takes for a node to boot up to the point of starting thecman
service, or some other condition that allows a node to reboot and rejoin the cluster before fencing of that node has fully completed
Subscriber exclusive content
A Red Hat subscription provides unlimited access to our knowledgebase, tools, and much more.