Pacemaker vdo-vol resource fails when using with ceph:rbd resource
Issue
- The
vdo-vol
resource probes fail which causes the resource to be stopped on all cluster nodes.
Nov 10 12:30:56 rhel8-3 pacemaker-controld[3781]: notice: Requesting local execution of probe operation for vdo on rhel8-3.examplerh.com
Nov 10 12:30:56 rhel8-3 pacemaker-controld[3781]: notice: Initiating monitor operation vdo_monitor_0 on rhel8-2.examplerh.com
Nov 10 12:30:56 rhel8-3 pacemaker-controld[3781]: notice: Initiating monitor operation vdo_monitor_0 on rhel8-1.examplerh.com
Nov 10 12:30:56 rhel8-3 pacemaker-controld[3781]: notice: Result of probe operation for virtfence_xvm on rhel8-3.examplerh.com: not running
Nov 10 12:30:56 rhel8-3 pacemaker-controld[3781]: notice: Initiating start operation virtfence_xvm_start_0 on rhel8-1.examplerh.com
Nov 10 12:30:56 rhel8-3 pacemaker-controld[3781]: notice: Initiating monitor operation virtfence_xvm_monitor_60000 on rhel8-1.examplerh.com
Nov 10 12:30:56 rhel8-3 vdo[4471]: ERROR - vdodumpconfig: Failed to make FileLayer from '/dev/disk/by-id/dm-uuid-mpath-36001405ebc0fa991959487e9792fd4b5' with No such file or directory
Nov 10 12:30:56 rhel8-3 vdo-vol(vdo)[4466]: ERROR: VDO volume(s): vdo1 failed\nvdo: ERROR - vdodumpconfig: Failed to make FileLayer from '/dev/disk/by-id/dm-uuid-mpath-36001405ebc0fa991959487e9792fd4b5' with No such file or directory
Nov 10 12:30:56 rhel8-3 pacemaker-controld[3781]: notice: Result of probe operation for vdo on rhel8-3.examplerh.com: error
Nov 10 12:30:56 rhel8-3 pacemaker-controld[3781]: notice: vdo_monitor_0@rhel8-3.examplerh.com output [ 'vdo1': No such file or directory\n ]
Nov 10 12:30:56 rhel8-3 pacemaker-controld[3781]: notice: Transition 13 aborted by operation vdo_monitor_0 'modify' on rhel8-3.examplerh.com: Event failed
Nov 10 12:30:56 rhel8-3 pacemaker-controld[3781]: notice: Transition 13 action 6 (vdo_monitor_0 on rhel8-3.examplerh.com): expected 'not running' but got 'error'
Nov 10 12:30:56 rhel8-3 pacemaker-controld[3781]: notice: Transition 13 action 2 (vdo_monitor_0 on rhel8-1.examplerh.com): expected 'not running' but got 'ok'
Nov 10 12:30:56 rhel8-3 pacemaker-controld[3781]: notice: Transition 13 (Complete=8, Pending=0, Fired=0, Skipped=0, Incomplete=2, Source=/var/lib/pacemaker/pengine/pe-input-275.bz2): Complete
Nov 10 12:30:56 rhel8-3 pacemaker-schedulerd[3780]: warning: Unexpected result (error) was recorded for probe of vdo on rhel8-3.examplerh.com at Nov 10 12:30:56 2022
Nov 10 12:30:56 rhel8-3 pacemaker-schedulerd[3780]: notice: If it is not possible for vdo to run on rhel8-3.examplerh.com, see the resource-discovery option for location constraints
Nov 10 12:30:56 rhel8-3 pacemaker-schedulerd[3780]: warning: Unexpected result (error) was recorded for probe of vdo on rhel8-3.examplerh.com at Nov 10 12:30:56 2022
Nov 10 12:30:56 rhel8-3 pacemaker-schedulerd[3780]: notice: If it is not possible for vdo to run on rhel8-3.examplerh.com, see the resource-discovery option for location constraints
Nov 10 12:30:56 rhel8-3 pacemaker-schedulerd[3780]: error: ocf resource vdo might be active on 2 nodes (attempting recovery)
Nov 10 12:30:56 rhel8-3 pacemaker-schedulerd[3780]: notice: See https://wiki.clusterlabs.org/wiki/FAQ#Resource_is_Too_Active for more information
Nov 10 12:30:56 rhel8-3 pacemaker-schedulerd[3780]: notice: Actions: Recover vdo ( rhel8-3.examplerh.com -> rhel8-1.examplerh.com )
Nov 10 12:30:56 rhel8-3 pacemaker-schedulerd[3780]: error: Calculated transition 14 (with errors), saving inputs in /var/lib/pacemaker/pengine/pe-error-6.bz2
Environment
- Red Hat Enterprise Linux Server 8, 9 (with the High Availability Add On)
- A pacemaker managed resource configured:
vdo-vol
Subscriber exclusive content
A Red Hat subscription provides unlimited access to our knowledgebase, tools, and much more.