Pacemaker canceled a cluster resource's migration and restarted the resource instead

Solution In Progress - Updated -

Issue

  • A cluster resource was stopped on both nodes after a successful migrate_to operation, and then it was restarted on one node. The migration did not complete.
  • A Pacemaker cluster resource's migration was canceled with a message like one of those below.

    Oct 21 11:36:14 [75806] node1.example.com    pengine:   notice: native_create_actions:  Resource VirtDomain can no longer migrate to node2.example.com. Stopping on node1.example.com too
    # # or # #
    Oct 31 02:16:35 fastvm-rhel-8-0-23 pacemaker-schedulerd[76201]: notice: Resource dummy1 can no longer migrate from node2 to node1 (will stop on both nodes)
    
  • An ocf:heartbeat:VirtualDomain resource migration appeared to be running successfully. Then the scheduler canceled the migration and stopped the resource on both nodes.

    Oct 21 11:32:32 node2 pengine[75806]:   notice:  * Migrate    vm1         ( node2 -> node1 )
    Oct 21 11:32:32 node2 VirtualDomain(vm1)[19228]: INFO: vm1: Starting live migration to node1 (using: virsh --connect=qemu:///system --quiet migrate --live  vm1 qemu+ssh://node1/system ).
    Oct 21 11:36:13 node2 VirtualDomain(vm1)[19228]: INFO: vm1: live migration to node1 succeeded.
    Oct 21 11:36:13 node2 crmd[75807]:   notice: Result of migrate_to operation for vm1 on node2: 0 (ok)
    Oct 21 11:36:14 node2 pengine[75806]:   notice: Resource vm1 can no longer migrate to node1. Stopping on node2 too
    Oct 21 11:36:14 node2 pengine[75806]:   notice:  * Move       vm1         ( node1 -> node2 )
    Oct 21 11:36:14 node2 crmd[75807]:   notice: Initiating stop operation vm1_stop_0 locally on node2
    Oct 21 11:36:14 node2 crmd[75807]:   notice: Initiating stop operation vm1_stop_0 on node1
    Oct 21 11:36:14 node2 crmd[75807]:   notice: Result of stop operation for vm1 on node2: 0 (ok)
    Oct 21 11:36:38 node1 crmd[12811]:   notice: Result of stop operation for vm1 on node1: 0 (ok)
    Oct 21 11:39:54 node2 pengine[75806]:   notice:  * Start      vm1         (                      node1 )
    Oct 21 11:39:54 node2 crmd[75807]:   notice: Initiating start operation vm1_start_0 on node1
    Oct 21 11:39:56 node1 crmd[12811]:   notice: Result of start operation for vm1 on node1: 0 (ok)
    

Environment

  • Red Hat Enterprise Linux 7 (with the High Availability Add-on)
  • Red Hat Enterprise Linux 8 (with the High Availability Add-on)

Subscriber exclusive content

A Red Hat subscription provides unlimited access to our knowledgebase, tools, and much more.

Current Customers and Partners

Log in for full access

Log In

New to Red Hat?

Learn more about Red Hat subscriptions

Using a Red Hat product through a public cloud?

How to access this content