Solve pcs resource stuck on OpenStack controller

Solution Unverified - Updated -

Issue

On Openstack 7, 8 and 9 there is lots of constraint in between all openstack services which are started as a pcs resource.

In some case of network failure, we could get some service fail on one controller and a resource cleanup do not resolve the issue.

Failed Actions:
* rabbitmq_monitor_10000 on overcloud-controller-0 'not running' (7): call=63, status=complete, exitreason='none',
    last-rc-change='Fri Jan 26 17:40:45 2018', queued=0ms, exec=0ms
* openstack-heat-engine_monitor_60000 on overcloud-controller-0 'not running' (7): call=324, status=complete, exitreason='none',
    last-rc-change='Fri Jan 26 17:39:12 2018', queued=0ms, exec=0ms

Try to cleanup

pcs resource cleanup
Error: Cleaning up all resources on all nodes will execute more than 100 operations in the cluster, which may negatively impact the responsiveness of the cluster. Consider specifying resource and/or node, use --force to override

pcs resource cleanup --force

But some resource stay stopped as

 Clone Set: openstack-nova-conductor-clone [openstack-nova-conductor]
     Started: [ overcloud-controller-1 overcloud-controller-2 ]
     Stopped: [ overcloud-controller-0 ]

Environment

  • Red Hat Openstack 7
  • Red Hat Openstack 8
  • Red Hat Openstack 9

Subscriber exclusive content

A Red Hat subscription provides unlimited access to our knowledgebase, tools, and much more.

Current Customers and Partners

Log in for full access

Log In

New to Red Hat?

Learn more about Red Hat subscriptions

Using a Red Hat product through a public cloud?

How to access this content