OpenStack overcloud update fails waiting for Ceph clean pgs

Solution Unverified - Updated -

Issue

The openstack overcloud update will fail with an error similar to the following:

2020-12-04 13:11:55Z [overcloud-AllNodesDeploySteps-2d3a5xmhthpb.WorkflowTasks_Step2]: UPDATE_COMPLETE  state changed
2020-12-04 13:11:56Z [overcloud-AllNodesDeploySteps-2d3a5xmhthpb.WorkflowTasks_Step2_Execution]: CREATE_IN_PROGRESS  state changed
2020-12-04 13:47:53Z [overcloud-AllNodesDeploySteps-2d3a5xmhthpb.WorkflowTasks_Step2_Execution]: CREATE_FAILED  resources.WorkflowTasks_Step2_Execution: Failure caused
by error in tasks: ceph_base_ansible_workflow

  ceph_base_ansible_workflow [task_ex_id=207c6837-efcb-467b-b882-c4b5a9a3abcd] -> Failure caused by error in tasks: ceph_install

  ceph_install [task_e
2020-12-04 13:47:53Z [overcloud-AllNodesDeploySteps-2d3a5xmhabcd]: UPDATE_FAILED  Resource CREATE failed: resources.WorkflowTasks_Step2_Execution: Failure caused by err
or in tasks: ceph_base_ansible_workflow

  ceph_base_ansible_workflow [task_ex_id=207c6837-efcb-467b-b882-c4b5a9a3abcd] -> Failure caused by error in tasks: ceph_install
2020-12-04 13:47:53Z [AllNodesDeploySteps]: UPDATE_FAILED  resources.AllNodesDeploySteps: Resource CREATE failed: resources.WorkflowTasks_Step2_Execution: Failure cause
d by error in tasks: ceph_base_ansible_workflow

  ceph_base_ansible_workflow [task_ex_id=207c6837-efcb-467b-b882-c4b5a9a3abcd] -> Failure caused
2020-12-04 13:47:53Z [overcloud]: UPDATE_FAILED  Resource UPDATE failed: resources.AllNodesDeploySteps: Resource CREATE failed: resources.WorkflowTasks_Step2_Execution:
 Failure caused by error in tasks: ceph_base_ansible_workflow

  ceph_base_ansible_workflow [task_ex_id=207c6837-efcb-467b-b882-c4b5abcd

 Stack overcloud UPDATE_FAILED

overcloud.AllNodesDeploySteps.WorkflowTasks_Step2_Execution:
  resource_type: OS::TripleO::WorkflowSteps
  physical_resource_id: 846bfbc3-641a-49ac-9380-73554cdaabcd
  status: CREATE_FAILED
  status_reason: |
    resources.WorkflowTasks_Step2_Execution: Failure caused by error in tasks: ceph_base_ansible_workflow

      ceph_base_ansible_workflow [task_ex_id=207c6837-efcb-467b-b882-c4b5a9a3abcd] -> Failure caused by error in tasks: ceph_install

      ceph_install [task_ex_id=80d7b0f8-8657-44a2-86bf-026d7c12abcd] -> One or more actions had failed.

And the ceph-install-workflow.log will show messages similar to these:

2020-12-04 08:46:47,171 p=2691 u=mistral |  FAILED - RETRYING: waiting for clean pgs... (2 retries left).
2020-12-04 08:47:17,908 p=2691 u=mistral |  FAILED - RETRYING: waiting for clean pgs... (1 retries left).
2020-12-04 08:47:48,635 p=2691 u=mistral |  fatal: [192.168.132.46 -> 192.168.132.28]: FAILED! => {"attempts": 40, "changed": true, "cmd".........

Environment

  • Red Hat OpenStack Platform 13

Subscriber exclusive content

A Red Hat subscription provides unlimited access to our knowledgebase, tools, and much more.

Current Customers and Partners

Log in for full access

Log In

New to Red Hat?

Learn more about Red Hat subscriptions

Using a Red Hat product through a public cloud?

How to access this content