Overcloud deployment failed with

Solution In Progress - Updated -

Issue

  • overcloud deployment is failing on new compute nodes at this step:
Pre-fetch all the containers

with an ansible error similar to this one:

2024-10-23 15:25:52,401 p=534438 u=mistral n=ansible | 2024-10-23 15:25:52.401294 | 566fb47d-008b-5ae4-aa8a-00000008d0bd |      FATAL | Pre-fetch all the containers | cell1-compute-121 | item=ourregistry.localdomain/rhosp-rhel8/openstack-
collectd:16.2.6 | error={"ansible_loop_var": "prefetch_image", "attempts": 5, "changed": false, "msg": "Failed to pull image ourregistry.localdomain/rhosp-rhel8/openstack-collectd:16.2.6", "prefetch_image": "ourregistry.localdomain/rhosp-
rhel8/openstack-collectd:16.2.6"}
  • If we set "validate_certs=false", this works fine so we're deducing there's a certificate which we confirmed by running the following:
[root@overcloud-compute-118 ~]# podman login ourregistry.localdomain
Username: admin
Password: 
Error: error authenticating creds for "ourregistry.localdomain": error pinging docker registry ourregistry.localdomain: Get "https://ourregistry.localdomain/v2/": x509: certificate signed by unknown authority

Environment

  • Red Hat OpenStack Platform 16.2 (RHOSP)

Subscriber exclusive content

A Red Hat subscription provides unlimited access to our knowledgebase, tools, and much more.

Current Customers and Partners

Log in for full access

Log In

New to Red Hat?

Learn more about Red Hat subscriptions

Using a Red Hat product through a public cloud?

How to access this content