Overcloud deployment failed with
Issue
- overcloud deployment is failing on new compute nodes at this step:
Pre-fetch all the containers
with an ansible error similar to this one:
2024-10-23 15:25:52,401 p=534438 u=mistral n=ansible | 2024-10-23 15:25:52.401294 | 566fb47d-008b-5ae4-aa8a-00000008d0bd | FATAL | Pre-fetch all the containers | cell1-compute-121 | item=ourregistry.localdomain/rhosp-rhel8/openstack-
collectd:16.2.6 | error={"ansible_loop_var": "prefetch_image", "attempts": 5, "changed": false, "msg": "Failed to pull image ourregistry.localdomain/rhosp-rhel8/openstack-collectd:16.2.6", "prefetch_image": "ourregistry.localdomain/rhosp-
rhel8/openstack-collectd:16.2.6"}
- If we set "validate_certs=false", this works fine so we're deducing there's a certificate which we confirmed by running the following:
[root@overcloud-compute-118 ~]# podman login ourregistry.localdomain
Username: admin
Password:
Error: error authenticating creds for "ourregistry.localdomain": error pinging docker registry ourregistry.localdomain: Get "https://ourregistry.localdomain/v2/": x509: certificate signed by unknown authority
Environment
- Red Hat OpenStack Platform 16.2 (RHOSP)
Subscriber exclusive content
A Red Hat subscription provides unlimited access to our knowledgebase, tools, and much more.