IPI private cluster installation of OpenShift 4.9 on Azure into a pre-existing VNet times out "FATAL Bootstrap failed to complete"

Solution In Progress - Updated -

Environment

  • Red Hat OpenShift Container Platform (RHOCP)
    • 4.9
  • Azure

Issue

  • When attempting an IPI install of OpenShift 4.9 on Azure into a pre-existing VNet, why does the installation timeout out waiting for bootstrap to complete?
  • On each installation attempt the install fails differently.

Resolution

If an ExpressRoute link is used for connectivity to the Microsoft Cloud Services via Corporate Network > Firewall > Internet:

  • Check and ensure that the ExpressRoute link is not busy nor is bandwidth restricted.
  • Increase the bandwidth limit and re-attempt the installation.

Disclaimer: Links contained herein to external website(s) are provided for convenience only. Red Hat has not reviewed the links and is not responsible for the content or its availability. The inclusion of any link to an external website does not imply endorsement by Red Hat of the website or their entities, products or services. You agree that Red Hat is not responsible or liable for any loss or expenses that may result due to your use of (or reliance on) the external site or content.

Root Cause

The ExpressRoute link can be busy and severely bandwidth restricted. As a result the RHOCP installation can fail and each installation attempt may fail differently.

Diagnostic Steps

Review the install log output for bootstrap timeout errors. Refer to official documentation Gathering logs from a failed installation for detailed instructions.

ERROR Attempted to gather debug logs after installation failure: could not read control plane IPs from outputs file "install/cluster.tfvars.json"
ERROR Bootstrap failed to complete: timed out waiting for the condition
ERROR Failed to wait for bootstrapping to complete. This error usually happens when there is a problem with control plane hosts that prevents the control plane operators from creating the control plane.
ERROR Attempted to analyze the debug logs after installation failure: could not open the gather bundle: open : no such file or directory
FATAL Bootstrap failed to complete

This solution is part of Red Hat’s fast-track publication program, providing a huge library of solutions that Red Hat engineers have created while supporting our customers. To give you the knowledge you need the instant it becomes available, these articles may be presented in a raw and unedited form.

Comments