Cluster Updates Without Error but Machine Config Pools Degraded with `Marking Degraded due to: unexpected on-disk state` on OCP 4.5 and older

Solution Verified - Updated -

Issue

  • After performing an update to a newer version of OpenShift Container Platform, the node versions are inconsistent. For example:

    $ oc get node
    NAME                      STATUS  ROLES   AGE  VERSION
    master-0.ocp.example.net  Ready   master  34d  v1.17.1+9d33dd3
    master-1.ocp.example.net  Ready   master  34d  v1.17.1+9d33dd3
    master-2.ocp.example.net  Ready   master  34d  v1.17.1+9d33dd3
    worker-0.ocp.example.net  Ready   worker  34d  v1.17.1+9d33dd3
    worker-1.ocp.example.net  Ready   worker  34d  v1.17.1+9d33dd3
    worker-2.ocp.example.net  Ready   worker  34d  v1.17.1+912792b
    
  • A machine config pool is degraded and shows the errors specified in "Diagnostic Steps" section.

Environment

  • Red Hat OpenShift Container Platform (RHOCP, OCP)
    • 4.5 and older

Subscriber exclusive content

A Red Hat subscription provides unlimited access to our knowledgebase, tools, and much more.

Current Customers and Partners

Log in for full access

Log In

New to Red Hat?

Learn more about Red Hat subscriptions

Using a Red Hat product through a public cloud?

How to access this content