Cluster Updates Without Error but Machine Config Pools Degraded with `Marking Degraded due to: unexpected on-disk state` on OCP 4.5 and older

Solution Verified - Updated -

Issue

  • After performing an update to a newer version of OpenShift Container Platform, the node versions are inconsistent. For example:

    $ oc get node
    NAME                      STATUS  ROLES   AGE  VERSION
    master-0.ocp.example.net  Ready   master  34d  v1.17.1+9d33dd3
    master-1.ocp.example.net  Ready   master  34d  v1.17.1+9d33dd3
    master-2.ocp.example.net  Ready   master  34d  v1.17.1+9d33dd3
    worker-0.ocp.example.net  Ready   worker  34d  v1.17.1+9d33dd3
    worker-1.ocp.example.net  Ready   worker  34d  v1.17.1+9d33dd3
    worker-2.ocp.example.net  Ready   worker  34d  v1.17.1+912792b
    
  • A machine config pool is degraded and shows the errors specified in "Diagnostic Steps" section.

Environment

  • Red Hat OpenShift Container Platform (RHOCP, OCP)
    • 4.5 and older

Subscriber exclusive content

A Red Hat subscription provides unlimited access to our knowledgebase of over 48,000 articles and solutions.

Current Customers and Partners

Log in for full access

Log In