Cluster Updates Without Error but Machine Config Pools Degraded with `Marking Degraded due to: unexpected on-disk state` on OCP 4.5 and older
Issue
-
After performing an update to a newer version of OpenShift Container Platform, the node versions are inconsistent. For example:
$ oc get node NAME STATUS ROLES AGE VERSION master-0.ocp.example.net Ready master 34d v1.17.1+9d33dd3 master-1.ocp.example.net Ready master 34d v1.17.1+9d33dd3 master-2.ocp.example.net Ready master 34d v1.17.1+9d33dd3 worker-0.ocp.example.net Ready worker 34d v1.17.1+9d33dd3 worker-1.ocp.example.net Ready worker 34d v1.17.1+9d33dd3 worker-2.ocp.example.net Ready worker 34d v1.17.1+912792b
-
A machine config pool is degraded and shows the errors specified in "Diagnostic Steps" section.
Environment
- Red Hat OpenShift Container Platform (RHOCP, OCP)
- 4.5 and older
Subscriber exclusive content
A Red Hat subscription provides unlimited access to our knowledgebase, tools, and much more.