OCP Node cannot rejoin cluster after reboot due to ocp-tuned-one-shot.service failure
Issue
-
After a reboot, a OCP node becomes NotReady and fails to rejoin the cluster:
$ oc get node NAME STATUS ROLES AGE VERSION master0 NotReady,SchedulingDisabled control-plane,master,worker 195d v1.27.16+03a907c master1 Ready control-plane,master,worker 195d v1.27.16+03a907c master2 Ready control-plane,master,worker 195d v1.27.16+03a907c worker0 Ready worker 195d v1.27.16+03a907c worker1 Ready worker 195d v1.27.16+03a907c worker2 Ready worker 195d v1.27.16+03a907c
-
On the node,
ocp-tuned-one-shot.service
is marked asfailed
# systemctl status ocp-tuned-one-shot.service × ocp-tuned-one-shot.service - TuneD service from NTO image Loaded: loaded (/etc/systemd/system/ocp-tuned-one-shot.service; enabled; preset: disabled) Active: failed (Result: exit-code) since Thu 2025-01-01 00:00:00 UTC; 4min 17s ago :
Environment
- Red Hat Openshift Container Platform 4.14.38 or lower
- Red Hat Openshift Container Platform 4.15.35 or lower
- Red Hat Openshift Container Platform 4.16.14 or lower
- Red Hat Openshift Container Platform 4.17.0
- Red Hat Openshift Container Platform 4.18.0
- Cluster-wide proxy configured
Subscriber exclusive content
A Red Hat subscription provides unlimited access to our knowledgebase, tools, and much more.