Node object is removed after being not ready for some time in vsphere

Solution Unverified - Updated -

Issue

  • A node gets removed when it is NotReady for some time in vsphere, even when its VM has not been removed

    $ oc get nodes|grep wrongnode
    wrongnode.example.com     Ready, SchedulingDisabled   master   2y151d   v1.23.5+8471591
    $ sleep $SOME_TIME
    $ oc get nodes|grep wrongnode
    $   
    

    An example of issue that can cause the node to be not ready is the one described in this solution

  • Messages like the ones in "diagnostic steps" are seen in kube-controller-manager

  • If the node is recovered, it is listed again in oc get nodes

Environment

  • Red Hat OpenShift Container Platform 4 (RHOCP 4)
    • 4.x
    • vSphere UPI or IPI
    • not baremetal UPI (even if installed on top of vSphere VMs)

Subscriber exclusive content

A Red Hat subscription provides unlimited access to our knowledgebase, tools, and much more.

Current Customers and Partners

Log in for full access

Log In

New to Red Hat?

Learn more about Red Hat subscriptions

Using a Red Hat product through a public cloud?

How to access this content