vSphere machines are stuck in deleting phase if associated OpenShift node object is deleted

Solution In Progress - Updated -

Issue

  • vSphere machine is stuck in deleting phase if associated node is deleted.

  • Below logs are observed in machine-api-controller pod:

2021-07-26T18:25:35.907168430Z I0726 18:25:35.907131       1 controller.go:168] worker-1: reconciling Machine
2021-07-26T18:25:35.907168430Z I0726 18:25:35.907155       1 controller.go:208] worker-1: reconciling machine triggers delete
2021-07-26T18:25:35.911705275Z I0726 18:25:35.911684       1 controller.go:337] Could not find node from noderef, it may have already been deleted: worker-1
2021-07-26T18:25:35.911705275Z I0726 18:25:35.911700       1 actuator.go:150] worker-1: actuator deleting machine
2021-07-26T18:25:36.024206052Z I0726 18:25:36.024183       1 machine_scope.go:148] Node "worker-1" not found
2021-07-26T18:25:36.024217091Z I0726 18:25:36.024204       1 machine_scope.go:102] worker-1: patching machine
2021-07-26T18:25:36.041096582Z E0726 18:25:36.041059       1 actuator.go:57] worker-1 error: worker-1: reconciler failed to Delete machine: worker-1: Can't check node status before vm destroy: nodes "worker-1" not found
2021-07-26T18:25:36.041123410Z E0726 18:25:36.041105       1 controller.go:229] worker-1: failed to delete machine: worker-1: reconciler failed to Delete machine: worker-1: Can't check node status before vm destroy: nodes "worker-1" not found

Environment

  • Red Hat OpenShift Container Platform (RHOCP)
    • 4.7
  • VMware vSphere

Subscriber exclusive content

A Red Hat subscription provides unlimited access to our knowledgebase, tools, and much more.

Current Customers and Partners

Log in for full access

Log In