vSphere machines are stuck in deleting phase if associated OpenShift node object is deleted

Solution In Progress - Updated -

Issue

  • vSphere machine is stuck in deleting phase if associated node is deleted.

  • Below logs are observed in machine-api-controller pod:

2021-07-26T18:25:35.907168430Z I0726 18:25:35.907131       1 controller.go:168] worker-1: reconciling Machine
2021-07-26T18:25:35.907168430Z I0726 18:25:35.907155       1 controller.go:208] worker-1: reconciling machine triggers delete
2021-07-26T18:25:35.911705275Z I0726 18:25:35.911684       1 controller.go:337] Could not find node from noderef, it may have already been deleted: worker-1
2021-07-26T18:25:35.911705275Z I0726 18:25:35.911700       1 actuator.go:150] worker-1: actuator deleting machine
2021-07-26T18:25:36.024206052Z I0726 18:25:36.024183       1 machine_scope.go:148] Node "worker-1" not found
2021-07-26T18:25:36.024217091Z I0726 18:25:36.024204       1 machine_scope.go:102] worker-1: patching machine
2021-07-26T18:25:36.041096582Z E0726 18:25:36.041059       1 actuator.go:57] worker-1 error: worker-1: reconciler failed to Delete machine: worker-1: Can't check node status before vm destroy: nodes "worker-1" not found
2021-07-26T18:25:36.041123410Z E0726 18:25:36.041105       1 controller.go:229] worker-1: failed to delete machine: worker-1: reconciler failed to Delete machine: worker-1: Can't check node status before vm destroy: nodes "worker-1" not found

Environment

  • Red Hat OpenShift Container Platform (RHOCP)
    • 4.7
  • VMware vSphere

Subscriber exclusive content

A Red Hat subscription provides unlimited access to our knowledgebase, tools, and much more.

Current Customers and Partners

Log in for full access

Log In

New to Red Hat?

Learn more about Red Hat subscriptions

Using a Red Hat product through a public cloud?

How to access this content