GPU Pods Failing to Schedule After OpenShift Cluster Upgrade Due to 'node.cloudprovider.kubernetes.io/uninitialized' Taint

Solution Verified - Updated 2025-08-30T10:39:35+00:00 -

Issue

After upgrading the OpenShift cluster, the GPU pods were unable to schedule onto newly provisioned GPU nodes. Although the GPU pods were created successfully, they remained in the Pending state due to unavailability of the GPU node.
GPU pods failed to get scheduled on node with below error:

    message: '0/10 nodes are available: 1 node(s) had untolerated taint {node.cloudprovider.kubernetes.io/uninitialized:
      true}, 2 node(s) had untolerated taint {node-role.kubernetes.io/infra: }, 3
      node(s) had untolerated taint {node-role.kubernetes.io/master: }, 4 node(s)
      didn''t match Pod''s node affinity/selector. preemption: 0/10 nodes are available:
      10 Preemption is not helpful for scheduling.'
    reason: Unschedulable
    status: "False"
    type: PodScheduled
  phase: Pending

Environment

Red Hat OpenShift Container Platform 4.x
Cloud Provider: AWS

Subscriber exclusive content

A Red Hat subscription provides unlimited access to our knowledgebase, tools, and much more.

Select Your Language

GPU Pods Failing to Schedule After OpenShift Cluster Upgrade Due to 'node.cloudprovider.kubernetes.io/uninitialized' Taint

Issue

Environment

Subscriber exclusive content

Current Customers and Partners

New to Red Hat?

Using a Red Hat product through a public cloud?

Quick Links

Help

Site Info

Related Sites

About

Red Hat legal and privacy links

Red Hat legal and privacy links

Issue

Environment

Subscriber exclusive content

Current Customers and Partners

New to Red Hat?

Using a Red Hat product through a public cloud?

Quick Links

Help

Site Info

Related Sites

Systems Status

About

Red Hat legal and privacy links

Red Hat legal and privacy links