Cannot Deploy NVIDIA GPU Driver in OpenShift

Solution Verified - Updated -

Issue

When attempting to deploy a NVIDIA GPU cluster policy on the OpenShift Container Platform the nvidia-driver-daemonset pod gets stuck in an indefinite wait state after the driver build portion completes and the cluster policy never goes to fully running ready state.

Environment

Red Hat OpenShift Container Platform 4.17
Red Hat OpenShift Container Platform 4.18

Subscriber exclusive content

A Red Hat subscription provides unlimited access to our knowledgebase, tools, and much more.

Current Customers and Partners

Log in for full access

Log In

New to Red Hat?

Learn more about Red Hat subscriptions

Using a Red Hat product through a public cloud?

How to access this content