sriov-network-operator pod not recovered after worker node reboot in RHOCP 4
Issue
- SR-IOV application Pod in
ContainerCreating
status after worker node restart - SR-IOV operator configured with PF under selector
SriovNetworkNodePolicy.spec.nicSelector.pfNames
causes application Pod inContainerCreating
status -
SR-IOV application Pod replica's output log presents the following error:
Normal AddedInterface 2m18s multus Add eth0 [10.129.10.19/23] from ovn-kubernetes │ │ Warning FailedCreatePodSandBox 2m17s kubelet Failed to create pod sandbox: rpc error: code = Unknown desc = failed to create pod network sandbox k8s_eric-pc- up-data-plane-5d98b75c57-v9fr2_pcg_9c59cea9-cf72-4ff1-baa9-0870aa63 │ │ f83d_0(b7b5521ce475b672b0b0cbd535f1da787c661a224a6ba09da0ee6fd90d2834ef): error adding pod <pod-name>-5d98b75c57-v9fr2 to CNI network "multus-cni-network": [ns/<podname>-5d98b75c57-v9fr2:eric-pc-up-data-plane-net1]: error │ │ adding container to network "eric-pc-up-data-plane-net1": SRIOV-CNI failed to load netconf: LoadConf(): failed to get VF information: "lstat /sys/bus/pci/devices/0000:37:00.0/physfn/net: no such file or directory"
Environment
- Red Hat OpenShift Container Platform (RHOCP)
o 4.11
o 4.10
o 4.9
o 4.8
o 4.7 - SR-IOV operator
Subscriber exclusive content
A Red Hat subscription provides unlimited access to our knowledgebase, tools, and much more.