node-exporter throws errors reading the InfiniBand class counter info in RHOCP
Issue
- InfiniBand counters are not scrapped by Prometheus
-
in the
node-exportpods are observed errors indicating that not able to read the Infiniband counters2025-09-30T16:23:04.696886438+05:30 ts=2025-09-30T10:53:04.696Z caller=collector.go:169 level=error msg="collector failed" name=infiniband duration_seconds=0.000299582 err="error obtaining InfiniBand class info: failed to read file \"/host/sys/class/infiniband/qedr0/ports/1/counters/VL15_dropped\": invalid argument"
Environment
- Red Hat OpenShift Container Platform (RHOCP)
- 4.18.22
- Prometheus
- Infiniband
Subscriber exclusive content
A Red Hat subscription provides unlimited access to our knowledgebase, tools, and much more.