node-exporter throws errors reading the InfiniBand class counter info in RHOCP

Solution Verified - Updated -

Issue

  • InfiniBand counters are not scrapped by Prometheus
  • in the node-export pods are observed errors indicating that not able to read the Infiniband counters

    2025-09-30T16:23:04.696886438+05:30 ts=2025-09-30T10:53:04.696Z caller=collector.go:169 level=error msg="collector failed" name=infiniband duration_seconds=0.000299582 err="error obtaining InfiniBand class info: failed to read file \"/host/sys/class/infiniband/qedr0/ports/1/counters/VL15_dropped\": invalid argument"
    

Environment

  • Red Hat OpenShift Container Platform (RHOCP)
    • 4.18.22
  • Prometheus
  • Infiniband

Subscriber exclusive content

A Red Hat subscription provides unlimited access to our knowledgebase, tools, and much more.

Current Customers and Partners

Log in for full access

Log In

New to Red Hat?

Learn more about Red Hat subscriptions

Using a Red Hat product through a public cloud?

How to access this content