Chapter 5. Remote health monitoring

OpenShift Data Foundation collects anonymized aggregated information about the health, usage, and size of clusters and reports it to Red Hat via an integrated component called Telemetry. This information allows Red Hat to improve OpenShift Data Foundation and to react to issues that impact customers more quickly.

A cluster that reports data to Red Hat via Telemetry is considered a connected cluster.

5.1. About Telemetry

Telemetry sends a carefully chosen subset of the cluster monitoring metrics to Red Hat. These metrics are sent continuously and describe:

  • The size of an OpenShift Data Foundation cluster
  • The health and status of OpenShift Data Foundation components
  • The health and status of any upgrade being performed
  • Limited usage information about OpenShift Data Foundation components and features
  • Summary info about alerts reported by the cluster monitoring component

This continuous stream of data is used by Red Hat to monitor the health of clusters in real time and to react as necessary to problems that impact our customers. It also allows Red Hat to roll out OpenShift Data Foundation upgrades to customers so as to minimize service impact and continuously improve the upgrade experience.

This debugging information is available to Red Hat Support and engineering teams with the same restrictions as accessing data reported via support cases. All connected cluster information is used by Red Hat to help make OpenShift Data Foundation better and more intuitive to use. None of the information is shared with third parties.

5.2. Information collected by Telemetry

Primary information collected by Telemetry includes:

  • The size of the Ceph cluster in bytes : "ceph_cluster_total_bytes",
  • The amount of the Ceph cluster storage used in bytes : "ceph_cluster_total_used_raw_bytes",
  • Ceph cluster health status : "ceph_health_status",
  • The total count of object storage devices (OSDs) : "job:ceph_osd_metadata:count",
  • The total number of OpenShift Data Foundation Persistent Volumes (PVs) present in the Red Hat OpenShift Container Platform cluster : "job:kube_pv:count",
  • The total input/output operations per second (IOPS) (reads+writes) value for all the pools in the Ceph cluster : "job:ceph_pools_iops:total",
  • The total IOPS (reads+writes) value in bytes for all the pools in the Ceph cluster : "job:ceph_pools_iops_bytes:total",
  • The total count of the Ceph cluster versions running : "job:ceph_versions_running:count"
  • The total number of unhealthy NooBaa buckets : "job:noobaa_total_unhealthy_buckets:sum",
  • The total number of NooBaa buckets : "job:noobaa_bucket_count:sum",
  • The total number of NooBaa objects : "job:noobaa_total_object_count:sum",
  • The count of NooBaa accounts : "noobaa_accounts_num",
  • The total usage of storage by NooBaa in bytes : "noobaa_total_usage",
  • The total amount of storage requested by the persistent volume claims (PVCs) from a particular storage provisioner in bytes: "cluster:kube_persistentvolumeclaim_resource_requests_storage_bytes:provisioner:sum",
  • The total amount of storage used by the PVCs from a particular storage provisioner in bytes: "cluster:kubelet_volume_stats_used_bytes:provisioner:sum".

Telemetry does not collect identifying information such as user names, passwords, or the names or addresses of user resources.