Ceph/ODF: alertname = MDSCacheUsageHigh: MDS cache usage for the daemon mds.ocs-storagecluster-cephfilesystem-X has exceeded above 95% of the requested value.
Issue
alertname = "MDSCacheUsageHigh
": MDS cache usage for the daemon mds.ocs-storagecluster-cephfilesystem-X has exceeded above 95% of the requested value.
The above alert is "firing
" in our Openshift Clusters. Here is the full text of the alert:
Labels
alertname = MDSCacheUsageHigh
ceph_daemon = mds.ocs-storagecluster-cephfilesystem-a
endpoint = ceph-exporter-http-metrics
instance = 10.131.2.24:9926
job = kube-state-metrics
managedBy = ocs-storagecluster
namespace = openshift-storage
openshift_io_alert_source = platform
pod = rook-ceph-exporter-oirnocp0088-595878d8c-gdgpv
prometheus = openshift-monitoring/k8s
service = rook-ceph-exporter
severity = critical
Annotations
description = MDS cache usage for the daemon mds.ocs-storagecluster-cephfilesystem-a has exceeded above 95% of the requested value. Increase the memory request for mds.ocs-storagecluster-cephfilesystem-a pod.
message = High MDS cache usage for the daemon mds.ocs-storagecluster-cephfilesystem-a.
runbook_url = https://github.com/openshift/runbooks/blob/master/alerts/openshift-container-storage-operator/CephMdsCacheUsageHigh.md
severity_level = error
Environment
Red Hat OpenShift Container Platform (OCP) 4.x
Red Hat OpenShift Container Storage (OCS) 4.x
Red Hat OpenShift Data Foundation (ODF) 4.x
Red Hat Ceph Storage (RHCS) 6.x
Red Hat Ceph Storage (RHCS) 7.x
Subscriber exclusive content
A Red Hat subscription provides unlimited access to our knowledgebase, tools, and much more.