Red Hat Training

A Red Hat training course is available for Red Hat Ceph Storage

Chapter 6. Create an Alert

Administrators can create monitors that track the metrics of the Ceph cluster and generate alerts. For example, if an OSD is down, Datadog can alert an administrator that one or more OSDs are down.

Click Monitors to see an overview of the Datadog monitors.

datadog manage monitors

To create a monitor, select Monitors→New Monitor. At step 1, select the detection method. For example, "Threshold Alert."

datadog new monitor

At step 2, define the metric. To create an advanced alert, click on the Advanced…​ link. Then, select a metric from the combo box. For example, select the ceph.num_in_osds Ceph metric. Then, click Add Query+ to add another query.

datadog monitor ceph metric 1

Select another metric from the combo box. For example, select the ceph.num_up_osds Ceph metric.

datadog monitor ceph metric 2

In the Express these queries as: field, enter a-b, where a is the value of ceph.num_in_osds and b is the value of ceph.num_up_osds. When the difference is 1 or greater, there is at least one OSD down.

At step 3, set the alert conditions. For example, set the trigger to be above or equal to, the threshold to in total and the time elapsed to 1 minute. Then, set the Alert threshold field to 1. When at least one OSD is in the cluster and it is not up and running, the monitor will alert the user.

At step 4, give the monitor a title in the input field below Preview and Edit. This is required to save the monitor. Enter a description of the alert in the text field.

datadog monitor ceph metric 3

The text field supports metric variables and Markdown syntax.

At step 5, add the recipients of the alert. This will add an email address to the text field of step 4. When the alert gets triggered, the recipients will receive the alert.