Chapter 6. Create an Alert
Administrators can create monitors that track the metrics of the Ceph cluster and generate alerts. For example, if an OSD is down, Datadog can alert an administrator that one or more OSDs are down.
Click Monitors to see an overview of the Datadog monitors.
To create a monitor, select Monitors→New Monitor. At step 1, select the detection method. For example, "Threshold Alert."
At step 2, define the metric. To create an advanced alert, click on the Advanced… link. Then, select a metric from the combo box. For example, select the
ceph.num_in_osds Ceph metric. Then, click Add Query+ to add another query.
Select another metric from the combo box. For example, select the
ceph.num_up_osds Ceph metric.
In the Express these queries as: field, enter
a is the value of
b is the value of
ceph.num_up_osds. When the difference is
1 or greater, there is at least one OSD down.
At step 3, set the alert conditions. For example, set the trigger to be above or equal to, the threshold to in total and the time elapsed to 1 minute. Then, set the Alert threshold field to
1. When at least one OSD is in the cluster and it is not up and running, the monitor will alert the user.
At step 4, give the monitor a title in the input field below Preview and Edit. This is required to save the monitor. Enter a description of the alert in the text field.
The text field supports metric variables and Markdown syntax.
At step 5, add the recipients of the alert. This will add an email address to the text field of step 4. When the alert gets triggered, the recipients will receive the alert.