Ceph - Gnocchi: slow request from pool metrics

Solution Verified - Updated -

Issue

On RHOSP 10 with backed RHCS and Gnocchi for collecting metrics are experienced slow request against PGs from pool metrics.
These slow request are affecting other pools which share the OSDs with pool metrics.
OSDs may failing with:

osd/ReplicatedPG.cc: 387: FAILED assert(needs_recovery)

or with:

7fd479246700  1 heartbeat_map is_healthy 'OSD::osd_op_tp thread 0x7fd4517d0700' had suicide timed out after 150
7fd479246700 -1 common/HeartbeatMap.cc: In function 'bool ceph::HeartbeatMap::_check(const ceph::heartbeat_handle_d*, const char*, time_t)' thread 7fd479246700 time 2017-05-30 23:10:57.905609
common/HeartbeatMap.cc: 86: FAILED assert(0 == "hit suicide timeout")

Environment

Red Hat Ceph Storage 2.x
Red Hat Ceph Storage 1.3.x.
Red Hat OpenStack Platform 10.
Red Hat OpenStack Platform 11.
Gnocchi 3.0.13 and below for RHOSP 10
Gnocchi 3.1.10 and below for RHOSP 11

Subscriber exclusive content

A Red Hat subscription provides unlimited access to our knowledgebase of over 48,000 articles and solutions.

Current Customers and Partners

Log in for full access

Log In
Close

Welcome! Check out the Getting Started with Red Hat page for quick tours and guides for common tasks.