Cluster node is rebooted by qdiskd after ping heuristic fails tko times in RHEL
Issue
- Cluster heuristic program
ping
sometimes fails and this causes the node to reboot
Dec 20 15:24:25 node1 qdiskd[8279]: <info> Heuristic: 'ping -c 1 -w 1 10.192.12.1' DOWN (10/10)
Dec 20 15:24:27 node1 qdiskd[8279]: <notice> Score insufficient for master operation (0/10; required=3); downgrading
- Host rebooted seeing
ping
errors in syslog.
Environment
- Red Hat Enterprise Linux (RHEL) 5 or 6 with the High Availability Add On
- Configuration contains a quorum device and
ping
heuristic- NOTE: This solution does not cover situations in which the
ping
heuristic is timing out rather than failing. If the failure in question is from a heuristic timeout, this alternative solution may correct the problem.
- NOTE: This solution does not cover situations in which the
Subscriber exclusive content
A Red Hat subscription provides unlimited access to our knowledgebase, tools, and much more.