The watchdog rebooted a node after the fence_scsi binary failed with a return code of 1 in a Red Hat High Availability cluster

Solution Verified - Updated -

Issue

  • A cluster node with the fence_scsi watchdog script configured was gracefully rebooted. /var/log/messages shows errors like the following:
Mar 28 10:28:08 node1 watchdog[22429]: test binary /etc/watchdog.d/fence_scsi_check_hardreboot returned 1
Mar 28 10:28:08 node1 watchdog[22429]: repair binary /etc/watchdog.d/fence_scsi_check_hardreboot returned 1
Mar 28 10:28:08 node1 watchdog[22429]: shutting down the system because of error 1

Environment

  • Red Hat Enterprise Linux 6, 7, or 8 (with the High Availability Add-on)

Subscriber exclusive content

A Red Hat subscription provides unlimited access to our knowledgebase of over 48,000 articles and solutions.

Current Customers and Partners

Log in for full access

Log In