The watchdog rebooted a node after the fence_scsi/fence_mpath binary failed with a return code of 1 in a Red Hat High Availability cluster

Solution Verified - Updated -

Issue

  • A cluster node with the fence_scsi watchdog script configured was gracefully rebooted. /var/log/messages shows errors like the following:
Mar 28 10:28:08 node1 watchdog[22429]: test binary /etc/watchdog.d/fence_scsi_check_hardreboot returned 1
Mar 28 10:28:08 node1 watchdog[22429]: repair binary /etc/watchdog.d/fence_scsi_check_hardreboot returned 1
Mar 28 10:28:08 node1 watchdog[22429]: shutting down the system because of error 1

Environment

  • Red Hat Enterprise Linux 6, 7, 8, 9 (with the High Availability Add-on)

Subscriber exclusive content

A Red Hat subscription provides unlimited access to our knowledgebase, tools, and much more.

Current Customers and Partners

Log in for full access

Log In

New to Red Hat?

Learn more about Red Hat subscriptions

Using a Red Hat product through a public cloud?

How to access this content