The `watchdog` failed to perform a hardreboot with error `child process did not return in time`
Issue
-
After configuring the watchdog script and when attempting to fence the node, the following error is reported:
Jul 16 02:46:42 node01 watchdog[5208]: test-binary /etc/watchdog.d/fence_scsi_check_hardreboot exceeded time limit Jul 16 02:47:43 node01 watchdog[5208]: test-binary /etc/watchdog.d/fence_scsi_check_hardreboot exceeded time limit 60 Jul 16 02:47:43 node01 watchdog[5208]: Retry timed-out at 61 seconds for /etc/watchdog.d/fence_scsi_check_hardreboot Jul 16 02:47:43 node01 watchdog[5208]: repair binary /etc/watchdog.d/fence_scsi_check_hardreboot returned 247 = 'child process did not return in time' -
Watchdog unable to reboot the node after fencing using fence_scsi/fence_mpath
Environment
- Red Hat Enterprise Linux 8, 9, 10
- High Availability Pacemaker cluster
- Fence agent used with watchdog script:
fence_scsiorfence_mpath
Subscriber exclusive content
A Red Hat subscription provides unlimited access to our knowledgebase, tools, and much more.