The `watchdog` failed to perform a hardreboot with error `child process did not return in time`

Solution Verified - Updated -

Issue

  • After configuring the watchdog script and when attempting to fence the node, the following error is reported:

    Jul 16 02:46:42 node01 watchdog[5208]: test-binary /etc/watchdog.d/fence_scsi_check_hardreboot exceeded time limit
    Jul 16 02:47:43 node01 watchdog[5208]: test-binary /etc/watchdog.d/fence_scsi_check_hardreboot exceeded time limit 60
    Jul 16 02:47:43 node01 watchdog[5208]: Retry timed-out at 61 seconds for /etc/watchdog.d/fence_scsi_check_hardreboot
    Jul 16 02:47:43 node01 watchdog[5208]: repair binary /etc/watchdog.d/fence_scsi_check_hardreboot returned 247 = 'child process did not return in time'
    
  • Watchdog unable to reboot the node after fencing using fence_scsi/fence_mpath

Environment

  • Red Hat Enterprise Linux 8, 9, 10
  • High Availability Pacemaker cluster
  • Fence agent used with watchdog script: fence_scsi or fence_mpath

Subscriber exclusive content

A Red Hat subscription provides unlimited access to our knowledgebase, tools, and much more.

Current Customers and Partners

Log in for full access

Log In

New to Red Hat?

Learn more about Red Hat subscriptions

Using a Red Hat product through a public cloud?

How to access this content