Scripts or programs in /etc/watchdog.d to be checked by watchdog are listed in 'ps' output as "defunct" in RHEL 6
Issue
- I noticed several
fence_scsi_check.plzombie processes on each cluster node. These zombie processes appeared on the systems after half of the SAN paths were unavailable for the storage LUNs.
USER PID %CPU %MEM VSZ RSS TTY STAT START TIME COMMAND
root 2380 0.0 0.0 0 0 ? Z Jan29 0:00 [fence_scsi_chec] <defunct>
root 2389 0.0 0.0 0 0 ? Z Jan29 0:00 [fence_scsi_chec] <defunct>
- I have a script in
/etc/watchdog.dthat will reboot the host if there is an error. Inpsoutput, I see several of these listed as<defunct>and don't seem to be going away.
USER PID %CPU %MEM VSZ RSS TTY STAT START TIME COMMAND
root 17468 0.0 0.0 0 0 ? Z 12:16 0:00 [watchdog-test.b] <defunct>
root 17525 0.0 0.0 0 0 ? Z 12:16 0:00 [watchdog-test.b] <defunct>
root 17535 0.0 0.0 0 0 ? Z 12:17 0:00 [watchdog-test.b] <defunct>
Environment
- Red Hat Enterprise Linux (RHEL) 6
watchdog- One or more scripts or programs exist in
/etc/watchdog.d
- One or more scripts or programs exist in
Subscriber exclusive content
A Red Hat subscription provides unlimited access to our knowledgebase, tools, and much more.