RHEL host reboots on its own when all storage paths fail and 'kernel.hung_task_panic = 1'
Issue
- We're doing storage failure testing and when we take down all paths on one host, that host reboots on its own
- A cluster node is losing all paths to a GFS2 storage device and then it stops responding and gets fenced
- Why does a host reboot with
kernel.hung_task_panicwhen storage devices become inaccessible?
Environment
- Red Hat Enterprise Linux (RHEL) 5, 6, 7, 8, 9
kernel.hung_task_panic = 1in/etc/sysctl.conf, or/proc/sys/kernel/hung_task_paniccontains the value1- Utilizing
device-mapper-multipathfor managing redundant storage paths- Device is configured to queue I/O through
no_path_retryand/orqueue_if_no_path
- Device is configured to queue I/O through
Subscriber exclusive content
A Red Hat subscription provides unlimited access to our knowledgebase, tools, and much more.