RHEL7.4: kdump fails due to megaraid_sas driver issues in kexec kernel

Solution In Progress - Updated -

Issue

  • kdump is failing with error message showing megasas_init_fw fails and because of this the kexec kernel cannot find the root disk
[ 2316.719676] RIP  [<ffffffff81400816>] sysrq_handle_crash+0x16/0x20   <------------------------ original kernel
[ 2316.730786]  RSP <ffff880c116d3e58>
[ 2316.738876] CR2: 0000000000000000
[    0.000000] Initializing cgroup subsys cpuset    <-------------------- kexec kernel starts
[    0.000000] Initializing cgroup subsys cpu
[    0.000000] Initializing cgroup subsys cpuacct
[    0.000000] Linux version 3.10.0-693.17.1.el7.x86_64 (mockbuild@x86-041.build.eng.bos.redhat.com) (gcc version 4.8.5 20150623 (Red Hat 4.8.5-16) (GCC) ) #1 SMP Sun Jan 14 10:36:03 EST 2018
...
[  196.367537] megaraid_sas 0000:03:00.0: Ignore DCMD timeout: megasas_get_ctrl_info 4735
[  197.379488] megaraid_sas 0000:03:00.0: Could not get controller info. Fail from megasas_init_adapter_fusion 1370
[  197.393697] megaraid_sas 0000:03:00.0: Failed from megasas_init_fw 5537        <-------------- some problem with storage driver
[  310.971018] dracut-initqueue[268]: Warning: dracut-initqueue timeout - starting timeout scripts
[  311.490496] dracut-initqueue[268]: Warning: dracut-initqueue timeout - starting timeout scripts
[  312.000287] dracut-initqueue[268]: Warning: dracut-initqueue timeout - starting timeout scripts
...
[  371.658379] dracut-initqueue[268]: Warning: dracut-initqueue timeout - starting timeout scripts
[  371.670274] dracut-initqueue[268]: Warning: Could not boot.
[  371.678278] dracut-initqueue[268]: Warning: /dev/disk/by-uuid/e61f6978-2a9f-4a7d-b90b-fa08ccbe0d69 does not exist
[  371.691265] dracut-initqueue[268]: Warning: /dev/mapper/rhel-root does not exist   <----------- failed due to storage driver
[  371.701262] dracut-initqueue[268]: Warning: /dev/rhel/root does not exist
[  371.709260] dracut-initqueue[268]: Warning: /dev/rhel/swap does not exist
[  OK  ] Stopped target Basic System.
[  OK  ] Stopped target System Initialization.
         Starting Kdump Emergency...

Environment

  • Red Hat Enterprise Linux 7.4
    • kernel-3.10.0-693.17.1.el7
    • kdump / kexec kernel
    • root filesystem on xfs over megaraid_sas driver
  • megaraid_sas (Red Hat's driver, not 3rd party)
    • see Diagnostic Steps for more details
filename:       /lib/modules/3.10.0-693.17.1.el7.x86_64/kernel/drivers/scsi/megaraid/megaraid_sas.ko.xz
description:    Avago MegaRAID SAS Driver
author:         megaraidlinux.pdl@avagotech.com
version:        07.701.17.00-rh1
license:        GPL
rhelversion:    7.4
srcversion:     5DA9AE7D4C4BA4FE7B77BBA
  • hardware: Cisco UCS-C240-M4
  • MegaRAID SAS-3 3108 [Invader] [1000:005d] (rev 02)
    • see Diagnostic Steps for detailed info
03:00.0 RAID bus controller [0104]: LSI Logic / Symbios Logic MegaRAID SAS-3 3108 [Invader] [1000:005d] (rev 02)
    Subsystem: Cisco Systems Inc Device [1137:00db]
    Physical Slot: 0-2
...
    Kernel driver in use: megaraid_sas
    Kernel modules: megaraid_sas

Subscriber exclusive content

A Red Hat subscription provides unlimited access to our knowledgebase, tools, and much more.

Current Customers and Partners

Log in for full access

Log In

New to Red Hat?

Learn more about Red Hat subscriptions

Using a Red Hat product through a public cloud?

How to access this content