System crash at qedf_free_mp_resc+0x2b/0x310

Solution Unverified - Updated -

Issue

  • System crash at qedf_free_mp_resc():
[3268184.165534] BUG: unable to handle kernel NULL pointer dereference at 0000000000000248
[3268184.165548] IP: [<ffffffffc027ba0b>] qedf_free_mp_resc+0x2b/0x310 [qedf]
[3268184.165550] PGD 0 
[3268184.165553] Oops: 0000 [#1] SMP 
....
[3268184.165633] CPU: 23 PID: 30690 Comm: kworker/23:0 Kdump: loaded Tainted: G           OE  ------------   3.10.0-1062.1.1.el7.x86_64 #1
[3268184.165634] Hardware name: HPE Synergy 480 Gen10/Synergy 480 Gen10 Compute Module, BIOS I42 04/08/2020
[3268184.165644] Workqueue: qedf_3_timer qedf_cmd_timeout [qedf]
[3268184.165646] task: ffff89d2b7b98000 ti: ffff89d802b6c000 task.ti: ffff89d802b6c000
[3268184.165656] RIP: 0010:[<ffffffffc027ba0b>]  [<ffffffffc027ba0b>] qedf_free_mp_resc+0x2b/0x310 [qedf]
[3268184.165658] RSP: 0018:ffff89d802b6fd90  EFLAGS: 00010286
[3268184.165660] RAX: ffff89ae26b3d480 RBX: ffffb33c59b050b8 RCX: 0000000000000000
[3268184.165662] RDX: ffff89d9ce35c000 RSI: 0000000000000282 RDI: ffffb33c59b050b8
[3268184.165663] RBP: ffff89d802b6fdc0 R08: ffffffff9ea018bc R09: ffffffff9ea90f4a
[3268184.165665] R10: 00000000000709c0 R11: 0000000000100000 R12: 0000000000000000
[3268184.165667] R13: ffffb33c59aa9000 R14: 0000000000000000 R15: 00000000000005c0
....
[3268184.165678] Call Trace:
[3268184.165690]  [<ffffffffc027bd49>] qedf_release_cmd+0x59/0x90 [qedf]
[3268184.165701]  [<ffffffffc027df9d>] qedf_cmd_timeout+0x12d/0x240 [qedf]
[3268184.165710]  [<ffffffff9dabd0ff>] process_one_work+0x17f/0x440
[3268184.165715]  [<ffffffff9dabe216>] worker_thread+0x126/0x3c0
[3268184.165720]  [<ffffffff9dabe0f0>] ? manage_workers.isra.26+0x2a0/0x2a0
[3268184.165724]  [<ffffffff9dac50d1>] kthread+0xd1/0xe0
[3268184.165729]  [<ffffffff9dac5000>] ? insert_kthread_work+0x40/0x40
[3268184.165734]  [<ffffffff9e18cd37>] ret_from_fork_nospec_begin+0x21/0x21
[3268184.165738]  [<ffffffff9dac5000>] ? insert_kthread_work+0x40/0x40
[3268184.165780] Code: 0f 1f 44 00 00 55 48 89 e5 41 56 41 55 41 54 53 48 89 fb 48 83 ec 10 48 8b 97 80 01 00 00 48 8b 47 28 48 85 d2 4c 8b 60 28 74 70 <49> 8b 84 24 48 02 00 00 48 8b 8f 88 01 00 00 49 89 c6 49 81 c6 
[3268184.165788] RIP  [<ffffffffc027ba0b>] qedf_free_mp_resc+0x2b/0x310 [qedf]
[3268184.165789]  RSP <ffff89d802b6fd90>
[3268184.165789] CR2: 0000000000000248
  • Potential port events present:
[3268170.859345] sd 3:0:2:6: [alua] Sense Key : Illegal Request [current] 
[3268170.860089] sd 3:0:2:6: [alua] Add. Sense: Logical unit not supported
[3268170.860819] device-mapper: multipath: Failing path 66:176.
[3268170.972878] [0000:61:00.3]:[qedf_scsi_completion:1166]:3: Firmware detected underrun: xid=0x205 fcp_rsp.flags=0x0a fcp_resid=8192 fw_residual=0x80.
[3268171.130967] sd 1:0:2:6: alua: port group 01 state A preferred supports tolusnA
[3268171.218920] sd 1:0:2:6: alua: port group 01 state A preferred supports tolusnA
[3268174.109497] [0000:61:00.3]:[qedf_cmd_timeout:52]:3: ABTS timeout, xid=0x781.
[3268174.110620] [0000:61:00.3]:[qedf_initiate_cleanup:1836]:3: io_req xid=0x781 already in cleanup processing or already completed.
[3268174.112120] [0000:61:00.3]:[qedf_restart_rport:358]:3: LOGO port_id=2ce80.
[3268174.112947] [0000:61:00.3]:[qedf_flush_active_ios:1474]:3: Could not get kref for ELS io_req=0xffffb33c59ab0b78 xid=0x2f.
[3268174.113748] [0000:61:00.3]:[qedf_flush_active_ios:1474]:3: Could not get kref for ELS io_req=0xffffb33c59ab1b38 xid=0x35.
[3268174.114535] [0000:61:00.3]:[qedf_flush_active_ios:1474]:3: Could not get kref for ELS io_req=0xffffb33c59ab1dd8 xid=0x36.
....
[3268174.176043] [0000:61:00.3]:[qedf_flush_active_ios:1474]:3: Could not get kref for ELS io_req=0xffffb33c59ba7f58 xid=0x612.
[3268174.176797] [0000:61:00.3]:[qedf_flush_active_ios:1474]:3: Could not get kref for ELS io_req=0xffffb33c59baa958 xid=0x622.
[3268174.177539] [0000:61:00.3]:[qedf_flush_active_ios:1490]:3: Could not get kref for abort io_req=0xffffb33c59baabf8 xid=0x623.
[3268175.194548] [0000:00:00.0]:[qedf_send_rrq:246]: fcport is no longer offloaded.
[3268175.386017] [0000:00:00.0]:[qedf_send_rrq:246]: fcport is no longer offloaded.
[3268179.663937] [0000:00:00.0]:[qedf_send_rrq:246]: fcport is no longer offloaded.
[3268180.845232] [0000:00:00.0]:[qedf_send_rrq:246]: fcport is no longer offloaded.
[3268180.972929] [0000:00:00.0]:[qedf_send_rrq:246]: fcport is no longer offloaded.
[3268184.165394] [0000:00:00.0]:[qedf_cmd_timeout:78]: ELS timeout, xid=0x232.
[3268184.165499] [0000:00:00.0]:[qedf_cmd_timeout:78]: ELS timeout, xid=0x231.
[3268184.165503] [0000:00:00.0]:[qedf_initiate_cleanup:1821]: tgt not offloaded

Environment

  • Red Hat Enterprise Linux 7

Subscriber exclusive content

A Red Hat subscription provides unlimited access to our knowledgebase, tools, and much more.

Current Customers and Partners

Log in for full access

Log In

New to Red Hat?

Learn more about Red Hat subscriptions

Using a Red Hat product through a public cloud?

How to access this content