The kernel crashes with a message, "kernel BUG at lib/list_debug.c:47!"

Solution Verified - Updated -

Issue

  • The kernel crashes with a message, "kernel BUG at lib/list_debug.c:47!"
[3401516.867207] ------------[ cut here ]------------
[3401516.877050] kernel BUG at lib/list_debug.c:47!
[3401516.887807] invalid opcode: 0000 [#1] SMP NOPTI
[3401516.897914] CPU: 32 PID: 1946 Comm: kworker/32:1H Kdump: loaded Tainted: G        W  OE    --------- -  - 4.18.0-348.12.2.el8_5.jump2.x86_64 #1
[3401516.924856] Hardware name: Dell Inc. PowerEdge C6525/04DK47, BIOS 2.3.6 07/06/2021
[3401516.938803] Workqueue: ib-comp-wq ib_cq_poll_work [ib_core]
[3401516.951134] RIP: 0010:__list_del_entry_valid.cold.1+0x12/0x4c
[3401516.963338] Code: ca ff 0f 0b 48 89 c1 4c 89 c6 48 c7 c7 38 74 d1 88 e8 4c ac ca ff 0f 0b 48 89 fe 48 89 c2 48 c7 c7 c8 74 d1 88 e8 38 ac ca ff <0f> 0b 48 c7 c7 78 75 d1 88 e8 2a ac ca ff 0f 0b 48 89 f2 48 89 fe
[3401516.994114] RSP: 0018:ffffa81f9cb97dd8 EFLAGS: 00010246
[3401517.006211] RAX: 000000000000004e RBX: ffff9bed8ec1e300 RCX: 0000000000000000
[3401517.019913] RDX: 0000000000000000 RSI: ffff9b773fa16858 RDI: ffff9b773fa16858
[3401517.034500] RBP: ffff9c78c8170798 R08: 0000000000057a90 R09: 0000000000aaaaaa
[3401517.049026] R10: 0000000000000000 R11: ffffa81fb2eff020 R12: ffff9c78c8170000
[3401517.063092] R13: ffff9c50ede17000 R14: ffff9bed8ec1e300 R15: ffff9c50ede14080
[3401517.077677] FS:  0000000000000000(0000) GS:ffff9b773fa00000(0000) knlGS:0000000000000000
[3401517.093799] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[3401517.108331] CR2: 00007ffff62b8000 CR3: 0000019e8b284000 CR4: 0000000000150fe0
[3401517.122822] Call Trace:
[3401517.133382]  frwr_mr_recycle+0x43/0xe0 [rpcrdma]
[3401517.145495]  frwr_wc_localinv+0x48/0xa0 [rpcrdma]
[3401517.157600]  __ib_process_cq+0x89/0x170 [ib_core]
[3401517.171139]  ib_cq_poll_work+0x26/0x80 [ib_core]
[3401517.183995]  process_one_work+0x1a7/0x360
[3401517.196223]  ? create_worker+0x1a0/0x1a0
[3401517.208729]  worker_thread+0x30/0x390
[3401517.221252]  ? create_worker+0x1a0/0x1a0
[3401517.232893]  kthread+0x116/0x130
[3401517.244271]  ? kthread_flush_work_fn+0x10/0x10
[3401517.256632]  ret_from_fork+0x1f/0x40
[3401517.268418] Modules linked in: ... skipping ...

Environment

  • Red Hat Enterprise Linux 8.5.z - kernel-4.18.0-348.12.2.el8_5
    • RDMA provider (IOW InfiniBand driver) installed and loaded (e.g. mlx5 driver in this case)

Subscriber exclusive content

A Red Hat subscription provides unlimited access to our knowledgebase, tools, and much more.

Current Customers and Partners

Log in for full access

Log In

New to Red Hat?

Learn more about Red Hat subscriptions

Using a Red Hat product through a public cloud?

How to access this content