Kdump fails to generate vmcore on a system with an Intel SRF/SPR/EMR/GNR CPU

Solution Verified - Updated -

Issue

  • Kdump fails to generate vmcore on a system with an Intel SRF/SPR/EMR/GNR CPU.

  • The kdump kernel crashed due to a divide by zero error in the rebalance_wq_table() function of the module (iaa_crypto).

[   26.145092] idxd: crypto: iaa_crypto now ENABLED
[   26.151960] divide error: 0000 [#1] PREEMPT SMP NOPTI
[   26.218428] CPU: 0 PID: 384 Comm: systemd-udevd Not tainted 5.14.0-427.3.1.el9_4.x86_64 #1
[   26.227764] Hardware name: Intel Corporation AvenueCity/AvenueCity, BIOS xxxxxx.2780.D02.2311070514 11/07/2023
[   26.239626] RIP: 0010:rebalance_wq_table+0x19c/0x240 [iaa_crypto]
[   26.246522] Code: 74 c1 45 31 e4 8b 35 73 ca c7 c2 44 89 e2 48 39 f2 73 4f 48 89 ef e8 a3 5c 11 c1 39 05 5d ca c7 c2 89 c6 76 3d 31 d2 44 89 e0 <f7> 35 e6 b5 00 00 83 fa 01 41 83 d5 00 44 89 ef e8 df f9 ff ff 85
[   26.267647] RSP: 0018:ffa0000000a6bba0 EFLAGS: 00010246
[   26.273568] RAX: 0000000000000000 RBX: 0000000000000000 RCX: 0000000000000000
[   26.281632] RDX: 0000000000000000 RSI: 0000000000000000 RDI: ff1100005bffafc0
[   26.289696] RBP: ff1100005bffafc0 R08: 0000000000000001 R09: 0000000000000000
[   26.297760] R10: 0000000000000000 R11: ff1100005bffafc0 R12: 0000000000000000
[   26.305824] R13: 00000000ffffffff R14: 0000000000000000 R15: ff110000577738f0
[   26.313888] FS:  00007f9c7ee1e540(0000) GS:ff1100005ba00000(0000) knlGS:0000000000000000
[   26.323019] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[   26.329525] CR2: 0000559273091bc8 CR3: 000000005aae2006 CR4: 0000000000771eb0
[   26.337589] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
[   26.345654] DR3: 0000000000000000 DR6: 00000000fffe07f0 DR7: 0000000000000400
[   26.353717] PKRU: 55555554
[   26.356809] Call Trace:
[   26.359603]  <TASK>
[   26.362012]  ? show_trace_log_lvl+0x1c4/0x2df
[   26.366954]  ? show_trace_log_lvl+0x1c4/0x2df
[   26.371892]  ? iaa_crypto_probe+0xff/0x240 [iaa_crypto]
[   26.377815]  ? __die_body.cold+0x8/0xd
[   26.382070]  ? die+0x2b/0x50
[   26.385357]  ? do_trap+0xce/0x120
[   26.389132]  ? rebalance_wq_table+0x19c/0x240 [iaa_crypto]
[   26.395341]  ? do_error_trap+0x65/0x80
[   26.399595]  ? rebalance_wq_table+0x19c/0x240 [iaa_crypto]
[   26.405803]  ? exc_divide_error+0x36/0x50
[   26.410347]  ? rebalance_wq_table+0x19c/0x240 [iaa_crypto]
[   26.416556]  ? asm_exc_divide_error+0x16/0x20
[   26.421496]  ? rebalance_wq_table+0x19c/0x240 [iaa_crypto]
[   26.427705]  iaa_crypto_probe+0xff/0x240 [iaa_crypto]
[   26.433423]  really_probe+0xde/0x390
[   26.437489]  ? pm_runtime_barrier+0x50/0x90
[   26.442235]  __driver_probe_device+0xd6/0x130
[   26.447172]  driver_probe_device+0x1e/0x90
[   26.451822]  __driver_attach+0xd2/0x1c0
[   26.456172]  ? __pfx___driver_attach+0x10/0x10
[   26.461217]  bus_for_each_dev+0x75/0xd0
[   26.465580]  bus_add_driver+0xc2/0x1f0
[   26.469834]  driver_register+0x70/0xd0
[   26.474089]  ? __pfx_init_module+0x10/0x10 [iaa_crypto]
[   26.480010]  iaa_crypto_init_module+0x15c/0xff0 [iaa_crypto]
[   26.486410]  do_one_initcall+0x41/0x210
[   26.490775]  ? kmalloc_trace+0x25/0xa0
[   26.495031]  do_init_module+0x5c/0x270
[   26.499288]  __do_sys_init_module+0x12e/0x1b0
[   26.504227]  do_syscall_64+0x59/0x90
[   26.508290]  ? handle_mm_fault+0xcd/0x290
[   26.512833]  ? do_user_addr_fault+0x1d6/0x6a0
[   26.517772]  ? exc_page_fault+0x62/0x150
[   26.522230]  entry_SYSCALL_64_after_hwframe+0x72/0xdc
[   26.527947] RIP: 0033:0x7f9c7fa64f1e
[   26.532017] Code: 48 8b 0d 05 af 0e 00 f7 d8 64 89 01 48 83 c8 ff c3 66 2e 0f 1f 84 00 00 00 00 00 90 f3 0f 1e fa 49 89 ca b8 af 00 00 00 0f 05 <48> 3d 01 f0 ff ff 73 01 c3 48 8b 0d d2 ae 0e 00 f7 d8 64 89 01 48
[   26.553142] RSP: 002b:00007ffd1d68d7c8 EFLAGS: 00000246 ORIG_RAX: 00000000000000af
[   26.561686] RAX: ffffffffffffffda RBX: 0000559272857c40 RCX: 00007f9c7fa64f1e
[   26.569750] RDX: 00007f9c7fbc632c RSI: 0000000000024d68 RDI: 0000559273070c60
[   26.577814] RBP: 0000559273070c60 R08: 0000559272859570 R09: 0000000000024010
[   26.585877] R10: 0000000000000005 R11: 0000000000000246 R12: 00007f9c7fbc632c
[   26.593942] R13: 0000559272857bd0 R14: 0000000000000007 R15: 000055927285b9b0
[   26.602006]  </TASK>

Environment

  • Red Hat Enterprise Linux 9.4
  • kernel-5.14.0-427.13.1.el9
  • Intel Sierra Forest (SRF) CPU-based system
  • Intel Sapphire Rapids (SPR) CPU-based system
  • Intel Emerald Rapids (EMR) CPU-based system
  • Intel Granite Rapids (GNR) CPU-based system
  • Intel IAA device
  • Single CPU (nr_cpus=1)

Subscriber exclusive content

A Red Hat subscription provides unlimited access to our knowledgebase, tools, and much more.

Current Customers and Partners

Log in for full access

Log In

New to Red Hat?

Learn more about Red Hat subscriptions

Using a Red Hat product through a public cloud?

How to access this content