A hard lockup occurs where many IPIs for TLB flushing are crushing into one CPU

Solution Verified - Updated -

Issue

  • A hard lockup occurs where many IPIs for TLB flushing are crushing into one CPU
[468984.926833] Kernel panic - not syncing: Hard LOCKUP
[468984.926834] CPU: 24 PID: 3622885 Comm: python Kdump: loaded Tainted: P        W  OE    --------- -t - 4.18.0-240.el8.x86_64 #1
[468984.926834] Hardware name: Dell Inc. PowerEdge XE8545/099K88, BIOS 2.2.5 04/14/2021
[468984.926834] Call Trace:
[468984.926834]  <NMI>
[468984.926835]  dump_stack+0x5c/0x80
[468984.926835]  panic+0xe7/0x2a9
[468984.926835]  nmi_panic.cold.9+0xc/0xc
[468984.926835]  watchdog_overflow_callback.cold.7+0x5c/0x70
[468984.926836]  __perf_event_overflow+0x52/0xf0
[468984.926836]  x86_pmu_handle_irq+0x106/0x160
[468984.926836]  ? __set_pte_vaddr+0x32/0x50
[468984.926836]  ? __native_set_fixmap+0x24/0x30
[468984.926837]  ? ghes_copy_tofrom_phys+0xd3/0x1c0
[468984.926837]  ? __ghes_peek_estatus.isra.12+0x49/0xa0
[468984.926837]  ? ghes_notify_nmi+0x86/0x310
[468984.926837]  amd_pmu_handle_irq+0xa/0x40
[468984.926838]  perf_event_nmi_handler+0x2d/0x50
[468984.926838]  nmi_handle+0x63/0x110
[468984.926838]  default_do_nmi+0x4e/0x100
[468984.926838]  do_nmi+0x128/0x190
[468984.926838]  end_repeat_nmi+0x16/0x6a
[468984.926839] RIP: 0010:flush_smp_call_function_queue+0x4d/0xe0
[468984.926839] Code: 9e 2e 00 48 89 c3 45 84 e4 74 17 eb 4f e8 bb b1 aa 00 f6 43 18 01 74 3c c7 43 18 00 00 00 00 48 89 eb 48 85 db 74 24 8b 43 18 <48> 8b 2b 48 8b 53 08 48 8b 7b 10 a8 02 75 d4 a8 01 74 1b c7 43 18
[468984.926839] RSP: 0000:ffff9affbf803e88 EFLAGS: 00000082
[468984.926840] RAX: 0000000000000003 RBX: ffffba7670b677e0 RCX: 0000000000029640
[468984.926840] RDX: ffff9b1fb0b47900 RSI: 0000000000000001 RDI: ffffba767131b840
[468984.926841] RBP: ffffba7670b677e0 R08: 0000000000000002 R09: 0000000000029600
[468984.926841] R10: 000468da5ee3a78a R11: 0000000000000000 R12: 0000000000000001
[468984.926841] R13: 0000000000000000 R14: 0000000000000000 R15: 0000000000000000
[468984.926842]  ? flush_smp_call_function_queue+0x4d/0xe0
[468984.926842]  ? flush_smp_call_function_queue+0x4d/0xe0
[468984.926842]  </NMI>
[468984.926842]  <IRQ>
[468984.926843]  smp_call_function_single_interrupt+0x3a/0xd0
[468984.926843]  call_function_single_interrupt+0xf/0x20
[468984.926843] RIP: 0010:__do_softirq+0x77/0x2f8
[468984.926844] Code: 05 ba 5b e1 56 00 01 00 00 c7 44 24 20 0a 00 00 00 44 89 74 24 04 48 c7 c0 80 95 02 00 65 66 c7 00 00 00 fb 66 0f 1f 44 00 00 <b8> ff ff ff ff 48 c7 c6 00 51 80 a9 0f bc 44 24 04 83 c0 01 49 89
[468984.926844] RSP: 0000:ffff9affbf803f70 EFLAGS: 00000206 ORIG_RAX: ffffffffffffff04
[468984.926845] RAX: 0000000000029580 RBX: ffff9affbf815fc0 RCX: 000000000000001f
[468984.926845] RDX: 000000000000001e RSI: 0000000030648809 RDI: ffffffc2e9914e60
[468984.926845] RBP: 0000000000000000 R08: 0000000000000002 R09: 0000000000029600
[468984.926846] R10: 000468cbba5ef054 R11: 0000000000000001 R12: 000000000000000
[468984.926846] Lost 61 message(s)!

Environment

  • Red Hat Enterprise Linux 8.3 - kernel-4.18.0-240.el8

Subscriber exclusive content

A Red Hat subscription provides unlimited access to our knowledgebase, tools, and much more.

Current Customers and Partners

Log in for full access

Log In

New to Red Hat?

Learn more about Red Hat subscriptions

Using a Red Hat product through a public cloud?

How to access this content