A hard lockup that is probably caused by the faulty hardware

Solution Verified - Updated -

Issue

  • A hard lockup happened on one of CPUs that just idled:
[2277765.557305] Kernel panic - not syncing: Hard LOCKUP
[2277765.557306] CPU: 4 PID: 0 Comm: swapper/4 Kdump: loaded Tainted: G        W  OE    --------- -  - 4.18.0-305.57.1.el8_4.x86_64 #1
[2277765.557306] Hardware name: Lenovo ThinkSystem SR630 -[7X02CTO1WW]-/-[7X02CTO1WW]-, BIOS -[IVE178I-3.31]- 05/04/2022
[2277765.557307] Call Trace:
[2277765.557307]  <NMI>
[2277765.557307]  dump_stack+0x5c/0x80
[2277765.557307]  panic+0xe7/0x2a9
[2277765.557308]  ? secondary_startup_64_no_verify+0xbc/0xcb
[2277765.557308]  nmi_panic.cold.9+0xc/0xc
[2277765.557308]  watchdog_overflow_callback.cold.7+0x5c/0x70
[2277765.557309]  __perf_event_overflow+0x52/0xf0
[2277765.557309]  handle_pmi_common+0x204/0x2a0
[2277765.557309]  ? __set_pte_vaddr+0x32/0x50
[2277765.557309]  ? __native_set_fixmap+0x24/0x30
[2277765.557310]  ? ghes_copy_tofrom_phys+0xd3/0x1c0
[2277765.557310]  intel_pmu_handle_irq+0xbf/0x160
[2277765.557310]  perf_event_nmi_handler+0x2d/0x50
[2277765.557311]  nmi_handle+0x63/0x110
[2277765.557311]  default_do_nmi+0x49/0x100
[2277765.557311]  do_nmi+0x183/0x1e0
[2277765.557311]  end_repeat_nmi+0x16/0x6f
[2277765.557312] RIP: 0010:mwait_idle+0x61/0x80
[2277765.557312] Code: 48 8b 04 25 40 5c 01 00 48 89 d1 0f 01 c8 48 8b 00 a8 08 75 17 e9 07 00 00 00 0f 00 2d 7a 3a 4b 00 31 c0 48 89 c1 fb 0f 01 c9 <eb> 07 fb 66 0f 1f 44 00 00 65 48 8b 04 25 40 5c 01 00 f0 80 60 02
[2277765.557313] RSP: 0018:ffffa7dd4c943ea8 EFLAGS: 00000246
[2277765.557313] RAX: 0000000000000000 RBX: 0000000000000004 RCX: 0000000000000000
[2277765.557313] RDX: 0000000000000000 RSI: 0000000000000004 RDI: ffff9866df91d640
[2277765.557314] RBP: 0000000000000004 R08: ffffffffaf406040 R09: 0000000000000000
[2277765.557314] R10: 0000000000000000 R11: 0000000000000001 R12: ffffffffffffffff
[2277765.557314] R13: 0000000000000000 R14: 0000000000000000 R15: ffff985000c31ec0
[2277765.557315]  ? mwait_idle+0x61/0x80
[2277765.557315]  ? mwait_idle+0x61/0x80
[2277765.557315]  </NMI>
[2277765.557315]  default_idle_call+0x40/0xf0
[2277765.557316]  do_idle+0x1f4/0x260
[2277765.557316]  cpu_startup_entry+0x6f/0x80
[2277765.557316]  start_secondary+0x199/0x1e0
[2277765.557317]  secondary_startup_64_no_verify+0xc2/0xcb

Environment

  • Red Hat Enterprise Linux
  • Lenovo ThinkSystem

Subscriber exclusive content

A Red Hat subscription provides unlimited access to our knowledgebase, tools, and much more.

Current Customers and Partners

Log in for full access

Log In

New to Red Hat?

Learn more about Red Hat subscriptions

Using a Red Hat product through a public cloud?

How to access this content