IOCHK error causes hpwdt to invoke an NMI panic. At that time, the exception %rip shows an invalid address within the range of the mapped memory region that is vmalloc-ed by a 3rd party module named "vboxdrv".

Solution Unverified - Updated -

Issue

  • IOCHK error causes hpwdt to invoke an NMI panic. At that time, the exception %rip shows an invalid address within the range of the mapped memory region that is vmalloc-ed by a 3rd party module named "vboxdrv".
[5076850.120271] Kernel panic - not syncing: 00: An NMI occurred. Depending on your system the reason for the NMI is logged in any one of the following resources:
                 1. Integrated Management Log (IML)
                 2. OA Syslog
                 3. OA Forward Progress Log
                 4. iLO Event Log
[5076850.121486] CPU: 0 PID: 52417 Comm: EMT Kdump: loaded Tainted: P        W  OE  ------------   3.10.0-1160.6.1.el7.x86_64 #1
[5076850.121829] Hardware name: HP ProLiant BL460c Gen8, BIOS I31 05/24/2019
[5076850.122004] Call Trace:
[5076850.122177]  <NMI>  [<ffffffff96781400>] dump_stack+0x19/0x1b
[5076850.122364]  [<ffffffff9677a958>] panic+0xe8/0x21f
[5076850.122554]  [<ffffffff9609af0f>] nmi_panic+0x3f/0x40
[5076850.122732]  [<ffffffffc03b73df>] hpwdt_pretimeout+0x6f/0xb0 [hpwdt]
[5076850.122909]  [<ffffffff9678c93c>] nmi_handle.isra.0+0x8c/0x150
[5076850.123086]  [<ffffffff9678b1bb>] ? save_paranoid+0xfb/0x140
[5076850.123262]  [<ffffffff9678cbc4>] do_nmi+0x1c4/0x460
[5076850.123439]  [<ffffffff96030e71>] ? iommu_shutdown_noop+0x1/0x10
[5076850.123616]  [<ffffffff9678bd9c>] end_repeat_nmi+0x1e/0x81
[5076850.123791]  <EOE>  [<ffffffff960e70f8>] ? enqueue_task_fair+0x208/0x6c0
[5076850.123976]  [<ffffffff960de1d5>] ? sched_clock_cpu+0x85/0xc0
[5076850.124154]  [<ffffffff9603bf30>] ? send_sigtrap+0xa0/0xa0
[5076850.124336]  [<ffffffff96184254>] ? filter_match_preds_cb+0x124/0x180
[5076850.124519]  [<ffffffff960d71f0>] ? check_preempt_curr+0x80/0xa0
[5076850.124702]  [<ffffffff96183b88>] ? walk_pred_tree+0x58/0x110
[5076850.124884]  [<ffffffff96183ca0>] ? filter_match_preds+0x60/0xa0
[5076850.125068]  [<ffffffff961a849c>] ? perf_tp_event_match.isra.81+0x4c/0x60
[5076850.125246]  [<ffffffff967931b8>] ? fetch_memory_u64+0x58/0xd0
[5076850.125433]  [<ffffffffc0999848>] ? VBoxHost_RTThreadCtxHookEnable+0x28/0x40 [vboxdrv]
[5076850.125781]  [<ffffffffc0984c5b>] ? supdrvIOCtlFast+0x5b/0xb0 [vboxdrv]
[5076850.125962]  [<ffffffffc0980535>] ? VBoxDrvLinuxIOCtl_6_1_8+0x55/0x230 [vboxdrv]
[5076850.126302]  [<ffffffff9603bf35>] ? syscall_trace_enter+0x5/0x210
[5076850.126479]  [<ffffffff9603bf30>] ? send_sigtrap+0xa0/0xa0
[5076850.126657]  [<ffffffff96263320>] ? do_vfs_ioctl+0x3a0/0x5b0
[5076850.126834]  [<ffffffff96798bd4>] ? ftrace_regs_call+0x5/0x81
[5076850.127009]  [<ffffffff962635d1>] ? SyS_ioctl+0xa1/0xc0
[5076850.127185]  [<ffffffff9613e4a4>] ? __audit_syscall_entry+0xb4/0x110
[5076850.127361]  [<ffffffff96795226>] ? tracesys+0xa6/0xcc
crash> bt -p
PID: 52417  TASK: ffff92ce6c686300  CPU: 0   COMMAND: "EMT"
 #0 [ffff92ce9e608c70] machine_kexec at ffffffff96066294
 #1 [ffff92ce9e608cd0] __crash_kexec at ffffffff961225b2
 #2 [ffff92ce9e608da0] panic at ffffffff9677a963
 #3 [ffff92ce9e608e20] nmi_panic at ffffffff9609af0f
 #4 [ffff92ce9e608e30] hpwdt_pretimeout at ffffffffc03b73df [hpwdt]
 #5 [ffff92ce9e608e58] nmi_handle at ffffffff9678c93c
 #6 [ffff92ce9e608eb0] do_nmi at ffffffff9678cbc4
 #7 [ffff92ce9e608ef0] end_repeat_nmi at ffffffff9678bd9c
    [exception RIP: no symbolic reference]
    RIP: ffffffffc000cd20  RSP: ffff92c98e1fbae0  RFLAGS: 00000002
    RAX: ffffea9e0143e360  RBX: ffffea9e0143e320  RCX: dead000000000200
    RDX: ffffea9e0114bfe0  RSI: 0000000000000001  RDI: ffffea9e0143e320
    RBP: ffff8cc6a611fc58   R8: ffffea9e0143e320   R9: 0000000000000000
    R10: ffff8cc6dfb76f80  R11: ffffffffffffffff  R12: ffff8cc6a6933190
    R13: 0000000000000006  R14: ffffea9e0143e320  R15: ffff8cc6dffc9800
    ORIG_RAX: ffffffffffffffff  CS: 0010  SS: 0018
--- <NMI exception stack> ---
 #9 [ffff92c98e1fbb40] enqueue_task_fair at ffffffff960e70f8
#10 [ffff92c98e1fbbb0] walk_pred_tree at ffffffff96183b88
#11 [ffff92c98e1fbc00] filter_match_preds at ffffffff96183ca0
#12 [ffff92c98e1fbc30] perf_tp_event_match at ffffffff961a849c
#13 [ffff92c98e1fbcf0] fetch_memory_u64 at ffffffff967931b8
#14 [ffff92c98e1fbd68] VBoxHost_RTThreadCtxHookEnable at ffffffffc0999848 [vboxdrv]
#15 [ffff92c98e1fbe10] supdrvIOCtlFast at ffffffffc0984c5b [vboxdrv]
#16 [ffff92c98e1fbe20] VBoxDrvLinuxIOCtl_6_1_8 at ffffffffc0980535 [vboxdrv]
#17 [ffff92c98e1fbe80] do_vfs_ioctl at ffffffff96263320
#18 [ffff92c98e1fbf00] sys_ioctl at ffffffff962635d1
#19 [ffff92c98e1fbf50] tracesys at ffffffff96795226 (via system_call)
    RIP: 00007f85ba892307  RSP: 00007f859fb78d38  RFLAGS: 00000246
    RAX: ffffffffffffffda  RBX: 00007f85bb008000  RCX: ffffffffffffffff
    RDX: 0000000000000000  RSI: 00000000000056c0  RDI: 0000000000000007
    RBP: 00007f859fb78d40   R8: 00000000bafae201   R9: 0000000000000000
    R10: 0000000000000001  R11: 0000000000000246  R12: 00007f85baff3000
    R13: 00007f85bb008000  R14: ffffffffefffffff  R15: 00007f859fb78e1b
    ORIG_RAX: 0000000000000010  CS: 0033  SS: 002b

Environment

  • Red Hat Enterprise Linux 7.9 (kernel-3.10.0-1160.6.1.el7)
  • HPE ProLiant BL460c Gen8
  • A 3rd party module named "vboxdrv" is installed and loaded

Subscriber exclusive content

A Red Hat subscription provides unlimited access to our knowledgebase, tools, and much more.

Current Customers and Partners

Log in for full access

Log In

New to Red Hat?

Learn more about Red Hat subscriptions

Using a Red Hat product through a public cloud?

How to access this content