The kernel crashes due to double fault on a RHEL guest running on VMware ESXi

Solution Verified - Updated -

Issue

  • The kernel crashed due to double fault on a RHEL guest running on VMware ESXi
[3715163.400007] BUG: unable to handle kernel paging request at ffffffffb5c01130
[3715163.400035] PGD 80a13067 P4D 80a13067 PUD 80a14063 PMD 0 
[3715163.400053] Thread overran stack, or stack corrupted
[3715163.400067] Oops: 0010 [#1] SMP NOPTI
[3715163.400080] CPU: 1 PID: 0 Comm: swapper/1 Kdump: loaded Not tainted 4.18.0-425.10.1.el8_7.x86_64 #1
[3715163.400104] Hardware name: VMware, Inc. VMware Virtual Platform/440BX Desktop Reference Platform, BIOS 6.00 11/12/2020
[3715163.400130] RIP: 0010:0xffffffffb5c01130
[3715163.400146] Code: Unable to access opcode bytes at RIP 0xffffffffb5c01106.
[3715163.400163] RSP: 0018:fffffe000003af10 EFLAGS: 00010046
[3715163.400178] RAX: 0000000080004000 RBX: 0000000000000001 RCX: 0000000000000020
[3715163.400196] RDX: 0000000000000001 RSI: ffffffff94ac41c0 RDI: ffffa0ecc2b02064
[3715163.400214] RBP: ffffa0ecc2b02064 R08: 0000000000000001 R09: ffffa0ecc2b02000
[3715163.400232] R10: 0000000000000454 R11: ffffa0eef1e69c04 R12: 0000000000000001
[3715163.400269] R13: ffffffff94ac41c0 R14: 0000000000000001 R15: 0000000000000001
[3715163.400292] FS:  0000000000000000(0000) GS:ffffa0eef1e40000(0000) knlGS:0000000000000000
[3715163.400312] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[3715163.400328] CR2: ffffffffb5c01106 CR3: 000000012a34e000 CR4: 0000000000350ee0
[3715163.400363] Call Trace:
[3715163.400373]  <#DF>
[3715163.400382]  </#DF>
[3715163.400390] WARNING: stack recursion on stack type 5
[3715163.400390] Modules linked in: [...]
[3715163.400575] CR2: ffffffffb5c01130

PID: 0        TASK: ffffa0ecc1ea0000  CPU: 1    COMMAND: "swapper/1"
 #0 [fffffe000003ac38] machine_kexec at ffffffff92e68863
 #1 [fffffe000003ac90] __crash_kexec at ffffffff92fafc4d
 #2 [fffffe000003ad58] crash_kexec at ffffffff92fb0b81
 #3 [fffffe000003ad70] oops_end at ffffffff92e28661
 #4 [fffffe000003ad90] no_context at ffffffff92e79be3
 #5 [fffffe000003ade8] __bad_area_nosemaphore at ffffffff92e79f44
 #6 [fffffe000003ae30] do_page_fault at ffffffff92e7a7f7
 #7 [fffffe000003ae60] page_fault at ffffffff9380114e
    [exception RIP: unknown or invalid address]
    RIP: ffffffffb5c01130  RSP: fffffe000003af10  RFLAGS: 00010046
    RAX: 0000000080004000  RBX: 0000000000000001  RCX: 0000000000000020
    RDX: 0000000000000001  RSI: ffffffff94ac41c0  RDI: ffffa0ecc2b02064
    RBP: ffffa0ecc2b02064   R8: 0000000000000001   R9: ffffa0ecc2b02000
    R10: 0000000000000454  R11: ffffa0eef1e69c04  R12: 0000000000000001
    R13: ffffffff94ac41c0  R14: 0000000000000001  R15: 0000000000000001
    ORIG_RAX: ffffffffffffffff  CS: 0010  SS: 0018
    [exception RIP: unknown or invalid address]
    RIP: ffffffffb5c01130  RSP: fffffe000003a010  RFLAGS: 00010046
    RAX: ffffffffb5c00d90  RBX: 0000000000000010  RCX: 0000000000000010
    RDX: 0000000000010046  RSI: fffffe000003afd0  RDI: 0000000000000018
    RBP: ffffffffb5c01130   R8: 0000000000000010   R9: 0000000000000018
    R10: fffffe000003afa0  R11: 0000000000010046  R12: 0000000000000010
    R13: 0000000000000018  R14: fffffe000003af70  R15: 0000000000010046
    ORIG_RAX: 0000000000000000  CS: 0010  SS: 0018
WARNING: possibly bogus exception frame
--- <DOUBLEFAULT exception stack> ---
    [ DOUBLEFAULT exception stack recursion: prior stack location overwritten ]

Environment

  • (include, but not be limited to,) Red Hat Enterprise Linux 8.7.z - kernel-4.18.0-425.10.1.el8_7
  • (include, but not be limited to,) Dell PowerEdge R7525 Bios Version - 2.10.2
  • (include, but not be limited to,) 2x AMD EPYC 7542 32-Core (0x8301052) 32x DDR4 3200Mhz 64GB
  • (include, but not be limited to,) VMware ESXi, 7.0.3 L, 21424296
    • Similar issues can happen due to some hypervisor malfunction or faulty hardware even if server/processor/OS model/versions is/are different.

Subscriber exclusive content

A Red Hat subscription provides unlimited access to our knowledgebase, tools, and much more.

Current Customers and Partners

Log in for full access

Log In

New to Red Hat?

Learn more about Red Hat subscriptions

Using a Red Hat product through a public cloud?

How to access this content