Unexplained Page Fault triggers kernel panic on ASRockRack hardware

Solution Unverified - Updated -

Issue

  • Kernel panic caused by an unexplained Page Fault (memory request). These could be logged as (but not limited to):
    • A BUG: unable to handle kernel paging request at <address> panic message
    • A general protection fault (GFP) panic message
    • A double fault occurring on two CPU's (can occur prior to the actual panic)
    • A CPU Hard Lockup
  • Exception pointers (RIP) seen (but not limited to) in the functions below:
    • memcpy_erms()
    • clear_page_erms()
    • copy_page()
    • dup_fd()
    • do_double_fault()
    • tick_nohz_next_event() (seen during hard lockup)
  • At boot time or shortly after, a Kernel Warning at arch/x86/kernel/alternative.c may or may not be logged with RIP in __text_poke()

Environment

  • ASRockRack hardware
    • For example: SO103A_G11/SPC621D8HM3/1U3
  • (Mostly seen on but not exclusive to) Red Hat Enterprise Linux (RHEL) 8

Subscriber exclusive content

A Red Hat subscription provides unlimited access to our knowledgebase, tools, and much more.

Current Customers and Partners

Log in for full access

Log In

New to Red Hat?

Learn more about Red Hat subscriptions

Using a Red Hat product through a public cloud?

How to access this content