ProLiant DL380 Gen10 is experiencing memory corruption or random crashes with 3.10.0-862.el7

Solution Verified - Updated -

Issue

  • Several DL380 proliant servers are experiencing sudden unexpected resets without anything logged in os logs since customer upgraded the kernel from RHEL 7.3 to RHEL 7.5 (3.10.0-862.el7).
  • The reboots did not generate any messages and hardly able to get vmcore files.
  • From the generated vmcore files it is suspected a possible memory corruption which is not even catched with slub_debug=zp

Environment

  • Red Hat Enterprise Linux 7.5 (3.10.0-862.el7.x86_64)
  • HPE/ProLiant DL380 Gen10 or BL920s Gen9
  • UEFI boot loader

Subscriber exclusive content

A Red Hat subscription provides unlimited access to our knowledgebase, tools, and much more.

Current Customers and Partners

Log in for full access

Log In

New to Red Hat?

Learn more about Red Hat subscriptions

Using a Red Hat product through a public cloud?

How to access this content