System crashes at region_chg() with corrupted list for alloc_huge_page in Red Hat Enteprise Linux 7

Solution In Progress - Updated -

Issue

  • System crashes with corrupted list.
------------[ cut here ]------------
WARNING: at lib/list_debug.c:62 __list_del_entry+0x82/0xd0()
list_del corruption. next->prev should be ffff881d2a35a160, but was ffff883f237befa0
[...]
CPU: 13 PID: 114773 Comm: java Not tainted 3.10.0-229.el7.x86_64 #1
[...]
Call Trace:
 [<ffffffff81603f36>] dump_stack+0x19/0x1b
 [<ffffffff8106e28b>] warn_slowpath_common+0x6b/0xb0
 [<ffffffff8106e32c>] warn_slowpath_fmt+0x5c/0x80
 [<ffffffff812ed4f2>] __list_del_entry+0x82/0xd0
 [<ffffffff812ed54d>] list_del+0xd/0x30
 [<ffffffff8119a18e>] region_add+0x9e/0xe0
 [<ffffffff8119b7dc>] alloc_huge_page+0x28c/0x490
 [<ffffffff8119dc04>] hugetlb_fault+0x404/0x910
 [<ffffffff81609259>] ? schedule+0x29/0x70
 [<ffffffff81182d68>] handle_mm_fault+0xbb8/0xd60
 [<ffffffff812e2974>] ? call_rwsem_down_read_failed+0x14/0x30
 [<ffffffff8160f1e6>] __do_page_fault+0x156/0x520
 [<ffffffff810d4f12>] ? do_futex+0x172/0x5b0
 [<ffffffff810b1c4c>] ? update_curr+0xcc/0x150
 [<ffffffff810ad726>] ? __dequeue_entity+0x26/0x40
 [<ffffffff81012609>] ? __switch_to+0x179/0x4a0
 [<ffffffff8160f5ca>] do_page_fault+0x1a/0x70
 [<ffffffff8160b808>] page_fault+0x28/
  • System crashes in region_chg().
BUG: unable to handle kernel NULL pointer dereference at 0000000000000008
IP: [<ffffffff8119a2a7>] region_chg+0xd7/0x100
PGD 1c32a0f067 PUD 1badd74067 PMD 0 
Oops: 0000 [#1] SMP 
[...]
CPU: 11 PID: 114768 Comm: java Tainted: G        W   --------------   3.10.0-229.el7.x86_64 #1
[...]
Call Trace:
 [<ffffffff8119a336>] vma_needs_reservation+0x66/0xb0
 [<ffffffff8119de8f>] hugetlb_fault+0x68f/0x910
 [<ffffffff81182d68>] handle_mm_fault+0xbb8/0xd60
 [<ffffffff812e2974>] ? call_rwsem_down_read_failed+0x14/0x30
 [<ffffffff8160f1e6>] __do_page_fault+0x156/0x520
 [<ffffffff810d4f12>] ? do_futex+0x172/0x5b0
 [<ffffffff810b1c4c>] ? update_curr+0xcc/0x150
 [<ffffffff810ad726>] ? __dequeue_entity+0x26/0x40
 [<ffffffff81012609>] ? __switch_to+0x179/0x4a0
 [<ffffffff8160f5ca>] do_page_fault+0x1a/0x70
 [<ffffffff8160b808>] page_fault+0x28/0x30

Environment

  • Red Hat Enterprise Linux 7
  • prior kernel-3.10.0-327.el7

Subscriber exclusive content

A Red Hat subscription provides unlimited access to our knowledgebase of over 48,000 articles and solutions.

Current Customers and Partners

Log in for full access

Log In