System crashed with 'BUG: Bad page state in process xxx' errors
Issue
-
RHEL 7.7 system with IBM Spectrum Scale (GPFS) crashed with multiple page corruption errors and call traces:
BUG: Bad page state in process snmpd pfn:116f4e8 page:ffffca6205bd3a00 count:1 mapcount:0 mapping: (null) index:0x0 page flags: 0x2fffff00000000() page dumped because: nonzero _count [...] CPU: 18 PID: 28505 Comm: snmpd Kdump: loaded Tainted: G OE ------------ T 3.10.0-1062.18.1.el7.x86_64 #1 Hardware name: Dell Inc. PowerEdge R730/072T6D, BIOS 2.11.0 11/02/2019 Call Trace: [<ffffffffadb7b416>] dump_stack+0x19/0x1b [<ffffffffadb7602a>] bad_page.part.75+0xdc/0xf9 [<ffffffffad5c7ee5>] get_page_from_freelist+0x785/0xaa0 [<ffffffffad5dfa38>] ? zone_statistics+0x88/0xa0 [<ffffffffad5c8366>] __alloc_pages_nodemask+0x166/0x450 [<ffffffffadb778b6>] kmalloc_large_node+0x5f/0x80 [<ffffffffad6287c9>] __kmalloc_node_track_caller+0x229/0x290 [<ffffffffada3916d>] ? __alloc_skb+0x8d/0x2d0 [<ffffffffada380f1>] __kmalloc_reserve.isra.32+0x31/0x90 [<ffffffffada3913d>] ? __alloc_skb+0x5d/0x2d0 [<ffffffffada3916d>] __alloc_skb+0x8d/0x2d0 [<ffffffffada8ace9>] netlink_dump+0x239/0x2b0 [...] BUG: unable to handle kernel paging request at 0000100000000008 IP: [<ffffffffc086d348>] clone_endio+0x28/0x120 [dm_mod] PGD 0 Oops: 0000 [#1] SMP [...] CPU: 17 PID: 0 Comm: swapper/17 Kdump: loaded Tainted: G B OE ------------ T 3.10.0-1062.18.1.el7.x86_64 #1 Hardware name: Dell Inc. PowerEdge R730/072T6D, BIOS 2.11.0 11/02/2019 task: ffff892593a25230 ti: ffff892593a34000 task.ti: ffff892593a34000 RIP: 0010:[<ffffffffc086d348>] [<ffffffffc086d348>] clone_endio+0x28/0x120 [dm_mod] RSP: 0018:ffff8944bf403d20 EFLAGS: 00010246 [...] Call Trace: <IRQ> [<ffffffffad689a97>] bio_endio+0x67/0xb0 [<ffffffffad751670>] blk_update_request+0x90/0x360 [<ffffffffad8e8584>] scsi_end_request+0x34/0x1e0 [<ffffffffad8e88f8>] scsi_io_completion+0x168/0x6a0 [<ffffffffad8ddcdc>] scsi_finish_command+0xdc/0x140 [<ffffffffad8e7e42>] scsi_softirq_done+0x132/0x160 [<ffffffffad758f96>] blk_done_softirq+0x96/0xc0 [<ffffffffad4a5435>] __do_softirq+0xf5/0x280 [<ffffffffadb9142c>] call_softirq+0x1c/0x30 [<ffffffffad42f715>] do_softirq+0x65/0xa0
Environment
- Red Hat Enterprise Linux 7.7
kernel 3.10.0-1062.18.1
IBM Spectrum Scale (GPFS) 5.0.4-2
Subscriber exclusive content
A Red Hat subscription provides unlimited access to our knowledgebase, tools, and much more.