MFI MPT linked list corruption in slab cache using megaraid sas followed by system panic in Red Hat Enterprise Linux
Issue
First sign of a problem is this message in the log:
WARNING: at lib/list_debug.c:51 list_del+0x8d/0xa0() (Not tainted)
Hardware name: UCSC-C240-M3S
list_del corruption. next->prev should be ffff8806c58bf000, but was ffff8806c58be545
Modules linked in: fuse autofs4 bonding 8021q garp stp llc ipv6 uinput microcode iTCO_wdt iTCO_vendor_support power_meter sg ses enclosure i2c_i801 lpc_ich mfd_core bnx2 igb dca i2c_algo_bit i2c_core ptp pps_core ext4 jbd2 mbcache sd_mod crc_t10dif megaraid_sas wmi dm_mirror dm_region_hash dm_log dm_mod [last unloaded: scsi_wait_scan]
Pid: 13997, comm: <commandname> Not tainted 2.6.32-431.23.3.el6.x86_64 #1
Call Trace:
[<ffffffff81071b37>] ? warn_slowpath_common+0x87/0xc0
[<ffffffff81071c26>] ? warn_slowpath_fmt+0x46/0x50
[<ffffffff811bf1fc>] ? __getblk+0x2c/0x2a0
[<ffffffff81294f5d>] ? list_del+0x8d/0xa0
[<ffffffff8116ead5>] ? cache_alloc_refill+0x145/0x240
[<ffffffffa008c097>] ? ext4_htree_store_dirent+0x37/0x140 [ext4]
[<ffffffff8116fa59>] ? __kmalloc+0x1a9/0x220
[<ffffffffa008c097>] ? ext4_htree_store_dirent+0x37/0x140 [ext4]
[<ffffffffa009a998>] ? htree_dirblock_to_tree+0x128/0x190 [ext4]
[<ffffffffa009b348>] ? ext4_htree_fill_tree+0xa8/0x260 [ext4]
[<ffffffff8114b0da>] ? handle_mm_fault+0x22a/0x300
[<ffffffffa008c3c7>] ? ext4_readdir+0x127/0x700 [ext4]
[<ffffffff8104a98c>] ? __do_page_fault+0x1ec/0x480
[<ffffffff8119f760>] ? filldir+0x0/0xe0
[<ffffffff8119f760>] ? filldir+0x0/0xe0
[<ffffffff8119f9e0>] ? vfs_readdir+0xc0/0xe0
[<ffffffff8119fb69>] ? sys_getdents+0x89/0xf0
[<ffffffff8152bb05>] ? page_fault+0x25/0x30
[<ffffffff8100b072>] ? system_call_fastpath+0x16/0x1b
Environment
- Red Hat Enterprise Linux 6.5, 6.6
- Cisco UCSC-C240-M3S
- LSI 9271-8i MegaRAID SAS HBA
- Firmware package version 2.0.(1b), 2.0.(3d)
Subscriber exclusive content
A Red Hat subscription provides unlimited access to our knowledgebase, tools, and much more.