RHEL6: kernel crashes in __link_path_walk because d_revalidate is 0, mvfs filesystem on NFS root
Issue
- kernel crashes with 'Bad RIP value' and RIP
0000000000000000
in__link_path_walk
- We get this bug upon some move and rm operations on local storage.
mvfs: Ok: INFO: MVFS version 7.1.2.10 (Thu Feb 21 00:53:03 2013)
(c) Copyright IBM Corp. 1990, 2013. All Rights Reserved.
MVFS initialized--adapter built at $Date: 2013-08-30.16:24:46 (UTC) $
mvfs: Ok: INFO: MVFS: Setting mvfs_largeinit to 23 based on physical memory size of 24594660KB
BUG: unable to handle kernel NULL pointer dereference at (null)
IP: [<(null)>] (null)
PGD 31982e067 PUD 318b6f067 PMD 0
Oops: 0010 [#1] SMP
last sysfs file: /sys/devices/pci0000:00/0000:00:01.0/0000:05:00.0/host0/target0:0:0/0:0:0:0/block/sda/uevent
CPU 0
Modules linked in: mvfs(U) mptctl mptbase nfsd exportfs autofs4 dm_mod power_meter radeon ttm drm_kms_helper drm i2c_algo_bit i2c_core sg serio_raw iTCO_wdt iTCO_vendor_support hpilo hpwdt i7core_edac edac_core shpchp ext3 jbd mbcache tg3 e1000 e1000e igb dca ptp pps_core bonding ipv6 8021q garp stp llc sr_mod cdrom sd_mod crc_t10dif pata_acpi ata_generic ata_piix hpsa bnx2 nfs lockd fscache auth_rpcgss nfs_acl sunrpc [last unloaded: scsi_wait_scan]
Pid: 29478, comm: rm Not tainted 2.6.32-358.14.1.el6.x86_64 #1 HP ProLiant DL380 G7
RIP: 0010:[<0000000000000000>] [<(null)>] (null)
RSP: 0018:ffff880257bf9ca0 EFLAGS: 00010202
RAX: ffff880257a5cbc0 RBX: ffff880319a41001 RCX: ffff880257bf9dd8
RDX: 0000000000000000 RSI: ffff880257bf9dd8 RDI: ffff8802599c58c0
RBP: ffff880257bf9d58 R08: 0000000000000016 R09: 0000000000000000
R10: 00007ffffad134f0 R11: 0000000000000246 R12: ffff880257bf9dd8
R13: 00000000ffffff9c R14: 0000000000000000 R15: 00002b2cc2790114
FS: 00002b2cc2796ba0(0000) GS:ffff880028200000(0000) knlGS:0000000000000000
CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
CR2: 0000000000000000 CR3: 00000006187d2000 CR4: 00000000000007f0
DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7: 0000000000000400
Process rm (pid: 29478, threadinfo ffff880257bf8000, task ffff880319e41540)
Stack:
ffffffff81190c94 ffff880257bf9d88 ffffffff811436f7 0000000000000000
<d> 00000000000084d0 0000000000000000 ffffffff81ac33e0 0000000000000000
<d> ffff88031623ce08 ffff880257bf9d18 ffffffff811609ea 80000001a857b067
Call Trace:
[<ffffffff81190c94>] ? __link_path_walk+0x664/0x1030
[<ffffffff811436f7>] ? handle_pte_fault+0xf7/0xb50
[<ffffffff811609ea>] ? alloc_pages_current+0xaa/0x110
[<ffffffff8104bac7>] ? pte_alloc_one+0x37/0x50
[<ffffffff811918ea>] path_walk+0x6a/0xe0
[<ffffffff81191abb>] do_path_lookup+0x5b/0xa0
[<ffffffff81192747>] user_path_at+0x57/0xa0
[<ffffffff81186b3c>] vfs_fstatat+0x3c/0x80
[<ffffffff81186bee>] vfs_lstat+0x1e/0x20
[<ffffffff81186c14>] sys_newlstat+0x24/0x50
[<ffffffff815137be>] ? do_page_fault+0x3e/0xa0
[<ffffffff81510b75>] ? page_fault+0x25/0x30
[<ffffffff8100b072>] system_call_fastpath+0x16/0x1b
Code: Bad RIP value.
RIP [<(null)>] (null)
RSP <ffff880257bf9ca0>
CR2: 0000000000000000
Environment
- Red Hat Enterprise Linux 6
- seen on kernel 2.6.32-358.14.1.el6
- IBM mvfs
- IBM ClearCase 7.1.2.10
- NFS root
Subscriber exclusive content
A Red Hat subscription provides unlimited access to our knowledgebase, tools, and much more.