RHEL5: NFS4 client hung, with "state recovery failed on NFSv4 server 192.168.1.103 with error 10008"
Issue
- Hung system with the following messages in the log
Error: state recovery failed on NFSv4 server 192.168.1.103 with error 2
printk: 6906 messages suppressed.
Error: state recovery failed on NFSv4 server 192.168.1.103 with error 2
printk: 6167 messages suppressed.
Error: state recovery failed on NFSv4 server 192.168.1.103 with error 10008
- Following the above messages, hung_task timeout backtraces occur, and a hung_task panic resulted
INFO: task sh:5800 blocked for more than 120 seconds.
"echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
sh D ffff81000101d4a0 0 5800 5798 (NOTLB)
ffff8107a9d83c58 0000000000000086 0000000000000292 0000000000000282
ffffffff8009b6bb 0000000000000003 ffff8107a9b457f0 ffff8107bfc040c0
00000017f0c885f3 000000000000167f ffff8107a9b459d8 0000000388684d33
Call Trace:
[<ffffffff8009b6bb>] recalc_sigpending+0xe/0x25
[<ffffffff80063c63>] __mutex_lock_slowpath+0x60/0x9b
[<ffffffff886d80ea>] :nfs:nfs_permission+0x1bd/0x1ce
[<ffffffff80063cad>] .text.lock.mutex+0xf/0x14
[<ffffffff8000d09a>] do_lookup+0xf5/0x24b
[<ffffffff8000a2a3>] __link_path_walk+0x9f4/0xf39
[<ffffffff8000ebb3>] link_path_walk+0x45/0xb8
[<ffffffff8000ce24>] do_path_lookup+0x294/0x311
[<ffffffff80023daf>] __path_lookup_intent_open+0x56/0x97
[<ffffffff8003c53c>] open_exec+0x24/0xc0
[<ffffffff8003f116>] do_execve+0x46/0x1ed
[<ffffffff800553e5>] sys_execve+0x36/0x4c
[<ffffffff8005d4e3>] stub_execve+0x67/0xb0
Environment
- Red Hat Enterprise Linux 5 (NFS4 client)
- seen on kernel 2.6.18-402.el5
- NFS4
- NFS server
- Seen with AIX 6.1.0.0
Subscriber exclusive content
A Red Hat subscription provides unlimited access to our knowledgebase, tools, and much more.