RHEL NFS server crashes due to corruption in the del_recall_lru list
Issue
- A RHEL 6 NFS server may log a message similar to:
list_add corruption. next->prev should be prev (ffff880818ab3df0), but was ffff88078a1a48d0. (next=ffff88078a1a48d0).
- Panic soon thereafter with a message similar to:
BUG: soft lockup - CPU#9 stuck for 67s! [nfsd4:5319]
-
The process triggering the panic will be 'nfsd4' (also known as the 'laundromat thread').
-
A RHEL 5 NFS server may log a message similar to:
list_add corruption. prev->next should be ffffffff88593e10, but was ffff811362f1b648
and immediately panic. The process triggering the panic will either be 'nfsd4' (also known as the 'laundromat thread'), or it will be one of the main 'nfsd' threads (in which case the nfsd_break_deleg_cb() function will probably be in the backtrace).
Environment
- Red Hat Enterprise Linux 6
- Seen on RHEL 6.1 and RHEL 6.3 kernels. Kernels up to and including 6.5 believed to be affected.
- Red Hat Enterprise Linux 5
- Seen on RHEL 5.10 kernel. Other kernel versions may be impacted as well.
Subscriber exclusive content
A Red Hat subscription provides unlimited access to our knowledgebase, tools, and much more.