RHEL6.5: NFSv4 mounts hung while trying "touch" or "cat" files, nfs4 state manager in nfs4_do_reclaim -> msleep loop

Solution Unverified - Updated -

Issue

  • System has high load, mostly if not all due to blocked / D-state processes.
  • An nfs4 state manager thread is stuck running indefinitely in a loop with an msleep from nfs4_do_reclaim, similar to the following partial backtrace
 #3 [ffff8804375fdde8] msleep at ffffffff810863a0
 #4 [ffff8804375fddf8] nfs4_do_reclaim at ffffffffa05d3112 [nfs]
 #5 [ffff8804375fde88] nfs4_run_state_manager at ffffffffa05d330f [nfs]
  • NFSv4 mounts getting hung while trying to "touch" or "cat" files on Red Hat Enterprise Linux 6.5, with the following backtrace
 PID: 9577   TASK: ffff880438079540  CPU: 0   COMMAND: "sh"
 #0 [ffff8803256677d8] schedule at ffffffff815278c2
 #1 [ffff8803256678a0] nfs_wait_bit_killable at ffffffffa05a4f72 [nfs]
 #2 [ffff8803256678b0] __wait_on_bit at ffffffff81528b6f
 #3 [ffff880325667900] out_of_line_wait_on_bit at ffffffff81528c18
 #4 [ffff880325667970] nfs4_wait_clnt_recover at ffffffffa05d1972 [nfs]
 #5 [ffff880325667990] nfs4_client_recover_expired_lease at ffffffffa05d1d60 [nfs]
 #6 [ffff8803256679b0] _nfs4_do_open at ffffffffa05bfac5 [nfs]
 #7 [ffff880325667a90] nfs4_do_open at ffffffffa05bffe5 [nfs]
 #8 [ffff880325667b30] nfs4_atomic_open at ffffffffa05c00f8 [nfs]
 #9 [ffff880325667b50] nfs_open_revalidate at ffffffffa05a1d6c [nfs]
#10 [ffff880325667bf0] do_lookup at ffffffff811988f6
#11 [ffff880325667c50] __link_path_walk at ffffffff81199354
#12 [ffff880325667d30] path_walk at ffffffff81199e6a
#13 [ffff880325667d70] filename_lookup at ffffffff8119a07b
#14 [ffff880325667db0] do_filp_open at ffffffff8119b554
#15 [ffff880325667f20] do_sys_open at ffffffff81185d29
#16 [ffff880325667f70] sys_open at ffffffff81185e40
#17 [ffff880325667f80] system_call_fastpath at ffffffff8100b072
    RIP: 000000316dcdb540  RSP: 00007fff7bb42b30  RFLAGS: 00010206
    RAX: 0000000000000002  RBX: ffffffff8100b072  RCX: 00000000025ce000
    RDX: 000000000068732e  RSI: 0000000000000000  RDI: 00000000025cdde0
    RBP: 00000000025cdde0   R8: 0000000000000002   R9: 0000000000000010
    R10: 0000000000000008  R11: 0000000000000246  R12: ffffffff81185e40
    R13: ffff880325667f78  R14: 0000000000000000  R15: 0000000000000000
    ORIG_RAX: 0000000000000002  CS: 0033  SS: 002b

Environment

  • Red Hat Enterprise Linux 6.5 (NFS client)
    • kernel prior to `kernel-2.6.32-431.37.1.el6
  • NFSv4 with delegations enabled
  • NFS Server
    • any NFS4 server with delegations enabled may be affected
    • seen on EMC Storage VNX 5300 (NFS server), NAS version:7.1.71-1
    • seen on IBM Storage N6210

Subscriber exclusive content

A Red Hat subscription provides unlimited access to our knowledgebase of over 48,000 articles and solutions.

Current Customers and Partners

Log in for full access

Log In
Close

Welcome! Check out the Getting Started with Red Hat page for quick tours and guides for common tasks.