RHEL7: NFSv4 client loops with WRITE / NFS4ERR_STALE_STATEID - if NFS server restarts multiple times within the grace period

Solution Verified - Updated -

Issue

  • NFS client can't execute open/write systemcall against NFS filesystem after NFS service restart
  • While NFS client is repeatedly executing I/O sequence (open/write/read/close) against a file that exists on the NFS filesystem, NFS server restart.
    • When executing open systemcall, EIO error occurs, and when executing write systemcall, EAGAIN error occurs
    • We must restart NFS client machine to recover
  • NFS4 server restarts multiple times within the grace period results in NFS client hangs
  • If NFS server restarts at interval longer than the NFS4 grace period, this problem does not occur

Environment

  • Red Hat Enterprise 7 (NFS client)
    • kernel prior to 3.10.0-229.11.1.el7
    • seen on kernels 3.10.0-123.el7 and 3.10.0-229.el7
  • NFS4
  • NFS server
    • Seen with RHEL6u6 kernel-2.6.32-504.el6, nfs-utils-1.2.3-54.el6

Subscriber exclusive content

A Red Hat subscription provides unlimited access to our knowledgebase, tools, and much more.

Current Customers and Partners

Log in for full access

Log In

New to Red Hat?

Learn more about Red Hat subscriptions

Using a Red Hat product through a public cloud?

How to access this content