RHV - Storage Domains down and no host is the SPM due to improper permissions in export domain

Solution In Progress - Updated -

Issue

  • None of the hosts in the cluster are set as the SPM.
  • [Errno 13] Permission Denied errors are appearing in RHV-M.
  • All storage domains are down, and DC is unresponsive.

There may be messages similar to:

VDSM example.local command HSMGetTaskStatusVDS failed: [Errno 13] Permission denied
VDSM example.local command HSMGetAllTasksStatusesVDS failed: Not SPM
VDSM example.local command HSMGetTaskStatusVDS failed: [Errno 13] Permission denied
VDSM example.local command HSMGetAllTasksStatusesVDS failed: Not SPM
VDSM example.local command HSMGetTaskStatusVDS failed: [Errno 13] Permission denied

Error from vdsm.log:

2018-03-02 15:01:21,654-0300 ERROR (monitor/83aeade) [storage.Monitor] Error checking domain xxxxxxxxxxxxxxxxxx (monitor:426)
Traceback (most recent call last):
  File "/usr/share/vdsm/storage/monitor.py", line 407, in _checkDomainStatus
    self.domain.selftest()
  File "/usr/share/vdsm/storage/sdc.py", line 50, in __getattr__
    return getattr(self.getRealDomain(), attrName)
  File "/usr/share/vdsm/storage/sdc.py", line 53, in getRealDomain
    return self._cache._realProduce(self._sdUUID)
  File "/usr/share/vdsm/storage/sdc.py", line 136, in _realProduce
    domain = self._findDomain(sdUUID)
  File "/usr/share/vdsm/storage/sdc.py", line 153, in _findDomain
    return findMethod(sdUUID)
  File "/usr/share/vdsm/storage/nfsSD.py", line 126, in findDomain
    return NfsStorageDomain(NfsStorageDomain.findDomainPath(sdUUID))
  File "/usr/share/vdsm/storage/fileSD.py", line 369, in __init__
    validateFileSystemFeatures(manifest.sdUUID, manifest.mountpoint)
  File "/usr/share/vdsm/storage/fileSD.py", line 98, in validateFileSystemFeatures
    oop.getProcessPool(sdUUID).directTouch(testFilePath)
  File "/usr/lib/python2.7/site-packages/vdsm/storage/outOfProcess.py", line 320, in directTouch
    ioproc.touch(path, flags, mode)
  File "/usr/lib/python2.7/site-packages/ioprocess/__init__.py", line 577, in touch
    self.timeout)
  File "/usr/lib/python2.7/site-packages/ioprocess/__init__.py", line 461, in _sendCommand
    raise OSError(errcode, errstr)
OSError: [Errno 13] Permission denied

Permissions are incorrect, the correct permissions should be vdsm:kvm (36:36):

/rhev/data-center/mnt:
total 21
drwxr-xr-x.  3 nobody nobody    4 Dec  6 16:36 10.0.0.1:_mnt_export_xxxxxxxxxxxxx
drwxr-xr-x.  3 nobody nobody    4 Dec  5 12:06 10.0.0.1:_mnt_iso_xxxxxxxxxxxxx
drwxr-xr-x. 10 vdsm   kvm    4096 Jan  9 09:11 blockSD

/rhev/data-center/mnt/10.0.0.1:_mnt_export/xxxxxxxxxxxxx:
total 1
drwxr-xr-x. 5 nobody nobody 5 Sep 14 22:45 xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx
-rwxr-xr-x. 1 nobody nobody 0 Feb  8 17:13 __DIRECT_IO_TEST__

Environment

  • Red Hat Virtualization 4.0
  • Red Hat Virtualization 4.1

Subscriber exclusive content

A Red Hat subscription provides unlimited access to our knowledgebase, tools, and much more.

Current Customers and Partners

Log in for full access

Log In

New to Red Hat?

Learn more about Red Hat subscriptions

Using a Red Hat product through a public cloud?

How to access this content