After upgrading RHEVM to 3.1, sanlock errors prevent any host from becoming SPM

Solution Verified - Updated -

Issue

  • RHEV upgraded from 3.0 to 3.1
  • After changing the compatibility mode to 3.1, and the storage is contending.
  • Each host contends for SPM status, and fails
  • All virtual guests are still running fine
  • /var/log/sanlock.log shows the following errors:
74181 [3041]: s315 lockspace eeb51e7a-da6e-4b43-8789-49e1b3c142e1:3:/dev/eeb51e7a-da6e-4b43-8789-49e1b3c142e1/ids:0
74201 [24039]: s315 wdmd_connect failed -111
74201 [24039]: s315 create_watchdog failed -1
74202 [3041]: s315 add_lockspace fail result -203
74323 [3042]: s316 lockspace eeb51e7a-da6e-4b43-8789-49e1b3c142e1:3:/dev/eeb51e7a-da6e-4b43-8789-49e1b3c142e1/ids:0
74343 [24593]: s316 wdmd_connect failed -111
74343 [24593]: s316 create_watchdog failed -1
74344 [3042]: s316 add_lockspace fail result -203
  • /var/log/messages log on the hosts show the following:
hpasrd[2981]: Starting with poll 1 and timeout 600.
hpasrd[2981]: Setting the watchdog timer.
hpasrd[2981]: Using watchdog HP iLO2+ HW Watchdog Timer.
rhnsd[3003]: Red Hat Network Services Daemon starting up, check in interval 60 minutes.
wdmd[3023]: wdmd started S0 H1 G179
wdmd[3023]: no /dev/watchdog, load a watchdog driver
  • Seeing these errors in /var/log/vdsm/vdsm.log:
3c8f8535-1a32-49f5-9fe1-e5d1e31122cc::ERROR::2013-07-22 12:04:19,191::sp::312::Storage.StoragePool::(startSpm) Unexpected error
Traceback (most recent call last):
  File "/usr/share/vdsm/storage/sp.py", line 270, in startSpm
    self._upgradePool(expectedDomVersion, __securityOverride=True)
  File "/usr/share/vdsm/storage/securable.py", line 68, in wrapper
    return f(self, *args, **kwargs)
  File "/usr/share/vdsm/storage/sp.py", line 410, in _upgradePool
    self._convertDomain(self.masterDomain, str(targetDomVersion))
  File "/usr/share/vdsm/storage/sp.py", line 1032, in _convertDomain
    domain.getRealDomain(), isMsd, targetFormat)
  File "/usr/share/vdsm/storage/imageRepository/formatConverter.py", line 360, in convert
    converter(repoPath, hostId, imageRepo, isMsd)
  File "/usr/share/vdsm/storage/imageRepository/formatConverter.py", line 126, in v3DomainConverter
    newClusterLock.acquireHostId(hostId, async=False)
  File "/usr/share/vdsm/storage/clusterlock.py", line 188, in acquireHostId
    raise se.AcquireHostIdFailure(self._sdUUID, e)
AcquireHostIdFailure: Cannot acquire host id: ('eeb51e7a-da6e-4b43-8789-49e1b3c142e1', SanlockException(-203, 'Sanlock lockspace add failure', 'Sanlock exception'))
3c8f8535-1a32-49f5-9fe1-e5d1e31122cc::ERROR::2013-07-22 12:04:19,192::sp::313::Storage.StoragePool::(startSpm) failed: Cannot acquire host id: ('eeb51e7a-da6e-4b43-8789-49e1b3c142e1', SanlockException(-203, 'Sanlock lockspace add failure', 'Sanlock exception'))

Environment

  • Red Hat Enterprise Virtualization 3.1
  • kernel-2.6.32-358.11.1.el6.x86_64
  • vdsm-4.10.2-22.0.el6ev

Subscriber exclusive content

A Red Hat subscription provides unlimited access to our knowledgebase, tools, and much more.

Current Customers and Partners

Log in for full access

Log In

New to Red Hat?

Learn more about Red Hat subscriptions

Using a Red Hat product through a public cloud?

How to access this content