VMs become non-responsive during a Live Merge
Issue
-
VMs become non-responsive during a
Live Merge(snapshot deletion). -
During this time they cannot be accessed remotely, e.g. via
ssh, howeverpingis successful. -
This may last a few minutes, however has been seen to be much longer.
-
The vdsm logs show timeouts, e.g.
Thread-1108850::ERROR::2016-05-31 18:05:43,221::utils::739::root::(wrapper) Unhandled exception
Traceback (most recent call last):
File "/usr/lib/python2.7/site-packages/vdsm/utils.py", line 736, in wrapper
return f(*a, **kw)
File "/usr/share/vdsm/virt/vm.py", line 5262, in run
self.tryPivot()
File "/usr/share/vdsm/virt/vm.py", line 5231, in tryPivot
ret = self.vm._dom.blockJobAbort(self.drive.name, flags)
File "/usr/share/vdsm/virt/virdomain.py", line 76, in f
raise toe
TimeoutError: Timed out during operation: cannot acquire state change lock (held by remoteDispatchDomainBlockJobAbort)
- Attempting to check the
blockjobstatus also times out;
# virsh -r blockjob VM-A vda
error: Timed out during operation: cannot acquire state change lock (held by remoteDispatchDomainBlockJobAbort)
- As do other attempts to access via the qemu monitor;
# virsh -r list
Id Name State
----------------------------------------------------
10 VM-A running
# virsh qemu-monitor-command --hmp 10 info status
Please enter your authentication name: vdsm@rhevh
Please enter your password:
error: Timed out during operation: cannot acquire state change lock (held by remoteDispatchDomainBlockJobAbort)
pstackshows one of the threads of theqemu-kvmprocess with the following stack frames;
Thread 1 (Thread 0x7ffa76fdec40 (LWP 118925)):
#0 0x00007ffa772cdf9c in aio_bh_poll ()
#1 0x00007ffa772dd039 in aio_dispatch_clients ()
#2 0x00007ffa772dd58b in aio_poll_clients ()
#3 0x00007ffa772d19c4 in bdrv_drain_one ()
#4 0x00007ffa772d2c14 in bdrv_drain_all ()
#5 0x00007ffa772d842f in bdrv_close ()
#6 0x00007ffa772d86b7 in bdrv_unref ()
#7 0x00007ffa77310213 in mirror_exit ()
#8 0x00007ffa772db26c in block_job_defer_to_main_loop_bh ()
#9 0x00007ffa772cdfc4 in aio_bh_poll ()
#10 0x00007ffa772dd039 in aio_dispatch_clients ()
#11 0x00007ffa772cde3e in aio_ctx_dispatch ()
#12 0x00007ffa7551379a in g_main_context_dispatch () from /lib64/libglib-2.0.so.0
#13 0x00007ffa772dbeb8 in main_loop_wait ()
#14 0x00007ffa770dac0e in main ()
Environment
- Red Hat Enterprise Virtualization (RHEV) 3.5, 3.6
Subscriber exclusive content
A Red Hat subscription provides unlimited access to our knowledgebase, tools, and much more.