Why did the monitor for an LVM resource timeout after a lun was removed failing the pacemaker resource?

Solution Verified - Updated -

Issue

  • Why did the monitor for an LVM resource timeout after a lun was removed failing the pacemaker resource?
Mar 19 07:06:58 node1 kernel: [3256390.121481]  rport-1:0-7: blocked FC remote port time out: removing target and saving binding
Mar 19 07:06:58 node1 kernel: [3256390.185502]  rport-2:0-7: blocked FC remote port time out: removing target and saving binding
Mar 19 07:14:56 node1 kernel: [3256867.818241] sd 1:0:1:1: rejecting I/O to offline device
Mar 19 07:14:56 node1 kernel: [3256867.818563] device-mapper: multipath: Failing path 8:32.
Mar 19 07:14:56 node1 kernel: [3256867.834262] sd 2:0:2:1: rejecting I/O to offline device
Mar 19 07:14:56 node1 kernel: [3256867.834559] sd 2:0:2:1: rejecting I/O to offline device
Mar 19 07:14:56 node1  kernel: [3256867.834855] device-mapper: multipath: Failing path 8:224.
Mar 19 07:14:56 node1 kernel: [3256867.856798] device-mapper: multipath: Failing path 8:16.
Mar 19 07:15:08 node1 kernel: [3256879.866484] device-mapper: multipath: Failing path 8:208.
Mar 19 07:16:51 node1 kernel: [3256983.255841]  rport-1:0-1: blocked FC remote port time out: removing target and saving binding
Mar 19 07:16:51 node1 kernel: [3256983.256180]  rport-2:0-2: blocked FC remote port time out: removing target and saving binding
Mar 19 07:16:51 node1 kernel: [3256983.256683] sd 1:0:1:0: alua: Detached
Mar 19 07:16:51 node1 kernel: [3256983.257272] sd 2:0:2:0: alua: Detached
Mar 19 07:16:51 node1 kernel: [3256983.258123] sd 1:0:1:1: alua: Detached
Mar 19 07:16:51 node1 kernel: [3256983.258814] sd 2:0:2:1: alua: Detached
Mar 19 07:16:51 node1 kernel: [3256983.261176] device-mapper: multipath: Failing path 8:208.
Mar 19 07:16:51 node1 kernel: [3256983.292422] device-mapper: multipath: Failing path 8:32.
[....]
Mar 19 07:36:36 node1 kernel: [3258168.638270] qla2xxx [0000:10:00.1]-801c:2: Abort command issued nexus=2:2:1 --  1 2002.
Mar 19 07:36:36 node1 kernel: [3258168.638917] qla2xxx [0000:10:00.0]-801c:1: Abort command issued nexus=1:1:1 --  1 2002.
Mar 19 07:36:40 node1 lrmd[2807]:  warning: PKm11ha1_LVM_monitor_10000 process (PID 13826) timed out
Mar 19 07:36:40 node1 lrmd[2807]:  warning: PKm11ha1_LVM_monitor_10000 process (PID 13826) timed out
Mar 19 07:36:40 node1 lrmd[2807]:  warning: PKm11ha1_LVM_monitor_10000:13826 - timed out after 30000ms
Mar 19 07:36:40 node1 lrmd[2807]:  warning: PKm11ha1_LVM_monitor_10000:13826 - timed out after 30000ms
Mar 19 07:36:40 node1 crmd[2810]:    error: Operation PKm11ha1_LVM_monitor_10000: Timed Out (node=node1, call=173, timeout=30000ms)
Mar 19 07:36:40 node1 crmd[2810]:    error: Operation PKm11ha1_LVM_monitor_10000: Timed Out (node=node1, call=173, timeout=30000ms)

Environment

  • Red Hat Enterprise Linux Server 6 (with the High Availability Add Ons)
  • Red Hat Enterprise Linux Server 7 (with the High Availability Add Ons)
  • pacemaker

Subscriber exclusive content

A Red Hat subscription provides unlimited access to our knowledgebase, tools, and much more.

Current Customers and Partners

Log in for full access

Log In

New to Red Hat?

Learn more about Red Hat subscriptions

Using a Red Hat product through a public cloud?

How to access this content