LVM resource fails probe operation in Pacemaker cluster with many LUNs

Solution Verified - Updated -

Issue

  • ocf:heartbeat:LVM resource times out on its probe operation as soon as it is created. Then it times out on its stop, and the node is rebooted.
  • The stderr from the LVM monitor operation contains many errors like the following. (errno 24 translates to "too many open files.")
2019-09-05T11:08:53.530219-04:00 node-1 lrmd[41824]:  notice: lvm_rsc_monitor_0:29333:stderr [   Device open /dev/mapper/mpathamd 253:1023 failed errno 24 ]
2019-09-05T11:08:53.530357-04:00 node-1 lrmd[41824]:  notice: lvm_rsc_monitor_0:29333:stderr [   Device open /dev/mapper/mpathamd 253:1023 failed errno 24 ]

Environment

  • Red Hat Enterprise Linux 7 (with the High Availability Add-on)
  • Many LUNs (i.e., more than 1000)
  • ocf:heartbeat:LVM resource

Subscriber exclusive content

A Red Hat subscription provides unlimited access to our knowledgebase, tools, and much more.

Current Customers and Partners

Log in for full access

Log In

New to Red Hat?

Learn more about Red Hat subscriptions

Using a Red Hat product through a public cloud?

How to access this content