LVM resource fails probe operation in Pacemaker cluster with many LUNs

Solution Verified - Updated -

Issue

  • ocf:heartbeat:LVM resource times out on its probe operation as soon as it is created. Then it times out on its stop, and the node is rebooted.
  • The stderr from the LVM monitor operation contains many errors like the following. (errno 24 translates to "too many open files.")
2019-09-05T11:08:53.530219-04:00 node-1 lrmd[41824]:  notice: lvm_rsc_monitor_0:29333:stderr [   Device open /dev/mapper/mpathamd 253:1023 failed errno 24 ]
2019-09-05T11:08:53.530357-04:00 node-1 lrmd[41824]:  notice: lvm_rsc_monitor_0:29333:stderr [   Device open /dev/mapper/mpathamd 253:1023 failed errno 24 ]

Environment

  • Red Hat Enterprise Linux 7 (with the High Availability Add-on)
  • Many LUNs (i.e., more than 1000)
  • ocf:heartbeat:LVM resource

Subscriber exclusive content

A Red Hat subscription provides unlimited access to our knowledgebase of over 48,000 articles and solutions.

Current Customers and Partners

Log in for full access

Log In