How do I recover failed non-critical clustered resources
Issue
-
A cluster service is having several resources defined as non-critical (marked with
__independent_subtree="2"
). One of the non-critical resource in cluster service was failed and the service is marked as Partial[P]
inclustat
output.$ clustat Cluster Status for test_cluster @ Fri May 31 18:58:00 2013 Member Status: Quorate Member Name ID Status ------ ---- ---- ------ node1 1 Online, Local, rgmanager node2 2 Online, rgmanager Service Name Owner (Last) State ------- ---- ----- ------ ----- service:S1 node1 started [P] <---
-
Is it possible to restart only failed, non-critical resources in cluster service? below is the affected service configuration:
<service autostart="1" domain="fd-1" exclusive="0" name="S1" recovery="relocate"> <lvm ref="lvm-1"> <fs ref="fs-1"/> </lvm <lvm __independent_subtree="2" ref="lvm-2"> <---- Currently failed, non-critical resource <fs ref="fs-2"/> </lvm> </service>
Environment
- Red Hat Enterprise Linux Server 5.5+ (with the High Availability or Resilient Storage Add Ons)
- Red Hat Enterprise Linux Server 6.1+ (with the High Availability or Resilient Storage Add Ons)
- Resource Manager: RGManager
- One or more resources defined as non-critical (marked with __independent_subtree="2")
Subscriber exclusive content
A Red Hat subscription provides unlimited access to our knowledgebase, tools, and much more.