How do I recover failed non-critical clustered resources

Solution Verified - Updated -

Issue

  • A cluster service is having several resources defined as non-critical (marked with __independent_subtree="2"). One of the non-critical resource in cluster service was failed and the service is marked as Partial [P] in clustat output.

    $ clustat
    Cluster Status for test_cluster @ Fri May 31 18:58:00 2013
    Member Status: Quorate
    
     Member Name            ID   Status
     ------ ----            ---- ------
     node1                  1 Online, Local, rgmanager
     node2                  2 Online, rgmanager
    
     Service Name            Owner (Last)   State         
     ------- ----            ----- ------   -----
     service:S1              node1          started    [P]  <---       
    
  • Is it possible to restart only failed, non-critical resources in cluster service? below is the affected service configuration:

    <service autostart="1" domain="fd-1" exclusive="0" name="S1" recovery="relocate">
        <lvm ref="lvm-1">
            <fs ref="fs-1"/>
        </lvm
        <lvm __independent_subtree="2" ref="lvm-2">                    <---- Currently failed, non-critical resource
            <fs ref="fs-2"/>
        </lvm>
    </service>
    

Environment

  • Red Hat Enterprise Linux Server 5.5+ (with the High Availability or Resilient Storage Add Ons)
  • Red Hat Enterprise Linux Server 6.1+ (with the High Availability or Resilient Storage Add Ons)
  • Resource Manager: RGManager
  • One or more resources defined as non-critical (marked with __independent_subtree="2")

Subscriber exclusive content

A Red Hat subscription provides unlimited access to our knowledgebase, tools, and much more.

Current Customers and Partners

Log in for full access

Log In

New to Red Hat?

Learn more about Red Hat subscriptions

Using a Red Hat product through a public cloud?

How to access this content