cluster is failing to start

Latest response

HI

 

if i check clustat on node1, status is showing node1 online and node2 offline. If the check clustat on node2, node2 is showing online and node1 is offline

 

<?xml version="1.0"?>
<cluster config_version="7" name="eccprd">
        <clusternodes>
                <clusternode name="cgceccprd1.test.net" nodeid="1">
                        <fence>
                                <method name="ucs-node1"/>
                        </fence>
                </clusternode>
                <clusternode name="cgceccprd2.test.net" nodeid="2">
                        <fence>
                                <method name="ucs-node2"/>
                        </fence>
                </clusternode>
        </clusternodes>
        <cman expected_votes="1" two_node="1"/>
        <rm>
                <resources>
                        <ip address="172.22.10.230" sleeptime="10"/>
                </resources>
                <service exclusive="1" name="eccsapmnt" recovery="relocate">
                        <ip ref="172.22.10.230"/>
                </service>
        </rm>
        <fencedevices>
                <fencedevice agent="fence_cisco_ucs" ipaddr="172.22.90.61" login="admin" name="ucs-node1" passwd="duc2Cisco"/>
                <fencedevice agent="fence_cisco_ucs" ipaddr="172.22.90.59" login="admin" name="ucs-node2" passwd="duc2Cisco"/>
        </fencedevices>
</cluster>

when i try to start cluster on node1, i am geeting this message on mesages:

 tail -f -n 0 /var/log/messages
Sep 18 06:06:02 cgceccprd1 modcluster: Starting service: eccsapmnt on node 
Sep 18 06:06:08 cgceccprd1 modcluster: Starting service: eccsapmnt on node cgceccprd1.test.net


but the service is not starting.on luci , it's showing both nodes are online.but on clustat different

main error getting on messages is 

Sep 18 03:35:48 cgceccprd1 fenced[8424]: fencing node cgceccprd2.test.net still retrying
Sep 18 04:06:16 cgceccprd1 fenced[8424]: fencing node cgceccprd2.test.net still retrying
Sep 18 04:36:45 cgceccprd1 fenced[8424]: fencing node cgceccprd2.test.net still retrying
Sep 18 05:07:14 cgceccprd1 fenced[8424]: fencing node cgceccprd2.test.net still retrying
Sep 18 05:37:42 cgceccprd1 fenced[8424]: fencing node cgceccprd2.test.net still retrying

These messages from node1.i am geeting same message on node2 saying that

cgceccprd2 fenced[8424]: fencing node cgceccprd1.test.net still retrying

 

 

please help me to solve this isssue

 

regards,

ben

 

Responses