Oracle HA cluster in RHEL 5.5

Latest response

Hi,

 

I've 2 HP DL380 G6 server with Redhat Enterprise Linux Advance Edition 5.5 x64 intalled (both the nodes having Fiber HBA cards and connected with SAN switches). I've assigned 5 common storage LUNs to both these servers on GFS2 filesystem and their mount points are /oradata, /data1, /index, /redo and /arch. I've setup 2 node Active / Passive cluster using Redhat cluster suite. I've Set the priority 1 to node 1 and 2 to node 2 in failover domain. The same volume and virtual ip ( or can say cluster ip for applications) will be available at one node at a time. In case of failure of node 1, the cluster resources (i.e. VIP, SAN vols) will switchover to node 2. As soon as the node 1 start working and get on the cluster resources are shifted back to node 1. This setup is tested and working fine.

Now, I've to configure the RHEL cluster in this way so that Oracle DB instance(s) should failover along with the other resources (vip and san vols).

Both servers having oracle 11g R2 Enterprise (non-RAC) installed on both servers. We're using GFS2 filesystem over CLVM on shared SAN volumes.

I want to clear few things that We're going to implement in this scenario.

 

1. we're not going to use Oracle RAC. We've configured OS level clustering in Active / Passive mode.
2. I want to start the Oracle DB services automatically by cluster service on Primary node while it will keep stopped in secondary node.
3. when the cluster services will shift from Primary to Secondary server, it should start the DB services on the other node, but before shifting to other node the DB services should stopped properly on the Primary node. For testing this, I've purposely stopped the rgmanager service (i.e. service rgmanager stop), which immediately starting to failover the cluster services and resources to secondary node. But what I've observed that the Oracle DB service still running on primary and the resources shifted to secondary node except those volumes which are still accessed by oracle or underlaying services (i.e. /index, /redo, /arch) on primary node. Rest volumes shifted to 2nd node. Later the cluster service failed to start (verified by "clustat" command).

 

I want such script which should be understandable and manageagle by cluster resource manager. It should work on both scenarios...

 

1. In case, 1st node get down completely. It should shift all the cluster resources (i.e. vip, san vols and oracle db service).
2. In case, if there any os dependent of application/db dependent service fails or not responds, cluster should be intelligent enough to understand that and failover the cluster services from 1st node to 2nd node properly and fence it (suppose, if i stop the rgmanager or oracle listener service, it should understand this and failover the cluster services to 2nd node and then fence it). But it's not happening.

 

Does anybody get any success in this. Pls help me to accomplish the same (either using conga or system-config-cluster).

 

Regards

Arunabh

Responses