Why ASCS is reported as 'not running' when ERS groups moves out of node where also ASCS group was running?

Solution In Progress - Updated -

Issue

Running ASCS and ERS instances via separate SAPInstance reosurces in 2 resource groups causes the ASCS to be stopped when ERS group moves out of the node that was running both ASCS and ERS resource group.

## Before move
   Resource Group: ASCS_group
       fs_RH1_ASCS        (ocf::heartbeat:Filesystem):    Started node-1
       nc_RH1_ASCS        (ocf::heartbeat:anything):      Started node-1
       vip_RH1_ASCS       (ocf::heartbeat:IPaddr2):       Started node-1
       rsc_sap_RH1_ASCS00 (ocf::heartbeat:SAPInstance):   Started node-1
   Resource Group: ERS_group
       fs_RH1_ERS (ocf::heartbeat:Filesystem):    Started node-1
       nc_RH1_ERS (ocf::heartbeat:anything):      Started node-1
       vip_RH1_ERS        (ocf::heartbeat:IPaddr2):       Started node-1
       rsc_sap_RH1_ERS10  (ocf::heartbeat:SAPInstance):   Started node-1 
## After move
   Resource Group: ASCS_group
       fs_RH1_ASCS        (ocf::heartbeat:Filesystem):    Started node-1
       nc_RH1_ASCS        (ocf::heartbeat:anything):      Started node-1
       vip_RH1_ASCS       (ocf::heartbeat:IPaddr2):       Started node-1
       rsc_sap_RH1_ASCS00 (ocf::heartbeat:SAPInstance):   Started node-1
   Resource Group: ERS_group
       fs_RH1_ERS (ocf::heartbeat:Filesystem):    Started node-2
       nc_RH1_ERS (ocf::heartbeat:anything):      Started node-2
       vip_RH1_ERS        (ocf::heartbeat:IPaddr2):       Started node-2
       rsc_sap_RH1_ERS10  (ocf::heartbeat:SAPInstance):   Started node-2 
Failed actions:
* rsc_sap_RH1_ASCS00_monitor_120 on node-1 'not running' (7)

Cluster logs are reports that during move of ERS the ASCS processed were stopped because they were accessing the ERS directories.

Filesystem(rh1_fs_ers10)[27942]:    INFO: Running stop for nfs-storage:ERS10 on /usr/sap/RH1/ERS10 
Filesystem(rh1_fs_ers10)[27942]:    INFO: Trying to unmount /usr/sap/RH1/ERS10 
Filesystem(rh1_fs_ers10)[27942]:    ERROR: Couldn't unmount /usr/sap/RH1/ERS10; trying cleanup with TERM 
Filesystem(rh1_fs_ers10)[27942]:    INFO: sending signal TERM to: rh1adm   14232     1  0 02:42 ?        Ssl    0:00 /usr/sap/RH1/ERS10/exe/sapstartsrv pf=/sapmnt/RH1/profile/RH1_ERS10_rh1-ers -D -u rh1adm    
Filesystem(rh1_fs_ers10)[27942]:    INFO: sending signal TERM to: rh1adm   21604     1  0 02:51 ?        Ssl    0:00 /usr/sap/RH1/ASCS00/exe/sapstartsrv pf=/sapmnt/RH1/profile/RH1_ASCS00_rh1-ascs -D -u rh1adm 
Filesystem(rh1_fs_ers10)[27942]:    INFO: sending signal TERM to: rh1adm   21786     1  0 02:51 ?        Ss     0:00 sapstart pf=/sapmnt/RH1/profile/RH1_ASCS00_rh1-ascs 
Filesystem(rh1_fs_ers10)[27942]:    INFO: sending signal TERM to: rh1adm   21806 21786  0 02:51 ?        Ss     0:00 ms.sapRH1_ASCS00 pf=/usr/sap/RH1/SYS/profile/RH1_ASCS00_rh1-ascs 
Filesystem(rh1_fs_ers10)[27942]:    INFO: sending signal TERM to: rh1adm   21810 21786  0 02:51 ?        Ssl    0:00 en.sapRH1_ASCS00 pf=/usr/sap/RH1/SYS/profile/RH1_ASCS00_rh1-ascs 
Filesystem(rh1_fs_ers10)[27942]:    INFO: unmounted /usr/sap/RH1/ERS10 successfully 
[12867] node-1       lrmd:   notice: operation_finished:     rh1_fs_ers10_stop_0:27942:stderr [ umount.nfs4: /usr/sap/RH1/ERS10: device is busy ] 

Environment

  • Red Hat Enterprise Linux 7 With High Availability Add-on
  • SAPInstance resources managing ASCS and ERS using resource groups (not Master/Slave)

Subscriber exclusive content

A Red Hat subscription provides unlimited access to our knowledgebase, tools, and much more.

Current Customers and Partners

Log in for full access

Log In

New to Red Hat?

Learn more about Red Hat subscriptions

Using a Red Hat product through a public cloud?

How to access this content