An SAPHana resource returns the wrong primary status for a node when the hdbnsutil -sr_state command doesn't respond

Solution In Progress - Updated -

Issue

  • A message like the following is logged, where node_status does not match the node's true current status, and then Pacemaker initiates a failover.

    ACT: Using getParameter.py as fallback - node_status=<wrong_status>
    *or*
    ACT: hdbnsutil not answering - using global.ini as fallback - node_status=<wrong_status>
    
  • Secondary node does not get promoted after SAP HANA binaries are lost on the primary node in a High Availability cluster.

  • HANA is in stopped state on both nodes of a Pacemaker cluster after losing the hdb binaries on the primary node.
  • When the cluster is in maintenance mode, HANA continues to run on both nodes after the hdb binaries are removed from one node. When the cluster is not in maintenance mode, the SAPHana resource agent takes action to stop the HANA instances.
  • An SAPHanaTopology resource logs the following, where srmode does not match the node's true current SR mode.

    ACT: hdbnsutil not answering - using global.ini as fallback - srmode=<wrong_mode>
    

Environment

  • Red Hat Enterprise Linux 6 for SAP Solutions
  • Red Hat Enterprise Linux 7 for SAP Solutions
  • Red Hat Enterprise Linux 8 for SAP Solutions
  • SAP HANA System Replication in Scale-Up with Pacemaker

Subscriber exclusive content

A Red Hat subscription provides unlimited access to our knowledgebase, tools, and much more.

Current Customers and Partners

Log in for full access

Log In