node with fence_scsi fails to start cman with 'unfence failed' error messages in RHEL 6 when not using clvmd or the fence_scsi devices attribute

Solution Verified - Updated -

Issue

  • The node configured with scsi fencing is unable to join cluster and shows unfence failed error messages:

    Apr 23 15:58:29 node1 corosync[2326]:   [TOTEM ] A processor joined or left the membership and a new membership was formed.
    Apr 23 15:58:29 node1 corosync[2326]:   [QUORUM] Members[2]: 1 2
    Apr 23 15:58:29 node1 corosync[2326]:   [QUORUM] Members[2]: 1 2
    Apr 23 15:58:29 node1 corosync[2326]:   [CPG   ] chosen downlist: sender r(0) ip(10.65.211.76) ; members(old:1 left:0)
    Apr 23 15:58:29 node1 corosync[2326]:   [MAIN  ] Completed service synchronization, ready to provide service.
    Apr 23 15:58:29 node1 dlm_controld[2412]: dlm_controld 3.0.12.1 started
    Apr 23 15:58:29 node1 gfs_controld[2459]: gfs_controld 3.0.12.1 started
    Apr 23 15:58:30 node1 fence_node[2502]: unfence node1 failed            <----------
    Apr 23 15:58:30 node1 kernel: dlm: closing connection to node 2
    Apr 23 15:58:30 node1 kernel: dlm: closing connection to node 1
    Apr 23 15:58:31 node1 corosync[2326]:   [SERV  ] Unloading all Corosync service engines.
    Apr 23 15:58:31 node1 corosync[2326]:   [SERV  ] Service engine unloaded: corosync extended virtual synchrony service
    Apr 23 15:58:31 node1 corosync[2326]:   [SERV  ] Service engine unloaded: corosync configuration service
    Apr 23 15:58:31 node1 corosync[2326]:   [SERV  ] Service engine unloaded: corosync cluster closed process group service v1.01
    Apr 23 15:58:31 node1 corosync[2326]:   [SERV  ] Service engine unloaded: corosync cluster config database access v1.01
    Apr 23 15:58:31 node1 corosync[2326]:   [SERV  ] Service engine unloaded: corosync profile loading service
    Apr 23 15:58:31 node1 corosync[2326]:   [SERV  ] Service engine unloaded: openais checkpoint service B.01.01
    Apr 23 15:58:31 node1 corosync[2326]:   [SERV  ] Service engine unloaded: corosync CMAN membership service 2.90
    Apr 23 15:58:31 node1 corosync[2326]:   [SERV  ] Service engine unloaded: corosync cluster quorum service v0.1
    Apr 23 15:58:31 node1 corosync[2326]:   [MAIN  ] Corosync Cluster Engine exiting with status 0 at main.c:1894.
    

Environment

  • Red Hat Enterprise Linux Server 6 with the High Availability and/or Resilient Storage Add Ons
  • fence_scsi
    • No devices attribute is specified on the fence_scsi <fencedevice/>
  • The cluster is not utilizing clvmd, or has no volume groups with the clustered attribute set

Subscriber exclusive content

A Red Hat subscription provides unlimited access to our knowledgebase of over 48,000 articles and solutions.

Current Customers and Partners

Log in for full access

Log In
Close

Welcome! Check out the Getting Started with Red Hat page for quick tours and guides for common tasks.