RHEL host unresponsive receiving 'did_error = 12' from qla2x00_done()

Solution Unverified - Updated -

Environment

  • Red Hat Enterprise Linux 4

  • kernel 2.6.9-89.0.16

Issue

  • Node found non responsive to network and console logins with the following console message.
scsi(1:0:2) qla2x00_done: did_error = 12. comp-scsi= 0x0-0x0 pid=9240538729

Resolution

  • did_error = 12 is defined in the source as;
    #define DID_IMM_RETRY   0x0c    /* Retry without decrementing retry count  */
  • Generally, this scsi error prevents transient errors, such as a BUFFER UNDERRUN, from taking a device going offline or in device-mapper-multipath case a path failing. These aren't always a problem on the array or switch itself, but may just be a bad cable.

Please review all physical interconnects between the Storage Array and HBA for any faulty hardware.

This solution is part of Red Hat’s fast-track publication program, providing a huge library of solutions that Red Hat engineers have created while supporting our customers. To give you the knowledge you need the instant it becomes available, these articles may be presented in a raw and unedited form.

Close

Welcome! Check out the Getting Started with Red Hat page for quick tours and guides for common tasks.