Server is showing 'SCSI error: return code = 0x06050000' in the logs and presenting disk issues
Issue
- One of our clustered servers started having disk issues.
- When checking the logs, we found the following messages constantly repeated themselves:
Jul 27 04:57:21 host kernel: Result: hostbyte=DID_ABORT driverbyte=DRIVER_TIMEOUT,SUGGEST_OK
Jul 27 04:57:21 host kernel: Buffer I/O error on device dm-17, logical block 17696630
Jul 27 04:57:21 host kernel: lost page write due to I/O error on dm-17
Jul 27 04:57:21 host kernel: Buffer I/O error on device dm-17, logical block 17696631
Jul 27 04:57:21 host kernel: lost page write due to I/O error on dm-17
Jul 27 04:57:21 host kernel: Buffer I/O error on device dm-17, logical block 17696632
Jul 27 04:57:21 host kernel: lost page write due to I/O error on dm-17
Jul 27 04:57:21 host kernel: sd 6:0:0:0: timing out command, waited 360s
Jul 27 04:57:21 host kernel: sd 6:0:0:0: Unhandled error code
Jul 27 04:57:21 host kernel: sd 6:0:0:0: SCSI error: return code = 0x06050000
Jul 27 04:57:21 host kernel: Result: hostbyte=DID_ABORT driverbyte=DRIVER_TIMEOUT,SUGGEST_OK
Jul 27 04:57:21 host kernel: sd 6:0:0:0: timing out command, waited 360s
Jul 27 04:57:21 host kernel: sd 6:0:0:0: Unhandled error code
Jul 27 04:57:21 host kernel: sd 6:0:0:0: SCSI error: return code = 0x06050000
Jul 27 04:57:21 host kernel: Result: hostbyte=DID_ABORT driverbyte=DRIVER_TIMEOUT,SUGGEST_OK
Jul 27 04:57:21 host kernel: sd 6:0:0:0: timing out command, waited 360s
Jul 27 04:57:21 host kernel: sd 6:0:0:0: Unhandled error code
Jul 27 04:57:21 host kernel: sd 6:0:0:0: SCSI error: return code = 0x06050000
- As a consequence, the server stopped responding and was eventually fenced by the other node.
Environment
- Red Hat Enterprise Linux (RHEL) 5
- Red Hat Enterprise Linux (RHEL) 6
- External storage in use, such as iSCSI or SAN
Subscriber exclusive content
A Red Hat subscription provides unlimited access to our knowledgebase, tools, and much more.