Server becomes unresponsive with 'megasas: waiting for X commands to complete' errors in the logs
Issue
-
Server become unresponsive after following error messages in the logs:
Jun 10 23:07:51 host1 Server Administrator: Storage Service EventID: 2266 Controller log file entry: Physical Disk 1:0:4 Controller 0, Connector 1 Jun 10 23:07:51 host1 kernel: sd 0:2:1:0: megasas: RESET -2405454574 cmd=2a retries=0 [...] Jun 10 23:07:56 host1 kernel: megasas: [ 5]waiting for 2 commands to complete Jun 10 23:08:01 host1 kernel: megasas: [10]waiting for 2 commands to complete Jun 10 23:09:49 host1 kernel: megasas: [15]waiting for 2 commands to complete Jun 10 23:09:49 host1 kernel: megasas: [20]waiting for 2 commands to complete [...] Jun 10 23:09:49 host1 kernel: megasas: [115]waiting for 2 commands to complete Jun 10 23:09:49 host1 kernel: megasas: reset successful Jun 10 23:09:49 host1 kernel: sd 0:2:1:0: timing out command, waited 360s Jun 10 23:09:49 host1 kernel: sd 0:2:1:0: SCSI error: return code = 0x06000000 Jun 10 23:09:49 host1 kernel: end_request: I/O error, dev sdb, sector 419717895 Jun 10 23:09:49 host1 kernel: Buffer I/O error on device dm-12, logical block 35881 Jun 10 23:09:49 host1 kernel: lost page write due to I/O error on dm-12 Jun 10 23:09:49 host1 kernel: Aborting journal on device dm-12. Jun 10 23:09:50 host1 Server Administrator: Storage Service EventID: 2266 Controller log file entry: Physical Disk 1:0:4 Controller 0, Connector 1 [...]
Environment
- Red Hat Enterprise Linux 5
- Red Hat Enterprise Linux 6
- LSI Logic / Symbios Logic MegaRAID SAS 1078 controller
Subscriber exclusive content
A Red Hat subscription provides unlimited access to our knowledgebase, tools, and much more.