[ixgbe] Intel 82599 network drops connections with "Received unrecoverable ECC Err, please reboot' message
Issue
- NIC stops working with the following error seen in the /var/log/messages file and/or dmesg output:
Nov 28 20:29:03 localhost kernel: ixgbe 0000:05:00.0: eth12: Received unrecoverable ECC Err, please reboot
- It may or may not also include the a Tx Unit Hang error:
Nov 28 20:29:08 localhost kernel: ixgbe 0000:05:00.0: eth12: Detected Tx Unit Hang
Nov 28 20:29:08 localhost kernel: Tx Queue <16>
Nov 28 20:29:08 localhost kernel: TDH, TDT <d7c>, <d94>
Nov 28 20:29:08 localhost kernel: next_to_use <d94>
Nov 28 20:29:08 localhost kernel: next_to_clean <d7c>
Nov 28 20:29:08 localhost kernel: tx_buffer_info[next_to_clean]
Nov 28 20:29:08 localhost kernel: time_stamp <111922672>
Nov 28 20:29:08 localhost kernel: jiffies <1119237da>
Environment
- Red Hat Enterprise Linux 6
- ixgbe module
- Intel Niantic (82599) NIC
Subscriber exclusive content
A Red Hat subscription provides unlimited access to our knowledgebase, tools, and much more.