Why the server got rebooted unexpectedly generating NIC firmware traces ?
Issue
- Server got rebooted and below are the logs before reboot.
Mar 28 06:58:27 HOSTNAME last message repeated 7 times
Mar 28 06:58:27 HOSTNAME kernel: NETDEV WATCHDOG: eth0: transmit timed out
Mar 28 06:58:28 HOSTNAME snmpd[6154]: Connection from UDP: [127.0.0.1]:53803
Mar 28 06:58:29 HOSTNAME snmpd[6154]: Connection from UDP: [127.0.0.1]:53803
Mar 28 06:58:29 HOSTNAME kernel: nx_nic: Flash Version: Firmware[4.0.555], BIOS[2.1.0]
Mar 28 06:58:29 HOSTNAME kernel: nx_nic: No memory on card. Load Cut through.
Mar 28 06:58:29 HOSTNAME kernel: Unable to handle kernel NULL pointer dereference at 0000000000000000 RIP:
Mar 28 06:58:29 HOSTNAME kernel: [<ffffffff800074ee>] kmem_cache_free+0x54/0x1e3
Mar 28 06:58:29 HOSTNAME kernel: PGD 0
Mar 28 06:58:29 HOSTNAME kernel: Oops: 0000 [1] SMP
Mar 28 06:58:29 HOSTNAME kernel: last sysfs file: /class/firmware/0000:04:00.0/loading
Mar 28 06:58:29 HOSTNAME kernel: CPU 13
Mar 28 06:58:29 HOSTNAME kernel: Modules linked in: mptctl mptbase nfs nfs_acl lockd sunrpc be2iscsi(U) ib_iser rdma_cm ib_cm iw_cm ib_sa ib_mad ib_addr iscsi_tcp xfrm_nalgo crypto_api uio cxgb3i libcxgbi iw_cxgb3 ib_core cxgb3 8021q libiscsi_tcp libiscsi2 scsi_transport_iscsi2 scsi_transport_iscsi dm_multipath scsi_dh video backlight sbs power_meter hwmon i2c_ec i2c_core dell_wmi wmi button battery asus_acpi acpi_memhotplug ac parport_pc lp parport joydev sr_mod cdrom sg i7core_edac edac_mc tpm_tis hpilo tpm nx_nic(U) shpchp serio_raw tpm_bios pcspkr dm_raid45 dm_message dm_region_hash dm_mem_cache dm_snapshot dm_zero dm_mirror dm_log dm_mod ata_piix libata cciss sd_mod scsi_mod ext3 jbd uhci_hcd ohci_hcd ehci_hcd
Mar 28 06:58:29 HOSTNAME kernel: Pid: 28045, comm: firmware_helper Tainted: G ---- 2.6.18-274.el5 #1
Mar 28 06:58:29 HOSTNAME kernel: RIP: 0010:[<ffffffff800074ee>] [<ffffffff800074ee>] kmem_cache_free+0x54/0x1e3
Mar 28 06:58:29 HOSTNAME kernel: RSP: 0018:ffff8108226eded0 EFLAGS: 00010246
Mar 28 06:58:29 HOSTNAME kernel: RAX: 0000000000000000 RBX: ffff810821c03800 RCX: 00000000000fe000
Mar 28 06:58:29 HOSTNAME kernel: RDX: 0000000000000000 RSI: 000001bc80000000 RDI: 00000000000007f0
Mar 28 06:58:29 HOSTNAME kernel: RBP: 0000000000000018 R08: c000003e00000001 R09: 0000000000000028
Mar 28 07:12:08 HOSTNAME syslogd 1.4.1: restart.
Mar 28 07:12:08 HOSTNAME kernel: klogd 1.4.1, log source = /proc/kmsg started.
Mar 28 07:12:08 HOSTNAME kernel: Linux version 2.6.18-274.el5 (mockbuild@x86-002.build.bos.redhat.com) (gcc version 4.1.2 20080704 (Red Hat 4.1.2-51)) #1 SMP Fri Jul 8 17:36:59 EDT 2011
- What has caused to generated above logs which resulted in system to become unresponsive and then reboot ?
Environment
- Red Hat Enterprise Linux 5
kernel-2.6.18-274.el5
- firmware-version: 4.0.555
nx_nic
network card module
Subscriber exclusive content
A Red Hat subscription provides unlimited access to our knowledgebase, tools, and much more.