"EDAC i5000 MC0: FATAL ERRORS Found!!! 1st FATAL Err Reg= 0x4" is output

Solution Verified - Updated -

Issue

  • A Server suddenly went down, we need root cause for this in Red Hat Enterprise Linux 4.
EDAC i5000 MC0: FATAL ERRORS Found!!! 1st FATAL Err Reg= 0x4 
EDAC i5000 MC0: >Tmid Thermal event with intelligent throttling disabled 
EDAC MC0: UE row 2, channel-a= 2 channel-b= 3 labels "-": 
(Branch=1 DRAM-Bank=3 RDWR=Write RAS=14656 CAS=0 FATAL Err=0x4) 
Kernel panic - not syncing: UE row 2, channel-a= 2 channel-b= 3 labels "-": 
(Branch=1 DRAM-Bank=3 RDWR=Write RAS=14656 CAS=0 FATAL Err=0x4) 

----------- [cut here ] --------- [please bite here ] --------- 
Kernel BUG at panic:75 
invalid operand: 0000 [1] SMP 
CPU 3 
Modules linked in: hangcheck_timer sg mptctl mptbase ipmi_devintf ipmi_si 
ipmi_msghandler dell_rbu md5 ipv6 netconsole netdump oracleasm(U) autofs4 i2c_dev 
i2c_core ocfs2(U) debugfs(U) ocfs2_dlmfs(U) ocfs2_dlm(U) ocfs2_nodemanager(U) configfs(U) 
emcpdm(U) emcpgpx(U) emcpmpx(U) emcp(U) button battery ac uhci_hcd ehci_hcd i5000_edac edac_mc 
hw_random igb bnx2 dm_snapshot dm_zero dm_mirror ext3 jbd dm_mod qla2xxx(U) qla2xxx_conf(U) 
megaraid_sas(U) ata_piix libata sd_mod scsi_mod 
Pid: 3446, comm: kedac Tainted: P      2.6.9-78.0.1.ELsmp 
RIP: 0010:[<ffffffff801385e2>] <ffffffff801385e2>{panic+211} 
RSP: 0018:000001042eb01bb8  EFLAGS: 00010286 
RAX: 0000000000000090 RBX: ffffffffa02409d0 RCX: 0000000000000246 
RDX: 0000000000009e28 RSI: 0000000000000246 RDI: ffffffff803f64c0 
RBP: 000001042eb01d78 R08: 000000000000000d R09: ffffffffa02409d0 
R10: 000001042effa000 R11: 0000000000000000 R12: 0000000000000180 
R13: 000001042eb01ca0 R14: 0000000000000003 R15: 0000000000000002 
FS:  0000000000000000(0000) GS:ffffffff8050d400(0000) knlGS:0000000000000000 
CS:  0010 DS: 0018 ES: 0018 CR0: 000000008005003b 
CR2: 00000000006dfe88 CR3: 0000000037d54000 CR4: 00000000000006e0 
Process kedac (pid: 3446, threadinfo 000001042eb00000, task 000001042f36f7f0) 
Stack: 0000003000000030 000001042eb01c98 000001042eb01bd8 0000000000000000 
       0000000000000002 0000000000000002 0000000000000002 0000000000000003 
       000001042eb01ca0 000001042eb01e18 
Call Trace:<ffffffff801f017f>{vsnprintf+1406} <ffffffffa023fa07>{:edac_mc:edac_mc_handle_fbd_ue+322} 
       <ffffffffa023fa30>{:edac_mc:edac_mc_handle_fbd_ue+363} 
       <ffffffff8013002d>{ia32_setup_arg_pages+167} <ffffffff801f4bcb>{pci_bus_read_config_dword+104} 
       <ffffffffa02482c7>{:i5000_edac:i5000_check_error+307} 
       <ffffffff80318308>{thread_return+0} <ffffffff80318360>{thread_return+88} 
       <ffffffff8014c5a0>{keventd_create_kthread+0} <ffffffff80318e47>{schedule_timeout+396} 
       <ffffffff8014c5a0>{keventd_create_kthread+0} <ffffffffa023fd29>{:edac_mc:check_mc_devices+82} 
       <ffffffffa023fd43>{:edac_mc:edac_kernel_thread+0} <ffffffffa023fd4c>{:edac_mc:edac_kernel_thread+9} 
       <ffffffff8014c577>{kthread+200} <ffffffff80110fd3>{child_rip+8} 
       <ffffffff8014c5a0>{keventd_create_kthread+0} <ffffffff8014c4af>{kthread+0} 
       <ffffffff80110fcb>{child_rip+0} 

Code: 0f 0b d8 cc 32 80 ff ff ff ff 4b 00 31 ff e8 77 bd fe ff e8 
RIP <ffffffff801385e2>{panic+211} RSP <000001042eb01bb8>
  • The following messages are output in Red Hat Enterprise Linux 5.
kernel: EDAC i5000 MC0:FATAL ERRORS Found!!! 1st FATAL Err Reg= 0x4
kernel: EDAC MC0: UE row 2, channel-a= 0 channel-b= 1 labels "-": (Branch=0 DRAM-Bank=3 RDWR=Read RAS=41 CAS=0 FATAL Err=0x4)

Environment

  • Red Hat Enterprise Linux 4.7
  • Red Hat Enterprise Linux 5.2
  • EDAC

Subscriber exclusive content

A Red Hat subscription provides unlimited access to our knowledgebase, tools, and much more.

Current Customers and Partners

Log in for full access

Log In

New to Red Hat?

Learn more about Red Hat subscriptions

Using a Red Hat product through a public cloud?

How to access this content