possible hang condition in GHES code on sandy bridge

Solution Unverified - Updated -

Issue

  • The error is exhibited error with EDAC_DEBUG enabled:
EDAC DEBUG: in drivers/edac/sb_edac.c, line at 911: SAD interleave #0: 0
EDAC DEBUG: in drivers/edac/sb_edac.c, line at 919: mc#0: Error detected on SAD#0: address 0x0000000000000000 <
0x000000043fffffff, Interleave [8:6]XOR[18:16]
EDAC DEBUG: in drivers/edac/sb_edac.c, line at 943: SAD interleave index: 0 (wayness 1) = CPU socket 0
EDAC DEBUG: in drivers/edac/sb_edac.c, line at 1043: TAD#0: address 0x0000000000000000 < 0x00000000bfffffff, socket
interleave 0, channel interleave 4 (offset 0x00000000), index 0, base ch: 0, ch mask: 0x01
EDAC MC0: CE - no information available: Can't discover the memory rank for ch addr 0x00000000
not that one appears to get 0x0 as the address
EDAC DEBUG: in drivers/edac/sb_edac.c, line at 911: SAD interleave #0: 1
EDAC DEBUG: in drivers/edac/sb_edac.c, line at 919: mc#1: Error detected on SAD#1: address 0x00000006b5dc3700 <
0x000000083fffffff, Interleave [8:6]XOR[18:16]
EDAC DEBUG: in drivers/edac/sb_edac.c, line at 943: SAD interleave index: 0 (wayness 1) = CPU socket 1
EDAC DEBUG: in drivers/edac/sb_edac.c, line at 1043: TAD#0: address 0x00000006b5dc3700 < 0x000000083fffffff, socket
interleave 0, channel interleave 4 (offset 0x440000000), index 2, base ch: 2, ch mask: 0x04
EDAC MC1: CE - no information available: Can't discover the memory rank for ch addr 0x4ebb86c0
  • This one gets a more valid address but still cant decode rank.
  • You never see the error decoded back to the dimm.

Environment

  • Red Hat Enterprise Linux 6

Subscriber exclusive content

A Red Hat subscription provides unlimited access to our knowledgebase, tools, and much more.

Current Customers and Partners

Log in for full access

Log In

New to Red Hat?

Learn more about Red Hat subscriptions

Using a Red Hat product through a public cloud?

How to access this content