RHEL5 server panics frequently on same logical processor with invalid register values

Solution Unverified - Updated -

Issue

  • RHEL5 server panics frequently on same logical processor with invalid register values:
Unable to handle kernel paging request at 000000000029f77e RIP: 
 [<ffffffff80010490>] number+0xd8/0x1d5
PGD c1b220067 PUD c1319e067 PMD 0 
Oops: 0000 [1] SMP 
last sysfs file: /devices/pci0000:00/0000:00:07.0/0000:05:00.1/irq
CPU 20 
Modules linked in: mpt2sas scsi_transport_sas mptctl mptbase autofs4 ipmi_devintf ipmi_si ipmi_msghandler nfs nfs_acl lockd sunrpc acpi_cpufreq freq_table mperf bonding ipv6 xfrm_nalgo crypto_api ipt_REJECT xt_tcpudp xt_state ip_conntrack nfnetlink xt_multiport iptable_filter ip_tables x_tables dm_multipath scsi_dh video backlight sbs power_meter hwmon i2c_ec dell_wmi wmi button battery asus_acpi acpi_memhotplug ac parport_pc lp parport joydev sg ixgbe tpm_tis i2c_i801 pcspkr tpm 8021q e1000e i2c_core i7core_edac tpm_bios serio_raw edac_mc dca dm_raid45 dm_message dm_region_hash dm_mem_cache dm_snapshot dm_zero dm_mirror dm_log dm_mod ahci libata shpchp megaraid_sas sd_mod scsi_mod ext3 jbd uhci_hcd ohci_hcd ehci_hcd
Pid: 28448, comm: ps Tainted: G     ---- 2.6.18-308.el5 #1
RIP: 0010:[<ffffffff80010490>]  [<ffffffff80010490>] number+0xd8/0x1d5
RSP: 0018:ffff810c12d2ba18  EFLAGS: 00010246
RAX: 004189373ef9db23 RBX: 000000000000000a RCX: 0000000000000002
RDX: 000000007ffffffe RSI: ffff810c937fafff RDI: ffff810c137fb056
RBP: 0000000000000000 R08: 00000000ffffffff R09: 0000000000000020
R10: 004189373ef9db23 R11: 0000000000000000 R12: ffffffff8029f780
R13: 00000000ffffffff R14: 0000000000000000 R15: 000000000000000a
FS:  00002aed6ec87f80(0000) GS:ffff810c3fd7bc40(0000) knlGS:0000000000000000
CS:  0010 DS: 0000 ES: 0000 CR0: 000000008005003b
CR2: 000000000029f77e CR3: 0000000c13e47000 CR4: 00000000000006a0
Process ps (pid: 28448, threadinfo ffff810c12d2a000, task ffff810c23e44040)
Stack:  3536383332353135 0000000000000000 0000000000000000 ffff810c28126c00
 ffff810c23e44040 0000000000000014 ffff810c28126c00 ffff810c28126c00
 0000000010000042 ffff810c28126c00 ffff810c12d2bb38 ffff810c12d2be68
Call Trace:
 [<ffffffff8001a9c9>] vsnprintf+0x5df/0x627
 [<ffffffff8003d806>] lock_timer_base+0x1b/0x3c
 [<ffffffff8004729c>] sprintf+0x51/0x59
 [<ffffffff800ece83>] free_poll_entry+0x11/0x1a
 [<ffffffff8002f488>] do_sys_poll+0x334/0x360
 [<ffffffff8001ec36>] __pollwait+0x0/0xe2
 [<ffffffff8008ee72>] default_wake_function+0x0/0xe
 [<ffffffff8003f8df>] memcpy_toiovec+0x36/0x66
 [<ffffffff8001e9d6>] do_task_stat+0x79a/0x7e1
 [<ffffffff8002cb45>] mntput_no_expire+0x19/0x89
 [<ffffffff8000eb98>] link_path_walk+0xac/0xb8
 [<ffffffff8010c132>] proc_info_read+0x5f/0xb9
 [<ffffffff8000b72f>] vfs_read+0xcb/0x171
 [<ffffffff80011d49>] sys_read+0x45/0x6e
 [<ffffffff8005d116>] system_call+0x7e/0x83


Code: 41 8a 04 14 88 04 0c 48 ff c1 4d 85 d2 75 e1 41 89 ca 45 39 
RIP  [<ffffffff80010490>] number+0xd8/0x1d5
 RSP <ffff810c12d2ba18>
  • Server becomes stable when the physical processor holding affected logical cpu is disabled.

Environment

  • Red Hat Enterprise Linux 5

Subscriber exclusive content

A Red Hat subscription provides unlimited access to our knowledgebase, tools, and much more.

Current Customers and Partners

Log in for full access

Log In

New to Red Hat?

Learn more about Red Hat subscriptions

Using a Red Hat product through a public cloud?

How to access this content