RHEL5 server panics frequently on same logical processor with invalid register values
Issue
- RHEL5 server panics frequently on same logical processor with invalid register values:
Unable to handle kernel paging request at 000000000029f77e RIP:
[<ffffffff80010490>] number+0xd8/0x1d5
PGD c1b220067 PUD c1319e067 PMD 0
Oops: 0000 [1] SMP
last sysfs file: /devices/pci0000:00/0000:00:07.0/0000:05:00.1/irq
CPU 20
Modules linked in: mpt2sas scsi_transport_sas mptctl mptbase autofs4 ipmi_devintf ipmi_si ipmi_msghandler nfs nfs_acl lockd sunrpc acpi_cpufreq freq_table mperf bonding ipv6 xfrm_nalgo crypto_api ipt_REJECT xt_tcpudp xt_state ip_conntrack nfnetlink xt_multiport iptable_filter ip_tables x_tables dm_multipath scsi_dh video backlight sbs power_meter hwmon i2c_ec dell_wmi wmi button battery asus_acpi acpi_memhotplug ac parport_pc lp parport joydev sg ixgbe tpm_tis i2c_i801 pcspkr tpm 8021q e1000e i2c_core i7core_edac tpm_bios serio_raw edac_mc dca dm_raid45 dm_message dm_region_hash dm_mem_cache dm_snapshot dm_zero dm_mirror dm_log dm_mod ahci libata shpchp megaraid_sas sd_mod scsi_mod ext3 jbd uhci_hcd ohci_hcd ehci_hcd
Pid: 28448, comm: ps Tainted: G ---- 2.6.18-308.el5 #1
RIP: 0010:[<ffffffff80010490>] [<ffffffff80010490>] number+0xd8/0x1d5
RSP: 0018:ffff810c12d2ba18 EFLAGS: 00010246
RAX: 004189373ef9db23 RBX: 000000000000000a RCX: 0000000000000002
RDX: 000000007ffffffe RSI: ffff810c937fafff RDI: ffff810c137fb056
RBP: 0000000000000000 R08: 00000000ffffffff R09: 0000000000000020
R10: 004189373ef9db23 R11: 0000000000000000 R12: ffffffff8029f780
R13: 00000000ffffffff R14: 0000000000000000 R15: 000000000000000a
FS: 00002aed6ec87f80(0000) GS:ffff810c3fd7bc40(0000) knlGS:0000000000000000
CS: 0010 DS: 0000 ES: 0000 CR0: 000000008005003b
CR2: 000000000029f77e CR3: 0000000c13e47000 CR4: 00000000000006a0
Process ps (pid: 28448, threadinfo ffff810c12d2a000, task ffff810c23e44040)
Stack: 3536383332353135 0000000000000000 0000000000000000 ffff810c28126c00
ffff810c23e44040 0000000000000014 ffff810c28126c00 ffff810c28126c00
0000000010000042 ffff810c28126c00 ffff810c12d2bb38 ffff810c12d2be68
Call Trace:
[<ffffffff8001a9c9>] vsnprintf+0x5df/0x627
[<ffffffff8003d806>] lock_timer_base+0x1b/0x3c
[<ffffffff8004729c>] sprintf+0x51/0x59
[<ffffffff800ece83>] free_poll_entry+0x11/0x1a
[<ffffffff8002f488>] do_sys_poll+0x334/0x360
[<ffffffff8001ec36>] __pollwait+0x0/0xe2
[<ffffffff8008ee72>] default_wake_function+0x0/0xe
[<ffffffff8003f8df>] memcpy_toiovec+0x36/0x66
[<ffffffff8001e9d6>] do_task_stat+0x79a/0x7e1
[<ffffffff8002cb45>] mntput_no_expire+0x19/0x89
[<ffffffff8000eb98>] link_path_walk+0xac/0xb8
[<ffffffff8010c132>] proc_info_read+0x5f/0xb9
[<ffffffff8000b72f>] vfs_read+0xcb/0x171
[<ffffffff80011d49>] sys_read+0x45/0x6e
[<ffffffff8005d116>] system_call+0x7e/0x83
Code: 41 8a 04 14 88 04 0c 48 ff c1 4d 85 d2 75 e1 41 89 ca 45 39
RIP [<ffffffff80010490>] number+0xd8/0x1d5
RSP <ffff810c12d2ba18>
- Server becomes stable when the physical processor holding affected logical cpu is disabled.
Environment
- Red Hat Enterprise Linux 5
Subscriber exclusive content
A Red Hat subscription provides unlimited access to our knowledgebase, tools, and much more.