Oops on dump_trace() with several 'CPU stuck' messages for idle CPUs
Issue
- Oops on dump_trace() with several 'CPU stuck' messages for idle CPUs
general protection fault: 0000 [1] SMP
last sysfs file: /devices/pci0000:00/0000:00:03.0/class
CPU 3
Modules linked in: autofs4 nfs lockd fscache nfs_acl sunrpc dm_multipath scsi_dh video backlight sbs i2c_ec button battery asus_acpi acpi_memhotplug ac parport_pc lp parport e1000(U) floppy i2c_amd756 i2c_core ide_cd serio_raw amd_rng k8_edac edac_mc k8temp cdrom tg3 hwmon libphy pcspkr dm_raid45 dm_message dm_region_hash dm_mem_cache dm_snapshot dm_zero dm_mirror dm_log dm_mod usb_storage shpchp cciss sd_mod scsi_mod ext3 jbd uhci_hcd ohci_hcd ehci_hcd
Pid: 6022, comm: cinfo Tainted: G 2.6.18-128.1.6.el5 #1
RIP: 0010:[<ffffffff80063132>] [<ffffffff80063132>] thread_return+0xfd/0xfe
RSP: 0018:000000000c47c91a EFLAGS: 00054646
RAX: 0000000000000010 RBX: 2f36363238786c2f RCX: 00000000c0000100
RDX: ffff810303841400 RSI: ffff8103e3049040 RDI: ffff8103e3049040
RBP: 6465732f6e69622f R08: ffff8103dcf46000 R09: 000000000000003c
R10: ffff8103d132f740 R11: ffff8100abca4718 R12: 4f4d542a006e6962
R13: 5f41000000005455 R14: 4f4d542a223d7a5f R15: 3d5f000000005455
FS: 00002b4168aaedc0(0000) GS:ffff8103fff3a9c0(0000) knlGS:00000000f7d95b90
CS: 0010 DS: 0000 ES: 0000 CR0: 000000008005003b
CR2: 0000000000722220 CR3: 000000039d192000 CR4: 00000000000006e0
Process cinfo (pid: 6022, threadinfo ffff8103dcf46000, task ffff8103e3049040)
Stack: c8d1000000000c00 c8f1000000000c47 0000000000000c47 c980000000000000
048e000000000c47 0000000000000000 c880000000000000 022b000000000c47
0002000000000000 c84048454c440000 c940000000000c47 0000000000000c47
Call Trace:
Unable to handle kernel paging request at 000000000c4d0000 RIP:
[<ffffffff8006b8b4>] dump_trace+0x206/0x23a
PGD 3f2dae067 PUD 390d77067 PMD 398754067 PTE 0
Oops: 0000 [2] SMP
last sysfs file: /devices/pci0000:00/0000:00:03.0/class
CPU 3
Modules linked in: autofs4 nfs lockd fscache nfs_acl sunrpc dm_multipath scsi_dh video backlight sbs i2c_ec button battery asus_acpi acpi_memhotplug ac parport_pc lp parport e1000(U) floppy i2c_amd756 i2c_core ide_cd serio_raw amd_rng k8_edac edac_mc k8temp cdrom tg3 hwmon libphy pcspkr dm_raid45 dm_message dm_region_hash dm_mem_cache dm_snapshot dm_zero dm_mirror dm_log dm_mod usb_storage shpchp cciss sd_mod scsi_mod ext3 jbd uhci_hcd ohci_hcd ehci_hcd
Pid: 6022, comm: cinfo Tainted: G 2.6.18-128.1.6.el5 #1
RIP: 0010:[<ffffffff8006b8b4>] [<ffffffff8006b8b4>] dump_trace+0x206/0x23a
RSP: 0018:000000000c47c6c8 EFLAGS: 00050006
RAX: 0000000000000000 RBX: 000000000c4cfffa RCX: 0000000000004f04
RDX: 0000000000000000 RSI: 0000000000040012 RDI: ffffffff802fb850
RBP: 0003000000000c45 R08: ffffffff8006b8b4 R09: 000000000000003c
R10: ffffffff803d9520 R11: 0000000000000000 R12: 000000000c47c868
R13: 0000000000000000 R14: ffffffff802f0680 R15: ffff8100f578ffc0
FS: 00002b4168aaedc0(0000) GS:ffff8103fff3a9c0(0000) knlGS:00000000f7d95b90
CS: 0010 DS: 0000 ES: 0000 CR0: 000000008005003b
CR2: 000000000c4d0000 CR3: 000000039d192000 CR4: 00000000000006e0
Process cinfo (pid: 6022, threadinfo ffff8103dcf46000, task ffff8103e3049040)
Stack: 0000000300000000 0000000000040006 000000000000000d 000000000c47c91a
000000000000000c 000000000c47c868 0000000000000000 ffff8100f578ffc0
ffff8100f578bfc0 ffffffff8006b91c 000000000c47c97a 0000000000000000
Call Trace:
[<ffffffff8006b91c>] show_trace+0x34/0x47
[<ffffffff8006ba21>] _show_stack+0xdb/0xea
[<ffffffff8006babd>] show_registers+0x8d/0x100
[<ffffffff800651c6>] __die+0xad/0xff
[<ffffffff8006bc17>] die+0x32/0x44
[<ffffffff80065657>] do_general_protection+0xfe/0x107
[<ffffffff8005dde9>] error_exit+0x0/0x84
[<ffffffff80063132>] thread_return+0xfd/0xfe
Code: 48 8b 2b 48 89 ef e8 68 08 03 00 85 c0 74 0a 48 89 ee 4c 89
RIP [<ffffffff8006b8b4>] dump_trace+0x206/0x23a
RSP <000000000c47c6c8>
- The log contains a lot of 'CPU stuck' messages for idle CPUs, and some hangs in tg3 code.
BUG: soft lockup - CPU#2 stuck for 10s! [swapper:0]
Call Trace:
[<ffffffff80048d26>] cpu_idle+0x95/0xb8
[<ffffffff80076c3c>] start_secondary+0x45a/0x469
BUG: soft lockup - CPU#2 stuck for 10s! [swapper:0]
Call Trace:
<IRQ> [<ffffffff881ad074>] :tg3:tg3_wait_for_event_ack+0x27/0x33
[<ffffffff881ba3af>] :tg3:tg3_timer+0x73c/0x7e8
[<ffffffff80094e11>] run_timer_softirq+0x133/0x1af
[<ffffffff80011fbc>] __do_softirq+0x89/0x133
[<ffffffff8005e2fc>] call_softirq+0x1c/0x28
[<ffffffff8006cada>] do_softirq+0x2c/0x85
[<ffffffff8006b120>] poll_idle+0x0/0x19
[<ffffffff8005dc8e>] apic_timer_interrupt+0x66/0x6c
<EOI> [<ffffffff8006b134>] poll_idle+0x14/0x19
[<ffffffff80048d26>] cpu_idle+0x95/0xb8
[<ffffffff80076c3c>] start_secondary+0x45a/0x469
Environment
- Red Hat Enterprise Linux 5
- kernel-2.6.18-128.1.6.el5
Subscriber exclusive content
A Red Hat subscription provides unlimited access to our knowledgebase, tools, and much more.