Oops on dump_trace() with several 'CPU stuck' messages for idle CPUs
Issue
- Oops on dump_trace() with several 'CPU stuck' messages for idle CPUs
general protection fault: 0000 [1] SMP
last sysfs file: /devices/pci0000:00/0000:00:03.0/class
CPU 3
Modules linked in: autofs4 nfs lockd fscache nfs_acl sunrpc dm_multipath scsi_dh video backlight sbs i2c_ec button battery asus_acpi acpi_memhotplug ac parport_pc lp parport e1000(U) floppy i2c_amd756 i2c_core ide_cd serio_raw amd_rng k8_edac edac_mc k8temp cdrom tg3 hwmon libphy pcspkr dm_raid45 dm_message dm_region_hash dm_mem_cache dm_snapshot dm_zero dm_mirror dm_log dm_mod usb_storage shpchp cciss sd_mod scsi_mod ext3 jbd uhci_hcd ohci_hcd ehci_hcd
Pid: 6022, comm: cinfo Tainted: G 2.6.18-128.1.6.el5 #1
RIP: 0010:[<ffffffff80063132>] [<ffffffff80063132>] thread_return+0xfd/0xfe
RSP: 0018:000000000c47c91a EFLAGS: 00054646
RAX: 0000000000000010 RBX: 2f36363238786c2f RCX: 00000000c0000100
RDX: ffff810303841400 RSI: ffff8103e3049040 RDI: ffff8103e3049040
RBP: 6465732f6e69622f R08: ffff8103dcf46000 R09: 000000000000003c
R10: ffff8103d132f740 R11: ffff8100abca4718 R12: 4f4d542a006e6962
R13: 5f41000000005455 R14: 4f4d542a223d7a5f R15: 3d5f000000005455
FS: 00002b4168aaedc0(0000) GS:ffff8103fff3a9c0(0000) knlGS:00000000f7d95b90
CS: 0010 DS: 0000 ES: 0000 CR0: 000000008005003b
CR2: 0000000000722220 CR3: 000000039d192000 CR4: 00000000000006e0
Process cinfo (pid: 6022, threadinfo ffff8103dcf46000, task ffff8103e3049040)
Stack: c8d1000000000c00 c8f1000000000c47 0000000000000c47 c980000000000000
048e000000000c47 0000000000000000 c880000000000000 022b000000000c47
0002000000000000 c84048454c440000 c940000000000c47 0000000000000c47
Call Trace:
Unable to handle kernel paging request at 000000000c4d0000 RIP:
[<ffffffff8006b8b4>] dump_trace+0x206/0x23a
PGD 3f2dae067 PUD 390d77067 PMD 398754067 PTE 0
Oops: 0000 [2] SMP
last sysfs file: /devices/pci0000:00/0000:00:03.0/class
CPU 3
Modules linked in: autofs4 nfs lockd fscache nfs_acl sunrpc dm_multipath scsi_dh video backlight sbs i2c_ec button battery asus_acpi acpi_memhotplug ac parport_pc lp parport e1000(U) floppy i2c_amd756 i2c_core ide_cd serio_raw amd_rng k8_edac edac_mc k8temp cdrom tg3 hwmon libphy pcspkr dm_raid45 dm_message dm_region_hash dm_mem_cache dm_snapshot dm_zero dm_mirror dm_log dm_mod usb_storage shpchp cciss sd_mod scsi_mod ext3 jbd uhci_hcd ohci_hcd ehci_hcd
Pid: 6022, comm: cinfo Tainted: G 2.6.18-128.1.6.el5 #1
RIP: 0010:[<ffffffff8006b8b4>] [<ffffffff8006b8b4>] dump_trace+0x206/0x23a
RSP: 0018:000000000c47c6c8 EFLAGS: 00050006
RAX: 0000000000000000 RBX: 000000000c4cfffa RCX: 0000000000004f04
RDX: 0000000000000000 RSI: 0000000000040012 RDI: ffffffff802fb850
RBP: 0003000000000c45 R08: ffffffff8006b8b4 R09: 000000000000003c
R10: ffffffff803d9520 R11: 0000000000000000 R12: 000000000c47c868
R13: 0000000000000000 R14: ffffffff802f0680 R15: ffff8100f578ffc0
FS: 00002b4168aaedc0(0000) GS:ffff8103fff3a9c0(0000) knlGS:00000000f7d95b90
CS: 0010 DS: 0000 ES: 0000 CR0: 000000008005003b
CR2: 000000000c4d0000 CR3: 000000039d192000 CR4: 00000000000006e0
Process cinfo (pid: 6022, threadinfo ffff8103dcf46000, task ffff8103e3049040)
Stack: 0000000300000000 0000000000040006 000000000000000d 000000000c47c91a
000000000000000c 000000000c47c868 0000000000000000 ffff8100f578ffc0
ffff8100f578bfc0 ffffffff8006b91c 000000000c47c97a 0000000000000000
Call Trace:
[<ffffffff8006b91c>] show_trace+0x34/0x47
[<ffffffff8006ba21>] _show_stack+0xdb/0xea
[<ffffffff8006babd>] show_registers+0x8d/0x100
[<ffffffff800651c6>] __die+0xad/0xff
[<ffffffff8006bc17>] die+0x32/0x44
[<ffffffff80065657>] do_general_protection+0xfe/0x107
[<ffffffff8005dde9>] error_exit+0x0/0x84
[<ffffffff80063132>] thread_return+0xfd/0xfe
Code: 48 8b 2b 48 89 ef e8 68 08 03 00 85 c0 74 0a 48 89 ee 4c 89
RIP [<ffffffff8006b8b4>] dump_trace+0x206/0x23a
RSP <000000000c47c6c8>
- The log contains a lot of 'CPU stuck' messages for idle CPUs, and some hangs in tg3 code.
BUG: soft lockup - CPU#2 stuck for 10s! [swapper:0]
Call Trace:
[<ffffffff80048d26>] cpu_idle+0x95/0xb8
[<ffffffff80076c3c>] start_secondary+0x45a/0x469
BUG: soft lockup - CPU#2 stuck for 10s! [swapper:0]
Call Trace:
<IRQ> [<ffffffff881ad074>] :tg3:tg3_wait_for_event_ack+0x27/0x33
[<ffffffff881ba3af>] :tg3:tg3_timer+0x73c/0x7e8
[<ffffffff80094e11>] run_timer_softirq+0x133/0x1af
[<ffffffff80011fbc>] __do_softirq+0x89/0x133
[<ffffffff8005e2fc>] call_softirq+0x1c/0x28
[<ffffffff8006cada>] do_softirq+0x2c/0x85
[<ffffffff8006b120>] poll_idle+0x0/0x19
[<ffffffff8005dc8e>] apic_timer_interrupt+0x66/0x6c
<EOI> [<ffffffff8006b134>] poll_idle+0x14/0x19
[<ffffffff80048d26>] cpu_idle+0x95/0xb8
[<ffffffff80076c3c>] start_secondary+0x45a/0x469
Environment
- Red Hat Enterprise Linux 5
- kernel-2.6.18-128.1.6.el5
Subscriber exclusive content
A Red Hat subscription provides unlimited access to our knowledgebase of over 48,000 articles and solutions.
Welcome! Check out the Getting Started with Red Hat page for quick tours and guides for common tasks.
