内核由于 copy_from_kernel_nofault()中的一个捕获的异常触发的一个错误 MCE 而崩溃了,其中 pagefault_disable()有效
Issue
- 内核由于 copy_from_kernel_nofault()中的一个捕获的异常触发的一个错误 MCE 而崩溃了。
[4847907.480421] mce: [Hardware Error]: CPU 6: Machine Check Exception: 5 Bank 1: bf80000000200401
[4847907.481951] mce: [Hardware Error]: RIP !INEXACT! 33:<000000000041b44e>
[4847907.483385] mce: [Hardware Error]: TSC 1122db98c46121c ADDR fee00340 MISC 86 PPIN 87907f512e31b079
[4847907.484815] mce: [Hardware Error]: PROCESSOR 0:50654 TIME 1739820952 SOCKET 0 APIC 22 microcode 2007006
[4847907.486234] mce: [Hardware Error]: Run the above through 'mcelog --ascii'
[4847907.513347] mce: [Hardware Error]: Machine check: Processor context corrupt
[4847907.514747] Kernel panic - not syncing: Fatal machine check
PID: 903233 TASK: ffff9daad8a6a380 CPU: 6 COMMAND: "kubelet"
#0 [ffffb27ea5c8bc60] machine_kexec at ffffffff9c86c767
#1 [ffffb27ea5c8bcb8] __crash_kexec at ffffffff9c9c58ca
#2 [ffffb27ea5c8bd78] panic at ffffffff9d2dfe3e
#3 [ffffb27ea5c8bdf8] mce_panic.cold at ffffffff9d2dab6a
#4 [ffffb27ea5c8be38] do_machine_check at ffffffff9d32f800
#5 [ffffb27ea5c8bf38] noist_exc_machine_check at ffffffff9d32f97a
#6 [ffffb27ea5c8bf50] asm_exc_machine_check at ffffffff9d400bff
RIP: 000000000041b44e RSP: 00007ff6fb7fdc58 RFLAGS: 00000206
RAX: 000000c01ea0f5c0 RBX: 000000c01c13c030 RCX: 0000000000000000
RDX: 00007ff81409f1e8 RSI: 000000c01ea0f501 RDI: 0000000000000006
RBP: 00007ff6fb7fdc80 R8: 0000000000000000 R9: 00000000028d3590
R10: 000000c01ea0f5c0 R11: 0000000000000002 R12: 00007ff6fb7fdd10
R13: 000000000000000f R14: 000000c0228369c0 R15: 0000000006ca01a0
ORIG_RAX: ffffffffffffffff CS: 0033 SS: 002b
PID: 9986 TASK: ffff9daa80350000 CPU: 42 COMMAND: "kubelet"
#0 [fffffe00008bde58] crash_nmi_callback at ffffffff9c85ea5e
#1 [fffffe00008bde60] nmi_handle at ffffffff9c8298db
#2 [fffffe00008bdea8] default_do_nmi at ffffffff9d32ea80
#3 [fffffe00008bdec8] exc_nmi at ffffffff9d32ec8d
#4 [fffffe00008bdef0] end_repeat_nmi at ffffffff9d4015eb
[exception RIP: __const_udelay+13]
RIP: ffffffff9cdc523d RSP: fffffe00008c1e08 RFLAGS: 00000202
RAX: 01122db992a95b82 RBX: 00000000004c26c7 RCX: 000000000000002a
RDX: 00000000002dc6c0 RSI: 000000000000002a RDI: 00000000000010c7
RBP: fffffe00008c1f58 R8: 000000000000002a R9: 0000000000000bb9
R10: 000000000000003e R11: 0000000000000004 R12: 0000000000000001
R13: 0000000000000002 R14: fffffe00008c1e58 R15: 0000000000000002
ORIG_RAX: ffffffffffffffff CS: 0010 SS: 0018
#5 [fffffe00008c1e08] __const_udelay at ffffffff9cdc523d
#6 [fffffe00008c1e08] wait_for_panic at ffffffff9c8432ed
#7 [fffffe00008c1e18] mce_timed_out at ffffffff9c843cc8
#8 [fffffe00008c1e30] do_machine_check at ffffffff9d32f3f4
#9 [fffffe00008c1f30] exc_machine_check at ffffffff9d32f905
#10 [fffffe00008c1f50] asm_exc_machine_check at ffffffff9d400bea
[exception RIP: copy_from_kernel_nofault+62]
RIP: ffffffff9cae2c8e RSP: ffffb27d39d6fd70 RFLAGS: 00000202
RAX: ffffffffffffffff RBX: ffffffffff5fc34b RCX: 0000000000000010
RDX: 0000000000000008 RSI: 0000000000000008 RDI: ffffffffff5fc34b
RBP: ffffb27d39d6fdf0 R8: 0000000000000001 R9: 0000000000000000
R10: 0000000000000001 R11: ffff9daa80350010 R12: 0000000000000008
R13: 00000000f2c0f300 R14: 0000000000000000 R15: ffffb27d39d6fe78
ORIG_RAX: ffffffffffffffff CS: 0010 SS: 0018
#11 [ffffb27d39d6fd70] copy_from_kernel_nofault at ffffffff9cae2c8e
#12 [ffffb27d39d6fd88] bpf_probe_read_kernel at ffffffff9ca4d258
#13 [ffffb27d39d6fe00] copy_from_kernel_nofault at ffffffff9cae2c6d
#14 [ffffb27d39d6fe20] bpf_probe_read_kernel at ffffffff9ca4d258
#15 [ffffb27d39d6fe88] bpf_trace_run2 at ffffffff9ca4e1d6
#16 [ffffb27d39d6feb8] syscall_exit_work at ffffffff9c999740
#17 [ffffb27d39d6fed0] syscall_exit_to_user_mode at ffffffff9d330d09
#18 [ffffb27d39d6fee0] do_syscall_64 at ffffffff9d32d169
#19 [ffffb27d39d6ff50] entry_SYSCALL_64_after_hwframe at ffffffff9d4000dc
RIP: 00000000004279ae RSP: 00007ff8557f9c90 RFLAGS: 00000202
RAX: 000000c01e581e60 RBX: 0000000000000090 RCX: 0000000000028000
RDX: 000000c01e581e60 RSI: 0000000000000090 RDI: 0000000000000012
RBP: 00007ff8557f9d10 R8: 0000000000000068 R9: 0000000000000000
R10: 0000000000000015 R11: 0000000000000028 R12: 0000000000000020
R13: 0000000000000009 R14: 000000c0019821a0 R15: 0000000000000001
ORIG_RAX: ffffffffffffffff CS: 0033 SS: 002b
Environment
- Red Hat Enterprise Linux 9
- Red Hat OpenShift Container Platform 4.14
- Red Hat Advanced Cluster Security 4.5.2 (Collector 版本:3.19.2)
Subscriber exclusive content
A Red Hat subscription provides unlimited access to our knowledgebase, tools, and much more.