The kernel crashes in read_tsc() due to a possible single bit-flip error

Solution Verified - Updated -

Issue

  • System was panicked inside read_tsc() function. Below variants of the issue were seen so far:
  • Example 1: The kernel crashes in read_tsc() due to a possible single bit-flip error.

    [906165.686714] BUG: unable to handle kernel paging request at ffffbfffbd035470
    [906165.693799] IP: [<ffffffffbd035470>] read_tsc+0x0/0x20
    [906165.699041] PGD 0 
    [906165.701167] Oops: 0010 [#1] SMP 
    [906165.704523] Modules linked in: mpt2sas raid_class scsi_transport_sas mptctl mptbase bonding 8021q garp mrp stp llc dell_rbu vfat fat xfs libcrc32c iTCO_wdt iTCO_vendor_support dell_smbios dell_wmi_descriptor dcdbas i10nm_edac nfit libnvdimm coretemp intel_rapl iosf_mbi kvm irqbypass crc32_pclmul ghash_clmulni_intel aesni_intel lrw gf128mul glue_helper ablk_helper cryptd pcspkr i2c_i801 sg mei_me mei wmi ipmi_si ipmi_devintf ipmi_msghandler pinctrl_lewisburg pinctrl_intel tpm_crb acpi_power_meter onload(OE) sfc_char(OE) sfc_resource(OE) sfc_affinity(OE) binfmt_misc auth_rpcgss sunrpc ip_tables ext4 mbcache jbd2 megaraid_sas crc32c_intel sfc(OE) mdio mtd i2c_algo_bit drm_kms_helper tg3 syscopyarea sysfillrect sysimgblt fb_sys_fops ttm drm ptp pps_core drm_panel_orientation_quirks sd_mod crc_t10dif
    [906165.776519]  crct10dif_generic crct10dif_pclmul crct10dif_common ahci libahci libata dm_mirror dm_region_hash dm_log dm_mod
    [906165.786566] CPU: 9 PID: 1779 Comm: user9 Kdump: loaded Tainted: G           OE  ------------   3.10.0 1160.62.1.el7.x86_64 #1
    [906165.797920] Hardware name: Dell Inc. PowerEdge R750/0WT8Y6, BIOS 1.5.4 12/17/2021
    [906165.805466] task: ffff96ffec4a9080 ti: ffff96ff87bc0000 task.ti: ffff96ff87bc0000
    [906165.813015] RIP: 0010:[<ffffffffbd035470>]  [<ffffffffbd035470>] read_tsc+0x0/0x20
    [906165.820675] RSP: 0000:ffff97013f303f10  EFLAGS: 00010007
    [906165.826062] RAX: ffffffffbd035470 RBX: ffffffffbdc2f200 RCX: 00033a1855f6dd80
    [906165.833264] RDX: 0000000000000000 RSI: 00033a1855f6dd80 RDI: ffffffffbdc2f200
    [906165.840468] RBP: ffff97013f303f38 R08: 00033a1855f6dd80 R09: ffff97013f303de0
    [906165.847669] R10: 000000000000b3ff R11: 0000000000000005 R12: 00000000000ddc0c
    [906165.854871] R13: 0000000000000000 R14: 000000006c44c73c R15: ffff97013f315ad8
    [906165.862074] FS:  00007f25bf3f4700(0000) GS:ffff97013f300000(0000) knlGS:0000000000000000
    [906165.870229] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
    [906165.876047] CR2: ffffbfffbd035470 CR3: 0000003e4231c000 CR4: 0000000000760fe0
    [906165.883249] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
    [906165.890453] DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7: 0000000000000400
    [906165.897655] PKRU: 55555554
    [906165.900447] Call Trace:
    [906165.902982]  <IRQ> 
    [906165.904999]  [<ffffffffbd107d82>] ? ktime_get+0x52/0xe0
    [906165.910497]  [<ffffffffbd10e499>] clockevents_program_event+0x39/0x100
    [906165.917093]  [<ffffffffbd110173>] tick_program_event+0x23/0x30
    [906165.923000]  [<ffffffffbd0ca952>] hrtimer_interrupt+0xf2/0x1d0
    [906165.928921]  [<ffffffffbd05ce2b>] local_apic_timer_interrupt+0x3b/0x60
    [906165.935520]  [<ffffffffbd79ea23>] smp_apic_timer_interrupt+0x43/0x60
    [906165.941941]  [<ffffffffbd79afba>] apic_timer_interrupt+0x16a/0x170
    [906165.948191]  <EOI> 
    [906165.950206] Code: 
    [906165.952333] d1 00 48 89 e5 5d c3 0f 1f 40 00 55 8b 05 c9 c0 d1 00 48 89 e5 5d c3 0f 1f 40 00 55 b8 ed ff ff ff 48 89 e5 5d c3 0f 1f 44 00 00 <55> 48 89 e5 0f ae e8 0f 31 89 c0 48 c1 e2 20 48 09 c2 48 89 d0 
    [906165.970881] RIP  [<ffffffffbd035470>] read_tsc+0x0/0x20
    [906165.976208]  RSP <ffff97013f303f10>
    [906165.979777] CR2: ffffbfffbd035470
    
  • Example 2: System was panicked inside read_tsc() function because of invalid memory address.

    [767391.203303] BUG: unable to handle kernel paging request at 000000003449869a
    [767391.287656] PGD 0 
    [767391.312726] Oops: 0002 [#1] SMP NOPTI
    [767391.357555] CPU: 1 PID: 0 Comm: swapper/1 Kdump: loaded Not tainted  4.18.0-305.49.1.el8_4.x86_64 #1
    [767391.466867] Hardware name: Cisco Systems Inc UCSC-C240-M6SX/UCSC-C240-M6SX, BIOS C240M6.4.3.2d.0.0825231000 08/25/2023
    [767391.595942] RIP: 0010:read_tsc+0x1/0x10
    [767391.642854] Code: 90 90 90 90 90 90 90 90 90 8b 05 86 18 7f 01 c3 66 0f 1f 84 00 00 00 00 00 c3 66 66 2e 0f 1f 84 00 00 00 00 00 0f 1f 40 00 0f <ae> e8 0f 31 48 c1 e2 20 48 09 d0 c3 0f 1f 00 8b 05 4a fc 7e 01 83
    [767391.868646] RSP: 0018:ff6d8adb8007cf98 EFLAGS: 00010046
    [767391.932195] RAX: ffffffffb7a2a3f0 RBX: 0002b9f133966809 RCX: 000000003449869a
    [767392.018624] RDX: ff1366b187c7ebe0 RSI: ffffffffb7b70ae6 RDI: ffffffffb9027d20
    [767392.105057] RBP: 0000000000000000 R08: 0000000000000000 R09: 0000000000000000
    [767392.191488] R10: 0000000000000000 R11: 0000000000000000 R12: 000000005b78d044
    [767392.277920] R13: ffffffffb99e5100 R14: 0000000000000000 R15: 0000000000000000
    [767392.364350] FS:  0000000000000000(0000) GS:ff1366efff640000(0000) knlGS:0000000000000000
    [767392.462219] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
    [767392.532010] CR2: 000000003449869a CR3: 000000751c210002 CR4: 0000000000773ee0
    [767392.618439] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
    [767392.704871] DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7: 0000000000000400
    [767392.791301] PKRU: 55555554
    [767392.824694] Call Trace:
    [767392.854966]  <IRQ>
    [767392.880041]  ? tick_irq_enter+0x29/0xa0
    [767392.926955]  ? irq_enter+0x51/0x60
    [767392.968665]  ? smp_apic_timer_interrupt+0x1c/0x130
    [767393.027013]  ? apic_timer_interrupt+0xf/0x20
    [767393.079122]  </IRQ>
    [767393.105237]  ? cpuidle_enter_state+0xd9/0x3c0
    [767393.158384]  ? cpuidle_enter_state+0xb4/0x3c0
    [767393.211536]  ? cpuidle_enter+0x2c/0x40
    [767393.257407]  ? do_idle+0x234/0x260
    [767393.299126]  ? cpu_startup_entry+0x6f/0x80
    [767393.349167]  ? start_secondary+0x199/0x1e0
    [767393.399198]  ? secondary_startup_64_no_verify+0xc2/0xcb
    [767393.462747] Modules linked in: overlay 8021q garp mrp sch_ingress bonding tun nls_utf8 isofs cdrom nf_log_ipv6 nf_log_ipv4 nf_log_common nft_limit nft_counter xt_LOG xt_limit xt_multiport xt_comment xt_state xt_conntrack nft_compat nf_tables nfnetlink vfat fat intel_rapl_msr intel_rapl_common i10nm_edac nfit libnvdimm x86_pkg_temp_thermal intel_powerclamp coretemp kvm_intel kvm rapl ipmi_ssif intel_uncore pcspkr isst_if_mbox_pci ses enclosure vfio_pci scsi_transport_sas vfio_virqfd irqbypass isst_if_mmio isst_if_common joydev mei_me mei ioatdma wmi acpi_ipmi ipmi_si ipmi_devintf ipmi_msghandler acpi_cpufreq acpi_power_meter acpi_pad vfio_iommu_type1 vfio openvswitch nf_conncount nf_nat nf_conntrack nf_defrag_ipv6 nf_defrag_ipv4 br_netfilter bridge auth_rpcgss stp llc xfs libcrc32c dm_multipath i40iw ib_uverbs ib_core sd_mod t10_pi sg crct10dif_pclmul crc32_pclmul crc32c_intel ghash_clmulni_intel megaraid_sas ahci libahci ixgbe mgag200 igb libata drm_kms_helper syscopyarea sysfillrect
    [767393.462785]  sysimgblt fb_sys_fops drm i40e i2c_algo_bit dca mdio sunrpc dm_mirror dm_region_hash dm_log dm_mod be2iscsi bnx2i cnic uio cxgb4i cxgb4 libcxgbi libcxgb qla4xxx iscsi_boot_sysfs iscsi_tcp libiscsi_tcp libiscsi scsi_transport_iscsi fuse 
    [767394.771345] CR2: 000000003449869a
    

Environment

  • Red Hat Enterprise Linux 7.9
  • Red Hat Enterprise Linux 8.4
  • Dell PowerEdge R750
  • Cisco UCS C-Series

Subscriber exclusive content

A Red Hat subscription provides unlimited access to our knowledgebase, tools, and much more.

Current Customers and Partners

Log in for full access

Log In

New to Red Hat?

Learn more about Red Hat subscriptions

Using a Red Hat product through a public cloud?

How to access this content