Multiple invocations to NMI crash shootdown when crashing cause a list_add double add error and inability to capture a vmcore file due to a missing backport of 26044aff37a5

Solution Verified - Updated -

Issue

  • Multiple invocations to NMI crash shootdown when crashing cause a list_add double add error and inability to capture a vmcore file due to a missing backport of 26044aff37a5
[926344.782067] kernel BUG at mm/slub.c:373!
[926344.784624] invalid opcode: 0000 [#1] SMP NOPTI
[926344.786859] CPU: 18 PID: 260957 Comm: mcast-receiver- Tainted: G           OE    --------- -  - 4.18.0-372.82.1.el8_6.x86_64 #1
[926344.792390] Hardware name: Microsoft Corporation Virtual Machine/Virtual Machine, BIOS Hyper-V UEFI Release v4.1 11/28/2023
[926344.797736] RIP: 0010:__slab_free+0x19e/0x340
[926344.799896] Code: 1f 44 00 00 eb 9c 41 f7 46 08 00 0d 21 00 0f 85 16 ff ff ff 4d 85 ed 0f 85 0d ff ff ff 80 4c 24 5b 80 45 31 ff e9 57 ff ff ff <0f> 0b 49 3b 54 24 28 75 c4 49 89 5c 24 20 49 89 4c 24 28 49 0f ba
[926344.809020] RSP: 0018:ffffa64b9fca3c60 EFLAGS: 00010246
[926344.811587] RAX: ffff954ccc717400 RBX: ffff954ccc717000 RCX: ffff954ccc717000
[926344.815053] RDX: 000000008010000f RSI: fffff5659031c400 RDI: ffff9549c0005500
[926344.818572] RBP: ffffa64b9fca3cf8 R08: 0000000000000001 R09: ffffffff9c1f75d7
[926344.822144] R10: ffff954ccc717000 R11: 0000000000000001 R12: fffff5659031c400
[926344.825659] R13: ffff954ccc717000 R14: ffff9549c0005500 R15: 0000000000000244
[926344.829100] FS:  00007f5a9cde5700(0000) GS:ffff958899c80000(0000) knlGS:0000000000000000
[926344.833268] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[926344.836037] CR2: 00007f33e92c8fd8 CR3: 0000001cea80a006 CR4: 0000000000370ee0
[926344.839420] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
[926344.842909] DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7: 0000000000000400
[926344.846376] Call Trace:
[926344.847637]  ? f_sys_recvfrom_x+0x69/0x110 [sysdigcloud_probe]
[926344.850622]  ? __skb_try_recv_from_queue+0x17a/0x1b0
[926344.853033]  ? __consume_stateless_skb+0x17/0x60
[926344.855230]  kfree+0x1d8/0x200
[926344.856720]  __consume_stateless_skb+0x17/0x60
[926344.858838]  udpv6_recvmsg+0x2b2/0x770
[926344.860641]  inet6_recvmsg+0x5b/0xd0
[926344.862433]  __sys_recvfrom+0xc3/0x180
[926344.864320]  ? syscall_trace_enter+0x1ff/0x2d0
[926344.866419]  __x64_sys_recvfrom+0x24/0x30
[926344.868322]  do_syscall_64+0x5b/0x1b0
[926344.870071]  entry_SYSCALL_64_after_hwframe+0x61/0xc6
[926344.872516] RIP: 0033:0x7f5b355a7dfa
[926344.874230] Code: 48 83 ec 18 4c 89 4c 24 08 e8 42 f9 ff ff 4c 8b 4c 24 08 4d 89 f8 45 89 f2 89 c5 4c 89 ea 4c 89 e6 89 df b8 2d 00 00 00 0f 05 <48> 3d 00 f0 ff ff 77 3a 89 ef 48 89 44 24 08 e8 72 f9 ff ff 48 8b
[926344.882919] RSP: 002b:00007f5a9cdd4590 EFLAGS: 00000293 ORIG_RAX: 000000000000002d
[926344.886424] RAX: ffffffffffffffda RBX: 000000000000002b RCX: 00007f5b355a7dfa
[926344.889733] RDX: 0000000000010000 RSI: 00007f5a28010fb0 RDI: 000000000000002b
[926344.893355] RBP: 0000000000000000 R08: 00007f5a9cdd46b0 R09: 00007f5a9cdd46a8
[926344.896693] R10: 0000000000000000 R11: 0000000000000293 R12: 00007f5a28010fb0
[926344.900554] R13: 0000000000010000 R14: 0000000000000000 R15: 00007f5a9cdd46b0
[926344.904399] Modules linked in: tcp_diag udp_diag inet_diag rpcsec_gss_krb5 auth_rpcgss nfsv4 nfs lockd grace fscache md4 cmac nls_utf8 cifs libarc4 dns_resolver unix_diag sysdigcloud_probe(OE) binfmt_misc veth xt_addrtype nf_conntrack_netlink ipt_REJECT nf_reject_ipv4 xt_nat xt_CT xt_LOG nf_log_syslog ip6t_MASQUERADE ipt_MASQUERADE xt_mark xt_conntrack xt_comment nft_compat nft_counter nft_chain_nat nf_tables overlay geneve ip6_udp_tunnel udp_tunnel nfnetlink_cttimeout nfnetlink openvswitch nf_conncount nf_nat nf_conntrack nf_defrag_ipv6 nf_defrag_ipv4 ext4 mbcache jbd2 rpcrdma intel_rapl_msr intel_rapl_common nfit sunrpc libnvdimm kvm_intel kvm irqbypass rapl hyperv_fb hv_balloon hv_utils pcspkr joydev ip_tables xfs libcrc32c rdma_ucm ib_srpt ib_isert iscsi_target_mod target_core_mod dm_multipath ib_iser ib_umad rdma_cm ib_ipoib iw_cm ib_cm mlx5_ib ib_uverbs ib_core mlx5_core psample mlxfw pci_hyperv pci_hyperv_intf sd_mod t10_pi sg hv_storvsc serio_raw hv_netvsc scsi_transport_fc
[926344.904468]  hid_hyperv hyperv_keyboard crct10dif_pclmul crc32_pclmul crc32c_intel hv_vmbus ghash_clmulni_intel dm_mirror dm_region_hash dm_log dm_mod be2iscsi bnx2i cnic uio cxgb4i cxgb4 tls libcxgbi libcxgb qla4xxx iscsi_boot_sysfs iscsi_tcp libiscsi_tcp libiscsi ipmi_devintf scsi_transport_iscsi ipmi_msghandler fuse
[926344.963387] Red Hat flags: eBPF/event
[926344.965836] ---[ end trace 4ebf0e1f13203f47 ]---
[926344.969202] RIP: 0010:__slab_free+0x19e/0x340
[926344.972350] Code: 1f 44 00 00 eb 9c 41 f7 46 08 00 0d 21 00 0f 85 16 ff ff ff 4d 85 ed 0f 85 0d ff ff ff 80 4c 24 5b 80 45 31 ff e9 57 ff ff ff <0f> 0b 49 3b 54 24 28 75 c4 49 89 5c 24 20 49 89 4c 24 28 49 0f ba
[926344.983278] RSP: 0018:ffffa64b9fca3c60 EFLAGS: 00010246
[926344.987226] RAX: ffff954ccc717400 RBX: ffff954ccc717000 RCX: ffff954ccc717000
[926344.991923] RDX: 000000008010000f RSI: fffff5659031c400 RDI: ffff9549c0005500
[926344.996692] RBP: ffffa64b9fca3cf8 R08: 0000000000000001 R09: ffffffff9c1f75d7
[926345.001229] R10: ffff954ccc717000 R11: 0000000000000001 R12: fffff5659031c400
[926345.006283] R13: ffff954ccc717000 R14: ffff9549c0005500 R15: 0000000000000244
[926345.011150] FS:  00007f5a9cde5700(0000) GS:ffff958899c80000(0000) knlGS:0000000000000000
[926345.016181] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[926345.020068] CR2: 00007f33e92c8fd8 CR3: 0000001cea80a006 CR4: 0000000000370ee0
[926345.024669] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
[926345.029246] DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7: 0000000000000400
[926345.033824] Kernel panic - not syncing: Fatal exception
[926345.038845] Kernel Offset: 0x1aa00000 from 0xffffffff81000000 (relocation range: 0xffffffff80000000-0xffffffffbfffffff)
[926345.072368] Rebooting in 10 seconds..
[926355.075834] list_add double add: new=ffffffff9d22eb40, prev=ffffffff9d228ee8, next=ffffffff9d22eb40.
[926355.081048] ------------[ cut here ]------------
[926355.084024] kernel BUG at lib/list_debug.c:31!
[926355.086835] invalid opcode: 0000 [#2] SMP NOPTI
[926355.089899] CPU: 18 PID: 260957 Comm: mcast-receiver- Tainted: G      D    OE    --------- -  - 4.18.0-372.82.1.el8_6.x86_64 #1
[926355.096930] Hardware name: Microsoft Corporation Virtual Machine/Virtual Machine, BIOS Hyper-V UEFI Release v4.1 11/28/2023
[926355.102690] RIP: 0010:__list_add_valid+0x45/0x50
[926355.106015] Code: 00 48 39 c7 74 0f 48 39 d7 74 0a b8 01 00 00 00 e9 40 6d 72 00 48 89 f2 4c 89 c1 48 89 fe 48 c7 c7 60 82 b2 9c e8 5f 9f c8 ff <0f> 0b 66 0f 1f 84 00 00 00 00 00 48 8b 07 48 8b 57 08 48 b9 00 01
[926355.116195] RSP: 0018:ffffa64b9fca3a00 EFLAGS: 00010046
[926355.119500] RAX: 0000000000000058 RBX: ffffffff9d22eb40 RCX: 0000000000000007
[926355.123552] RDX: 0000000000000000 RSI: 00000000ffffc69e RDI: ffff958899c96790
[926355.127809] RBP: 0000000000000008 R08: 0000000000000000 R09: c0000000ffffc69e
[926355.131896] R10: 000000000003f360 R11: ffffa64b9fca3820 R12: ffffffff9d228ee8
[926355.136166] R13: ffffffff9d228ee0 R14: 0000000000000046 R15: ffffffff9d22eb40
[926355.140191] FS:  00007f5a9cde5700(0000) GS:ffff958899c80000(0000) knlGS:0000000000000000
[926355.144729] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[926355.148167] CR2: 00007f33e92c8fd8 CR3: 0000001cea80a006 CR4: 0000000000370ee0
[926355.152201] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
[926355.156276] DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7: 0000000000000400
[926355.160254] Call Trace:
[926355.162182]  __register_nmi_handler+0xde/0x120
[926355.165191]  nmi_shootdown_cpus+0x3f/0xa0
[926355.167803]  native_machine_emergency_restart+0x224/0x280
[926355.171015]  panic+0x242/0x2ac
[926355.173152]  oops_end.cold.10+0x18/0x18
[926355.175670]  do_trap+0xe7/0x110
[926355.177809]  ? __slab_free+0x19e/0x340
[926355.180191]  do_invalid_op+0x36/0x40
[926355.182497]  ? __slab_free+0x19e/0x340
[926355.185281]  invalid_op+0x14/0x20
[926355.187486] RIP: 0010:__slab_free+0x19e/0x340
[926355.190114] Code: 1f 44 00 00 eb 9c 41 f7 46 08 00 0d 21 00 0f 85 16 ff ff ff 4d 85 ed 0f 85 0d ff ff ff 80 4c 24 5b 80 45 31 ff e9 57 ff ff ff <0f> 0b 49 3b 54 24 28 75 c4 49 89 5c 24 20 49 89 4c 24 28 49 0f ba
[926355.199878] RSP: 0018:ffffa64b9fca3c60 EFLAGS: 00010246
[926355.202926] RAX: ffff954ccc717400 RBX: ffff954ccc717000 RCX: ffff954ccc717000
[926355.206971] RDX: 000000008010000f RSI: fffff5659031c400 RDI: ffff9549c0005500
[926355.210842] RBP: ffffa64b9fca3cf8 R08: 0000000000000001 R09: ffffffff9c1f75d7
[926355.215030] R10: ffff954ccc717000 R11: 0000000000000001 R12: fffff5659031c400
[926355.219138] R13: ffff954ccc717000 R14: ffff9549c0005500 R15: 0000000000000244
[926355.223080]  ? __consume_stateless_skb+0x17/0x60
[926355.225918]  ? f_sys_recvfrom_x+0x69/0x110 [sysdigcloud_probe]
[926355.229213]  ? __skb_try_recv_from_queue+0x17a/0x1b0
[926355.232120]  ? __consume_stateless_skb+0x17/0x60
[926355.234933]  kfree+0x1d8/0x200
[926355.237043]  __consume_stateless_skb+0x17/0x60
[926355.239784]  udpv6_recvmsg+0x2b2/0x770
[926355.242153]  inet6_recvmsg+0x5b/0xd0
[926355.244597]  __sys_recvfrom+0xc3/0x180
[926355.247010]  ? syscall_trace_enter+0x1ff/0x2d0
[926355.249786]  __x64_sys_recvfrom+0x24/0x30
[926355.252319]  do_syscall_64+0x5b/0x1b0
[926355.254657]  entry_SYSCALL_64_after_hwframe+0x61/0xc6
[926355.257569] RIP: 0033:0x7f5b355a7dfa
[926355.259773] Code: 48 83 ec 18 4c 89 4c 24 08 e8 42 f9 ff ff 4c 8b 4c 24 08 4d 89 f8 45 89 f2 89 c5 4c 89 ea 4c 89 e6 89 df b8 2d 00 00 00 0f 05 <48> 3d 00 f0 ff ff 77 3a 89 ef 48 89 44 24 08 e8 72 f9 ff ff 48 8b
[926355.269280] RSP: 002b:00007f5a9cdd4590 EFLAGS: 00000293 ORIG_RAX: 000000000000002d
[926355.273230] RAX: ffffffffffffffda RBX: 000000000000002b RCX: 00007f5b355a7dfa
[926355.277065] RDX: 0000000000010000 RSI: 00007f5a28010fb0 RDI: 000000000000002b
[926355.280864] RBP: 0000000000000000 R08: 00007f5a9cdd46b0 R09: 00007f5a9cdd46a8
[926355.284616] R10: 0000000000000000 R11: 0000000000000293 R12: 00007f5a28010fb0
[926355.288338] R13: 0000000000010000 R14: 0000000000000000 R15: 00007f5a9cdd46b0
[926355.292109] Modules linked in: tcp_diag udp_diag inet_diag rpcsec_gss_krb5 auth_rpcgss nfsv4 nfs lockd grace fscache md4 cmac nls_utf8 cifs libarc4 dns_resolver unix_diag sysdigcloud_probe(OE) binfmt_misc veth xt_addrtype nf_conntrack_netlink ipt_REJECT nf_reject_ipv4 xt_nat xt_CT xt_LOG nf_log_syslog ip6t_MASQUERADE ipt_MASQUERADE xt_mark xt_conntrack xt_comment nft_compat nft_counter nft_chain_nat nf_tables overlay geneve ip6_udp_tunnel udp_tunnel nfnetlink_cttimeout nfnetlink openvswitch nf_conncount nf_nat nf_conntrack nf_defrag_ipv6 nf_defrag_ipv4 ext4 mbcache jbd2 rpcrdma intel_rapl_msr intel_rapl_common nfit sunrpc libnvdimm kvm_intel kvm irqbypass rapl hyperv_fb hv_balloon hv_utils pcspkr joydev ip_tables xfs libcrc32c rdma_ucm ib_srpt ib_isert iscsi_target_mod target_core_mod dm_multipath ib_iser ib_umad rdma_cm ib_ipoib iw_cm ib_cm mlx5_ib ib_uverbs ib_core mlx5_core psample mlxfw pci_hyperv pci_hyperv_intf sd_mod t10_pi sg hv_storvsc serio_raw hv_netvsc scsi_transport_fc
[926355.292178]  hid_hyperv hyperv_keyboard crct10dif_pclmul crc32_pclmul crc32c_intel hv_vmbus ghash_clmulni_intel dm_mirror dm_region_hash dm_log dm_mod be2iscsi bnx2i cnic uio cxgb4i cxgb4 tls libcxgbi libcxgb qla4xxx iscsi_boot_sysfs iscsi_tcp libiscsi_tcp libiscsi ipmi_devintf scsi_transport_iscsi ipmi_msghandler fuse
[926355.347337] Red Hat flags: eBPF/event
[926355.349582] ---[ end trace 4ebf0e1f13203f48 ]---
[926355.352202] RIP: 0010:__slab_free+0x19e/0x340
[926355.354728] Code: 1f 44 00 00 eb 9c 41 f7 46 08 00 0d 21 00 0f 85 16 ff ff ff 4d 85 ed 0f 85 0d ff ff ff 80 4c 24 5b 80 45 31 ff e9 57 ff ff ff <0f> 0b 49 3b 54 24 28 75 c4 49 89 5c 24 20 49 89 4c 24 28 49 0f ba
[926355.364064] RSP: 0018:ffffa64b9fca3c60 EFLAGS: 00010246
[926355.367166] RAX: ffff954ccc717400 RBX: ffff954ccc717000 RCX: ffff954ccc717000
[926355.370905] RDX: 000000008010000f RSI: fffff5659031c400 RDI: ffff9549c0005500
[926355.374570] RBP: ffffa64b9fca3cf8 R08: 0000000000000001 R09: ffffffff9c1f75d7
[926355.378294] R10: ffff954ccc717000 R11: 0000000000000001 R12: fffff5659031c400
[926355.381939] R13: ffff954ccc717000 R14: ffff9549c0005500 R15: 0000000000000244
[926355.385652] FS:  00007f5a9cde5700(0000) GS:ffff958899c80000(0000) knlGS:0000000000000000
[926355.389729] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[926355.392848] CR2: 00007f33e92c8fd8 CR3: 0000001cea80a006 CR4: 0000000000370ee0
[926355.396605] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
[926355.400268] DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7: 0000000000000400
[926355.403916] Kernel panic - not syncing: Fatal exception
[926355.406951] Kernel Offset: 0x1aa00000 from 0xffffffff81000000 (relocation range: 0xffffffff80000000-0xffffffffbfffffff)
[926355.412649] hyperv_fb: Unable to send packet via vmbus; error -11
[926355.412650] hyperv_fb: Unable to send packet via vmbus; error -11
[926355.412651] hyperv_fb: Unable to send packet via vmbus; error -11
[926355.412651] hyperv_fb: Unable to send packet via vmbus; error -11
[926355.412652] hyperv_fb: Unable to send packet via vmbus; error -11
[926355.412652] hyperv_fb: Unable to send packet via vmbus; error -11
[926355.412652] hyperv_fb: Unable to send packet via vmbus; error -11
[926355.412653] hyperv_fb: Unable to send packet via vmbus; error -11
[926355.412653] hyperv_fb: Unable to send packet via vmbus; error -11
[926355.412654] hyperv_fb: Unable to send packet via vmbus; error -11
[926355.452162] Rebooting in 10 seconds..

Environment

  • Red Hat Enterprise Linux 8.6.z
  • Red Hat Enterprise Linux 8.7.z
  • Red Hat Enterprise Linux 8.8.z
  • Red Hat Enterprise Linux 8.9.z
  • Red Hat Enterprise Linux 8.10.z
  • Red Hat Enterprise Linux CoreOS (RHCOS) shipped with RHOCP 4.12

Subscriber exclusive content

A Red Hat subscription provides unlimited access to our knowledgebase, tools, and much more.

Current Customers and Partners

Log in for full access

Log In

New to Red Hat?

Learn more about Red Hat subscriptions

Using a Red Hat product through a public cloud?

How to access this content