RHEL 7 for Power (ppc64le) SR-IOV interface goes down with error: kernel BUG at drivers/net/ethernet/emulex/benet/be_main.c:2802!

Solution Unverified - Updated -

Issue

  • An Emulex be2net-based SR-IOV interface on a PPC64 host stops responding causes the following backtrace:

    Mar 16 16:21:24 localhost kernel: ------------[ cut here ]------------
    Mar 16 16:21:24 localhost kernel: kernel BUG at drivers/net/ethernet/emulex/benet/be_main.c:2802!
    Mar 16 16:21:24 localhost kernel: Oops: Exception in kernel mode, sig: 5 [#1]
    Mar 16 16:21:24 localhost kernel: SMP NR_CPUS=2048 NUMA pSeries
    Mar 16 16:21:24 localhost kernel: Modules linked in: tracedev(OE) sctp_diag sctp dccp_diag dccp tcp_diag udp_diag inet_diag unix_diag af_packet_diag netlink_diag ip6t_rpfilter ipt_REJECT nf_reject_ipv4 ip6t_REJECT nf_reject_ipv6 xt_conntrack ip_set nfnetlink ebtable_nat ebtable_broute bridge stp llc ip6table_nat nf_conntrack_ipv6 nf_defrag_ipv6 nf_nat_ipv6 ip6table_mangle ip6table_security ip6table_raw iptable_nat nf_conntrack_ipv4 nf_defrag_ipv4 nf_nat_ipv4 nf_nat nf_conntrack iptable_mangle iptable_security iptable_raw ebtable_filter ebtables ip6table_filter ip6_tables iptable_filter pseries_rng sg be2net nfsd auth_rpcgss nfs_acl lockd grace sunrpc binfmt_misc ip_tables xfs libcrc32c dm_service_time sd_mod crc_t10dif crct10dif_generic crct10dif_common ibmvscsi scsi_transport_srp ibmveth scsi_tgt dm_multipath dm_mirror
    Mar 16 16:21:24 localhost kernel: dm_region_hash dm_log dm_mod [last unloaded: mmfslinux]
    Mar 16 16:21:24 localhost kernel: CPU: 32 PID: 14820 Comm: ip Tainted: G        W  OE  ------------   3.10.0-693.17.1.el7.ppc64le #1
    Mar 16 16:21:24 localhost kernel: task: c00000151ca19620 ti: c0000014d4250000 task.ti: c0000014d4250000
    Mar 16 16:21:24 localhost kernel: NIP: d000000013a38d60 LR: d000000013a38d44 CTR: c000000000826a70
    Mar 16 16:21:24 localhost kernel: REGS: c0000014d4252e70 TRAP: 0700   Tainted: G        W  OE  ------------    (3.10.0-693.17.1.el7.ppc64le)
    Mar 16 16:21:24 localhost kernel: MSR: 8000000100029033 <SF,EE,ME,IR,DR,RI,LE>  CR: 28222824  XER: 00000000
    Mar 16 16:21:24 localhost kernel: CFAR: d000000013a3643c SOFTE: 1 #012GPR00: d000000013a38d44 c0000014d42530f0 d000000013a5ee28 0000000000000004 #012GPR04: 0000000000000000 0000000000000021 00000000000005c2 00000000000005c2 #012GPR08: 0000000000000000 ffffffffffffe000 0000000000000001 d000000013a4fb28 #012GPR12: c000000000826a70 c000000007b32000 00003fffeca708b8 0000000000000000 #012GPR16: 0000000000000000 c0000014fa7ae410 c000001dd0e05c78 c000001dfe0a8098 #012GPR20: 0000000000000000 c000001dd0f50000 0000000000000000 c000001dd0e05c00 #012GPR24: c000001dd0e05c08 c000001dd0e00000 00000000000005be c000001dd0e05c08 #012GPR28: c000001dd0e00900 c000001dd0e05c18 0000000000000000 c000001dd0e05c00 
    Mar 16 16:21:24 localhost kernel: NIP [d000000013a38d60] be_close+0x8a0/0x990 [be2net]
    Mar 16 16:21:24 localhost kernel: LR [d000000013a38d44] be_close+0x884/0x990 [be2net]
    Mar 16 16:21:24 localhost kernel: Call Trace:
    Mar 16 16:21:24 localhost kernel: [c0000014d42530f0] [d000000013a38d44] be_close+0x884/0x990 [be2net] (unreliable)
    Mar 16 16:21:24 localhost kernel: [c0000014d42531d0] [c00000000082bdc4] __dev_close+0x1a4/0x260
    Mar 16 16:21:24 localhost kernel: [c0000014d4253290] [c00000000083238c] __dev_change_flags+0x3fc/0x520
    Mar 16 16:21:24 localhost kernel: [c0000014d4253330] [c0000000008324e4] dev_change_flags+0x34/0x150
    Mar 16 16:21:24 localhost kernel: [c0000014d4253380] [c00000000084a600] do_setlink+0x400/0xfe0
    Mar 16 16:21:24 localhost kernel: [c0000014d42534e0] [c00000000084e43c] rtnl_newlink+0x8ec/0xa80
    Mar 16 16:21:24 localhost kernel: [c0000014d42537d0] [c000000000848cf4] rtnetlink_rcv_msg+0xd4/0x300
    Mar 16 16:21:24 localhost kernel: [c0000014d4253850] [c00000000087c8a8] netlink_rcv_skb+0x128/0x140
    Mar 16 16:21:24 localhost kernel: [c0000014d4253890] [c000000000848bfc] rtnetlink_rcv+0x3c/0x60
    Mar 16 16:21:24 localhost kernel: [c0000014d42538c0] [c00000000087c468] netlink_unicast+0x628/0x6d0
    Mar 16 16:21:24 localhost kernel: [c0000014d4253960] [c00000000087da00] netlink_sendmsg+0x3b0/0x830
    Mar 16 16:21:24 localhost kernel: [c0000014d4253a40] [c0000000007f6d7c] sock_sendmsg+0xec/0x140
    Mar 16 16:21:24 localhost kernel: [c0000014d4253bb0] [c0000000007f849c] ___sys_sendmsg+0x36c/0x380
    Mar 16 16:21:24 localhost kernel: [c0000014d4253d50] [c0000000007fd844] SyS_sendmsg+0xc4/0x130
    Mar 16 16:21:24 localhost kernel: [c0000014d4253dd0] [c0000000007fe5bc] SyS_socketcall+0x3ac/0x430
    Mar 16 16:21:24 localhost kernel: [c0000014d4253e30] [c00000000000a184] system_call+0x38/0xe4
    Mar 16 16:21:24 localhost kernel: Instruction dump: 
    Mar 16 16:21:24 localhost kernel: 7d4ad214 7fe4fb78 7d454838 78a50420 4bffd529 7d20e828 7d234850 7d20e92d 
    Mar 16 16:21:24 localhost kernel: 40c2fff4 60420000 813f0018 7d2907b4 <0b090000> b2df40d0 935b000c 935b0008 
    Mar 16 16:21:24 localhost kernel: ---[ end trace cc5a527518abf256 ]---
    
  • The error may be preceded by iommu_free: invalid entry warnings:

    Mar 16 16:21:24 localhost kernel: iommu_free_check: 7155 callbacks suppressed
    Mar 16 16:21:24 localhost kernel: iommu_free: invalid entry
    Mar 16 16:21:24 localhost kernel: #011entry     = 0x0
    Mar 16 16:21:24 localhost kernel: #011dma_addr  = 0x0
    Mar 16 16:21:24 localhost kernel: #011Table     = 0xc000001dd17e4800
    Mar 16 16:21:24 localhost kernel: #011bus#      = 0x1
    Mar 16 16:21:24 localhost kernel: #011size      = 0x80000
    Mar 16 16:21:24 localhost kernel: #011startOff  = 0x800000000000
    Mar 16 16:21:24 localhost kernel: #011index     = 0x70040018
    Mar 16 16:21:24 localhost kernel: ------------[ cut here ]------------
    Mar 16 16:21:24 localhost kernel: WARNING: CPU: 32 PID: 14820 at arch/powerpc/kernel/iommu.c:368 iommu_unmap_page+0x24c/0x260
    Mar 16 16:21:24 localhost kernel: Modules linked in: tracedev(OE) sctp_diag sctp dccp_diag dccp tcp_diag udp_diag inet_diag unix_diag af_packet_diag netlink_diag ip6t_rpfilter ipt_REJECT nf_reject_ipv4 ip6t_REJECT nf_reject_ipv6 xt_conntrack ip_set nfnetlink ebtable_nat ebtable_broute bridge stp llc ip6table_nat nf_conntrack_ipv6 nf_defrag_ipv6 nf_nat_ipv6 ip6table_mangle ip6table_security ip6table_raw iptable_nat nf_conntrack_ipv4 nf_defrag_ipv4 nf_nat_ipv4 nf_nat nf_conntrack iptable_mangle iptable_security iptable_raw ebtable_filter ebtables ip6table_filter ip6_tables iptable_filter pseries_rng sg be2net nfsd auth_rpcgss nfs_acl lockd grace sunrpc binfmt_misc ip_tables xfs libcrc32c dm_service_time sd_mod crc_t10dif crct10dif_generic crct10dif_common ibmvscsi scsi_transport_srp ibmveth scsi_tgt dm_multipath dm_mirror
    Mar 16 16:21:24 localhost kernel: dm_region_hash dm_log dm_mod [last unloaded: mmfslinux]
    Mar 16 16:21:24 localhost kernel: CPU: 32 PID: 14820 Comm: ip Tainted: G        W  OE  ------------   3.10.0-693.17.1.el7.ppc64le #1
    Mar 16 16:21:24 localhost kernel: task: c00000151ca19620 ti: c0000014d4250000 task.ti: c0000014d4250000
    Mar 16 16:21:24 localhost kernel: NIP: c00000000004b48c LR: c00000000004b488 CTR: 0000000000655170
    Mar 16 16:21:24 localhost kernel: REGS: c0000014d4252d60 TRAP: 0700   Tainted: G        W  OE  ------------    (3.10.0-693.17.1.el7.ppc64le)
    Mar 16 16:21:24 localhost kernel: MSR: 8000000100029033 <SF,EE,ME,IR,DR,RI,LE>  CR: 48222828  XER: 20000000
    Mar 16 16:21:24 localhost kernel: CFAR: c0000000009dfaf4 SOFTE: 1 #012GPR00: c00000000004b488 c0000014d4252fe0 c00000000120ef00 0000000000000017 #012GPR04: 0000000000000001 0000000000000000 0000000000000600 0000000000000001 #012GPR08: 0000000000000001 0000000000000007 0000000000000006 00000002000b0004 #012GPR12: 0000000000002200 c000000007b32000 00003fffeca708b8 0000000000000000 #012GPR16: 0000000000000000 c0000014fa7ae410 c000001dd0e05c78 c000001dfe0a8098 #012GPR20: 0000000000000000 c000001dd0f50000 0000000000000000 c000001dd0e05c00 #012GPR24: 00000000000005be 00000000000005c1 c000001dd0e00900 0000000000000001 #012GPR28: 0000000000000000 0000000000000000 ffff800000000000 c000001dd17e4800
    Mar 16 16:21:24 localhost kernel: NIP [c00000000004b48c] iommu_unmap_page+0x24c/0x260
    Mar 16 16:21:24 localhost kernel: LR [c00000000004b488] iommu_unmap_page+0x248/0x260
    Mar 16 16:21:24 localhost kernel: Call Trace:
    Mar 16 16:21:24 localhost kernel: [c0000014d4252fe0] [c00000000004b488] iommu_unmap_page+0x248/0x260 (unreliable)
    Mar 16 16:21:24 localhost kernel: [c0000014d4253070] [c000000000049060] dma_iommu_unmap_page+0x20/0x40
    Mar 16 16:21:24 localhost kernel: [c0000014d4253090] [d000000013a363b8] be_tx_compl_process+0x158/0x280 [be2net]
    Mar 16 16:21:24 localhost kernel: [c0000014d42530f0] [d000000013a38d44] be_close+0x884/0x990 [be2net]
    Mar 16 16:21:24 localhost kernel: [c0000014d42531d0] [c00000000082bdc4] __dev_close+0x1a4/0x260
    Mar 16 16:21:24 localhost kernel: [c0000014d4253290] [c00000000083238c] __dev_change_flags+0x3fc/0x520
    Mar 16 16:21:24 localhost kernel: [c0000014d4253330] [c0000000008324e4] dev_change_flags+0x34/0x150
    Mar 16 16:21:24 localhost kernel: [c0000014d4253380] [c00000000084a600] do_setlink+0x400/0xfe0
    Mar 16 16:21:24 localhost kernel: [c0000014d42534e0] [c00000000084e43c] rtnl_newlink+0x8ec/0xa80
    Mar 16 16:21:24 localhost kernel: [c0000014d42537d0] [c000000000848cf4] rtnetlink_rcv_msg+0xd4/0x300
    Mar 16 16:21:24 localhost kernel: [c0000014d4253850] [c00000000087c8a8] netlink_rcv_skb+0x128/0x140
    Mar 16 16:21:24 localhost kernel: [c0000014d4253890] [c000000000848bfc] rtnetlink_rcv+0x3c/0x60
    Mar 16 16:21:24 localhost kernel: [c0000014d42538c0] [c00000000087c468] netlink_unicast+0x628/0x6d0
    Mar 16 16:21:24 localhost kernel: [c0000014d4253960] [c00000000087da00] netlink_sendmsg+0x3b0/0x830
    Mar 16 16:21:24 localhost kernel: [c0000014d4253a40] [c0000000007f6d7c] sock_sendmsg+0xec/0x140
    Mar 16 16:21:24 localhost kernel: [c0000014d4253bb0] [c0000000007f849c] ___sys_sendmsg+0x36c/0x380
    Mar 16 16:21:24 localhost kernel: [c0000014d4253d50] [c0000000007fd844] SyS_sendmsg+0xc4/0x130
    Mar 16 16:21:24 localhost kernel: [c0000014d4253dd0] [c0000000007fe5bc] SyS_socketcall+0x3ac/0x430
    Mar 16 16:21:24 localhost kernel: [c0000014d4253e30] [c00000000000a184] system_call+0x38/0xe4
    Mar 16 16:21:24 localhost kernel: Instruction dump:
    Mar 16 16:21:24 localhost kernel: 48994641 60000000 e89f0028 3c62ff9a 3863e4e0 4899462d 60000000 e89f0038
    Mar 16 16:21:24 localhost kernel: 3c62ff9a 3863e4f8 48994619 60000000 <0fe00000> 4bfffe9c 60000000 60000000
    Mar 16 16:21:24 localhost kernel: ---[ end trace cc5a527518abf254 ]---
    

Environment

  • Red Hat Enterprise Linux 7 for Power
  • Emulex be2net SR-IOV interfaces

Subscriber exclusive content

A Red Hat subscription provides unlimited access to our knowledgebase, tools, and much more.

Current Customers and Partners

Log in for full access

Log In

New to Red Hat?

Learn more about Red Hat subscriptions

Using a Red Hat product through a public cloud?

How to access this content