cxgb4: Badness at mm/page_alloc.c:2139 and RTNETLINK answers: Cannot allocate memory.

Solution In Progress - Updated -

Issue

  • When booting the 642 kernel, cxgb4 functions are seen in a mm WARN_ON and trying to bring up the interfaces results in RTNETLINK answers: Cannot allocate memory:

    Jun 12 14:23:26 localhost kernel: Chelsio T4/T5 Network Driver - version 2.0.0-ko
    Jun 12 14:23:26 localhost kernel: cxgb4 0004:01:00.0: enabling device (0140 -> 0142)
    Jun 12 14:23:26 localhost kernel: cxgb4 0004:01:00.1: enabling device (0140 -> 0142)
    Jun 12 14:23:26 localhost kernel: cxgb4 0004:01:00.4: enabling device (0140 -> 0142)
    Jun 12 14:23:26 localhost kernel: cxgb4 0004:01:00.4: ibm,query-pe-dma-windows(53) 10000 8000000 20000201 returned 0
    Jun 12 14:23:26 localhost kernel: cxgb4 0004:01:00.4: ibm,create-pe-dma-window(54) 10000 8000000 20000201 c 25 returned 0 (liobn = 0x70000001 starting addr = 100000 0)
    Jun 12 14:23:26 localhost kernel: cxgb4 0004:01:00.4: Using 64-bit direct DMA at offset 10000000000000
    Jun 12 14:23:26 localhost kernel: cxgb4 0004:01:00.4: Using 64-bit direct DMA at offset 10000000000000
    Jun 12 14:23:26 localhost kernel: cxgb4 0004:01:00.4: firmware: requesting cxgb4/t4fw-1.14.4.0.bin
    Jun 12 14:23:26 localhost kernel: cxgb4 0004:01:00.4: Coming up as MASTER: Initializing adapter
    Jun 12 14:23:26 localhost kernel: cxgb4 0004:01:00.4: firmware: requesting cxgb4/t4-config.txt
    Jun 12 14:23:26 localhost kernel: cxgb4 0004:01:00.4: mbox 4: 70000000c1f0203 0 0 0 0 0 0 0
    Jun 12 14:23:26 localhost kernel: cxgb4 0004:01:00.4: Successfully configured using Firmware Configuration File "Firmware Default", version 0x0, computed checksum 0x0
    Jun 12 14:23:26 localhost kernel: cxgb4 0004:01:00.4: max_ordird_qp 255 max_ird_adapter 589824
    Jun 12 14:23:26 localhost kernel: cxgb4 0004:01:00.4: eth4: Chelsio T422-CR rev 2 1000/10GBASE-R SFP+ RNIC PCIe x8 5 GT/s MSI-X
    Jun 12 14:23:26 localhost kernel: cxgb4 0004:01:00.4: eth4: S/N: PT38110240, P/N: 110113840B0
    Jun 12 14:23:26 localhost kernel: cxgb4 0004:01:00.4: eth5: Chelsio T422-CR rev 2 1000/10GBASE-R SFP+ RNIC PCIe x8 5 GT/s MSI-X
    Jun 12 14:23:26 localhost kernel: cxgb4 0004:01:00.4: eth5: S/N: PT38110240, P/N: 110113840B0
    Jun 12 14:23:26 localhost kernel: cxgb4 0004:01:00.4: eth6: Chelsio T422-CR rev 2 100/1000BASE-T SGMII RNIC PCIe x8 5 GT/s MSI-X
    Jun 12 14:23:26 localhost kernel: cxgb4 0004:01:00.4: eth6: S/N: PT38110240, P/N: 110113840B0
    Jun 12 14:23:26 localhost kernel: cxgb4 0004:01:00.4: eth7: Chelsio T422-CR rev 2 100/1000BASE-T SGMII RNIC PCIe x8 5 GT/s MSI-X
    Jun 12 14:23:26 localhost kernel: cxgb4 0004:01:00.4: eth7: S/N: PT38110240, P/N: 110113840B0
    
    Jun 12 14:23:26 localhost kernel: ------------[ cut here ]------------
    Jun 12 14:23:26 localhost kernel: Badness at mm/page_alloc.c:2139
    Jun 12 14:23:26 localhost kernel: NIP: c00000000017193c LR: c00000000017183c CTR: 0000000000000009
    Jun 12 14:23:26 localhost kernel: REGS: c000001f02f9e830 TRAP: 0700   Not tainted  (2.6.32-642.el6.ppc64)
    Jun 12 14:23:26 localhost kernel: MSR: 8000000000029032 <EE,ME,CE,IR,DR>  CR: 24042444  XER: 00000020
    Jun 12 14:23:26 localhost kernel: TASK = c000001f06c3d7e0[2415] 'ip' THREAD: c000001f02f9c000 CPU: 9
    Jun 12 14:23:26 localhost kernel: GPR00: 0000000000000001 c000001f02f9eab0 c000000000f9f3b0 0000000000000000 
    Jun 12 14:23:26 localhost kernel: GPR04: 0000000000000001 0000000000000000 c000001f02f9e970 00000000000001b8 
    Jun 12 14:23:26 localhost kernel: GPR08: c000001f06c3d7e0 c000000001478080 0000000000000001 c00000000112991c 
    Jun 12 14:23:26 localhost kernel: GPR12: 0000000044042428 c000000001083b80 0000000000000000 ffffffffffffffef 
    Jun 12 14:23:26 localhost kernel: GPR16: c000000f9f2ce120 c000000000660678 0000000000000000 0000000000000004 
    Jun 12 14:23:26 localhost kernel: GPR20: 0000000000000000 00000000000200d0 c000000fbfff9288 0000000000000001 
    Jun 12 14:23:26 localhost kernel: GPR24: 0000000000000001 c00000000105d798 0000000000000000 c000000fbfff9280 
    Jun 12 14:23:26 localhost kernel: GPR28: 0000000000000011 0000000000000000 c000000000f30bc8 00000000000000d0 
    Jun 12 14:23:26 localhost kernel: NIP [c00000000017193c] .__alloc_pages_nodemask+0x25c/0x930
    Jun 12 14:23:26 localhost kernel: LR [c00000000017183c] .__alloc_pages_nodemask+0x15c/0x930
    Jun 12 14:23:26 localhost kernel: Call Trace:
    Jun 12 14:23:26 localhost kernel: [c000001f02f9eab0] [c00000000017183c] .__alloc_pages_nodemask+0x15c/0x930 (unreliable)
    Jun 12 14:23:26 localhost kernel: [c000001f02f9ec60] [c000000000033914] .dma_direct_alloc_coherent+0xa4/0x184
    Jun 12 14:23:26 localhost kernel: [c000001f02f9ed00] [d00000000db8e020] .alloc_ring+0x90/0x190 [cxgb4]
    Jun 12 14:23:26 localhost kernel: [c000001f02f9edb0] [d00000000db8e260] .t4_sge_alloc_ofld_txq+0x70/0x1e0 [cxgb4]
    Jun 12 14:23:26 localhost kernel: [c000001f02f9ee80] [d00000000db77fc0] .cxgb_up+0x330/0xcc0 [cxgb4]
    Jun 12 14:23:26 localhost kernel: [c000001f02f9ef60] [d00000000db78ef0] .cxgb_open+0x40/0x110 [cxgb4]
    Jun 12 14:23:26 localhost kernel: [c000001f02f9eff0] [c00000000050a624] .dev_open+0x134/0x1a0
    Jun 12 14:23:26 localhost kernel: [c000001f02f9f080] [c000000000509050] .dev_change_flags+0xe0/0x240
    Jun 12 14:23:26 localhost kernel: [c000001f02f9f120] [c00000000051bb48] .do_setlink+0x1a8/0xa20
    Jun 12 14:23:26 localhost kernel: [c000001f02f9f230] [c00000000051dbac] .rtnl_newlink+0x4bc/0x620
    Jun 12 14:23:26 localhost kernel: [c000001f02f9f4b0] [c00000000051d684] .rtnetlink_rcv_msg+0x344/0x3b0
    Jun 12 14:23:26 localhost kernel: [c000001f02f9f580] [c000000000540d50] .netlink_rcv_skb+0x110/0x140
    Jun 12 14:23:26 localhost kernel: [c000001f02f9f610] [c00000000051dda8] .rtnetlink_rcv+0x28/0x50
    Jun 12 14:23:26 localhost kernel: [c000001f02f9f6a0] [c000000000540758] .netlink_unicast+0x418/0x490
    Jun 12 14:23:26 localhost kernel: [c000001f02f9f770] [c000000000541620] .netlink_sendmsg+0x2f0/0x4c0
    Jun 12 14:23:26 localhost kernel: [c000001f02f9f870] [c0000000004e881c] .sock_sendmsg+0x14c/0x1a0
    Jun 12 14:23:26 localhost kernel: [c000001f02f9fa90] [c0000000004e8cec] .__sys_sendmsg+0x34c/0x360
    Jun 12 14:23:26 localhost kernel: [c000001f02f9fca0] [c0000000004e8fac] .SyS_sendmsg+0x4c/0xb0
    Jun 12 14:23:26 localhost kernel: [c000001f02f9fd70] [c0000000004e71b4] .SyS_socketcall+0x1a4/0x3a0
    Jun 12 14:23:26 localhost kernel: [c000001f02f9fe30] [c000000000008564] syscall_exit+0x0/0x40
    Jun 12 14:23:26 localhost kernel: Instruction dump:
    Jun 12 14:23:26 localhost kernel: f90100d0 9a810090 9a810091 409d00b4 7be9bfe3 40e2ff24 e93e8018 3d290002 
    Jun 12 14:23:26 localhost kernel: 80090110 7c000034 5400d97e 78000020 <0b000000> 2fa00000 41feff00 38000001 
    

Environment

  • Red Hat Enterprise Linux 6.8
  • kernel-2.6.32-642.el6
  • IBM Power hardware

Subscriber exclusive content

A Red Hat subscription provides unlimited access to our knowledgebase, tools, and much more.

Current Customers and Partners

Log in for full access

Log In

New to Red Hat?

Learn more about Red Hat subscriptions

Using a Red Hat product through a public cloud?

How to access this content