Server was panicked with "BUG: unable to handle kernel NULL pointer dereference at (null)" because of involvement of third party module rdma_cm

Solution Unverified - Updated -

Issue

"BUG: unable to handle kernel NULL pointer dereference at           (null)"
[11784649.062488] Modules linked in: tcp_diag udp_diag inet_diag unix_diag af_packet_diag netlink_diag iptable_filter mmfs26(OE) mmfslinux(OE) tracedev(OE) ext4 mbcache jbd2 nvmeibs(POE) nfsv3 nfs_acl nvmeibc(POE) rpcsec_gss_krb5 nfsv4 dns_resolver nfs lockd grace fscache nvmeib_common_mlx5_public(OE) nvmeib_common_mlx4_public(OE) nvmeib_common_public(OE) nvmeib_common(POE) bonding mlx5_fpga_tools(OE) mlx4_ib(OE) mlx4_en(OE) mlx4_core(OE) ib_isert(OE) iscsi_target_mod target_core_mod ib_srpt(OE) ib_srp(OE) scsi_transport_srp(OE) ib_ucm(OE) rdma_ucm(OE) ib_iser(OE) ib_umad(OE) rdma_cm(OE) iw_cm(OE) libiscsi scsi_transport_iscsi ib_ipoib(OE) ib_cm(OE) intel_powerclamp coretemp intel_rapl iosf_mbi mlx5_ib(OE) ib_uverbs(OE) ib_core(OE) kvm ipmi_ssif irqbypass crc32_pclmul ghash_clmulni_intel aesni_intel lrw
[11784649.069923]  gf128mul glue_helper ablk_helper cryptd pcspkr ses enclosure mei_me sg lpc_ich mei ipmi_si hpilo hpwdt wmi ipmi_devintf ipmi_msghandler acpi_power_meter binfmt_misc knem(OE) auth_rpcgss sunrpc ip_tables xfs libcrc32c dm_snapshot dm_bufio uas usb_storage sd_mod crc_t10dif crct10dif_generic mgag200 i2c_algo_bit drm_kms_helper syscopyarea sysfillrect sysimgblt fb_sys_fops ttm mlx5_core(OE) crct10dif_pclmul crct10dif_common crc32c_intel mlxfw(OE) drm serio_raw vfio_mdev(OE) vfio_iommu_type1 nvme smartpqi vfio tg3 nvme_core mdev(OE) scsi_transport_sas mlx_compat(OE) devlink ptp pps_core drm_panel_orientation_quirks dm_mirror dm_region_hash dm_log dm_mod
[11784649.077289] CPU: 35 PID: 21580 Comm: kworker/u72:0 Kdump: loaded Tainted: P        W  OE  ------------   3.10.0-1062.9.1.el7.x86_64 #1
[11784649.079692] Hardware name: HPE ProLiant DL380 Gen10/ProLiant DL380 Gen10, BIOS U30 03/09/2020
[11784649.080906] Workqueue: rdma_cm cma_work_handler [rdma_cm]
[11784649.082099] task: ffff9f9de26c41c0 ti: ffff9f8f37be8000 task.ti: ffff9f8f37be8000
[11784649.083286] RIP: 0010:[<ffffffffbbcd375d>]  [<ffffffffbbcd375d>] __wake_up_common+0x4d/0x120
[11784649.084469] RSP: 0018:ffff9f8f37bebd70  EFLAGS: 00010086
[11784649.085634] RAX: ffffffffffffffe8 RBX: ffff9f41834fae58 RCX: 0000000000000000
[11784649.086790] RDX: 0000000000000000 RSI: 0000000000000003 RDI: ffff9f41834fae58
[11784649.087931] RBP: ffff9f8f37bebdb8 R08: 0000000000000000 R09: 0000000000000000
[11784649.089060] R10: 0000000000000000 R11: 000000000000254c R12: ffff9f41834fae50
[11784649.090175] R13: ffff9f41834fae60 R14: 0000000000000000 R15: 0000000000000900
[11784649.091272] FS:  0000000000000000(0000) GS:ffff9ff8bfa40000(0000) knlGS:0000000000000000
[11784649.092362] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[11784649.093434] CR2: 0000000000000000 CR3: 000000a4f029e000 CR4: 00000000007607e0
[11784649.094498] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
[11784649.095540] DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7: 0000000000000400
[11784649.096563] PKRU: 00000000
[11784649.097564] Call Trace:
[11784649.098548]  [<ffffffffbbcd39cf>] complete+0x3f/0x60
[11784649.099529]  [<ffffffffc08da118>] cma_deref_id+0x28/0x30 [rdma_cm]
[11784649.100490]  [<ffffffffc08deb67>] cma_work_handler+0x47/0xa0 [rdma_cm]
[11784649.101435]  [<ffffffffbbcbe21f>] process_one_work+0x17f/0x440
[11784649.102362]  [<ffffffffbbcbf336>] worker_thread+0x126/0x3c0
[11784649.103271]  [<ffffffffbbcbf210>] ? manage_workers.isra.26+0x2a0/0x2a0
[11784649.104167]  [<ffffffffbbcc61f1>] kthread+0xd1/0xe0
[11784649.105042]  [<ffffffffbbcc6120>] ? insert_kthread_work+0x40/0x40
[11784649.105902]  [<ffffffffbc38dd1d>] ret_from_fork_nospec_begin+0x7/0x21
[11784649.106749]  [<ffffffffbbcc6120>] ? insert_kthread_work+0x40/0x40
[11784649.107580] Code: 89 55 c4 89 4d d0 4c 89 45 c8 74 0a 41 f6 01 04 0f 85 a8 00 00 00 49 8b 45 08 48 83 e8 18 48 8d 50 18 49 83 c5 08 49 39 d5 74 5b <48> 8b 48 18 31 db 4c 8d 79 e8 66 0f 1f 84 00 00 00 00 00 4c 8b 
[11784649.109312] RIP  [<ffffffffbbcd375d>] __wake_up_common+0x4d/0x120
[11784649.110137]  RSP <ffff9f8f37bebd70>
[11784649.110935] CR2: 0000000000000000
(END)

Environment

  • Red Hat Enterprise Linux 7
  • Third-party kernel module [rdma_cm]

Subscriber exclusive content

A Red Hat subscription provides unlimited access to our knowledgebase, tools, and much more.

Current Customers and Partners

Log in for full access

Log In

New to Red Hat?

Learn more about Red Hat subscriptions

Using a Red Hat product through a public cloud?

How to access this content