Why does system panic with emcp module ?

Solution Verified - Updated -

Issue

  • After upgrading to kernel 3.10.0-1127.el7.8.2.x86_64 or 3.10.0-1160.62.1.el7.x86_64 the server crashes with PowerPath:
[ 3641.477105] general protection fault: 0000 [#1] SMP 
[ 3641.477129] Modules linked in: team_mode_activebackup 8021q garp mrp stp llc team dell_rbu emcpdm(POE) emcpgpx(POE) emcpmpx(POE) emcp(POE) dell_smbios iTCO_wdt iTCO_vendor_support dell_wmi_descriptor dcdbas skx_edac intel_powerclamp coretemp intel_rapl iosf_mbi kvm_intel kvm irqbypass crc32_pclmul ghash_clmulni_intel aesni_intel ipmi_ssif pcspkr sg i2c_i801 lpc_ich mei_me mei wmi ipmi_si ipmi_devintf ipmi_msghandler acpi_power_meter acpi_pad binfmt_misc ip_tables xfs libcrc32c sd_mod crc_t10dif qla2xxx mgag200 i2c_algo_bit drm_kms_helper syscopyarea sysfillrect sysimgblt fb_sys_fops ttm ahci crct10dif_pclmul nvme_fc drm nvme_fabrics libahci nvme_core megaraid_sas i40e libata scsi_transport_fc scsi_tgt ptp pps_core drm_panel_orientation_quirks nfit libnvdimm dm_mirror dm_region_hash dm_log authenc
[ 3641.477364]  cmac zlib wp512 twofish_generic twofish_avx_x86_64 twofish_x86_64_3way twofish_x86_64 twofish_common tea sha512_ssse3 sha512_generic serpent_avx2 serpent_avx_x86_64 serpent_sse2_x86_64 serpent_generic seed salsa20_generic salsa20_x86_64 rmd320 rmd256 rmd160 rmd128 michael_mic md4 khazad ghash_generic gcm fcrypt drbg dm_crypt dm_mod des_generic deflate cts crypto_null crct10dif_generic crct10dif_common crc32c_intel ccm cast6_avx_x86_64 cast6_generic cast_common camellia_generic camellia_aesni_avx2 camellia_aesni_avx_x86_64 ablk_helper cryptd camellia_x86_64 xts lrw gf128mul glue_helper blowfish_generic blowfish_x86_64 blowfish_common arc4 ansi_cprng [last unloaded: emcpioc]
[ 3641.477556] CPU: 24 PID: 0 Comm: swapper/24 Kdump: loaded Tainted: P           OE  ------------   3.10.0-1127.8.2.el7.x86_64 #1
[ 3641.477583] Hardware name: Dell Inc. PowerEdge R740/00WGD1, BIOS 2.6.4 04/09/2020
[ 3641.477602] task: ffff990a1d6d62a0 ti: ffff990a1d6d8000 task.ti: ffff990a1d6d8000
[ 3641.477622] RIP: 0010:[<ffffffff9e37da10>]  [<ffffffff9e37da10>] __blk_add_trace+0x20/0x340
[ 3641.477649] RSP: 0018:ffff99c5feb03ba0  EFLAGS: 00010286
[ 3641.477663] RAX: 9000000000000080 RBX: ffff9a85f67814b8 RCX: 0000000000000000
[ 3641.477682] RDX: 00000000a368db00 RSI: ffff9a85d73a8060 RDI: 9000000000000080
[ 3641.477700] RBP: ffff99c5feb03c00 R08: 0000000001800008 R09: 000000000000000f
[ 3641.477718] R10: 9000000000000080 R11: ffff9a85d73a8060 R12: ffff99c2a368db00
[ 3641.477735] R13: ffff990a1d6d62a0 R14: 0000000000000000 R15: 0000000000000000
[ 3641.477753] FS:  0000000000000000(0000) GS:ffff99c5feb00000(0000) knlGS:0000000000000000
[ 3641.477772] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[ 3641.477787] CR2: 0000000000c90000 CR3: 000000b7d6df4000 CR4: 00000000007607e0
[ 3641.477805] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
[ 3641.477823] DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7: 0000000000000400
[ 3641.477840] PKRU: 00000000
[ 3641.477847] Call Trace:
[ 3641.477856]  <IRQ> 
[ 3641.477866]  [<ffffffff9e3c1517>] ? mempool_free_slab+0x17/0x20
[ 3641.477884]  [<ffffffff9e3c192f>] ? mempool_free+0x4f/0xa0
[ 3641.477900]  [<ffffffff9e37dd84>] blk_add_trace_rq.isra.10+0x54/0x90
[ 3641.477927]  [<ffffffffc11ead39>] PowerPlatformTopIodone+0x1b9/0x300 [emcp]
[ 3641.477949]  [<ffffffffc11eae9e>] PowerTopIodone+0x1e/0x130 [emcp]
[ 3641.477969]  [<ffffffffc11eb124>] PowerProcessTopIodonePirps+0x64/0xe0 [emcp]
[ 3641.477989]  [<ffffffffc11eb44c>] PowerBottomIodoneNew+0x2ac/0x5d0 [emcp]
[ 3641.478009]  [<ffffffffc11ebb83>] PowerPlatformBottomIodone+0xc3/0x1f0 [emcp]
[ 3641.478030]  [<ffffffff9e48cbec>] bio_endio+0x8c/0x130
[ 3641.478047]  [<ffffffff9e554b10>] blk_update_request+0x90/0x370
[ 3641.478065]  [<ffffffff9e6ec274>] scsi_end_request+0x34/0x1e0
[ 3641.478081]  [<ffffffff9e6ec5e8>] scsi_io_completion+0x168/0x720
[ 3641.478813]  [<ffffffff9e6e18dc>] scsi_finish_command+0xdc/0x140
[ 3641.479471]  [<ffffffff9e6ebb30>] scsi_softirq_done+0x130/0x160
[ 3641.480127]  [<ffffffff9e55c496>] blk_done_softirq+0x96/0xc0
[ 3641.480753]  [<ffffffff9e2a5695>] __do_softirq+0xf5/0x280
[ 3641.481370]  [<ffffffff9e99642c>] call_softirq+0x1c/0x30
[ 3641.481977]  [<ffffffff9e22f715>] do_softirq+0x65/0xa0
[ 3641.482544]  [<ffffffff9e2a5a15>] irq_exit+0x105/0x110
[ 3641.483094]  [<ffffffff9e2596b9>] smp_call_function_single_interrupt+0x39/0x40
[ 3641.483636]  [<ffffffff9e9951aa>] call_function_single_interrupt+0x16a/0x170
[ 3641.484184]  <EOI> 
[ 3641.484195]  [<ffffffff9e7c6177>] ? cpuidle_enter_state+0x57/0xd0
[ 3641.485233]  [<ffffffff9e7c62ce>] cpuidle_idle_call+0xde/0x230
[ 3641.485748]  [<ffffffff9e237c6e>] arch_cpu_idle+0xe/0xc0
[ 3641.486225]  [<ffffffff9e301c2a>] cpu_startup_entry+0x14a/0x1e0
[ 3641.486688]  [<ffffffff9e25a517>] start_secondary+0x1f7/0x270
[ 3641.487155]  [<ffffffff9e2000d5>] start_cpu+0x5/0x14
[ 3641.487598] Code: fe ff ff 0f 1f 84 00 00 00 00 00 55 49 89 fa 49 89 f3 48 89 e5 41 57 41 56 41 55 65 4c 8b 2c 25 c0 0e 01 00 41 54 53 48 83 ec 38 <83> 3f 02 89 55 d0 44 89 4d cc 44 0f b6 3d 36 2c be 00 0f 85 7d 
[ 3641.488553] RIP  [<ffffffff9e37da10>] __blk_add_trace+0x20/0x340
[ 3641.489002]  RSP <ffff99c5feb03ba0>
  • On RHEL 6 system panic because of 'emcp' module with following traces:
PID: 8001   TASK: ffff880108bb6aa0  CPU: 0   COMMAND: "oracle"
 #0 [ffff880028207e90] crash_nmi_callback at ffffffff8102d3a6
 #1 [ffff880028207ea0] notifier_call_chain at ffffffff81513705
 #2 [ffff880028207ee0] atomic_notifier_call_chain at ffffffff8151376a
 #3 [ffff880028207ef0] notify_die at ffffffff8109cc1e
 #4 [ffff880028207f20] do_nmi at ffffffff815113cb
 #5 [ffff880028207f50] nmi at ffffffff81510c90
    [exception RIP: _spin_lock_irqsave+0x2f]
    RIP: ffffffff815103ef  RSP: ffff880028203b20  RFLAGS: 00000097
    RAX: 0000000000002c93  RBX: ffff88090b863620  RCX: 0000000000002c92
    RDX: 0000000000000202  RSI: 000000000145b708  RDI: ffff88090ec2fe70
    RBP: ffff880028203b20   R8: 0000000000000001   R9: 0000000000000001
    R10: ffff88090bfe7f80  R11: 0000000000000400  R12: 0000000007800030
    R13: 0000000000000078  R14: 0000000000000000  R15: ffff88090ec2fe70
    ORIG_RAX: ffffffffffffffff  CS: 0010  SS: 0018
--- <NMI exception stack> ---
 #6 [ffff880028203b20] _spin_lock_irqsave at ffffffff815103ef
 #7 [ffff880028203b28] PowerPlatformTopIodone at ffffffffa03876df [emcp]
 #8 [ffff880028203b88] PowerTopIodone at ffffffffa038f1a9 [emcp]
 #9 [ffff880028203bb8] PowerProcessTopIodonePirps at ffffffffa038f347 [emcp]
#10 [ffff880028203bf8] PowerBottomIodoneNew at ffffffffa038f700 [emcp]
#11 [ffff880028203c48] PowerPlatformBottomIodone at ffffffffa038fbf7 [emcp]
#12 [ffff880028203c98] bio_endio at ffffffff811b92cd
#13 [ffff880028203ca8] req_bio_endio at ffffffff8125ad2b
#14 [ffff880028203cd8] blk_update_request at ffffffff8125c7d7
#15 [ffff880028203d38] blk_update_bidi_request at ffffffff8125cb87
#16 [ffff880028203d68] blk_end_bidi_request at ffffffff8125dfcf
#17 [ffff880028203d98] blk_end_request at ffffffff8125e070
#18 [ffff880028203da8] scsi_io_completion at ffffffff8137aa1f
#19 [ffff880028203e28] scsi_finish_command at ffffffff81371a52
#20 [ffff880028203e68] scsi_softirq_done at ffffffff8137b195
#21 [ffff880028203e98] blk_done_softirq at ffffffff81263715
#22 [ffff880028203ec8] __do_softirq at ffffffff81076fb1
#23 [ffff880028203f38] call_softirq at ffffffff8100c1cc
#24 [ffff880028203f50] do_softirq at ffffffff8100de05
#25 [ffff880028203f70] irq_exit at ffffffff81076d95
#26 [ffff880028203f80] do_IRQ at ffffffff815171c5

or

PID: 21461  TASK: ffff880ffbaa2040  CPU: 13  COMMAND: "oracle"
 #0 [ffff88085c4a7e90] crash_nmi_callback at ffffffff810297e6
 #1 [ffff88085c4a7ea0] notifier_call_chain at ffffffff814efc75
 #2 [ffff88085c4a7ee0] atomic_notifier_call_chain at ffffffff814efcda
 #3 [ffff88085c4a7ef0] notify_die at ffffffff81096c3e
 #4 [ffff88085c4a7f20] do_nmi at ffffffff814ed8f3
 #5 [ffff88085c4a7f50] nmi at ffffffff814ed200
    [exception RIP: _spin_lock_irq+40]
    RIP: ffffffff814ec998  RSP: ffff880fa7a0d8f8  RFLAGS: 00000093
    RAX: 0000000000007bff  RBX: ffff881025c86f68  RCX: ffff880b8e68e4c0
    RDX: 0000000000007bfd  RSI: ffff880fa7a0d920  RDI: ffff881028a99c70
    RBP: ffff880fa7a0d8f8   R8: 0000000000000246   R9: 0000000000000000
    R10: 0000000000000000  R11: 0000000000000000  R12: 0000000000000000
    R13: 0000000000000005  R14: 0000000000004000  R15: 0000000007800060
    ORIG_RAX: ffffffffffffffff  CS: 0010  SS: 0018
--- <NMI exception stack> ---
 #6 [ffff880fa7a0d8f8] _spin_lock_irq at ffffffff814ec998
 #7 [ffff880fa7a0d900] __make_request at ffffffff81250390
 #8 [ffff880fa7a0d980] emcp_mrf at ffffffffa026947d [emcp]
 #9 [ffff880fa7a0d9a0] generic_make_request at ffffffff8124ea7e
#10 [ffff880fa7a0da70] submit_bio at ffffffff8124eddd
#11 [ffff880fa7a0dac0] dio_bio_submit at ffffffff811b0f3c
#12 [ffff880fa7a0daf0] __blockdev_direct_IO_newtrunc at ffffffff811b1921
#13 [ffff880fa7a0dbd0] __blockdev_direct_IO at ffffffff811b1e7e
#14 [ffff880fa7a0dc50] blkdev_direct_IO at ffffffff811af4c7
#15 [ffff880fa7a0dc90] generic_file_aio_read at ffffffff81111f4b
#16 [ffff880fa7a0dd70] blkdev_aio_read at ffffffff811aea13
#17 [ffff880fa7a0ddc0] do_sync_read at ffffffff811766da
#18 [ffff880fa7a0def0] vfs_read at ffffffff81176fc5
#19 [ffff880fa7a0df30] sys_pread64 at ffffffff811772f2
#20 [ffff880fa7a0df80] system_call_fastpath at ffffffff8100b072

Environment

  • Red Hat Enterprise Linux (RHEL)
    • 6
    • 7
  • EMC powerpath software

Subscriber exclusive content

A Red Hat subscription provides unlimited access to our knowledgebase, tools, and much more.

Current Customers and Partners

Log in for full access

Log In

New to Red Hat?

Learn more about Red Hat subscriptions

Using a Red Hat product through a public cloud?

How to access this content