Server intermittently crashing with RIP 'lpfc_sli_validate_fcp_iocb+0x74/0xc0 [lpfc]'

Solution In Progress - Updated -

Issue

  • During the database restoration activity, system crashed with General Protection Fault and call traces as seen below:

    general protection fault: 0000 [#1] SMP 
    [...]
    scsi_tgt scsi_transport_sas crct10dif_common dm_mirror dm_region_hash dm_log dm_mod [last unloaded: oracleasm]
    CPU: 8 PID: 892 Comm: scsi_eh_2 Kdump: loaded Tainted: P        W  OE  ------------ T 3.10.0-862.3.2.el7.x86_64 #1
    Hardware name: HP ProLiant BL460c Gen9, BIOS I36 09/12/2016
    task: ffff8bf7b61b0fd0 ti: ffff8be935f78000 task.ti: ffff8be935f78000
    RIP: 0010:[<ffffffffc045a6d4>]  [<ffffffffc045a6d4>] lpfc_sli_validate_fcp_iocb+0x74/0xc0 [lpfc]
    RSP: 0018:ffff8be935f7bc68  EFLAGS: 00010097
    RAX: 0000000000000001 RBX: 0000000000000001 RCX: 0000000000000014
    RDX: 0000000000000001 RSI: 6572662f67726f2f RDI: ffff8bbe99ec0470
    RBP: ffff8be935f7bc70 R08: 0000000000000000 R09: ffff8bf7b7a9ef00
    R10: 000000000000189a R11: 0000000000000001 R12: ffff8bf7b69d3740
    R13: ffff8bf7b6030000 R14: 0000000000000000 R15: 0000000000000f4f
    FS:  0000000000000000(0000) GS:ffff8bf7bf600000(0000) knlGS:0000000000000000
    CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
    CR2: 00007fe0f5002000 CR3: 00000030f600e000 CR4: 00000000003607e0
    DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
    DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7: 0000000000000400
    Call Trace:
     [<ffffffffc04642c8>] lpfc_sli_sum_iocb+0x78/0xc0 [lpfc]
     [<ffffffffc04ae86e>] lpfc_reset_flush_io_context+0x2e/0x190 [lpfc]
     [<ffffffffc04afedc>] lpfc_device_reset_handler+0x16c/0x220 [lpfc]
     [<ffffffffb069d30d>] scsi_try_bus_device_reset+0x2d/0x60
     [<ffffffffb069f50f>] scsi_eh_ready_devs+0x4ef/0xc60
     [<ffffffffb06a0f6c>] scsi_error_handler+0x56c/0x8b0
     [<ffffffffb06a0a00>] ? scsi_eh_get_sense+0x250/0x250
     [<ffffffffb02bb161>] kthread+0xd1/0xe0
     [<ffffffffb02bb090>] ? insert_kthread_work+0x40/0x40
     [<ffffffffb0920677>] ret_from_fork_nospec_begin+0x21/0x21
     [<ffffffffb02bb090>] ? insert_kthread_work+0x40/0x40
    [1896190.457188] Code: c2 48 c7 c6 30 21 4e c0 48 c7 c7 f8 84 4e c0 31 c0 e8 08 e5 4a f0 b8 01 00 00 00 c9 c3 66 2e 0f 1f 84 00 00 00 00 00 48 8b 77 a8 <48> 8b 36 48 85 f6 74 a8 66 3b 56 38 75 a2 48 8b 7f e0 48 89 4d 
    [1896190.459160] RIP  [<ffffffffc045a6d4>] lpfc_sli_validate_fcp_iocb+0x74/0xc0 [lpfc]
    [1896190.460140]  RSP <ffff8be935f7bc68>
    

Environment

  • Red Hat Enterprise Linux 7.5
  • kernel-3.10.0-862.3.2.el7
  • Emulex OneConnect OCe14000, FCoE Initiator
  • Oracle DB

Subscriber exclusive content

A Red Hat subscription provides unlimited access to our knowledgebase of over 48,000 articles and solutions.

Current Customers and Partners

Log in for full access

Log In
Close

Welcome! Check out the Getting Started with Red Hat page for quick tours and guides for common tasks.