Server crash rebooted "general protection fault: 0000 [#1] SMP" due to third party kernel module nvidia.

Solution Verified - Updated -

Issue

[2712745.419007] general protection fault: 0000 [#1] SMP 
[2712745.419045] Modules linked in: cmac nls_utf8 cifs ccm cts rpcsec_gss_krb5 nfsv4 dns_resolver nfs lockd grace fscache xt_CHECKSUM ipt_MASQUERADE nf_nat_masquerade_ipv4 tun ip6t_rpfilter ip6t_REJECT nf_reject_ipv6 ipt_REJECT nf_reject_ipv4 xt_conntrack ebtable_nat ebtable_broute bridge stp llc ip6table_nat nf_conntrack_ipv6 nf_defrag_ipv6 nf_nat_ipv6 ip6table_mangle ip6table_security ip6table_raw iptable_nat nf_conntrack_ipv4 nf_defrag_ipv4 nf_nat_ipv4 nf_nat iptable_mangle iptable_security iptable_raw nf_conntrack ip_set ebtable_filter ebtables ip6table_filter devlink ip6_tables iptable_filter ampnetworkflow(OE) ampfsm(OE) nvidia_drm(POE) nvidia_modeset(POE) nvidia_uvm(OE) nvidia(POE) amd64_edac_mod edac_mce_amd kvm_amd kvm irqbypass crc32_pclmul ghash_clmulni_intel vfat fat aesni_intel lrw gf128mul
[2712745.419366]  glue_helper ablk_helper cryptd ses pcspkr enclosure joydev sg k10temp ccp hpwdt hpilo i2c_piix4 ipmi_si ipmi_devintf ipmi_msghandler acpi_cpufreq acpi_power_meter binfmt_misc auth_rpcgss sunrpc ip_tables xfs libcrc32c sd_mod crc_t10dif crct10dif_generic mgag200 i2c_algo_bit drm_kms_helper syscopyarea sysfillrect sysimgblt crct10dif_pclmul crct10dif_common fb_sys_fops ttm crc32c_intel tg3 smartpqi drm ptp scsi_transport_sas drm_panel_orientation_quirks pps_core wmi uas usb_storage dm_mirror dm_region_hash dm_log dm_mod fuse
[2712745.419585] CPU: 27 PID: 86790 Comm: chrome Kdump: loaded Tainted: P           OEL ------------   3.10.0-1160.31.1.el7.x86_64 #1
[2712745.419627] Hardware name: HPE ProLiant DL385 Gen10/ProLiant DL385 Gen10, BIOS A40 06/07/2018
[2712745.419657] task: ffff96dac2883580 ti: ffff96d8d3f18000 task.ti: ffff96d8d3f18000
[2712745.419683] RIP: 0010:[<ffffffffc1d70820>]  [<ffffffffc1d70820>] _nv035831rm+0xb0/0xe0 [nvidia]
[2712745.419885] RSP: 0018:ffff96d8d3f1bb58  EFLAGS: 00010202
[2712745.419906] RAX: 0000000000000001 RBX: ffff975619243008 RCX: ffff96acd3a01170
[2712745.419932] RDX: 6b6b6b6b6b6b6b6b RSI: 6b6b6b6b00000000 RDI: ffff96b89407ad28
[2712745.419957] RBP: ffff96b89407ad28 R08: 0000000000000020 R09: ffff96b89407ad30
[2712745.419983] R10: 0000000000000000 R11: ffff96acd3a02538 R12: ffff96f2d0c35460
[2712745.420009] R13: 6b6b6b6b00000000 R14: ffff96b89407ada0 R15: ffff975619243008
[2712745.420036] FS:  00007fd5a28eab80(0000) GS:ffff96db3fac0000(0000) knlGS:0000000000000000
[2712745.420065] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[2712745.420087] CR2: 00007f7c43e99c40 CR3: 00000093d21a2000 CR4: 00000000003407e0
[2712745.420113] Call Trace:
[2712745.420277]  [<ffffffffc1d6e4ee>] ? _nv014653rm+0x2ee/0x770 [nvidia]  <<---
[2712745.420453]  [<ffffffffc1d6ce43>] ? _nv037672rm+0xb3/0x150 [nvidia]  <<---
[2712745.420627]  [<ffffffffc1d6d177>] ? _nv037671rm+0x297/0x4e0 [nvidia]  <<---
[2712745.420802]  [<ffffffffc1d6d510>] ? _nv037666rm+0x60/0x70 [nvidia]  <<---
[2712745.420976]  [<ffffffffc1d6d63b>] ? _nv037667rm+0x7b/0xb0 [nvidia]  <<---
[2712745.421111]  [<ffffffffc168eb60>] ? _nv036043rm+0x40/0xe0 [nvidia]  <<---
[2712745.421268]  [<ffffffffc1faaf18>] ? _nv000699rm+0x68/0x80 [nvidia]  <<---
[2712745.421424]  [<ffffffffc1fabfda>] ? rm_cleanup_file_private+0xea/0x160 [nvidia]  <<---
[2712745.421538]  [<ffffffffc160be77>] ? nvidia_close+0x137/0x310 [nvidia]
[2712745.421652]  [<ffffffffc161c06f>] ? nvidia_frontend_close+0x2f/0x50 [nvidia]  <<---
[2712745.421683]  [<ffffffffa185037c>] ? __fput+0xec/0x230
[2712745.421704]  [<ffffffffa18505ae>] ? ____fput+0xe/0x10
[2712745.421727]  [<ffffffffa16c296b>] ? task_work_run+0xbb/0xe0
[2712745.421753]  [<ffffffffa16a1924>] ? do_exit+0x2d4/0xa30
[2712745.421774]  [<ffffffffa16a20ff>] ? do_group_exit+0x3f/0xa0
[2712745.421796]  [<ffffffffa16a2174>] ? SyS_exit_group+0x14/0x20
[2712745.421824]  [<ffffffffa1d96226>] ? tracesys+0xa6/0xcc
[2712745.421843] Code: 48 89 c2 48 89 ef 48 8d b1 48 01 00 00 4c 89 e9 e8 d6 5b ff ff 66 0f 1f 44 00 00 48 89 ef e8 38 5c ff ff 84 c0 74 8a 48 8b 75 00 <48> 39 5e 08 75 ea 4c 39 26 75 e5 49 8b 44 24 20 48 8d b8 48 01 
[2712745.421973] RIP  [<ffffffffc1d70820>] _nv035831rm+0xb0/0xe0 [nvidia]   <<---
[2712745.422146]  RSP <ffff96d8d3f1bb58>

Environment

  • Red Hat Enterprise Linux 7
  • Third-party kernel module [nvidia]

Subscriber exclusive content

A Red Hat subscription provides unlimited access to our knowledgebase, tools, and much more.

Current Customers and Partners

Log in for full access

Log In

New to Red Hat?

Learn more about Red Hat subscriptions

Using a Red Hat product through a public cloud?

How to access this content