A great number of tasks calling percpu_rwsem_wait() or __rt_mutex_slowlock() are getting stuck for a long time

Solution Unverified - Updated -

Issue

  • A great number of tasks calling percpu_rwsem_wait() or __rt_mutex_slowlock() are getting stuck for a long time
    [...]
[288914.308281] INFO: task systemd:1 blocked for more than 600 seconds.
[288914.308287]       Tainted: G        W  OE    --------- -  - 4.18.0-372.69.1.rt7.227.el8_6.x86_64 #1
[288914.308289] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
[288914.308290] task:systemd         state:D stack:    0 pid:    1 ppid:     0 flags:0x00080000
[288914.308295] Call Trace:
[288914.308299]  __schedule+0x37b/0x8e0
[288914.308307]  ? proc_cgroup_show+0x30/0x290
[288914.308313]  schedule+0x6c/0x120
[288914.308316]  rt_mutex_slowlock_block.isra.18+0x9c/0x170
[288914.308320]  rt_mutex_slowlock.constprop.21+0xd4/0x140
[288914.308324]  proc_cgroup_show+0x4a/0x290
[288914.308328]  proc_single_show+0x4e/0xa0
[288914.308333]  seq_read+0x163/0x420
[288914.308370]  vfs_read+0x91/0x150
[288914.308375]  ksys_read+0x52/0xc0
[288914.308377]  do_syscall_64+0x87/0x1b0
[288914.308382]  entry_SYSCALL_64_after_hwframe+0x61/0xc6
[288914.308386] RIP: 0033:0x7fa1f4b2c534
[288914.308394] Code: Unable to access opcode bytes at RIP 0x7fa1f4b2c50a.
[288914.308395] RSP: 002b:00007fff5eb59870 EFLAGS: 00000246 ORIG_RAX: 0000000000000000
[288914.308398] RAX: ffffffffffffffda RBX: 000000000000002c RCX: 00007fa1f4b2c534
[288914.308400] RDX: 0000000000000400 RSI: 0000559669ed6bf0 RDI: 000000000000002c
[288914.308401] RBP: 0000559669ed6bf0 R08: 0000000000000000 R09: 0000000000000000
[288914.308402] R10: 0000000000000000 R11: 0000000000000246 R12: 0000000000000400
[288914.308403] R13: 00007fa1f4dc83a0 R14: 00000000000007ff R15: 000055966a027340
[288914.308527] INFO: task systemd-journal:1540 blocked for more than 600 seconds.
[288914.308529]       Tainted: G        W  OE    --------- -  - 4.18.0-372.69.1.rt7.227.el8_6.x86_64 #1
[288914.308530] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
[288914.308531] task:systemd-journal state:D stack:    0 pid: 1540 ppid:     1 flags:0x00080100
[288914.308534] Call Trace:
[288914.308536]  __schedule+0x37b/0x8e0
[288914.308549]  schedule+0x6c/0x120
[288914.308551]  rt_mutex_slowlock_block.isra.18+0x9c/0x170
[288914.308555]  rt_mutex_slowlock.constprop.21+0xd4/0x140
[288914.308559]  proc_cgroup_show+0x4a/0x290
[288914.308562]  proc_single_show+0x4e/0xa0
[288914.308565]  seq_read+0x163/0x420
[288914.308568]  vfs_read+0x91/0x150
[288914.308571]  ksys_read+0x52/0xc0
[288914.308573]  do_syscall_64+0x87/0x1b0
[288914.308576]  entry_SYSCALL_64_after_hwframe+0x61/0xc6
[288914.308578] RIP: 0033:0x7f398e1d1534
[288914.308581] Code: Unable to access opcode bytes at RIP 0x7f398e1d150a.
[288914.308582] RSP: 002b:00007ffd9911bbe0 EFLAGS: 00000246 ORIG_RAX: 0000000000000000
[288914.308583] RAX: ffffffffffffffda RBX: 0000000000000018 RCX: 00007f398e1d1534
[288914.308585] RDX: 0000000000000400 RSI: 000055c3e20608b0 RDI: 0000000000000018
[288914.308586] RBP: 000055c3e20608b0 R08: 0000000000000000 R09: 0000000000000000
[288914.308587] R10: 0000000000000000 R11: 0000000000000246 R12: 0000000000000400
[288914.308588] R13: 00007f398e46d3a0 R14: 00000000000007ff R15: 000055c3e2003030
[288914.308598] INFO: task ovs-vswitchd:2285 blocked for more than 600 seconds.
[288914.308600]       Tainted: G        W  OE    --------- -  - 4.18.0-372.69.1.rt7.227.el8_6.x86_64 #1
[288914.308601] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
[288914.308601] task:ovs-vswitchd    state:D stack:    0 pid: 2285 ppid:     1 flags:0x00080080
[288914.308604] Call Trace:
[288914.308605]  __schedule+0x37b/0x8e0
[288914.308608]  schedule+0x6c/0x120
[288914.308611]  percpu_rwsem_wait+0x13f/0x170
[288914.308615]  ? __percpu_rwsem_trylock.part.7+0x70/0x70
[288914.308619]  __percpu_down_read+0x64/0x90
[288914.308622]  cgroup_can_fork+0x1e3/0x220
[288914.308626]  copy_process+0x1232/0x1d30
[288914.308633]  _do_fork+0x8b/0x3a0
[288914.308637]  ? syscall_trace_enter+0x1db/0x300
[288914.308640]  ? __audit_syscall_exit+0x258/0x2c0
[288914.308644]  do_syscall_64+0x87/0x1b0
[288914.308647]  entry_SYSCALL_64_after_hwframe+0x61/0xc6
[288914.308649] RIP: 0033:0x7f4e94bfcdc5
[288914.308652] Code: Unable to access opcode bytes at RIP 0x7f4e94bfcd9b.
[288914.308652] RSP: 002b:00007ffe2b582f98 EFLAGS: 00003206 ORIG_RAX: 0000000000000038
[288914.308654] RAX: ffffffffffffffda RBX: 00007f4e96e5ec40 RCX: 00007f4e94bfcdc5
[288914.308655] RDX: 00007f4e96e5ef10 RSI: 00007f4e96e5cff0 RDI: 00000000003d0f00
[288914.308657] RBP: 00007ffe2b5830f0 R08: 00007f4e96e5ec40 R09: 00007f4e96e5ec40
[288914.308658] R10: 00007f4e96e5ef10 R11: 0000000000003206 R12: 00007ffe2b58303e
[288914.308659] R13: 00007ffe2b58303f R14: 00007ffe2b5830f0 R15: 00007f4e96e5d000
    [...]

Environment

  • Red Hat OpenShift Container Platform 4.12
    • Red Hat CoreOS (Red Hat Enterprise Linux 8 for Real Time)

Subscriber exclusive content

A Red Hat subscription provides unlimited access to our knowledgebase, tools, and much more.

Current Customers and Partners

Log in for full access

Log In

New to Red Hat?

Learn more about Red Hat subscriptions

Using a Red Hat product through a public cloud?

How to access this content