Rebooting while mdraid device is resyncing can stall reboot in RHEL 5

Solution Verified - Updated -

Issue

  • If the md raid is in state "resync" when the reboot command is run then it stalls for several hours with messages like this :

    "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
    reboot        D ffffffff801546d1     0 10776      1                 307 (NOTLB)
     ffff810371ef7d98 0000000000000086 ffff81037e815068 ffffffff80050b3a
     ffff81037e815068 0000000000000008 ffff81037c5037a0 ffff81067fc18820
     0000020817efb32f 0000000000005802 ffff81037c503988 0000000d00000282
    Call Trace:
     [<ffffffff80050b3a>] blk_put_request+0x2d/0x45
     [<ffffffff880c57d5>] :megaraid_sas:megasas_issue_blocked_cmd+0x7a/0x94
     [<ffffffff800a2e5d>] autoremove_wake_function+0x0/0x2e
     [<ffffffff880c60a1>] :megaraid_sas:megasas_flush_cache+0x69/0x7c
     [<ffffffff880c6255>] :megaraid_sas:megasas_shutdown+0x17/0x25
     [<ffffffff801d064b>] device_shutdown+0x56/0x88
     [<ffffffff8009d545>] kernel_restart+0x9/0x46
     [<ffffffff8009d6cf>] sys_reboot+0x146/0x1c7
     [<ffffffff8003ad50>] hrtimer_try_to_cancel+0x4a/0x53
     [<ffffffff8005a12f>] hrtimer_cancel+0xc/0x16
     [<ffffffff80063ce9>] do_nanosleep+0x47/0x70
     [<ffffffff8005a01c>] hrtimer_nanosleep+0x58/0x118
     [<ffffffff800a4e22>] hrtimer_wakeup+0x0/0x22
     [<ffffffff8001df0b>] sigprocmask+0xb7/0xdb
     [<ffffffff800549ca>] sys_nanosleep+0x4c/0x62
     [<ffffffff8005d116>] system_call+0x7e/0x83
    

The only way to reboot is a hardware reset of the server.

Environment

  • Red Hat Enterprise Linux (RHEL) 5
  • Software (md) RAID backing /
  • One or more megaraid_sas drives in the RAID array

Subscriber exclusive content

A Red Hat subscription provides unlimited access to our knowledgebase, tools, and much more.

Current Customers and Partners

Log in for full access

Log In

New to Red Hat?

Learn more about Red Hat subscriptions

Using a Red Hat product through a public cloud?

How to access this content