Apparent lock-up under high workload with the real-time kernel

Solution Verified - Updated -

Issue

  • Development system
  • We run all processes that are normally distributed to more machines (for performance reason) on this one machine.
  • The system is heavily loaded, but is running quite well for most of the time. Then suddenly the load average rushed in to the 1000-3000 rage and the machine gets unusable. When that happens even a normal ps hangs and one of our nagios shell script executes a simple ps and than hangs forever. Also the later running nagios scripts keep hanging and this way creating the huge amount of processes, but this is only because we didn't catch the hanging server fast enough. If we detect it fast enough the are not so many nagios processes, but still the machine hangs.

Environment

  • Red Hat Enterprise MRG Realtime 1.x and 2.x
  • kernel-rt

Subscriber exclusive content

A Red Hat subscription provides unlimited access to our knowledgebase of over 48,000 articles and solutions.

Current Customers and Partners

Log in for full access

Log In
Close

Welcome! Check out the Getting Started with Red Hat page for quick tours and guides for common tasks.