Why did the server crash
This server was recently upgraded to rhel6.4 (last Friday), and it crashed twice today, with the same symptom. Will upload sosreport and vmcore soon.
What can be done to prevent this situation
Overall there is general consternation about an enterprise (class?) OS allowing users to DOS via (simple) mallocing. Yes there is a sloppy OOM killer as an option, as well as disallowing/controlling over-commitment, but those solutions have shortcomings that kill their utility in our environment. Given this was RHEL6, I was hoping for a little bit of resiliency for the system to hobble along, at least for awhile, such that an operator could login and kill offending processes, if need be.
Red Hat Enterprise Linux
Subscriber exclusive content
A Red Hat subscription provides unlimited access to our knowledgebase, tools, and much more.