Production nodes become unresponsive
Issue
- Tomcat nodes running RHEL 6.4 on VMWare ESX 5 enter a cascade failure state. As one node becomes unresponsive, many start to fail. They are clustered using Apache and MOD_JK.
- Attempting to connect to one of the servers during the event failed. While it responded to ping and accepted an ssh connection, it would immediately closed that connection with no notice.
- Logging onto the system console and entering login name resulted in a hang and no password prompt.
Environment
- Red Hat Enterprise Linux, version 6.4
- VMWare ESX 5
- Apache with MOD_JK
Subscriber exclusive content
A Red Hat subscription provides unlimited access to our knowledgebase of over 48,000 articles and solutions.
Welcome! Check out the Getting Started with Red Hat page for quick tours and guides for common tasks.
