tomcat resource frequently timing out on start and showing as stopped in pcs output in a RHEL 7 High Availability cluster with pacemaker
Issue
lrmdshows timeouts for atomcatresource during start operations, andpcsreports it as stopped
Oct 21 05:35:33 node1 lrmd[1769]: warning: child_timeout_callback: tomcat_start_0 process (PID 459) timed out
Oct 21 05:35:33 node1 lrmd[1769]: warning: operation_finished: tomcat_start_0:459 - timed out after 60000ms
Oct 21 05:35:33 node1 crmd[1772]: error: process_lrm_event: LRM operation tomcat_start_0 (222) Timed Out (timeout=60000ms)
pcsshows a "Failed action" for atomcatresource with "status=Timed Out"
Failed actions:
tomcat_start_0 on node1.example.com 'unknown error' (1): call=222, status=Timed Out, last-rc-change='Tue Oct 21 05:34:33 2014', queued=60002ms, exec=0ms
- Periodically,
tomcatseems to disassociate itself frompacemaker.tomcatis seen as running inpsoutput but it shows stopped inpcsstatus. Sometimes,tomcatcompletely fails.
Environment
- Red Hat Enterprise Linux (RHEL) 7 with the High Availabililty Add On
pacemaker- One or more
ocf:heartbeat:tomcatresources defined in the CIB
Subscriber exclusive content
A Red Hat subscription provides unlimited access to our knowledgebase, tools, and much more.