In MRG 1.2, HA Schedd primary and failover are running at same time
Issue
- Failover schedds startup despite the primary schedds still running.
- Using NFS for locks.
- Error messages similar to these are found:
ProcAPI: Unexpected short scan on /proc/14873/stat, errno: 3.
condor_write(): Socket closed when trying to write 1842 bytes to collector <collector hostname here>, fd is 8
Buf::write(): condor_write() failed
GetLock warning: Expired lock found '/condor/share/schedd/SCHEDD_S0.lock'
Started process "/usr/sbin/condor_schedd", pid and pgroup = 18937
Environment
- MRG Grid 1.2
Subscriber exclusive content
A Red Hat subscription provides unlimited access to our knowledgebase of over 48,000 articles and solutions.
Welcome! Check out the Getting Started with Red Hat page for quick tours and guides for common tasks.
