In MRG 1.2, HA Schedd primary and failover are running at same time
Issue
- Failover schedds startup despite the primary schedds still running.
- Using NFS for locks.
- Error messages similar to these are found:
ProcAPI: Unexpected short scan on /proc/14873/stat, errno: 3.
condor_write(): Socket closed when trying to write 1842 bytes to collector <collector hostname here>, fd is 8
Buf::write(): condor_write() failed
GetLock warning: Expired lock found '/condor/share/schedd/SCHEDD_S0.lock'
Started process "/usr/sbin/condor_schedd", pid and pgroup = 18937
Environment
- MRG Grid 1.2
Subscriber exclusive content
A Red Hat subscription provides unlimited access to our knowledgebase, tools, and much more.