In MRG 1.2, HA Schedd primary and failover are running at same time

Solution Verified - Updated -

Issue

  • Failover schedds startup despite the primary schedds still running.
  • Using NFS for locks.
  • Error messages similar to these are found:
ProcAPI: Unexpected short scan on /proc/14873/stat, errno: 3.
condor_write(): Socket closed when trying to write 1842 bytes to collector <collector hostname here>, fd is 8
Buf::write(): condor_write() failed
GetLock warning: Expired lock found '/condor/share/schedd/SCHEDD_S0.lock'
Started process "/usr/sbin/condor_schedd", pid and pgroup = 18937

Environment

  • MRG Grid 1.2

Subscriber exclusive content

A Red Hat subscription provides unlimited access to our knowledgebase of over 48,000 articles and solutions.

Current Customers and Partners

Log in for full access

Log In
Close

Welcome! Check out the Getting Started with Red Hat page for quick tours and guides for common tasks.