In MRG 1.2, HA Schedd primary and failover are running at same time

Solution Verified - Updated -

Issue

  • Failover schedds startup despite the primary schedds still running.
  • Using NFS for locks.
  • Error messages similar to these are found:
ProcAPI: Unexpected short scan on /proc/14873/stat, errno: 3.
condor_write(): Socket closed when trying to write 1842 bytes to collector <collector hostname here>, fd is 8
Buf::write(): condor_write() failed
GetLock warning: Expired lock found '/condor/share/schedd/SCHEDD_S0.lock'
Started process "/usr/sbin/condor_schedd", pid and pgroup = 18937

Environment

  • MRG Grid 1.2

Subscriber exclusive content

A Red Hat subscription provides unlimited access to our knowledgebase, tools, and much more.

Current Customers and Partners

Log in for full access

Log In

New to Red Hat?

Learn more about Red Hat subscriptions

Using a Red Hat product through a public cloud?

How to access this content