How to have a post-processing job in condor dagman, running at the end of a batch, regardless of parent job failures?

Solution Verified - Updated -

Issue

Having a very simple DAG, with 2 levels:

JOB A a.condor
RETRY A 5
JOB B a.condor
RETRY B 5
JOB C a.condor
RETRY C 5
JOB PostPROCESS pp.condor
PARENT A B C CHILD PostPROCESS
  • Jobs A - C are processing jobs: each does something which sometimes fails, therefore having RETRY 5
  • a post-processing job that aggregates results from A - C and produces a report.

It is possible that some of the jobs A - C fail eventually (even after the 5 retries). It is desired the PostPROCESS job to run, even in this case.

Otherwise said, the PostPROCESS should be be just a synchronization point, and executed regardless of the result of the jobs A-C, but only after each of A-C has been given its chance to run.

How can one achieve this?

Environment

  • MRG Grid (any version)

Subscriber exclusive content

A Red Hat subscription provides unlimited access to our knowledgebase, tools, and much more.

Current Customers and Partners

Log in for full access

Log In

New to Red Hat?

Learn more about Red Hat subscriptions

Using a Red Hat product through a public cloud?

How to access this content