25.06.2015 Views

Administering Platform LSF - SAS

Administering Platform LSF - SAS

Administering Platform LSF - SAS

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

Chapter 31<br />

Configuring Job Controls<br />

RESUME action<br />

TERMINATE action<br />

Change a suspended job from SSUSP, USUSP, or PSUSP state to the RUN state.<br />

The default action is to send the signal SIGCONT.<br />

<strong>LSF</strong> invokes the RESUME action when:<br />

◆ The user or <strong>LSF</strong> administrator issues a bresume command to the job<br />

◆ Load conditions on the execution host satisfy all of:<br />

❖ The resume conditions of the queue, as specified by the<br />

RESUME_COND parameter in lsb.queues<br />

❖ The scheduling thresholds of the queue and the execution host<br />

◆ A closed run window of the queue opens again<br />

◆ A preempted job finishes<br />

Terminate a job. This usually causes the job change to EXIT status. The default<br />

action is to send SIGINT first, then send SIGTERM 10 seconds after SIGINT,<br />

then send SIGKILL 10 seconds after SIGTERM. The delay between signals<br />

allows user programs to catch the signals and clean up before the job<br />

terminates.<br />

To override the 10 second interval, use the parameter<br />

JOB_TERMINATE_INTERVAL in the lsb.params file. See the <strong>Platform</strong> <strong>LSF</strong><br />

Reference for information about the lsb.params file.<br />

<strong>LSF</strong> invokes the TERMINATE action when:<br />

◆ The user or <strong>LSF</strong> administrator issues a bkill or brequeue command to the<br />

job<br />

◆ The TERMINATE_WHEN parameter in the queue definition (lsb.queues)<br />

causes a SUSPEND action to be redirected to TERMINATE<br />

◆ The job reaches its CPULIMIT, MEMLIMIT, RUNLIMIT or PROCESSLIMIT<br />

If the execution of an action is in progress, no further actions are initiated<br />

unless it is the TERMINATE action. A TERMINATE action is issued for all job<br />

states except PEND.<br />

Windows job control actions<br />

On Windows, actions equivalent to the UNIX signals have been implemented<br />

to do the default job control actions. Job control messages replace the SIGINT<br />

and SIGTERM signals, but only customized applications will be able to process<br />

them. Termination is implemented by the TerminateProcess() system call.<br />

See Using the <strong>Platform</strong> <strong>LSF</strong> SDK for more information about <strong>LSF</strong> signal<br />

handling on Windows.<br />

<strong>Administering</strong> <strong>Platform</strong> <strong>LSF</strong> 391

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!