25.06.2015 Views

Administering Platform LSF - SAS

Administering Platform LSF - SAS

Administering Platform LSF - SAS

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

Job Scheduling and Dispatch<br />

Job Scheduling and Dispatch<br />

Scheduling policies<br />

First-Come, First-<br />

Served (FCFS)<br />

scheduling<br />

Fairshare<br />

scheduling and<br />

other policies<br />

Scheduling and dispatch<br />

Submitted jobs sit in queues until they are scheduled and dispatched to a host<br />

for execution. When a job is submitted to <strong>LSF</strong>, many factors control when and<br />

where the job starts to run:<br />

◆ Active time window of the queue or hosts<br />

◆ Resource requirements of the job<br />

◆ Availability of eligible hosts<br />

◆ Various job slot limits<br />

◆ Job dependency conditions<br />

◆ Fairshare constraints<br />

◆ Load conditions<br />

By default, jobs in a queue are dispatched in first-come, first-served (FCFS)<br />

order. This means that jobs are dispatched according to their order in the<br />

queue. Since jobs are ordered according to job priority, this does not<br />

necessarily mean that jobs will be dispatched in the order of submission. The<br />

order of jobs in the queue can also be modified by the user or administrator.<br />

If a fairshare scheduling policy has been specified for the queue or if host<br />

partitions have been configured, jobs are dispatched in accordance with these<br />

policies instead. To solve diverse problems, <strong>LSF</strong> allows multiple scheduling<br />

policies in the same cluster. <strong>LSF</strong> has several queue scheduling policies such as<br />

exclusive, preemptive, fairshare, and hierarchical fairshare.<br />

Jobs are scheduled at regular intervals (5 seconds by default, configured by the<br />

parameter JOB_SCHEDULING_INTERVAL in lsb.params). Once jobs are<br />

scheduled, they can be immediately dispatched to hosts.<br />

To prevent overloading any host, <strong>LSF</strong> waits a short time between dispatching<br />

jobs to the same host. The delay is configured by the JOB_ACCEPT_INTERVAL<br />

parameter in lsb.params or lsb.queues; the default is 60 seconds. If<br />

JOB_ACCEPT_INTERVAL is set to zero, more than one job can be started on a<br />

host at a time.<br />

46<br />

<strong>Administering</strong> <strong>Platform</strong> <strong>LSF</strong>

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!