25.06.2015 Views

Administering Platform LSF - SAS

Administering Platform LSF - SAS

Administering Platform LSF - SAS

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

Reserving Processors<br />

Reserving Processors<br />

About processor reservation<br />

When parallel jobs have to compete with sequential jobs for resources, job<br />

slots that become available are likely to be taken immediately by a sequential<br />

job. Parallel jobs need multiple job slots to be available before they can be<br />

dispatched. If the cluster is always busy, a large parallel job could be pending<br />

indefinitely. The more processors a parallel job requires, the worse the<br />

problem is.<br />

Processor reservation solves this problem by reserving job slots as they become<br />

available, until there are enough reserved job slots to run the parallel job.<br />

You might want to configure processor reservation if your cluster has a lot of<br />

sequential jobs that compete for resources with parallel jobs.<br />

How processor reservation works<br />

Processor reservation is disabled by default.<br />

If processor reservation is enabled, and a parallel job cannot be dispatched<br />

because there are not enough job slots to satisfy its minimum processor<br />

requirements, the job slots that are currently available will be reserved and<br />

accumulated.<br />

A reserved job slot is unavailable to any other job. To avoid deadlock situations<br />

in which the system reserves job slots for multiple parallel jobs and none of<br />

them can acquire sufficient resources to start, a parallel job will give up all its<br />

reserved job slots if it has not accumulated enough to start within a specified<br />

time. The reservation time starts from the time the first slot is reserved. When<br />

the reservation time expires, the job cannot reserve any slots for one<br />

scheduling cycle, but then the reservation process can begin again.<br />

Configuring processor reservation<br />

Syntax<br />

To enable processor reservation, set SLOT_RESERVE in lsb.queues and<br />

specify the reservation time (a job cannot hold any reserved slots after its<br />

reservation time expires).<br />

SLOT_RESERVE=MAX_RESERVE_TIME[n].<br />

where n is an integer by which to multiply MBD_SLEEP_TIME.<br />

MBD_SLEEP_TIME is defined in lsb.params; the default value is 60 seconds.<br />

Example<br />

Begin Queue<br />

.<br />

PJOB_LIMIT=1<br />

SLOT_RESERVE = MAX_RESERVE_TIME[5]<br />

.<br />

End Queue<br />

In this example, if MBD_SLEEP_TIME is 60 seconds, a job can reserve job slots<br />

for 5 minutes. If MBD_SLEEP_TIME is 30 seconds, a job can reserve job slots<br />

for 5 *30= 150 seconds, or 2.5 minutes.<br />

450<br />

<strong>Administering</strong> <strong>Platform</strong> <strong>LSF</strong>

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!