25.06.2015 Views

Administering Platform LSF - SAS

Administering Platform LSF - SAS

Administering Platform LSF - SAS

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

Chapter 34<br />

Running Parallel Jobs<br />

Allowing Jobs to Use Reserved Job Slots<br />

About backfill scheduling<br />

How backfilling works<br />

Example<br />

By default, a reserved job slot cannot be used by another job. To make better<br />

use of resources and improve performance of <strong>LSF</strong>, you can configure backfill<br />

scheduling. Backfill scheduling allows other jobs to use the reserved job slots,<br />

as long as the other jobs will not delay the start of another job. Backfilling,<br />

together with processor reservation, allows large parallel jobs to run while not<br />

underutilizing resources.<br />

In a busy cluster, processor reservation helps to schedule large parallel jobs<br />

sooner. However, by default, reserved processors remain idle until the large<br />

job starts. This degrades the performance of <strong>LSF</strong> because the reserved<br />

resources are idle while jobs are waiting in the queue.<br />

Backfill scheduling allows the reserved job slots to be used by small jobs that<br />

can run and finish before the large job starts. This improves the performance<br />

of <strong>LSF</strong> because it increases the utilization of resources.<br />

For backfill scheduling, <strong>LSF</strong> assumes that a job will run until its run limit<br />

expires. Backfill scheduling works most efficiently when all the jobs in the<br />

cluster have a run limit.<br />

Since jobs with a shorter run limit have more chance of being scheduled as<br />

backfill jobs, users who specify appropriate run limits in a backfill queue will<br />

be rewarded by improved turnaround time.<br />

Once the big parallel job has reserved sufficient job slots, <strong>LSF</strong> calculates the<br />

start time of the big job, based on the run limits of the jobs currently running<br />

in the reserved slots. <strong>LSF</strong> cannot backfill if the big job is waiting for a job that<br />

has no run limit defined.<br />

If <strong>LSF</strong> can backfill the idle job slots, only jobs with run limits that expire before<br />

the start time of the big job will be allowed to use the reserved job slots. <strong>LSF</strong><br />

cannot backfill with a job that has no run limit.<br />

<strong>Administering</strong> <strong>Platform</strong> <strong>LSF</strong> 453

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!