25.06.2015 Views

Administering Platform LSF - SAS

Administering Platform LSF - SAS

Administering Platform LSF - SAS

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

Performance Tuning for Interactive Batch Jobs<br />

Performance Tuning for Interactive Batch Jobs<br />

Types of load conditions<br />

Scheduling<br />

conditions<br />

Suspending<br />

conditions<br />

Resuming<br />

conditions<br />

<strong>LSF</strong> is often used on systems that support both interactive and batch users. On<br />

one hand, users are often concerned that load sharing will overload their<br />

workstations and slow down their interactive tasks. On the other hand, some<br />

users want to dedicate some machines for critical batch jobs so that they have<br />

guaranteed resources. Even if all your workload is batch jobs, you still want to<br />

reduce resource contentions and operating system overhead to maximize the<br />

use of your resources.<br />

Numerous parameters can be used to control your resource allocation and to<br />

avoid undesirable contention.<br />

Since interferences are often reflected from the load indices, <strong>LSF</strong> responds to<br />

load changes to avoid or reduce contentions. <strong>LSF</strong> can take actions on jobs to<br />

reduce interference before or after jobs are started. These actions are triggered<br />

by different load conditions. Most of the conditions can be configured at both<br />

the queue level and at the host level. Conditions defined at the queue level<br />

apply to all hosts used by the queue, while conditions defined at the host level<br />

apply to all queues using the host.<br />

These conditions, if met, trigger the start of more jobs. The scheduling<br />

conditions are defined in terms of load thresholds or resource requirements.<br />

At the queue level, scheduling conditions are configured as either resource<br />

requirements or scheduling load thresholds, as described in lsb.queues. At<br />

the host level, the scheduling conditions are defined as scheduling load<br />

thresholds, as described in lsb.hosts.<br />

These conditions affect running jobs. When these conditions are met, a<br />

SUSPEND action is performed to a running job.<br />

At the queue level, suspending conditions are defined as STOP_COND as<br />

described in lsb.queues or as suspending load threshold. At the host level,<br />

suspending conditions are defined as stop load threshold as described in<br />

lsb.hosts.<br />

These conditions determine when a suspended job can be resumed. When<br />

these conditions are met, a RESUME action is performed on a suspended job.<br />

At the queue level, resume conditions are defined as by RESUME_COND in<br />

lsb.queues, or by the loadSched thresholds for the queue if RESUME_COND<br />

is not defined.<br />

404<br />

<strong>Administering</strong> <strong>Platform</strong> <strong>LSF</strong>

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!