25.06.2015 Views

Administering Platform LSF - SAS

Administering Platform LSF - SAS

Administering Platform LSF - SAS

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

Chapter 32<br />

Interactive Jobs with bsub<br />

Types of load indices<br />

Paging rate (pg)<br />

Interactive idle<br />

time (it)<br />

To effectively reduce interference between jobs, correct load indices should be<br />

used properly. Below are examples of a few frequently used parameters.<br />

The paging rate (pg) load index relates strongly to the perceived interactive<br />

performance. If a host is paging applications to disk, the user interface feels<br />

very slow.<br />

The paging rate is also a reflection of a shortage of physical memory. When an<br />

application is being paged in and out frequently, the system is spending a lot<br />

of time performing overhead, resulting in reduced performance.<br />

The paging rate load index can be used as a threshold to either stop sending<br />

more jobs to the host, or to suspend an already running batch job to give<br />

priority to interactive users.<br />

This parameter can be used in different configuration files to achieve different<br />

purposes. By defining paging rate threshold in lsf.cluster.cluster_name,<br />

the host will become busy from LIM’s point of view; therefore, no more jobs<br />

will be advised by LIM to run on this host.<br />

By including paging rate in queue or host scheduling conditions, jobs can be<br />

prevented from starting on machines with a heavy paging rate, or can be<br />

suspended or even killed if they are interfering with the interactive user on the<br />

console.<br />

A job suspended due to pg threshold will not be resumed even if the resume<br />

conditions are met unless the machine is interactively idle for more than<br />

PG_SUSP_IT seconds.<br />

Strict control can be achieved using the idle time (it) index. This index<br />

measures the number of minutes since any interactive terminal activity.<br />

Interactive terminals include hard wired ttys, rlogin and lslogin sessions,<br />

and X shell windows such as xterm. On some hosts, LIM also detects mouse<br />

and keyboard activity.<br />

This index is typically used to prevent batch jobs from interfering with<br />

interactive activities. By defining the suspending condition in the queue as<br />

it50, a job from this queue will be suspended if the machine is not<br />

interactively idle and the paging rate is higher than 50 pages per second.<br />

Furthermore, by defining the resuming condition as it>5 && pg

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!