25.06.2015 Views

Administering Platform LSF - SAS

Administering Platform LSF - SAS

Administering Platform LSF - SAS

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

The Checkpoint Directory<br />

The Checkpoint Directory<br />

A checkpoint directory must be specified for every checkpointable job and is<br />

used to store the files to restart a job. The directory must be writable by the<br />

job owner. To restart the job on another host (job migration), the directory<br />

must be accessible by both hosts. <strong>LSF</strong> does not delete the checkpoint files;<br />

checkpoint file maintenance is the user’s responsibility.<br />

<strong>LSF</strong> writes the checkpoint file in a directory named with the job ID of the job<br />

being checkpointed under the checkpoint directory. This allows <strong>LSF</strong> to<br />

checkpoint multiple jobs to the same checkpoint directory. For example, when<br />

you specify a checkpoint directory called my_dir and when job 123 is<br />

checkpointed, <strong>LSF</strong> will save the checkpoint file in:<br />

my_dir/123/<br />

When <strong>LSF</strong> restarts a checkpointed job, it renames the checkpoint directory<br />

using the job ID of the new job and creates a symbolic link from the old<br />

checkpoint directory to the new one. For example, if a job with job ID 123 is<br />

restarted with job ID 456 the checkpoint directory will be renamed to:<br />

my_dir/456/<br />

314<br />

<strong>Administering</strong> <strong>Platform</strong> <strong>LSF</strong>

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!