23.07.2014 Views

Lustre 1.6 Operations Manual

Lustre 1.6 Operations Manual

Lustre 1.6 Operations Manual

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

Description<br />

If an MDS or an OST becomes corrupt, you can run a distributed check on the<br />

filesystem to determine what sort of problems exist.<br />

1. Run 'e2fsck -f' on the individual MDS / OST that had problems to fix any local<br />

filesystem damage.<br />

It is a very good idea to run this e2fsck under "script" so you have a log of<br />

whatever changes it made to the filesystem (in case this is needed later). After<br />

this is complete, you can bring the filesystem up if necessary to reduce the outage<br />

window.<br />

2. Run a full e2fsck of the MDS to create a database for lfsck.<br />

The -n option is critical for a mounted filesystem, otherwise you might corrupt<br />

your filesystem. The mdsdb file can grow fairly large, depending on the number<br />

of files in the filesystem (10 GB or more for millions of files, though the actual file<br />

size is larger because the file is sparse). It is fastest if this is written to a local<br />

filesystem because of the seeking and small writes. Depending on the number of<br />

files, this step can take several hours to complete. In the following example,<br />

/tmp/mdsdb is the database file.<br />

e2fsck -n -v --mdsdb /tmp/mdsdb /dev/{mdsdev}<br />

Chapter 28 User Utilities (man1) 28-13

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!