23.07.2014 Views

Lustre 1.6 Operations Manual

Lustre 1.6 Operations Manual

Lustre 1.6 Operations Manual

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

21.3 Options to Format MDT and OST<br />

Filesystems<br />

The backing filesystems on an MDT and OSTs are independent of one another, so the<br />

formatting parameters for them should not be same. The size of the MDT backing<br />

filesystem depends solely on how many inodes you want in the entire <strong>Lustre</strong><br />

filesystem. This is not related to the size of the aggregate OST space.<br />

21.3.1 Planning for Inodes<br />

Each time you create a file on a <strong>Lustre</strong> filesystem, it consumes one inode on the MDT<br />

and one inode for each OST object that the file is striped over. Normally, it is based<br />

on the default stripe count option -c; but this may change on a per-file basis. In<br />

ext3/ldiskfs filesystems, inodes are pre-allocated, so creating a new file does not<br />

consume any of the free blocks. However, this also means that the format-time<br />

options should be conservative, as it is not possible to increase the number of inodes<br />

after the filesystem is formatted. If there is a shortage of inodes or space on the OSTs,<br />

it is possible to add OSTs to the filesystem.<br />

To be on the safe side, plan for 4 KB per inode on the MDT (the default). For the<br />

OST, the amount of space taken by each object depends entirely upon the usage<br />

pattern of the users/applications running on the system. <strong>Lustre</strong>, by necessity,<br />

defaults to a very conservative estimate for the object size (16 KB per object). You can<br />

almost always increase this value for filesystem installations. Many <strong>Lustre</strong><br />

filesystems have average file sizes over 1 MB per object.<br />

21.3.2 Calculating MDT Size<br />

When calculating the MDT size, the only important factor is the average size of files<br />

to be stored in the filesystem. If the average file size is, for example, 5 MB and you<br />

have 100 TB of usable OST space, then you need at least (100 TB * 1024 GB/TB * 1024<br />

MB/GB / 5 MB/inode) = 20 million inodes. Sun recommends that you have twice<br />

the minimum (40 million inodes in this example). At the default 4 KB per inode, this<br />

works out to only 160 GB of space for the MDT.<br />

Conversely, if you have a very small average file size (4 KB for example), <strong>Lustre</strong> is<br />

not very efficient. This is because you consume as much space on the MDT as on the<br />

OSTs. This is not a very common configuration for <strong>Lustre</strong>.<br />

21-4 <strong>Lustre</strong> <strong>1.6</strong> <strong>Operations</strong> <strong>Manual</strong> • September 2008

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!