23.07.2014 Views

Lustre 1.6 Operations Manual

Lustre 1.6 Operations Manual

Lustre 1.6 Operations Manual

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

27.4 Failures Running a Client and OST on<br />

the Same Machine<br />

There are inherent problems if a client and OST share the same machine (and the<br />

same memory pool). An effort to relieve memory pressure (by the client), requires<br />

memory to be available to the OST. If the client is experiencing memory pressure,<br />

then the OST is as well. The OST may not get the memory it needs to help the client<br />

get the memory it needs because it is all one memory pool; this results in deadlock.<br />

Running a client and an OST on the same machine can cause these failures:<br />

■ If the client contains a dirty filesystem in memory and memory pressure, a kernel<br />

thread flushes dirty pages to the filesystem, and it writes to a local OST. To<br />

complete the write, the OST needs to do an allocation. Then the blocking of<br />

allocation occurs while waiting for the above kernel thread to complete the write<br />

process and free up some memory. This is a deadlock condition.<br />

■<br />

If the node with both a client and OST crashes, then the OST waits for the<br />

mounted client on that node to recover. However, since the client is now in<br />

crashed state, the OST considers it to be a new client and blocks it from mounting<br />

until the recovery completes.<br />

As a result, running OST and client on same machine can cause a double failure and<br />

prevent a complete recovery.<br />

27.5 Improving <strong>Lustre</strong> Metadata<br />

Performance While Using Large<br />

Directories<br />

To improve metadata performance while using large directories can be improved by:<br />

■ Have more RAM on the MDS – On the MDS, more memory translates into bigger<br />

caches, thereby increasing the metadata performance.<br />

■ Patching the core kernel on the MDS with the 3G/1G patch (if not running a 64-<br />

bit kernel), which increases the available kernel address space. This translates into<br />

support for bigger caches on the MDS.<br />

Chapter 27 <strong>Lustre</strong> Operating Tips 27-5

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!