23.07.2014 Views

Lustre 1.6 Operations Manual

Lustre 1.6 Operations Manual

Lustre 1.6 Operations Manual

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

Or from the device label at any time:<br />

# e2label /dev/sda<br />

lustre-MDT0000<br />

20.1.2 <strong>Lustre</strong> Timeouts/ Debugging<br />

<strong>Lustre</strong> uses two types of timeouts.<br />

■ LND timeouts that ensure point-to-point communications complete in finite time<br />

in the presence of failures. These timeouts are logged with the S_LND flag set.<br />

They may not be printed as console messages, so you should check the <strong>Lustre</strong> log<br />

for D_NETERROR messages, or enable printing of D_NETERROR messages to the<br />

console (echo + neterror > /proc/sys/lnet/printk).<br />

Congested routers can be a source of spurious LND timeouts. To avoid this,<br />

increase the number of LNET router buffers to reduce back-pressure and/or<br />

increase LND timeouts on all nodes on all connected networks. You should also<br />

consider increasing the total number of LNET router nodes in the system so that<br />

the aggregate router bandwidth matches the aggregate server bandwidth.<br />

■ <strong>Lustre</strong> timeouts that ensure <strong>Lustre</strong> RPCs complete in finite time in the presence of<br />

failures. These timeouts should always be printed as console messages. If <strong>Lustre</strong><br />

timeouts are not accompanied by LNET timeouts, then you need to increase the<br />

lustre timeout on both servers and clients.<br />

Specific <strong>Lustre</strong> timeouts are described below.<br />

/proc/sys/lustre/timeout<br />

This is the time period that a client waits for a server to complete an RPC (default<br />

100s). Servers wait half of this time for a normal client RPC to complete and a<br />

quarter of this time for a single bulk request (read or write of up to 1 MB) to<br />

complete. The client pings recoverable targets (MDS and OSTs) at one quarter of the<br />

timeout, and the server waits one and a half times the timeout before evicting a<br />

client for being "stale."<br />

Note – <strong>Lustre</strong> sends periodic ‘PING’ messages to servers with which it had no<br />

communication for a specified period of time. Any network activity on the<br />

filesystem that triggers network traffic toward servers also works as a health check.<br />

/proc/sys/lustre/ldlm_timeout<br />

This is the time period for which a server will wait for a client to reply to an initial<br />

AST (lock cancellation request) where default is 20s for an OST and 6s for an MDS.<br />

If the client replies to the AST, the server will give it a normal timeout (half of the<br />

client timeout) to flush any dirty data and release the lock.<br />

Chapter 20 <strong>Lustre</strong>Proc 20-3

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!