23.07.2014 Views

Lustre 1.6 Operations Manual

Lustre 1.6 Operations Manual

Lustre 1.6 Operations Manual

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

Parameter<br />

at_early_margin<br />

at_extra<br />

ldlm_enqueue_min<br />

Description<br />

Sets how far before the deadline <strong>Lustre</strong> sends an early reply. Default<br />

value is 5 † .<br />

Sets the incremental amount of time that a server asks for with each<br />

early reply. The server does not know how much time the RPC will<br />

take, so it asks for a fixed value. Default value is 30 ‡ . When a server<br />

finds a queued request about to time out (and needs to send an early<br />

reply out), the server adds the at_extra value (up to its estimate). If<br />

the time expires, the <strong>Lustre</strong> client will enter recovery status and<br />

reconnect to restore it to normal status.<br />

If you see multiple early replies for the same RPC asking for multiple<br />

30-second increases, change the at_extra value to a larger number<br />

to cut down on early replies sent and, therefore, network load.<br />

Sets the minimum lock enqueue time. Default value is 100. The<br />

ldlm_enqueue time is the maximum of the measured enqueue<br />

estimate (influenced by at_min and at_max parameters), multiplied<br />

by a weighting factor, and the ldlm_enqueue_min setting. LDLM<br />

lock enqueues were based on the obd_timeout value; now they<br />

have a dedicated minimum value. Lock enqueues increase as the<br />

measured enqueue times increase (similar to adaptive timeouts).<br />

* In future releases, the default will be 600 (adaptive timeouts will be enabled).<br />

† This default was chosen as a reasonable time in which to send a reply from the point at which it was sent.<br />

‡ This default was chosen as a balance between sending too many early replies for the same RPC and overestimating<br />

the actual completion time.<br />

In <strong>Lustre</strong> <strong>1.6</strong>.5, adaptive timeouts are disabled, by default. 2 To enable adaptive<br />

timeouts, do one of the following:<br />

■ At compile time, rebuild <strong>Lustre</strong> with:<br />

--enable-adaptive-timeouts<br />

■ At run time, set at_max to 600 on all nodes.<br />

■<br />

$ echo 600 > /sys/module/ptlrpc/at_max<br />

In modprobe.conf, run:<br />

options ptlrpc at_max=600<br />

The modprobe.conf line should be added (s/run/add) on all nodes before <strong>Lustre</strong><br />

modules are loaded.<br />

To disable adaptive timeouts, at run time, set at_max to 0 on all nodes.<br />

$ echo 0 > /sys/module/ptlrpc/at_max<br />

2. In <strong>Lustre</strong> 1.8, adaptive timeouts will be enabled, by default.<br />

Chapter 20 <strong>Lustre</strong>Proc 20-7

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!