23.07.2014 Views

Lustre 1.6 Operations Manual

Lustre 1.6 Operations Manual

Lustre 1.6 Operations Manual

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

ME/MD Queue Length<br />

The ptllnd uses a single portal set by the portal module parameter (default value of<br />

9) for both message and bulk buffers. Message buffers are always attached with<br />

PTL_INS_AFTER and match anything sent with "message" matchbits. Bulk buffers<br />

are always attached with PTL_INS_BEFORE and match only specific matchbits for<br />

that particular bulk transfer.<br />

This scheme assumes that the majority of ME / MDs posted are for "message"<br />

buffers, and that the overhead of searching through the preceding "bulk" buffers is<br />

acceptable. Since the number of "bulk" buffers posted at any time is also dependent<br />

on the bulk transfer breakpoint set by max_msg_size, this seems like an issue<br />

worth measuring at scale.<br />

TX Descriptors<br />

The ptllnd has a pool of so-called "tx descriptors", which it uses not only for<br />

outgoing messages, but also to hold state for bulk transfers requested by incoming<br />

messages. This pool should scale with the total number of peers.<br />

To enable the building of the Portals LND (ptllnd.ko) configure with this option:<br />

./configure --with-portals=<br />

Variable<br />

ntx<br />

(256)<br />

concurrent_peers<br />

(1152)<br />

peer_hash_table_size<br />

(101)<br />

cksum<br />

(0)<br />

timeout<br />

(50)<br />

portal<br />

(9)<br />

rxb_npages<br />

(64 * #cpus)<br />

Description<br />

Total number of messaging descriptors.<br />

Maximum number of concurrent peers. Peers that attempt to<br />

connect beyond the maximum are not allowed.<br />

Number of hash table slots for the peers. This number should scale<br />

with concurrent_peers. The size of the peer hash table is set by the<br />

module parameter peer_hash_table_size which defaults to a value<br />

of 101. This number should be prime to ensure the peer hash table<br />

is populated evenly. It is advisable to increase this value to 1001 for<br />

~10000 peers.<br />

Set to non-zero to enable message (not RDMA) checksums for<br />

outgoing packets. Incoming packets are always check-summed if<br />

necessary, independent of this value.<br />

Amount of time (in seconds) that a request can linger in a peersactive<br />

queue before the peer is considered dead.<br />

Portal ID to use for the ptllnd traffic.<br />

Number of pages in an RX buffer.<br />

31-16 <strong>Lustre</strong> <strong>1.6</strong> <strong>Operations</strong> <strong>Manual</strong> • September 2008

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!