23.07.2014 Views

Lustre 1.6 Operations Manual

Lustre 1.6 Operations Manual

Lustre 1.6 Operations Manual

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

Information on the <strong>Lustre</strong> Networking (LNET) protocol<br />

<strong>Lustre</strong> layers the socket LND (socklnd) protocol above TCP/IP. Every LNET<br />

message is an lnet_hdr_t sent in (little-endian (LE) byte order followed by<br />

'payload_length' bytes of opaque payload data. There are four types of messages.<br />

■<br />

■<br />

■<br />

■<br />

PUT - request to send data contained in the payload<br />

ACK - response to a PUT with ack_wmd != LNET_WIRE_HANDLE_NONE<br />

GET - request to fetch data<br />

REPLY - response to a GET with data in the payload<br />

Typically, ACK and GET messages have 0 bytes of payload.<br />

Explanation of: '... previously skipped # similar messages' in <strong>Lustre</strong> logs<br />

Unlike syslog, which occupies exactly identical lines, the space for <strong>Lustre</strong> messages<br />

is occupied if there are bursts of messages from the same line of code, even if they<br />

are not sequential. This avoids duplication of the same event from different clients,<br />

or in cases where two or more messages are repeated.<br />

All messages are kept in the <strong>Lustre</strong> kernel debug log, so "lctl dk" at that time would<br />

show all messages (in case they are not wrapped).<br />

Printing a large number of messages to the kernel console can dramatically slow<br />

down the system. As this happens with IRQs disabled and for a slow console, it<br />

severely impacts overall system performance when there are large number of<br />

messages.<br />

For example:<br />

<strong>Lustre</strong>Error: 559:0:(genops.c:1292:obd_export_evict_by_nid())<br />

evicting<br />

b155f37b-b426-ccc2-f0a9-bfbf00000000 at adminstrative request<br />

<strong>Lustre</strong>Error: 559:0:(genops.c:1292:obd_export_evict_by_nid())<br />

previously skipped 2 similar messages<br />

In this case, the 'similar' messages are reported for the exact line of source, without<br />

matching the text. Therefore, this is expected output for evictions of more than one<br />

client.<br />

Appendix D <strong>Lustre</strong> Knowledge Base D-29

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!