23.07.2014 Views

Lustre 1.6 Operations Manual

Lustre 1.6 Operations Manual

Lustre 1.6 Operations Manual

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

CHAPTER 8<br />

Failover<br />

This chapter describes failover in a <strong>Lustre</strong> system and includes the following<br />

sections:<br />

■ What is Failover?<br />

■ OST Failover<br />

■ MDS Failover<br />

■ Configuring MDS and OSTs for Failover<br />

■ Setting Up Failover with Heartbeat V1<br />

■ Using MMP<br />

■ Setting Up Failover with Heartbeat V2<br />

■ Considerations with Failover Software and Solutions<br />

8.1 What is Failover?<br />

We say a computer system is Highly Available when the services it provides are<br />

available with minimum downtime. In a highly-available system, if a failure<br />

condition occurs, such as loss of a server or a network or software fault, the services<br />

provided remain unaffected. Generally, we measure availability by the percentage of<br />

time the system is required to be available.<br />

Availability is accomplished by providing replicated hardware and/or software, so<br />

failure of the system will be covered by a paired system. The concept of “failover” is<br />

the method of switching an application and its resources to a standby server when<br />

the primary system fails or is unavailable. Failover should be automatic and, in most<br />

cases, completely application-transparent.<br />

8-1

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!