23.07.2014 Views

Lustre 1.6 Operations Manual

Lustre 1.6 Operations Manual

Lustre 1.6 Operations Manual

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

What extra resources are required for automated failover?<br />

To automate failover with <strong>Lustre</strong>, you need power management software, remote<br />

control power equipment, and cluster management software.<br />

Power Management Software<br />

PowerMan, by the Lawrence Livermore National Laboratory, is a tool that<br />

manipulates remote power control (RPC) devices from a central location. PowerMan<br />

natively supports several RPC varieties. Expect-like configurability simplifies the<br />

addition of new devices. For more information about PowerMan, go to:<br />

http://www.llnl.gov/linux/powerman.html<br />

Other power management software is available, but PowerMan is the best we have<br />

used so far, and the one with which we are most familiar.<br />

Power Equipment<br />

A multi-port, Ethernet-addressable RPC is relatively inexpensive. For recommended<br />

products, see the list of supported hardware on the PowerMan website.<br />

If you can afford them, Linux Network ICEboxes are very good tools. They combine<br />

both remote power control and remote serial console in a single unit.<br />

Cluster management software<br />

There are two options for cluster management software that have been implemented<br />

successfully by <strong>Lustre</strong> customers. Both software options are open source and<br />

available free for download.<br />

■ Heartbeat<br />

The Heartbeat program is one of the core components of the High-Availability Linux<br />

(Linux-HA) project. Heartbeat is highly-portable, and runs on every known Linux<br />

platform, as well as FreeBSD and Solaris.<br />

For information, see: http://linux-ha.org/heartbeat/<br />

To download, see: http://linux-ha.org/download/<br />

■<br />

Red Hat Cluster Manager (CluManager)<br />

Red Hat Cluster Manager allows administrators to connect separate systems (called<br />

members or nodes) together to create failover clusters that ensure application<br />

availability and data integrity under several failure conditions.<br />

Administrators can use Red Hat Cluster Manager with database applications, file<br />

sharing services, web servers, and more.<br />

Appendix D <strong>Lustre</strong> Knowledge Base D-17

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!