09.11.2012 Views

Redpaper - IBM Redbooks

Redpaper - IBM Redbooks

Redpaper - IBM Redbooks

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

These kinds of systems will operate with an amount of planned and<br />

unplanned outages. There are two significant aspects in this definition:<br />

– The system should survive a single failure but a second failure may result<br />

in a loss of service.<br />

– The detection of a fault and the triggering of an action to recover from it<br />

should be automatic, that is, require no manual intervention.<br />

Figure 1-1 illustrates the relationship between the components of systems<br />

availability.<br />

Continuous Availability<br />

Concurrency<br />

Redundancy<br />

Systems Management<br />

Reliable, Robust and Resilient Technologies<br />

High Availability + Continuous Operation<br />

Figure 1-1 High availability + Continuous Operation = Continuous Availability<br />

There are times with the term fault tolerance is used mistakenly when the terms<br />

high availability or continuous availability are meant. Fault tolerance describes<br />

systems which, in the event of a failure, can substitute a replacement component<br />

for the failed component in a matter of a few milliseconds. This kind of<br />

achievement is supported by components that have redundant sub-components,<br />

error checking and correction for data, retry capabilities for basic operations,<br />

alternate path for I/O requests, and so forth. However, there may also be a single<br />

point of failure which, despite the fault tolerance, can cause a component to fail.<br />

Similarly, if one important component in a system is not fault-tolerant, then the<br />

system is not fault-tolerant even though all other components are.<br />

Specifically, HA refers to a specific level of service that provides availability in the<br />

event of a single, non-catastrophic component failure. Transaction capacity may<br />

Chapter 1. Introduction and technical overview 3

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!