11.01.2013 Views

IBM AIX Continuous Availability Features - IBM Redbooks

IBM AIX Continuous Availability Features - IBM Redbooks

IBM AIX Continuous Availability Features - IBM Redbooks

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

3.1 <strong>AIX</strong> Reliability, <strong>Availability</strong>, and Serviceability component<br />

hierarchy<br />

The advanced continuous availability tools and features can be assimilated as Reliability,<br />

<strong>Availability</strong>, and Serviceability (RAS) tools or facilities. These facilities were developed to<br />

enhance <strong>AIX</strong> capabilities for investigating application, component or subsystem failures and<br />

problems—without needing to perform a complete system dump or stop the application or<br />

system.<br />

A RAS component hierarchy is used by some features. This divides the system into a<br />

resource hierarchy, and allows individual RAS commands to be directed to very specific parts<br />

of the system. The RAS features that exploit the RAS component hierarchy are runtime<br />

checking, component trace, and component dump. This grouping hierarchy is illustrated in<br />

Figure 3-1.<br />

Figure 3-1 Component Reliability, <strong>Availability</strong>, and Serviceability<br />

3.1.1 First Failure Data Capture feature<br />

Lightweight memory trace is used to provide information for First Failure Data Capture<br />

(FFDC). It was introduced in <strong>AIX</strong> Version V5.3 ML3. The set of First Failure Data Capture<br />

features was further expanded in <strong>AIX</strong> V5.3 TL5 and <strong>AIX</strong> 6.1. They include:<br />

► Lightweight memory trace (LMT)<br />

► Run-time error checking (RTEC)<br />

► Component Trace (CT)<br />

► Live Dump<br />

These features are enabled by default at levels that provide valuable First Failure Data<br />

Capture information with minimal performance impact. To enable or disable all four advanced<br />

First Failure Data Capture features, enter the following command:<br />

/usr/lib/ras/ffdcctrl -o ffdc=enabled -o bosboot=no<br />

56 <strong>IBM</strong> <strong>AIX</strong> <strong>Continuous</strong> <strong>Availability</strong> <strong>Features</strong><br />

RAS Components<br />

Reliability <strong>Availability</strong> Serviceability<br />

Runtime Error Checking<br />

CT Private<br />

Buffers<br />

Trace<br />

Light<br />

Weight<br />

Memory<br />

Trace<br />

…<br />

Component RAS …<br />

Component Trace Component Dump<br />

System<br />

Trace<br />

System<br />

Dump<br />

Live Dump

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!