11.01.2013 Views

IBM AIX Continuous Availability Features - IBM Redbooks

IBM AIX Continuous Availability Features - IBM Redbooks

IBM AIX Continuous Availability Features - IBM Redbooks

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

Component Trace uses mechanisms similar to system trace. Existing TRCHKxx and<br />

TRCGEN macros can be replaced with Component Trace macros to trace into system trace<br />

buffers or memory trace mode private buffers. These macros are CT_HOOKx and CT_GEN,<br />

located in /usr/include/sys/ras_trace.h. Once recorded, Component Trace events can be<br />

retrieved by using the ctctrl command. Extraction using the ctctrl command is relevant<br />

only to in-memory tracing. Component Trace events can also be present in a system trace.<br />

The trcrpt command is used in both cases to process the events.<br />

2.3.10 Lightweight Memory Trace (LMT)<br />

2.3.11 ProbeVue<br />

2.3.12 Error logging<br />

Lightweight Memory Trace is an efficient, default-on per CPU, in-memory kernel trace. It is<br />

built upon the trace function that already exists in kernel subsystems, and is of most use for<br />

those who have <strong>AIX</strong> source-code access or a deep understanding of <strong>AIX</strong> internals.<br />

LMT provides system trace information for First Failure Data Capture (FFDC). It is a constant<br />

kernel trace mechanism that records software events occurring during system operation. The<br />

system activates LMT at initialization, then tracing runs continuously. Recorded events are<br />

saved into per-processor memory trace buffers. There are two memory trace buffers for each<br />

processor—one to record common events, and one to record rare events. The memory trace<br />

buffers can be extracted from system dumps accessed on a live system by service personnel.<br />

The trace records look like traditional <strong>AIX</strong> system trace records. The extracted memory trace<br />

buffers can be viewed with the trcrpt command, with formatting as defined in the /etc/trcfmt<br />

file.<br />

For further details about LMT, refer to 3.2, “Lightweight memory trace” on page 57.<br />

<strong>AIX</strong> V6.1 provides a new dynamic tracing facility that can help to debug complex system or<br />

application code. This dynamic tracing facility is introduced via a new tracing command,<br />

probevue, that allows a developer or system administrator to dynamically insert trace probe<br />

points in existing code without having to recompile the code. ProbeVue is described in detail<br />

in 3.8, “ProbeVue” on page 111. To show or change the ProbeVue configuration, use the<br />

following command:<br />

smitty probevue<br />

Troubleshooting system problems is an important and challenging task for system<br />

administrators. <strong>AIX</strong> provides an error logging facility for the runtime recording of hardware<br />

and software failures in an error log. This error log can be used for informational purposes, or<br />

for fault detection and corrective actions.<br />

The purpose of error logging is to collect and record data related to a failure so that it can be<br />

subsequently analyzed to determine the cause of the problem. The information recorded in<br />

the error log enables the customer and the service provider to rapidly isolate problems,<br />

retrieve failure data, and take corrective action.<br />

Error logging is automatically started by the rc.boot script during system initialization. Error<br />

logging is automatically stopped by the shutdown script during system shutdown.<br />

The error logging process begins when the <strong>AIX</strong> operating system module detects an error.<br />

The error-detecting segment of code then sends error information to either the errsave kernel<br />

30 <strong>IBM</strong> <strong>AIX</strong> <strong>Continuous</strong> <strong>Availability</strong> <strong>Features</strong>

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!