28.06.2014 Views

Sun Fire V445 Server Administration Guide - SCN Research

Sun Fire V445 Server Administration Guide - SCN Research

Sun Fire V445 Server Administration Guide - SCN Research

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

If the kernel hangs and the watchdog times out, ALOM reports and logs the event<br />

and performs one of three user configurable actions.<br />

■ xir: this is the default action and will cause the server to capture cpu register<br />

and memory contents to the dump-device using the firmware level sync<br />

command. In the event of the sync hanging, ALOM falls back to a hard reset<br />

after 15 minutes.<br />

Note – Do not confuse this OpenBoot sync command with the Solaris OS sync<br />

command, which results in I/O writes of buffered data to the disk drives, prior to<br />

unmounting file systems.<br />

■<br />

■<br />

Reset: this is a hard reset and results in a rapid system recovery but diagnostic<br />

data regarding the hang is not stored, and file system damage may result.<br />

None - this will result in the system being left in the hung state indefinitely after<br />

the watchdog timeout has been reported.<br />

For more information, see the sys_autorestart section of the ALOM Online Help.<br />

About Automatic System Restoration<br />

Note – Automatic System Restoration (ASR) is not the same as Automatic <strong>Server</strong><br />

Restart, which the <strong>Sun</strong> <strong>Fire</strong> <strong>V445</strong> server also supports.<br />

Automatic System Restoration (ASR) consists of self-test features and an autoconfiguring<br />

capability to detect failed hardware components and unconfigure them.<br />

By doing this, the server is able to resume operating after certain nonfatal hardware<br />

faults or failures have occured.<br />

If a component is one that is monitored by ASR, and the server is capable of<br />

operating without it, the server will automatically reboot if that component should<br />

develop a fault or fail.<br />

ASR monitors the following components:<br />

■ Memory modules<br />

■ PCI cards<br />

If a fault is detected during the power-on sequence, the faulty component is<br />

disabled. If the system remains capable of functioning, the boot sequence continues.<br />

Chapter 8 Diagnostics 195

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!