28.12.2012 Views

z/VM: System Messages and Codes Š CP - z/VM - IBM

z/VM: System Messages and Codes Š CP - z/VM - IBM

z/VM: System Messages and Codes Š CP - z/VM - IBM

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

H<strong>CP</strong>9102E H<strong>CP</strong>9107D<br />

an isolated event that will not repeat itself. However,<br />

there is a risk in varying the failed processor back<br />

online because the error may repeat itself. And if the<br />

error does repeat, there is no guarantee that the error<br />

will again occur at a point where normal operation can<br />

continue; the system might terminate next time.<br />

H<strong>CP</strong>9102E MACHINE CHECKS: 2ND ERROR<br />

WHILE HANDLING 1ST.<br />

Explanation: Before the machine-check h<strong>and</strong>ler<br />

finished h<strong>and</strong>ling one machine check, another machine<br />

check (or possibly a check-stop) occurred.<br />

<strong>System</strong> Action: <strong>System</strong> operation will stop. The<br />

termination may or may not be followed by an automatic<br />

software re-IPL.<br />

Operator Response: Watch for a following message<br />

to see if an automatic software re-IPL occurs. If it does<br />

not occur, consider notifying system support personnel<br />

to look at the problem while the system is down, instead<br />

of manually re-IPLing immediately.<br />

H<strong>CP</strong>9103E THE MCIC IS NOT VALID. (CAUSE OF<br />

INTERRUPTION IS MISSING.)<br />

Explanation: Message H<strong>CP</strong>9101I describes the<br />

machine check that had an invalid MCIC (machine<br />

check interrupt code). The bits in the first 2 bytes of the<br />

MCIC identify a cause or type of error, <strong>and</strong> at least one<br />

of these bits must be a 1-bit. In this case, however,<br />

none of the expected bits was present. The architecture<br />

does not permit this; therefore, there is not a proper<br />

machine check to work with.<br />

<strong>System</strong> Action: <strong>System</strong> operation stops. There is not<br />

any automatic software re-IPL afterward.<br />

Operator Response: Take a st<strong>and</strong>-alone dump <strong>and</strong><br />

contact your system support personnel.<br />

Programmer Response: Contact your <strong>IBM</strong> Support<br />

Center personnel to diagnose <strong>and</strong> correct the hardware<br />

problem.<br />

H<strong>CP</strong>9104I The storage page at xxxxxxxx will be<br />

taken offline if possible.<br />

Explanation: This message relates to the machine<br />

check that message H<strong>CP</strong>9101I describes.<br />

That machine check identifies a storage error at the<br />

page address given in the present message.<br />

Message H<strong>CP</strong>9101I precedes this message.<br />

<strong>System</strong> Action: The system has noted that the page<br />

is in error. If the page is part of the paging pool, it is<br />

taken offline as soon as it becomes available <strong>and</strong> is not<br />

used again. However, even if it is a part of the paging<br />

pool, there is no guarantee that it can be taken offline<br />

any time soon; for example, it may be locked in storage<br />

for an indefinite time before becoming available to be<br />

460 z/<strong>VM</strong>: <strong>System</strong> <strong>Messages</strong> <strong>and</strong> <strong>Codes</strong> — <strong>CP</strong><br />

varied offline. In the meantime, the page can cause<br />

more errors.<br />

Operator Response: None.<br />

H<strong>CP</strong>9105D Machine-check warning: A power or<br />

cooling failure is imminent.<br />

Explanation: Message H<strong>CP</strong>9101I precedes this<br />

message <strong>and</strong> identifies a machine check.<br />

Message H<strong>CP</strong>9105W indicates there is a<br />

machine-check warning condition present in that<br />

machine check. This indicates that system power or<br />

cooling is failing.<br />

<strong>System</strong> Action: The system action is uncertain. The<br />

hardware may fail almost immediately, or in seconds or<br />

minutes, or possibly never. Meanwhile, system<br />

operation continues. The warning condition is sent to<br />

virtual machines to give them a chance to checkpoint<br />

themselves, if possible.<br />

Operator Response: If the hardware does not fail<br />

instantly, send a message to all users warning them to<br />

prepare for the failure.<br />

Once this message is issued, the system runs disabled<br />

for further warning conditions. If the system is still<br />

running after several minutes, you may wish to<br />

determine if the problem has gone away or if it is still<br />

there. To do so, enter:<br />

SET MODE RECORD<br />

This will enable the system once again for warning<br />

conditions. If the problem is still present, you should<br />

immediately receive this same message again.<br />

H<strong>CP</strong>9106I A machine-check degradation is<br />

present. There may be possible buffer<br />

or TLB damage.<br />

Explanation: Message H<strong>CP</strong>9101I precedes this<br />

message <strong>and</strong> identifies a machine check.<br />

Message H<strong>CP</strong>9106I indicates that there was a<br />

machine-check degradation condition present in that<br />

machine check.<br />

<strong>System</strong> Action: The system may run more slowly<br />

because of the loss of some of the hardware.<br />

Operator Response: Notify your system support<br />

personnel. Send a message to the users to notify them<br />

that performance may decrease.<br />

H<strong>CP</strong>9107D Recording was disabled for “system<br />

recovery” machine check.<br />

Explanation: The system has disabled itself from<br />

recording further system-recovery machine checks. It<br />

does this after 12 system-recovery type machine checks<br />

occur. This is done so that these machine checks will

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!