02.12.2012 Views

OpenVMS Cluster Systems - OpenVMS Systems - HP

OpenVMS Cluster Systems - OpenVMS Systems - HP

OpenVMS Cluster Systems - OpenVMS Systems - HP

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

Troubleshooting the NISCA Protocol<br />

F.2 Addressing LAN Communication Problems<br />

This message is displayed when PEDRIVER recently had to perform an<br />

excessively high rate of packet retransmissions on the LAN path consisting<br />

of the local device, the intervening network, and the device on the remote node.<br />

The message indicates that the LAN path has degraded and is approaching,<br />

or has reached, the point where reliable communications with the remote node<br />

are no longer possible. It is likely that the virtual circuit to the remote node<br />

will close if the losses continue. Furthermore, continued operation with high<br />

LAN packet losses can result in significant loss in performance because of the<br />

communication delays resulting from the packet loss detection timeouts and<br />

packet retransmission.<br />

The corrective steps to take are:<br />

1. Check the local and remote LAN device error counts to see whether a problem<br />

exists on the devices. Issue the following commands on each node:<br />

$ SHOW DEVICE local-device-name<br />

$ MC SCACP<br />

SCACP> SHOW LAN device-name<br />

$ MC LANCP<br />

LANCP> SHOW DEVICE device-name/COUNT<br />

2. If device error counts on the local devices are within normal bounds, contact<br />

your network administrators to request that they diagnose the LAN path<br />

between the devices.<br />

F.2.4 Preliminary Network Diagnosis<br />

If the symptoms and preliminary diagnosis indicate that you might have a<br />

network problem, troubleshooting LAN communication failures should start<br />

with the step-by-step procedures described in Appendix C. Appendix C helps you<br />

diagnose and solve common Ethernet and FDDI LAN communication failures<br />

during the following stages of <strong>OpenVMS</strong> <strong>Cluster</strong> activity:<br />

• When a computer or a satellite fails to boot<br />

• When a computer fails to join the <strong>OpenVMS</strong> <strong>Cluster</strong><br />

• During run time when startup procedures fail to complete<br />

• When a <strong>OpenVMS</strong> <strong>Cluster</strong> hangs<br />

The procedures in Appendix C require that you verify a number of parameters<br />

during the diagnostic process. Because system parameter settings play a key role<br />

in effective <strong>OpenVMS</strong> <strong>Cluster</strong> communications, Section F.2.6 describes several<br />

system parameters that are especially important to the timing of LAN bridges,<br />

disk failover, and channel availability.<br />

F.2.5 Tracing Intermittent Errors<br />

Because PEDRIVER communication is based on channels, LAN network problems<br />

typically fall into these areas:<br />

• Channel formation and maintenance<br />

Channels are formed when HELLO datagram messages are received from a<br />

remote system. A failure can occur when the HELLO datagram messages are<br />

not received or when the channel control message contains the wrong data.<br />

F–6 Troubleshooting the NISCA Protocol

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!