30.01.2013 Views

TotalView Users Guide - CI Wiki

TotalView Users Guide - CI Wiki

TotalView Users Guide - CI Wiki

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

MPI Rank Display<br />

Figure 85: Ranks Tab<br />

If you can’t successfully start <strong>TotalView</strong> on MPI programs, check the following:<br />

� Can you successfully start MPICH programs without <strong>TotalView</strong>?<br />

The MPICH code contains some useful scripts that let you verify that you<br />

can start remote processes on all of the computers in your computers file.<br />

(See tstmachines in mpich/util.)<br />

� You won’t get a message queue display if you get the following warning:<br />

The symbols and types in the MPICH library used by<br />

<strong>TotalView</strong> to extract the message queues are not as<br />

expected in the image . This is probably<br />

an MPICH version or configuration problem.<br />

You need to check that you are using MPICH Version 1.1.0 or later and that<br />

you have configured it with the –debug option. (You can check this by<br />

looking in the config.status file at the root of the MPICH directory tree.)<br />

� Does the <strong>TotalView</strong> Server (tvdsvr) fail to start?<br />

tvdsvr must be in your PATH when you log in. Remember that <strong>TotalView</strong><br />

uses rsh to start the server, and that this command doesn’t pass your current<br />

environment to remotely started processes.<br />

� Make sure you have the correct MPI version and have applied all required<br />

patches. See the <strong>TotalView</strong> Release Notes for up-to-date information.<br />

� Under some circumstances, MPICH kills <strong>TotalView</strong> with the SIGINT signal.<br />

You can see this behavior when you use the Group > Kill command as<br />

the first step in restarting an MPICH job.<br />

CLI: dfocus g ddelete<br />

If <strong>TotalView</strong> exits and terminates abnormally with a Killed message, try setting<br />

the TV::ignore_control_c variable to true.<br />

MPI Rank Display<br />

The Processes/Ranks Tab at the bottom of the Process Window contains a<br />

grid that displays the status of each rank. For example, in Figure 85, six<br />

ranks are at a breakpoint, one is running, and one is stopped.<br />

For more information, see “Using the Processes/Ranks Tab” on page 229.<br />

106 Chapter 6: Setting Up MPI Debugging Sessions

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!