27.12.2014 Views

QLogic OFED+ Host Software User Guide, Rev. B

QLogic OFED+ Host Software User Guide, Rev. B

QLogic OFED+ Host Software User Guide, Rev. B

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

F–Troubleshooting<br />

Kernel and Initialization Issues<br />

InfiniPath ib_qib Initialization Failure<br />

There may be cases where ib_qib was not properly initialized. Symptoms of this<br />

may show up in error messages from an MPI job or another program. Here is a<br />

sample command and error message:<br />

$ mpirun -np 2 -m ~/tmp/mbu13 osu_latency<br />

:ipath_userinit: assign_port command failed: Network is<br />

down<br />

:can’t open /dev/ipath, network down<br />

This will be followed by messages of this type after 60 seconds:<br />

MPIRUN: 1 rank has not yet exited 60 seconds<br />

after rank 0 (node ) exited without reaching<br />

MPI_Finalize().<br />

MPIRUN:Waiting at most another 60 seconds for<br />

the remaining ranks to do a clean shutdown before terminating 1<br />

node processes.<br />

If this error appears, check to see if the InfiniPath driver is loaded by typing:<br />

$ lsmod | grep ib_qib<br />

If no output is displayed, the driver did not load for some reason. In this case, try<br />

the following commands (as root):<br />

# modprobe -v ib_qib<br />

# lsmod | grep ib_qib<br />

# dmesg | grep -i ib_qib | tail -25<br />

The output will indicate whether the driver has loaded. Printing out messages<br />

using dmesg may help to locate any problems with ib_qib.<br />

If the driver loaded, but MPI or other programs are not working, check to see if<br />

problems were detected during the driver and <strong>QLogic</strong> hardware initialization with<br />

the command:<br />

$ dmesg | grep -i ib_qib<br />

This command may generate more than one screen of output.<br />

Also, check the link status with the commands:<br />

$ cat /sys/class/infiniband/ipath*/device/status_str<br />

These commands are normally executed by the ipathbug-helper script, but<br />

running them separately may help locate the problem.<br />

See also “status_str” on page I-19 and “ipath_checkout” on page I-10.<br />

D000046-005 B F-5

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!