29.12.2014 Views

Magellan Final Report - Office of Science - U.S. Department of Energy

Magellan Final Report - Office of Science - U.S. Department of Energy

Magellan Final Report - Office of Science - U.S. Department of Energy

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

<strong>Magellan</strong> <strong>Final</strong> <strong>Report</strong><br />

Figure 9.11: MILC benchmark results on OpenStack run on 32 nodes (1 and 8 cores per node).<br />

9.2.3 DNS<br />

DNS3D, a computational fluid dynamics application, was chosen as our second real-world MPI application<br />

for testing purposes. It was considered both because <strong>of</strong> its nature as a primarily network-bound application<br />

and its widespread use among many scientific disciplines.<br />

Ten trials were run using 4-cores-per-node, 8-node clusters on both virtual machines and raw hardware.<br />

The average time-to-solution for the virtual and raw hardware clusters was 2.721 and 0.214 seconds respectively,<br />

giving a factor <strong>of</strong> ten performance penalty for running in a virtual environment. Like the previous<br />

test, this is again most likely a result <strong>of</strong> high network latency and contention for network bandwidth.<br />

9.2.4 Phloem<br />

For a general set <strong>of</strong> MPI benchmarks, the Phloem toolkit was chosen [68]. This suite <strong>of</strong> applications was<br />

designed for diagnosis <strong>of</strong> performance problems in an MPI cluster, and it was run with the expectation<br />

<strong>of</strong> determining which aspects <strong>of</strong> virtual networking in particular are detrimental to the performance <strong>of</strong><br />

parallel applications. The first application shown is SQMR, a general MPI point-to-point message bandwidth<br />

benchmark, run on a 16-rank 2-node cluster. The results shown below are averages <strong>of</strong> ten 1000-round runs.<br />

Figure 9.12: Phloem SQMR point-to-point average throughput (10K rounds).<br />

61

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!