Magellan Final Report - Office of Science - U.S. Department of Energy
Magellan Final Report - Office of Science - U.S. Department of Energy
Magellan Final Report - Office of Science - U.S. Department of Energy
You also want an ePaper? Increase the reach of your titles
YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.
<strong>Magellan</strong> <strong>Final</strong> <strong>Report</strong><br />
Figure 9.11: MILC benchmark results on OpenStack run on 32 nodes (1 and 8 cores per node).<br />
9.2.3 DNS<br />
DNS3D, a computational fluid dynamics application, was chosen as our second real-world MPI application<br />
for testing purposes. It was considered both because <strong>of</strong> its nature as a primarily network-bound application<br />
and its widespread use among many scientific disciplines.<br />
Ten trials were run using 4-cores-per-node, 8-node clusters on both virtual machines and raw hardware.<br />
The average time-to-solution for the virtual and raw hardware clusters was 2.721 and 0.214 seconds respectively,<br />
giving a factor <strong>of</strong> ten performance penalty for running in a virtual environment. Like the previous<br />
test, this is again most likely a result <strong>of</strong> high network latency and contention for network bandwidth.<br />
9.2.4 Phloem<br />
For a general set <strong>of</strong> MPI benchmarks, the Phloem toolkit was chosen [68]. This suite <strong>of</strong> applications was<br />
designed for diagnosis <strong>of</strong> performance problems in an MPI cluster, and it was run with the expectation<br />
<strong>of</strong> determining which aspects <strong>of</strong> virtual networking in particular are detrimental to the performance <strong>of</strong><br />
parallel applications. The first application shown is SQMR, a general MPI point-to-point message bandwidth<br />
benchmark, run on a 16-rank 2-node cluster. The results shown below are averages <strong>of</strong> ten 1000-round runs.<br />
Figure 9.12: Phloem SQMR point-to-point average throughput (10K rounds).<br />
61