Magellan Final Report - Office of Science - U.S. Department of Energy
Magellan Final Report - Office of Science - U.S. Department of Energy
Magellan Final Report - Office of Science - U.S. Department of Energy
Create successful ePaper yourself
Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.
<strong>Magellan</strong> <strong>Final</strong> <strong>Report</strong><br />
7.4 Summary<br />
User support models for technical computing users remains one <strong>of</strong> the open issues for cloud systems. Technical<br />
computing users have grown accustomed to a high level <strong>of</strong> support from HPC center personnel. In this setting,<br />
purpose built systems are constructed for a specific use case <strong>of</strong> large scale parallel applications. As a result,<br />
many users have not developed the skills that are needed to effectively work in the more flexible environment<br />
that clouds provide. Several possible solutions are discussed below.<br />
Users could be educated to become comfortable designing and deploying infrastructure. This education<br />
effort would be time-consuming and is unlikely to be sufficient, leaving some technical computing users unable<br />
to use large scale cloud computing resources. This approach would also require that scientists develop skills<br />
which are not directly related to their area <strong>of</strong> study, most likely reducing their scientific output.<br />
Another potential solution is for an analogous HPC-tuned environment to be made available on a private<br />
or public cloud. This solution is the most similar to current production computing centers, and would be<br />
likely comparable in cost to current HPC center support activities. In fact, this approach was explored<br />
at NERSC using a virtual version <strong>of</strong> an existing cluster (“Little <strong>Magellan</strong>” described in Chapter 3) which<br />
was used by users to easily measure the overhead <strong>of</strong> running in a virtualized cloud environment. ALCF<br />
also worked with the Argonne Laboratory Computing Resource Center (LCRC), which provides the large<br />
laboratory mid-range computing cluster, to provide a virtual version <strong>of</strong> the existing Fusion cluster within<br />
the <strong>Magellan</strong> public OpenStack cloud. In addition, the ALCF/LCRC team was able to set up the Fusion<br />
resource manager so that jobs could expand into the <strong>Magellan</strong> public cloud in a secure fashion with a virtual<br />
Fusion environment, providing dynamic expansion into the cloud when the Fusion resources were fully used.<br />
Another approach would involve building user communities around common domains and applications. In<br />
this model, sophisticated users would provide VM images which are optimized for a particular science domain<br />
and are used by a large number <strong>of</strong> users. Many <strong>of</strong> these communities would mirror virtual organizations<br />
that are already in place. This model would work well for large, established organizations, but could present<br />
challenges for smaller communities composed primarily <strong>of</strong> novice users.<br />
39