29.12.2014 Views

Magellan Final Report - Office of Science - U.S. Department of Energy

Magellan Final Report - Office of Science - U.S. Department of Energy

Magellan Final Report - Office of Science - U.S. Department of Energy

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

<strong>Magellan</strong> <strong>Final</strong> <strong>Report</strong><br />

7.4 Summary<br />

User support models for technical computing users remains one <strong>of</strong> the open issues for cloud systems. Technical<br />

computing users have grown accustomed to a high level <strong>of</strong> support from HPC center personnel. In this setting,<br />

purpose built systems are constructed for a specific use case <strong>of</strong> large scale parallel applications. As a result,<br />

many users have not developed the skills that are needed to effectively work in the more flexible environment<br />

that clouds provide. Several possible solutions are discussed below.<br />

Users could be educated to become comfortable designing and deploying infrastructure. This education<br />

effort would be time-consuming and is unlikely to be sufficient, leaving some technical computing users unable<br />

to use large scale cloud computing resources. This approach would also require that scientists develop skills<br />

which are not directly related to their area <strong>of</strong> study, most likely reducing their scientific output.<br />

Another potential solution is for an analogous HPC-tuned environment to be made available on a private<br />

or public cloud. This solution is the most similar to current production computing centers, and would be<br />

likely comparable in cost to current HPC center support activities. In fact, this approach was explored<br />

at NERSC using a virtual version <strong>of</strong> an existing cluster (“Little <strong>Magellan</strong>” described in Chapter 3) which<br />

was used by users to easily measure the overhead <strong>of</strong> running in a virtualized cloud environment. ALCF<br />

also worked with the Argonne Laboratory Computing Resource Center (LCRC), which provides the large<br />

laboratory mid-range computing cluster, to provide a virtual version <strong>of</strong> the existing Fusion cluster within<br />

the <strong>Magellan</strong> public OpenStack cloud. In addition, the ALCF/LCRC team was able to set up the Fusion<br />

resource manager so that jobs could expand into the <strong>Magellan</strong> public cloud in a secure fashion with a virtual<br />

Fusion environment, providing dynamic expansion into the cloud when the Fusion resources were fully used.<br />

Another approach would involve building user communities around common domains and applications. In<br />

this model, sophisticated users would provide VM images which are optimized for a particular science domain<br />

and are used by a large number <strong>of</strong> users. Many <strong>of</strong> these communities would mirror virtual organizations<br />

that are already in place. This model would work well for large, established organizations, but could present<br />

challenges for smaller communities composed primarily <strong>of</strong> novice users.<br />

39

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!