Magellan Final Report - Office of Science - U.S. Department of Energy
Magellan Final Report - Office of Science - U.S. Department of Energy
Magellan Final Report - Office of Science - U.S. Department of Energy
You also want an ePaper? Increase the reach of your titles
YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.
Executive Summary<br />
The goal <strong>of</strong> <strong>Magellan</strong>, a project funded through the U.S. <strong>Department</strong> <strong>of</strong> <strong>Energy</strong> (DOE) <strong>Office</strong> <strong>of</strong> Advanced<br />
Scientific Computing Research (ASCR), was to investigate the potential role <strong>of</strong> cloud computing in addressing<br />
the computing needs for the DOE <strong>Office</strong> <strong>of</strong> <strong>Science</strong> (SC), particularly related to serving the needs <strong>of</strong> midrange<br />
computing and future data-intensive computing workloads. A set <strong>of</strong> research questions was formed to<br />
probe various aspects <strong>of</strong> cloud computing from performance, usability, and cost. To address these questions,<br />
a distributed testbed infrastructure was deployed at the Argonne Leadership Computing Facility (ALCF)<br />
and the National <strong>Energy</strong> Research Scientific Computing Center (NERSC). The testbed was designed to be<br />
flexible and capable enough to explore a variety <strong>of</strong> computing models and hardware design points in order<br />
to understand the impact for various scientific applications. During the project, the testbed also served as<br />
a valuable resource to application scientists. Applications from a diverse set <strong>of</strong> projects such as MG-RAST<br />
(a metagenomics analysis server), the Joint Genome Institute, the STAR experiment at the Relativistic<br />
Heavy Ion Collider, and the Laser Interferometer Gravitational Wave Observatory (LIGO), were used by<br />
the <strong>Magellan</strong> project for benchmarking within the cloud, but the project teams were also able to accomplish<br />
important production science utilizing the <strong>Magellan</strong> cloud resources.<br />
Cloud computing has garnered significant attention from both industry and research scientists as it has<br />
emerged as a potential model to address a broad array <strong>of</strong> computing needs and requirements such as custom<br />
s<strong>of</strong>tware environments and increased utilization among others. Cloud services, both private and public, have<br />
demonstrated the ability to provide a scalable set <strong>of</strong> services that can be easily and cost-effectively utilized<br />
to tackle various enterprise and web workloads. These benefits are a direct result <strong>of</strong> the definition <strong>of</strong> cloud<br />
computing: on-demand self-service resources that are pooled, can be accessed via a network, and can be<br />
elastically adjusted by the user. The pooling <strong>of</strong> resources across a large user base enables economies <strong>of</strong> scale,<br />
while the ability to easily provision and elastically expand the resources provides flexible capabilities.<br />
Following the Executive Summary we summarize the key findings and recommendations <strong>of</strong> the project.<br />
Greater detail is provided in the body <strong>of</strong> the report. Here we briefly summarize some <strong>of</strong> the high-level<br />
findings from the study.<br />
• Cloud approaches provide many advantages, including customized environments that enable users to<br />
bring their own s<strong>of</strong>tware stack and try out new computing environments without significant administration<br />
overhead, the ability to quickly surge resources to address larger problems, and the advantages<br />
that come from increased economies <strong>of</strong> scale. Virtualization is the primary strategy <strong>of</strong> providing these<br />
capabilities. Our experience working with application scientists using the cloud demonstrated the<br />
power <strong>of</strong> virtualization to enable fully customized environments and flexible resource management,<br />
and their potential value to scientists.<br />
• Cloud computing can require significant initial effort and skills in order to port applications to these<br />
new models. This is also true for some <strong>of</strong> the emerging programming models used in cloud computing.<br />
Scientists should consider this upfront investment in any economic analysis when deciding whether to<br />
move to the cloud.<br />
• Significant gaps and challenges exist in the areas <strong>of</strong> managing virtual environments, workflows, data,<br />
cyber-security, and others. Further research and development is needed to ensure that scientists can<br />
i