29.12.2014 Views

Magellan Final Report - Office of Science - U.S. Department of Energy

Magellan Final Report - Office of Science - U.S. Department of Energy

Magellan Final Report - Office of Science - U.S. Department of Energy

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

Executive Summary<br />

The goal <strong>of</strong> <strong>Magellan</strong>, a project funded through the U.S. <strong>Department</strong> <strong>of</strong> <strong>Energy</strong> (DOE) <strong>Office</strong> <strong>of</strong> Advanced<br />

Scientific Computing Research (ASCR), was to investigate the potential role <strong>of</strong> cloud computing in addressing<br />

the computing needs for the DOE <strong>Office</strong> <strong>of</strong> <strong>Science</strong> (SC), particularly related to serving the needs <strong>of</strong> midrange<br />

computing and future data-intensive computing workloads. A set <strong>of</strong> research questions was formed to<br />

probe various aspects <strong>of</strong> cloud computing from performance, usability, and cost. To address these questions,<br />

a distributed testbed infrastructure was deployed at the Argonne Leadership Computing Facility (ALCF)<br />

and the National <strong>Energy</strong> Research Scientific Computing Center (NERSC). The testbed was designed to be<br />

flexible and capable enough to explore a variety <strong>of</strong> computing models and hardware design points in order<br />

to understand the impact for various scientific applications. During the project, the testbed also served as<br />

a valuable resource to application scientists. Applications from a diverse set <strong>of</strong> projects such as MG-RAST<br />

(a metagenomics analysis server), the Joint Genome Institute, the STAR experiment at the Relativistic<br />

Heavy Ion Collider, and the Laser Interferometer Gravitational Wave Observatory (LIGO), were used by<br />

the <strong>Magellan</strong> project for benchmarking within the cloud, but the project teams were also able to accomplish<br />

important production science utilizing the <strong>Magellan</strong> cloud resources.<br />

Cloud computing has garnered significant attention from both industry and research scientists as it has<br />

emerged as a potential model to address a broad array <strong>of</strong> computing needs and requirements such as custom<br />

s<strong>of</strong>tware environments and increased utilization among others. Cloud services, both private and public, have<br />

demonstrated the ability to provide a scalable set <strong>of</strong> services that can be easily and cost-effectively utilized<br />

to tackle various enterprise and web workloads. These benefits are a direct result <strong>of</strong> the definition <strong>of</strong> cloud<br />

computing: on-demand self-service resources that are pooled, can be accessed via a network, and can be<br />

elastically adjusted by the user. The pooling <strong>of</strong> resources across a large user base enables economies <strong>of</strong> scale,<br />

while the ability to easily provision and elastically expand the resources provides flexible capabilities.<br />

Following the Executive Summary we summarize the key findings and recommendations <strong>of</strong> the project.<br />

Greater detail is provided in the body <strong>of</strong> the report. Here we briefly summarize some <strong>of</strong> the high-level<br />

findings from the study.<br />

• Cloud approaches provide many advantages, including customized environments that enable users to<br />

bring their own s<strong>of</strong>tware stack and try out new computing environments without significant administration<br />

overhead, the ability to quickly surge resources to address larger problems, and the advantages<br />

that come from increased economies <strong>of</strong> scale. Virtualization is the primary strategy <strong>of</strong> providing these<br />

capabilities. Our experience working with application scientists using the cloud demonstrated the<br />

power <strong>of</strong> virtualization to enable fully customized environments and flexible resource management,<br />

and their potential value to scientists.<br />

• Cloud computing can require significant initial effort and skills in order to port applications to these<br />

new models. This is also true for some <strong>of</strong> the emerging programming models used in cloud computing.<br />

Scientists should consider this upfront investment in any economic analysis when deciding whether to<br />

move to the cloud.<br />

• Significant gaps and challenges exist in the areas <strong>of</strong> managing virtual environments, workflows, data,<br />

cyber-security, and others. Further research and development is needed to ensure that scientists can<br />

i

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!