29.12.2014 Views

Magellan Final Report - Office of Science - U.S. Department of Energy

Magellan Final Report - Office of Science - U.S. Department of Energy

Magellan Final Report - Office of Science - U.S. Department of Energy

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

Recommendations<br />

The <strong>Magellan</strong> project has evaluated cloud computing models and various associated technologies and explored<br />

their potential role in addressing the computing needs <strong>of</strong> scientists funded by the <strong>Department</strong> <strong>of</strong> <strong>Energy</strong>’s<br />

<strong>Office</strong> <strong>of</strong> <strong>Science</strong>. Our findings highlight both the potential value and the challenges <strong>of</strong> exploiting cloud<br />

computing for DOE SC applications. Here we summarize our recommendations for DOE SC, DOE resource<br />

providers, application scientists, and tool developers. A number <strong>of</strong> these recommendations do not fit within<br />

current scope and budgets, and additional funding beyond currently funded DOE projects might be required.<br />

DOE <strong>Office</strong> <strong>of</strong> <strong>Science</strong>. The findings <strong>of</strong> the <strong>Magellan</strong> project demonstrate some potential benefits <strong>of</strong><br />

cloud computing for meeting the computing needs <strong>of</strong> the DOE SC scientific community. Cloud features such<br />

as customized environments and the MapReduce programming model help in addressing the needs <strong>of</strong> some<br />

current users <strong>of</strong> DOE resources as well as the needs <strong>of</strong> some new scientific applications that need to scale<br />

up from current environments. The findings also show that existing DOE HPC centers are cost effective<br />

compared with commercial providers. Consequently, DOE should work with the DOE HPC centers and<br />

DOE resource providers to ensure that the expanding needs <strong>of</strong> the scientific community are being met. If<br />

cases emerge where cloud models are deemed necessary in order to fully address these needs, we recommend<br />

that DOE should first consider a private cloud computing strategy. This approach can provide many <strong>of</strong> the<br />

benefits <strong>of</strong> cloud environments while being more cost effective, allowing for more optimized <strong>of</strong>ferings, and<br />

better addressing the security, performance, and data management challenges.<br />

DOE Resource Providers. There is a rising class <strong>of</strong> applications that do not fit the current traditional<br />

model <strong>of</strong> large-scale tightly coupled applications that run in supercomputing centers, yet have increasing<br />

needs for computation and storage resources. In addition, various scientific communities need custom<br />

s<strong>of</strong>tware environments or shared virtual clusters to meet specific collaboration or workload requirements.<br />

Additionally, clouds provide mechanisms and tools for high-throughput and data-intensive applications that<br />

have large-scale resource requirements. The cloud model provides solutions to address some <strong>of</strong> these requirements,<br />

but these can be met by DOE resource providers as well. We make some specific recommendations<br />

to DOE resource providers for providing these features. Resource providers include the large ASCR facilities<br />

such as the Leadership Computing Facilities and NERSC; institutional resources operated by many <strong>of</strong> the<br />

labs; and the computing resources deployed in conjunction with many large user facilities funded by other<br />

science program <strong>of</strong>fices.<br />

1. DOE resource providers should investigate mechanisms to support more diverse workloads such as highthroughput<br />

and data-intensive workloads that increasingly require more compute and storage resources.<br />

Specifically, resource providers should consider more flexible queueing policies for these loosely coupled<br />

computational workloads.<br />

2. A number <strong>of</strong> user communities require a different resource provisioning model where they need exclusive<br />

access to a pre-determined amount <strong>of</strong> resources for a fixed period <strong>of</strong> time due to increased computational<br />

load. Our experience with the Material Genomes project, the Joint Genome Institute, and the analysis<br />

<strong>of</strong> the German E. coli strains demonstrates that on-demand resource provisioning can play a valuable<br />

vi

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!