Magellan Final Report - Office of Science - U.S. Department of Energy
Magellan Final Report - Office of Science - U.S. Department of Energy
Magellan Final Report - Office of Science - U.S. Department of Energy
You also want an ePaper? Increase the reach of your titles
YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.
<strong>Magellan</strong> <strong>Final</strong> <strong>Report</strong><br />
Table 12.2: Hourly Cost <strong>of</strong> a DOE HPC system<br />
Hopper<br />
Total Acquisition Cost $52,000,000<br />
Annual Acquisition Cost (divided over 4 years) $13,000,000<br />
Maintenance & Support $5,200,000<br />
Power & Cooling $2,600,000<br />
FTEs $300,000<br />
Total Annual Costs $21,100,000<br />
Cores 153,408<br />
Utilization 85%<br />
Annual Utilized Core Hours 1,142,275,968<br />
Cost per Utilized Core Hour (includes storage subsystem) $0.018<br />
HPC systems. Currently, there is no streamlined method to provide this type <strong>of</strong> capability in commercial<br />
IaaS cloud systems like Amazon. However, even using the best methods available in the existing IaaS system,<br />
creating a file system <strong>of</strong> this scale would add additional costs for storage and for additional instances to act<br />
as servers for the file system.<br />
12.2.3 Cost <strong>of</strong> a DOE Center in the Cloud<br />
Another cost analysis approach is to consider the total annual budget <strong>of</strong> a center and the resources it provides<br />
to the users and translate those into commercial cloud <strong>of</strong>ferings. In Table 12.3, we show a summary <strong>of</strong> the<br />
approximate cost to operate NERSC and ALCF in the Amazon cloud. The computational systems are<br />
calculated using Amazon’s Cluster Compute <strong>of</strong>fering which provides the closest performance to traditional<br />
HPC systems. Based on the performance studies described in Chapter 9, using less expensive, less capable<br />
<strong>of</strong>ferings would result in significantly lower performance, which would actually increase costs. In addition, the<br />
estimates use a 1-year reserved instance with the generous assumption that the cores are used for an entire<br />
year (see Section 12.2.1 for a more detailed description <strong>of</strong> the assumptions used in the pricing calculations).<br />
This results in effective core hour cost <strong>of</strong> $0.13 per core hour. The total core hours delivered by each system in<br />
a year is multiplied by this rate assuming 85% overall utilization (NERSC typically exceeds this value). The<br />
storage costs are calculated using $0.037 per GB Month and $0.05 per GB for transfers out <strong>of</strong> S3 (transfers<br />
into S3 are currently free). This is based on the marginal cost <strong>of</strong> single copy storage in Amazon S3. The file<br />
system costs are based on Amazon’s Elastic Block Storage cost <strong>of</strong> $0.10 per GB Month. The data amounts<br />
are based on the current volume <strong>of</strong> data stored on disk and tape, not the available capacity. They do not<br />
account for the observed growth in data storage. For example, the annual growth in HPSS (tape) is around<br />
70%. The calculated total annual cost to operate NERSC in the cloud is just over $200M, and to operate<br />
ALCF in the cloud is just under $180M. NERSC has a current annual budget <strong>of</strong> between $50M-$55M. ALCF<br />
is a relatively new facility and its budget has grown since it started. The budget shown in the table is an<br />
average over a 4 year period, using ALCF’s actual budgets for 2008 through 2011. These budgets include<br />
all system acquisition costs, support and maintenance, power and cooling cost, system administration and<br />
management, and s<strong>of</strong>tware licensing costs, as well as data analytics and visualization resources that are<br />
provided to the facility users.<br />
While facilities costs are not directly included in NERSC’s budget, and some are not directly included in<br />
ALCF’s budget (data center electricity is included), they are incorporated in overhead charges and burdens<br />
and therefore indirectly included in the budget. However, the facilities costs are negligible. For example,<br />
if we use an industry average cost <strong>of</strong> $1,200 per square foot for data center space for a system the size <strong>of</strong><br />
Hopper, we arrive at $3.6M. Assuming a 10 year life span for the data center, which is low, we arrive at $360k<br />
per year. This translates into approximately 3/100th <strong>of</strong> a cent per core hour or around a 2% increase in the<br />
121