03.01.2015 Views

prepublication copy - The Department of Astronomy & Astrophysics ...

prepublication copy - The Department of Astronomy & Astrophysics ...

prepublication copy - The Department of Astronomy & Astrophysics ...

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

would normally be for a five-year period and the entire program would be subject to a senior review after<br />

five years. <strong>The</strong>se networks fulfill a different role from the NASA Astrophysical <strong>The</strong>ory Program and the<br />

NSF AAG Program and should not be funded at their expense. For NSF’s AAG program the success rate<br />

for theory proposals is roughly 37%.<br />

DATA AND SOFTWARE<br />

<strong>The</strong> scientific richness and extent <strong>of</strong> astronomical datasets is increasing rapidly. <strong>The</strong> sizes <strong>of</strong><br />

modern databases have grown over the past decade in to the petabyte (one million gigabytes) range, with<br />

present growth <strong>of</strong> roughly 0.2 petabytes per year. Challenges for data archiving will increase dramatically<br />

in the future. <strong>The</strong> committee’s top-ranked ground project, the Large Synoptic Survey Telescope (LSST),<br />

expects its archive to grow by a petabyte per month. A complete SKA operated in the manner <strong>of</strong> the<br />

VLA or ALMA would operate in the thousand petaflop or exaflop (one thousand petaflops) scale<br />

compared to the petaflop <strong>of</strong> sustained power consumed by current astronomical computing. Proper<br />

maintenance and accessibility <strong>of</strong> these archives is essential to optimizing scientific return, especially for<br />

LSST studies <strong>of</strong> transient and time variable phenomena where rapid availability <strong>of</strong> validated data will be<br />

critical.<br />

As discussed in Chapter 3, the AAAC can play a key role in providing tactical advice to DOE,<br />

NASA, and NSF on the support <strong>of</strong> data archiving and dissemination, and data analysis s<strong>of</strong>tware funding<br />

across the three agencies relative to the agencies’ programmatic needs as identified by Astro2010. In<br />

particular, the optimal infrastructure for the curation <strong>of</strong> archival space and ground-based data from<br />

federally supported missions/facilities will need periodic attention.<br />

Data Archives<br />

Data archives are central to astronomy today, and their importance continues to grow. <strong>The</strong><br />

science impact <strong>of</strong> these archives is large and increasing rapidly. Papers based on archival data from the<br />

Hubble Space Telescope now outnumber those based on new observations in any year and include some<br />

<strong>of</strong> the highest-impact science from the HST, as shown in Figures 5-6 and 5-7. Data from the 2 Micron All<br />

Sky Survey (2MASS) and the Sloan Digital Sky Survey (SDSS), which were both designed as archival<br />

projects, led to more than 1400 and 2650 refereed papers in the past decade, respectively, with the<br />

scientific output continuing to increase well after the completion <strong>of</strong> these surveys.<br />

Publicly accessible data archives can multiply the scientific impact <strong>of</strong> a facility or mission⎯for a<br />

fraction <strong>of</strong> the capital and operating costs <strong>of</strong> those facilities or missions. <strong>The</strong> data explosion and the longterm<br />

need for the ability to cross-correlate enormous datasets require archival data preservation beyond<br />

the life <strong>of</strong> projects and the development <strong>of</strong> new analysis and data mining tools. <strong>The</strong> establishment over<br />

the past decade <strong>of</strong> a National Virtual Observatory, a top recommendation <strong>of</strong> the last decadal survey and<br />

part <strong>of</strong> an International Virtual Observatory initiative, has produced widely accepted standards for data<br />

formatting, curation, and the infrastructure <strong>of</strong> a common user interface. <strong>The</strong>se standards have the<br />

potential to substantially enhance the collective value <strong>of</strong> archival datasets.<br />

NASA has regarded data handling and archiving as an integral part <strong>of</strong> space missions. It has<br />

established a network <strong>of</strong> data centers to host data from their missions, and a National Academies report 9<br />

9 Portals to the universe: <strong>The</strong> NASA <strong>Astronomy</strong> Science Centers, National Academies Press, Washington DC,<br />

2007. This report emphasizes the role <strong>of</strong> NASA archives in allowing astronomers to examine data on a particular<br />

target set across a range <strong>of</strong> wavelengths: “Not only are the archives the keepers <strong>of</strong> the raw observations, but they<br />

also provide direct access to calibrated versions <strong>of</strong> their data products, with online documentation and searchable<br />

databases linked to the literature. This “shrink-wrapped” feature <strong>of</strong> modern archives makes it easier for astronomers<br />

to combine data across various subdisciplines, a task that would have been difficult even a few years ago when all<br />

astronomers had their own sets <strong>of</strong> tools and did most <strong>of</strong> the data reduction themselves.”<br />

PREPUBLICATION COPY—SUBJECT TO FURTHER EDITORIAL CORRECTION<br />

5-10

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!