prepublication copy - The Department of Astronomy & Astrophysics ...
prepublication copy - The Department of Astronomy & Astrophysics ...
prepublication copy - The Department of Astronomy & Astrophysics ...
You also want an ePaper? Increase the reach of your titles
YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.
would normally be for a five-year period and the entire program would be subject to a senior review after<br />
five years. <strong>The</strong>se networks fulfill a different role from the NASA Astrophysical <strong>The</strong>ory Program and the<br />
NSF AAG Program and should not be funded at their expense. For NSF’s AAG program the success rate<br />
for theory proposals is roughly 37%.<br />
DATA AND SOFTWARE<br />
<strong>The</strong> scientific richness and extent <strong>of</strong> astronomical datasets is increasing rapidly. <strong>The</strong> sizes <strong>of</strong><br />
modern databases have grown over the past decade in to the petabyte (one million gigabytes) range, with<br />
present growth <strong>of</strong> roughly 0.2 petabytes per year. Challenges for data archiving will increase dramatically<br />
in the future. <strong>The</strong> committee’s top-ranked ground project, the Large Synoptic Survey Telescope (LSST),<br />
expects its archive to grow by a petabyte per month. A complete SKA operated in the manner <strong>of</strong> the<br />
VLA or ALMA would operate in the thousand petaflop or exaflop (one thousand petaflops) scale<br />
compared to the petaflop <strong>of</strong> sustained power consumed by current astronomical computing. Proper<br />
maintenance and accessibility <strong>of</strong> these archives is essential to optimizing scientific return, especially for<br />
LSST studies <strong>of</strong> transient and time variable phenomena where rapid availability <strong>of</strong> validated data will be<br />
critical.<br />
As discussed in Chapter 3, the AAAC can play a key role in providing tactical advice to DOE,<br />
NASA, and NSF on the support <strong>of</strong> data archiving and dissemination, and data analysis s<strong>of</strong>tware funding<br />
across the three agencies relative to the agencies’ programmatic needs as identified by Astro2010. In<br />
particular, the optimal infrastructure for the curation <strong>of</strong> archival space and ground-based data from<br />
federally supported missions/facilities will need periodic attention.<br />
Data Archives<br />
Data archives are central to astronomy today, and their importance continues to grow. <strong>The</strong><br />
science impact <strong>of</strong> these archives is large and increasing rapidly. Papers based on archival data from the<br />
Hubble Space Telescope now outnumber those based on new observations in any year and include some<br />
<strong>of</strong> the highest-impact science from the HST, as shown in Figures 5-6 and 5-7. Data from the 2 Micron All<br />
Sky Survey (2MASS) and the Sloan Digital Sky Survey (SDSS), which were both designed as archival<br />
projects, led to more than 1400 and 2650 refereed papers in the past decade, respectively, with the<br />
scientific output continuing to increase well after the completion <strong>of</strong> these surveys.<br />
Publicly accessible data archives can multiply the scientific impact <strong>of</strong> a facility or mission⎯for a<br />
fraction <strong>of</strong> the capital and operating costs <strong>of</strong> those facilities or missions. <strong>The</strong> data explosion and the longterm<br />
need for the ability to cross-correlate enormous datasets require archival data preservation beyond<br />
the life <strong>of</strong> projects and the development <strong>of</strong> new analysis and data mining tools. <strong>The</strong> establishment over<br />
the past decade <strong>of</strong> a National Virtual Observatory, a top recommendation <strong>of</strong> the last decadal survey and<br />
part <strong>of</strong> an International Virtual Observatory initiative, has produced widely accepted standards for data<br />
formatting, curation, and the infrastructure <strong>of</strong> a common user interface. <strong>The</strong>se standards have the<br />
potential to substantially enhance the collective value <strong>of</strong> archival datasets.<br />
NASA has regarded data handling and archiving as an integral part <strong>of</strong> space missions. It has<br />
established a network <strong>of</strong> data centers to host data from their missions, and a National Academies report 9<br />
9 Portals to the universe: <strong>The</strong> NASA <strong>Astronomy</strong> Science Centers, National Academies Press, Washington DC,<br />
2007. This report emphasizes the role <strong>of</strong> NASA archives in allowing astronomers to examine data on a particular<br />
target set across a range <strong>of</strong> wavelengths: “Not only are the archives the keepers <strong>of</strong> the raw observations, but they<br />
also provide direct access to calibrated versions <strong>of</strong> their data products, with online documentation and searchable<br />
databases linked to the literature. This “shrink-wrapped” feature <strong>of</strong> modern archives makes it easier for astronomers<br />
to combine data across various subdisciplines, a task that would have been difficult even a few years ago when all<br />
astronomers had their own sets <strong>of</strong> tools and did most <strong>of</strong> the data reduction themselves.”<br />
PREPUBLICATION COPY—SUBJECT TO FURTHER EDITORIAL CORRECTION<br />
5-10