13.07.2015 Views

download as a pdf - Southern California Coastal Ocean Observing ...

download as a pdf - Southern California Coastal Ocean Observing ...

download as a pdf - Southern California Coastal Ocean Observing ...

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

Part I. OverviewBiological Data ConsiderationsManagement and stewardship of biological data present special challenges, which historicallyhave often been neglected. Biological data management requires that special considerationbe given to metadata (see Part III, Appendix 7 for a more detailed discussion).For example, the b<strong>as</strong>ic units for biological data are species. New species are continuallybeing discovered and named, and names of recognized species are sometimes changed.The hierarchical tree of evolutionary relationships among species, and the <strong>as</strong>sociated hierarchicalnomenclature, must continually be revised to incorporate new information. Biologicaldata systems require name translators that provide currently recognized scientificnames from synonymous scientific names and common names. The taxonomic authorityfor each major group of organisms maintains the accepted list of species, with oversightfrom the Global Biodiversity Information Facility (GBIF), Catalogue of Life, and organizationssuch <strong>as</strong> the Integrated Taxonomic Information System (ITIS)/ Species 2000, andthe <strong>Ocean</strong> Biogeographic Information System (OBIS). Protocols for using DNA sequencedata <strong>as</strong> a “bar code of life” have been proposed <strong>as</strong> an aid to taxonomic identification andevolutionary relationships.grids, time series, vertical profiles, and species type and abundance. The protocol ensures that thestructure, numeric values, and metadata attributes of the data are preserved between server andclient. It does not, however, impose a particular geospatial data model. For example, OPeNDAPdoes not “understand” what a time series is, nor does it “know” the significance of “phytoplankton_abundance” <strong>as</strong> a variable name. When transmitting a simple time series, OPeNDAP merely knowsthat it is conveying a one-dimensional array of values with attributes such <strong>as</strong> units = “seconds” andtitle = “Phytoplankton Abundance” attached to it. Such an approach greatly lowers the barrier toinitial participation by data suppliers, since nearly all data holdings can e<strong>as</strong>ily be encapsulated inthis f<strong>as</strong>hion and sent over the Internet. It also ensures the level of generality needed to provide semanticallyaware data transport for the very broad range of ocean data cl<strong>as</strong>ses.To achieve the desired level of interoperability, the mature DMAC will require that all data are deliveredin a consistent geospatial data model (or family of models). In this ph<strong>as</strong>ed DMAC ImplementationPlan, the development of a rich and comprehensive data model occurs in parallel withthe pilot deployment of OPeNDAP data servers and clients. The comprehensive data model(s)must harmonize with ongoing work in several communities, such <strong>as</strong> GIS and forec<strong>as</strong>t and climatemodeling. It must standardize controlled vocabularies, include the encoding of ocean biological29

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!