26.12.2014 Views

Rome Wasn't Digitized in a Day - Council on Library and Information ...

Rome Wasn't Digitized in a Day - Council on Library and Information ...

Rome Wasn't Digitized in a Day - Council on Library and Information ...

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

263<br />

DARIAH has tested this <str<strong>on</strong>g>in</str<strong>on</strong>g>frastructure by creat<str<strong>on</strong>g>in</str<strong>on</strong>g>g an experiment that l<str<strong>on</strong>g>in</str<strong>on</strong>g>ked TextGrid, an iRODS, <strong>and</strong><br />

a Fedora test server <str<strong>on</strong>g>in</str<strong>on</strong>g>to a s<str<strong>on</strong>g>in</str<strong>on</strong>g>gle federati<strong>on</strong> <strong>and</strong> replicated digital objects across the different<br />

repositories <strong>and</strong> created an <str<strong>on</strong>g>in</str<strong>on</strong>g>dex of all the TEI/XML objects <str<strong>on</strong>g>in</str<strong>on</strong>g> the federati<strong>on</strong>. Aschenbrenner et al.<br />

(2010) c<strong>on</strong>cluded that the use of Atom will not <strong>on</strong>ly “ensure coherence am<strong>on</strong>g decentralised agents”<br />

but also, as a lightweight protocol that is “deeply embedded <str<strong>on</strong>g>in</str<strong>on</strong>g>to the web envir<strong>on</strong>ment of HTTP-based,<br />

ReSTful Services,” serve as a gateway to a number of exist<str<strong>on</strong>g>in</str<strong>on</strong>g>g tools <strong>and</strong> improve the scalability of<br />

DARIAH as a whole. Some rema<str<strong>on</strong>g>in</str<strong>on</strong>g><str<strong>on</strong>g>in</str<strong>on</strong>g>g challenges to be addressed by this <str<strong>on</strong>g>in</str<strong>on</strong>g>frastructure <str<strong>on</strong>g>in</str<strong>on</strong>g>clude user<br />

<strong>and</strong> rights management <strong>and</strong> the need for persistent identifiers for digital objects. 751<br />

One of the major projects of this first stage of DARIAH was to build two dem<strong>on</strong>strators that<br />

dem<strong>on</strong>strated the feasibility of their technical architecture. As the website, notes, however, they were<br />

also an opportunity for two “associated communities to positi<strong>on</strong> themselves with<str<strong>on</strong>g>in</str<strong>on</strong>g> the DARIAH<br />

<str<strong>on</strong>g>in</str<strong>on</strong>g>frastructure,” 752 namely the digital archaeology <strong>and</strong> textual encod<str<strong>on</strong>g>in</str<strong>on</strong>g>g communities. The first<br />

“community dem<strong>on</strong>strator” ARENA 2, 753 migrated a “legacy applicati<strong>on</strong> of the European archaeology<br />

community <str<strong>on</strong>g>in</str<strong>on</strong>g>to a more susta<str<strong>on</strong>g>in</str<strong>on</strong>g>able service-oriented architecture (SOA).” The orig<str<strong>on</strong>g>in</str<strong>on</strong>g>al ARENA<br />

(Archaeological Records of Europe-Networked Access) project was f<str<strong>on</strong>g>in</str<strong>on</strong>g>ished <str<strong>on</strong>g>in</str<strong>on</strong>g> 2004 <strong>and</strong> had served as<br />

a “traditi<strong>on</strong>al metadata search portal service” based <strong>on</strong> Z39.50 754 <strong>and</strong> OAI Harvest<str<strong>on</strong>g>in</str<strong>on</strong>g>g. The newly<br />

released dem<strong>on</strong>strator makes use of DARIAH web services <strong>and</strong> exposed the various participat<str<strong>on</strong>g>in</str<strong>on</strong>g>g<br />

archaeological databases as aut<strong>on</strong>omous services. The sec<strong>on</strong>d dem<strong>on</strong>strator, the TEI dem<strong>on</strong>strator, was<br />

designed to “dem<strong>on</strong>strate the practical benefits of us<str<strong>on</strong>g>in</str<strong>on</strong>g>g TEI for the representati<strong>on</strong> of digital resources<br />

of all k<str<strong>on</strong>g>in</str<strong>on</strong>g>ds, but primarily of orig<str<strong>on</strong>g>in</str<strong>on</strong>g>al source collecti<strong>on</strong>s with<str<strong>on</strong>g>in</str<strong>on</strong>g> the arts <strong>and</strong> humanities.” The<br />

dem<strong>on</strong>strator can be used to upload <strong>and</strong> publish TEI documents <str<strong>on</strong>g>in</str<strong>on</strong>g>to a repository am<strong>on</strong>g other<br />

functi<strong>on</strong>alities <strong>and</strong> makes use of software platform called eSciDoc 755 that was developed by the Max<br />

Planck Digital <strong>Library</strong>. The end goal of this dem<strong>on</strong>strator was thus to make it easier for humanities<br />

researchers to both share their TEI texts with others <strong>and</strong> to compare pers<strong>on</strong>al encod<str<strong>on</strong>g>in</str<strong>on</strong>g>g practices with<br />

that of the larger TEI community. 756<br />

Another important factor c<strong>on</strong>sidered by DARIAH is the frequently distributed nature of humanities<br />

data; for example, <strong>on</strong>e digital archive may have transcripti<strong>on</strong>s of a manuscript while another has digital<br />

images of this manuscript. Thus, DARIAH plans to build a data architecture that will “cover the easy<br />

exchange of file type data, the ability to create relati<strong>on</strong>ships between files <str<strong>on</strong>g>in</str<strong>on</strong>g> remote locati<strong>on</strong>s <strong>and</strong><br />

flexible cach<str<strong>on</strong>g>in</str<strong>on</strong>g>g mechanism to deal with the exchange of large s<str<strong>on</strong>g>in</str<strong>on</strong>g>gle data items like digitizati<strong>on</strong><br />

images”(Blanke 2010). S<str<strong>on</strong>g>in</str<strong>on</strong>g>ce humanities data also need to be preserved for l<strong>on</strong>g periods of time to<br />

support archiv<str<strong>on</strong>g>in</str<strong>on</strong>g>g <strong>and</strong> reuse, DARIAH plans to <str<strong>on</strong>g>in</str<strong>on</strong>g>corporate exist<str<strong>on</strong>g>in</str<strong>on</strong>g>g archived research data. In his f<str<strong>on</strong>g>in</str<strong>on</strong>g>al<br />

overview of the project, Blanke proposed that:<br />

DARIAH is <strong>on</strong>e way to build a research <str<strong>on</strong>g>in</str<strong>on</strong>g>frastructure for the humanities. It uses grid<br />

technologies together with digital library technologies to deliver services to support the<br />

<str<strong>on</strong>g>in</str<strong>on</strong>g>formati<strong>on</strong> needs of humanities researchers. It <str<strong>on</strong>g>in</str<strong>on</strong>g>tegrates many services useful for humanities<br />

research <strong>and</strong> will focus less <strong>on</strong> automati<strong>on</strong> of process<str<strong>on</strong>g>in</str<strong>on</strong>g>g but <strong>on</strong> provid<str<strong>on</strong>g>in</str<strong>on</strong>g>g an <str<strong>on</strong>g>in</str<strong>on</strong>g>frastructure to<br />

751 The discussi<strong>on</strong> over how to both design <strong>and</strong> implement persistent <strong>and</strong> unique identifiers has a vast body of literature. For some recent work, see T<strong>on</strong>k<str<strong>on</strong>g>in</str<strong>on</strong>g><br />

(2008), Campbell (2007), <strong>and</strong> Hilse <strong>and</strong> Kothe (2006).<br />

752 http://dariah.eu/<str<strong>on</strong>g>in</str<strong>on</strong>g>dex.phpopti<strong>on</strong>=com_c<strong>on</strong>tent&view=article&id=129&Itemid=113<br />

753 http://www.dariah.eu/<str<strong>on</strong>g>in</str<strong>on</strong>g>dex.phpopti<strong>on</strong>=com_c<strong>on</strong>tent&view=article&id=30&Itemid=34. The dem<strong>on</strong>strator can also be accessed at<br />

http://mun<str<strong>on</strong>g>in</str<strong>on</strong>g>n.york.ac.uk/arena2/<br />

754 Z39.50 is an ISO st<strong>and</strong>ard ma<str<strong>on</strong>g>in</str<strong>on</strong>g>ta<str<strong>on</strong>g>in</str<strong>on</strong>g>ed by the <strong>Library</strong> of C<strong>on</strong>gress that “specifies a client/server-based protocol for search<str<strong>on</strong>g>in</str<strong>on</strong>g>g <strong>and</strong> retriev<str<strong>on</strong>g>in</str<strong>on</strong>g>g <str<strong>on</strong>g>in</str<strong>on</strong>g>formati<strong>on</strong><br />

from remote databases” (http://www.loc.gov/z3950/agency/) <strong>and</strong> has been the predom<str<strong>on</strong>g>in</str<strong>on</strong>g>ant st<strong>and</strong>ard used <str<strong>on</strong>g>in</str<strong>on</strong>g> <str<strong>on</strong>g>in</str<strong>on</strong>g>tegrated library systems<br />

755 http://www.escidoc.org/<br />

756 The TEI dem<strong>on</strong>strator can be accessed at (http://vm20.mpdl.mpg.de:8080/tei_dem<strong>on</strong>strator/).

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!