Rome Wasn't Digitized in a Day - Council on Library and Information ...
Rome Wasn't Digitized in a Day - Council on Library and Information ...
Rome Wasn't Digitized in a Day - Council on Library and Information ...
You also want an ePaper? Increase the reach of your titles
YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.
263<br />
DARIAH has tested this <str<strong>on</strong>g>in</str<strong>on</strong>g>frastructure by creat<str<strong>on</strong>g>in</str<strong>on</strong>g>g an experiment that l<str<strong>on</strong>g>in</str<strong>on</strong>g>ked TextGrid, an iRODS, <strong>and</strong><br />
a Fedora test server <str<strong>on</strong>g>in</str<strong>on</strong>g>to a s<str<strong>on</strong>g>in</str<strong>on</strong>g>gle federati<strong>on</strong> <strong>and</strong> replicated digital objects across the different<br />
repositories <strong>and</strong> created an <str<strong>on</strong>g>in</str<strong>on</strong>g>dex of all the TEI/XML objects <str<strong>on</strong>g>in</str<strong>on</strong>g> the federati<strong>on</strong>. Aschenbrenner et al.<br />
(2010) c<strong>on</strong>cluded that the use of Atom will not <strong>on</strong>ly “ensure coherence am<strong>on</strong>g decentralised agents”<br />
but also, as a lightweight protocol that is “deeply embedded <str<strong>on</strong>g>in</str<strong>on</strong>g>to the web envir<strong>on</strong>ment of HTTP-based,<br />
ReSTful Services,” serve as a gateway to a number of exist<str<strong>on</strong>g>in</str<strong>on</strong>g>g tools <strong>and</strong> improve the scalability of<br />
DARIAH as a whole. Some rema<str<strong>on</strong>g>in</str<strong>on</strong>g><str<strong>on</strong>g>in</str<strong>on</strong>g>g challenges to be addressed by this <str<strong>on</strong>g>in</str<strong>on</strong>g>frastructure <str<strong>on</strong>g>in</str<strong>on</strong>g>clude user<br />
<strong>and</strong> rights management <strong>and</strong> the need for persistent identifiers for digital objects. 751<br />
One of the major projects of this first stage of DARIAH was to build two dem<strong>on</strong>strators that<br />
dem<strong>on</strong>strated the feasibility of their technical architecture. As the website, notes, however, they were<br />
also an opportunity for two “associated communities to positi<strong>on</strong> themselves with<str<strong>on</strong>g>in</str<strong>on</strong>g> the DARIAH<br />
<str<strong>on</strong>g>in</str<strong>on</strong>g>frastructure,” 752 namely the digital archaeology <strong>and</strong> textual encod<str<strong>on</strong>g>in</str<strong>on</strong>g>g communities. The first<br />
“community dem<strong>on</strong>strator” ARENA 2, 753 migrated a “legacy applicati<strong>on</strong> of the European archaeology<br />
community <str<strong>on</strong>g>in</str<strong>on</strong>g>to a more susta<str<strong>on</strong>g>in</str<strong>on</strong>g>able service-oriented architecture (SOA).” The orig<str<strong>on</strong>g>in</str<strong>on</strong>g>al ARENA<br />
(Archaeological Records of Europe-Networked Access) project was f<str<strong>on</strong>g>in</str<strong>on</strong>g>ished <str<strong>on</strong>g>in</str<strong>on</strong>g> 2004 <strong>and</strong> had served as<br />
a “traditi<strong>on</strong>al metadata search portal service” based <strong>on</strong> Z39.50 754 <strong>and</strong> OAI Harvest<str<strong>on</strong>g>in</str<strong>on</strong>g>g. The newly<br />
released dem<strong>on</strong>strator makes use of DARIAH web services <strong>and</strong> exposed the various participat<str<strong>on</strong>g>in</str<strong>on</strong>g>g<br />
archaeological databases as aut<strong>on</strong>omous services. The sec<strong>on</strong>d dem<strong>on</strong>strator, the TEI dem<strong>on</strong>strator, was<br />
designed to “dem<strong>on</strong>strate the practical benefits of us<str<strong>on</strong>g>in</str<strong>on</strong>g>g TEI for the representati<strong>on</strong> of digital resources<br />
of all k<str<strong>on</strong>g>in</str<strong>on</strong>g>ds, but primarily of orig<str<strong>on</strong>g>in</str<strong>on</strong>g>al source collecti<strong>on</strong>s with<str<strong>on</strong>g>in</str<strong>on</strong>g> the arts <strong>and</strong> humanities.” The<br />
dem<strong>on</strong>strator can be used to upload <strong>and</strong> publish TEI documents <str<strong>on</strong>g>in</str<strong>on</strong>g>to a repository am<strong>on</strong>g other<br />
functi<strong>on</strong>alities <strong>and</strong> makes use of software platform called eSciDoc 755 that was developed by the Max<br />
Planck Digital <strong>Library</strong>. The end goal of this dem<strong>on</strong>strator was thus to make it easier for humanities<br />
researchers to both share their TEI texts with others <strong>and</strong> to compare pers<strong>on</strong>al encod<str<strong>on</strong>g>in</str<strong>on</strong>g>g practices with<br />
that of the larger TEI community. 756<br />
Another important factor c<strong>on</strong>sidered by DARIAH is the frequently distributed nature of humanities<br />
data; for example, <strong>on</strong>e digital archive may have transcripti<strong>on</strong>s of a manuscript while another has digital<br />
images of this manuscript. Thus, DARIAH plans to build a data architecture that will “cover the easy<br />
exchange of file type data, the ability to create relati<strong>on</strong>ships between files <str<strong>on</strong>g>in</str<strong>on</strong>g> remote locati<strong>on</strong>s <strong>and</strong><br />
flexible cach<str<strong>on</strong>g>in</str<strong>on</strong>g>g mechanism to deal with the exchange of large s<str<strong>on</strong>g>in</str<strong>on</strong>g>gle data items like digitizati<strong>on</strong><br />
images”(Blanke 2010). S<str<strong>on</strong>g>in</str<strong>on</strong>g>ce humanities data also need to be preserved for l<strong>on</strong>g periods of time to<br />
support archiv<str<strong>on</strong>g>in</str<strong>on</strong>g>g <strong>and</strong> reuse, DARIAH plans to <str<strong>on</strong>g>in</str<strong>on</strong>g>corporate exist<str<strong>on</strong>g>in</str<strong>on</strong>g>g archived research data. In his f<str<strong>on</strong>g>in</str<strong>on</strong>g>al<br />
overview of the project, Blanke proposed that:<br />
DARIAH is <strong>on</strong>e way to build a research <str<strong>on</strong>g>in</str<strong>on</strong>g>frastructure for the humanities. It uses grid<br />
technologies together with digital library technologies to deliver services to support the<br />
<str<strong>on</strong>g>in</str<strong>on</strong>g>formati<strong>on</strong> needs of humanities researchers. It <str<strong>on</strong>g>in</str<strong>on</strong>g>tegrates many services useful for humanities<br />
research <strong>and</strong> will focus less <strong>on</strong> automati<strong>on</strong> of process<str<strong>on</strong>g>in</str<strong>on</strong>g>g but <strong>on</strong> provid<str<strong>on</strong>g>in</str<strong>on</strong>g>g an <str<strong>on</strong>g>in</str<strong>on</strong>g>frastructure to<br />
751 The discussi<strong>on</strong> over how to both design <strong>and</strong> implement persistent <strong>and</strong> unique identifiers has a vast body of literature. For some recent work, see T<strong>on</strong>k<str<strong>on</strong>g>in</str<strong>on</strong>g><br />
(2008), Campbell (2007), <strong>and</strong> Hilse <strong>and</strong> Kothe (2006).<br />
752 http://dariah.eu/<str<strong>on</strong>g>in</str<strong>on</strong>g>dex.phpopti<strong>on</strong>=com_c<strong>on</strong>tent&view=article&id=129&Itemid=113<br />
753 http://www.dariah.eu/<str<strong>on</strong>g>in</str<strong>on</strong>g>dex.phpopti<strong>on</strong>=com_c<strong>on</strong>tent&view=article&id=30&Itemid=34. The dem<strong>on</strong>strator can also be accessed at<br />
http://mun<str<strong>on</strong>g>in</str<strong>on</strong>g>n.york.ac.uk/arena2/<br />
754 Z39.50 is an ISO st<strong>and</strong>ard ma<str<strong>on</strong>g>in</str<strong>on</strong>g>ta<str<strong>on</strong>g>in</str<strong>on</strong>g>ed by the <strong>Library</strong> of C<strong>on</strong>gress that “specifies a client/server-based protocol for search<str<strong>on</strong>g>in</str<strong>on</strong>g>g <strong>and</strong> retriev<str<strong>on</strong>g>in</str<strong>on</strong>g>g <str<strong>on</strong>g>in</str<strong>on</strong>g>formati<strong>on</strong><br />
from remote databases” (http://www.loc.gov/z3950/agency/) <strong>and</strong> has been the predom<str<strong>on</strong>g>in</str<strong>on</strong>g>ant st<strong>and</strong>ard used <str<strong>on</strong>g>in</str<strong>on</strong>g> <str<strong>on</strong>g>in</str<strong>on</strong>g>tegrated library systems<br />
755 http://www.escidoc.org/<br />
756 The TEI dem<strong>on</strong>strator can be accessed at (http://vm20.mpdl.mpg.de:8080/tei_dem<strong>on</strong>strator/).