26.12.2014 Views

Rome Wasn't Digitized in a Day - Council on Library and Information ...

Rome Wasn't Digitized in a Day - Council on Library and Information ...

Rome Wasn't Digitized in a Day - Council on Library and Information ...

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

162<br />

descripti<strong>on</strong>s that provides scholarly <str<strong>on</strong>g>in</str<strong>on</strong>g>formati<strong>on</strong> <strong>on</strong> “codicology, manuscript history <strong>and</strong> textual<br />

transmissi<strong>on</strong>”; (5) a flexible data model that can accommodate the <str<strong>on</strong>g>in</str<strong>on</strong>g>tegrati<strong>on</strong> of “manuscript<br />

descripti<strong>on</strong>s” of vary<str<strong>on</strong>g>in</str<strong>on</strong>g>g semantic depth <strong>and</strong> length; <strong>and</strong> (6) l<str<strong>on</strong>g>in</str<strong>on</strong>g>k<str<strong>on</strong>g>in</str<strong>on</strong>g>g to important exist<str<strong>on</strong>g>in</str<strong>on</strong>g>g <strong>on</strong>l<str<strong>on</strong>g>in</str<strong>on</strong>g>e<br />

resources such as library catalogs, specialist bibliographies, <strong>and</strong> digital texts. Deckers et al. (2009)<br />

reported that they particularly wanted to create a tool that provides scholars <str<strong>on</strong>g>in</str<strong>on</strong>g> the fields of Greek<br />

codicology <strong>and</strong> palaeography with the ability to publish digital research materials.<br />

The Teuchos platform is built off of a Fedora repository. Three types of users can <str<strong>on</strong>g>in</str<strong>on</strong>g>teract with this<br />

repository through a web applicati<strong>on</strong>: 528 systems adm<str<strong>on</strong>g>in</str<strong>on</strong>g>istrators; registered users, who may c<strong>on</strong>tribute<br />

resources; <strong>and</strong> public users, who can view <strong>on</strong>ly publicly released materials. The Teuchos Fedora<br />

repository <str<strong>on</strong>g>in</str<strong>on</strong>g>cludes several types of complicated digital objects, all of which have been designed to try<br />

<strong>and</strong> cover all potential categories of text transmissi<strong>on</strong>. Manuscript watermark trac<str<strong>on</strong>g>in</str<strong>on</strong>g>gs are stored as<br />

digital images <strong>and</strong> <str<strong>on</strong>g>in</str<strong>on</strong>g>formati<strong>on</strong> about them is stored <str<strong>on</strong>g>in</str<strong>on</strong>g> a custom XML format created by the project. A<br />

“textual transmissi<strong>on</strong>” group has two subgroups, each of which is then subdivided: the first group<br />

provides <str<strong>on</strong>g>in</str<strong>on</strong>g>formati<strong>on</strong> related to <str<strong>on</strong>g>in</str<strong>on</strong>g>dividual manuscripts <strong>and</strong> the sec<strong>on</strong>d <str<strong>on</strong>g>in</str<strong>on</strong>g>cludes <str<strong>on</strong>g>in</str<strong>on</strong>g>formati<strong>on</strong> related to<br />

<str<strong>on</strong>g>in</str<strong>on</strong>g>dividual works. With<str<strong>on</strong>g>in</str<strong>on</strong>g> the manuscript group, <str<strong>on</strong>g>in</str<strong>on</strong>g>dividual data objects <str<strong>on</strong>g>in</str<strong>on</strong>g>clude digital page images (of<br />

complete or partial manuscripts) that are aggregated for each manuscript, codicological descripti<strong>on</strong>s<br />

that reference page images when available, <strong>and</strong> vary<str<strong>on</strong>g>in</str<strong>on</strong>g>g levels of transcripti<strong>on</strong> data. In terms of works,<br />

this subgroup encompasses a wide range of materials referr<str<strong>on</strong>g>in</str<strong>on</strong>g>g “to a source text with its entire set of<br />

manuscripts rather than to <strong>on</strong>e particular witness” <strong>and</strong> <str<strong>on</strong>g>in</str<strong>on</strong>g>cludes full critical editi<strong>on</strong>s, translati<strong>on</strong>s, <strong>and</strong><br />

commentaries (Deckers et al. 2009). The three other major categories of digital object that are created<br />

are biographical dicti<strong>on</strong>aries, bibliographical data, <strong>and</strong> published research papers. Because of the<br />

heterogeneous nature of these data, <strong>on</strong>ly the manuscript descripti<strong>on</strong>s <strong>and</strong> transcripti<strong>on</strong>s could be<br />

encoded accord<str<strong>on</strong>g>in</str<strong>on</strong>g>g to TEI P5 XML.<br />

Because the creators of Teuchos hope to provide scholars with advanced search<str<strong>on</strong>g>in</str<strong>on</strong>g>g <strong>and</strong> edit<str<strong>on</strong>g>in</str<strong>on</strong>g>g<br />

functi<strong>on</strong>ality, they have developed a data model for both the physical <strong>and</strong> <str<strong>on</strong>g>in</str<strong>on</strong>g>tellectual c<strong>on</strong>tent of<br />

manuscripts <str<strong>on</strong>g>in</str<strong>on</strong>g> their platform. While not all of the descriptive material <str<strong>on</strong>g>in</str<strong>on</strong>g> Teuchos <str<strong>on</strong>g>in</str<strong>on</strong>g>cludes digital<br />

images of manuscripts, all the digital images that are <str<strong>on</strong>g>in</str<strong>on</strong>g>cluded have accompany<str<strong>on</strong>g>in</str<strong>on</strong>g>g descriptive <strong>and</strong><br />

authority metadata. All manuscripts with digital images also have a corresp<strong>on</strong>d<str<strong>on</strong>g>in</str<strong>on</strong>g>g reference document<br />

that makes use of the TEI element <strong>and</strong> a list of elements with unique identifiers<br />

<str<strong>on</strong>g>in</str<strong>on</strong>g> the form of xml:id attributes <strong>and</strong> unambiguous labels for pages us<str<strong>on</strong>g>in</str<strong>on</strong>g>g the “n” attribute. The<br />

elements are listed <str<strong>on</strong>g>in</str<strong>on</strong>g> the physical order of the manuscript, <strong>and</strong> miss<str<strong>on</strong>g>in</str<strong>on</strong>g>g pages are<br />

represented with empty elements.<br />

To facilitate user access to <str<strong>on</strong>g>in</str<strong>on</strong>g>dividual page images, Teuchos provides at least a m<str<strong>on</strong>g>in</str<strong>on</strong>g>imal transcripti<strong>on</strong><br />

for each manuscript (e.g., it may simply c<strong>on</strong>ta<str<strong>on</strong>g>in</str<strong>on</strong>g> page-break <str<strong>on</strong>g>in</str<strong>on</strong>g>formati<strong>on</strong> <strong>and</strong> no textual transcripti<strong>on</strong>)<br />

that c<strong>on</strong>ta<str<strong>on</strong>g>in</str<strong>on</strong>g>s structural <str<strong>on</strong>g>in</str<strong>on</strong>g>formati<strong>on</strong> that “can be used to offer alternate representati<strong>on</strong>s <strong>and</strong> improved<br />

navigati<strong>on</strong> for brows<str<strong>on</strong>g>in</str<strong>on</strong>g>g, <strong>and</strong> to give a clearer <str<strong>on</strong>g>in</str<strong>on</strong>g>dicati<strong>on</strong> of the part of the text to which an image<br />

viewed perta<str<strong>on</strong>g>in</str<strong>on</strong>g>s” (Deckers et al. 2009). These data are then encoded with<str<strong>on</strong>g>in</str<strong>on</strong>g> TEI elements.<br />

While elements with “corresp” attributes that po<str<strong>on</strong>g>in</str<strong>on</strong>g>t to unique page identifiers are used to<br />

reference digital images of <str<strong>on</strong>g>in</str<strong>on</strong>g>dividual manuscript pages, the element is used to separately encode<br />

foliati<strong>on</strong> or pag<str<strong>on</strong>g>in</str<strong>on</strong>g>ati<strong>on</strong> <str<strong>on</strong>g>in</str<strong>on</strong>g>formati<strong>on</strong>. This separate encod<str<strong>on</strong>g>in</str<strong>on</strong>g>g is important, Deckers et al. reported,<br />

because it “permits record<str<strong>on</strong>g>in</str<strong>on</strong>g>g whether numbers provided by the transcriber are actually present <strong>on</strong> the<br />

page or not” <strong>and</strong> also supports “record<str<strong>on</strong>g>in</str<strong>on</strong>g>g more than <strong>on</strong>e such reference system,” a particularly<br />

important issue, s<str<strong>on</strong>g>in</str<strong>on</strong>g>ce many manuscripts can have multiple foliati<strong>on</strong> systems.<br />

528 A beta versi<strong>on</strong> of this applicati<strong>on</strong> is available at http://beta.teuchos.uni-hamburg.de/TeuchosWebUI/teuchos-web-ui

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!