26.12.2014 Views

Rome Wasn't Digitized in a Day - Council on Library and Information ...

Rome Wasn't Digitized in a Day - Council on Library and Information ...

Rome Wasn't Digitized in a Day - Council on Library and Information ...

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

147<br />

comp<strong>on</strong>ent. While C<strong>on</strong>cordia <strong>and</strong> LaQuAT seek to <str<strong>on</strong>g>in</str<strong>on</strong>g>tegrate papyri collecti<strong>on</strong>s with other digital<br />

classical resources, such as epigraphical databases, <str<strong>on</strong>g>in</str<strong>on</strong>g>to larger “virtual” collecti<strong>on</strong>s that can be<br />

simultaneously searched, eAQUA <strong>and</strong> eSAD are develop<str<strong>on</strong>g>in</str<strong>on</strong>g>g technologies to assist papyrologists <str<strong>on</strong>g>in</str<strong>on</strong>g> the<br />

<str<strong>on</strong>g>in</str<strong>on</strong>g>terpretati<strong>on</strong> of their ancient texts.<br />

Focused exclusively <strong>on</strong> papyri collecti<strong>on</strong>s, the IDP project (Sos<str<strong>on</strong>g>in</str<strong>on</strong>g> et al. 2007, Sos<str<strong>on</strong>g>in</str<strong>on</strong>g> et al. 2008), 497<br />

which is a jo<str<strong>on</strong>g>in</str<strong>on</strong>g>t effort of the oldest digital resource <str<strong>on</strong>g>in</str<strong>on</strong>g> papyrology the DDbDP, the HGV, <strong>and</strong> the APIS,<br />

is work<str<strong>on</strong>g>in</str<strong>on</strong>g>g to create a s<str<strong>on</strong>g>in</str<strong>on</strong>g>gle <str<strong>on</strong>g>in</str<strong>on</strong>g>terface to these three collecti<strong>on</strong>s, a project that has largely been realized<br />

through the creati<strong>on</strong> of the Papyrological Navigator (PN). 498 Active research <strong>on</strong> improv<str<strong>on</strong>g>in</str<strong>on</strong>g>g the PN is<br />

<strong>on</strong>go<str<strong>on</strong>g>in</str<strong>on</strong>g>g, as illustrated by a recent blog post by Hugh Cayless (Cayless 2010c). One particular<br />

comp<strong>on</strong>ent of the PN that he has recently improved is a service that provides “lookup of identifiers” of<br />

papyri <str<strong>on</strong>g>in</str<strong>on</strong>g> <strong>on</strong>e collecti<strong>on</strong> <strong>and</strong> “correlates them with related records <str<strong>on</strong>g>in</str<strong>on</strong>g> other collecti<strong>on</strong>s.” While this<br />

service was orig<str<strong>on</strong>g>in</str<strong>on</strong>g>ally based <strong>on</strong> a Lucene-based numbers server, Cayless is work<str<strong>on</strong>g>in</str<strong>on</strong>g>g to replace it with a<br />

RDF triplestore. One particular challenge is that of data <str<strong>on</strong>g>in</str<strong>on</strong>g>tegrati<strong>on</strong> <strong>and</strong> the difficulties of model<str<strong>on</strong>g>in</str<strong>on</strong>g>g the<br />

relati<strong>on</strong>ships between the same items <str<strong>on</strong>g>in</str<strong>on</strong>g> different databases. The complicated nature of these<br />

relati<strong>on</strong>ships <str<strong>on</strong>g>in</str<strong>on</strong>g>cludes several dimensi<strong>on</strong>s, such as different levels of hierarchy <str<strong>on</strong>g>in</str<strong>on</strong>g> database structures<br />

<strong>and</strong> various FRBR type relati<strong>on</strong>ships (e.g., the ancient document is the work but then it has various<br />

expressi<strong>on</strong>s <str<strong>on</strong>g>in</str<strong>on</strong>g> different pr<str<strong>on</strong>g>in</str<strong>on</strong>g>ted editi<strong>on</strong>s (<str<strong>on</strong>g>in</str<strong>on</strong>g>clud<str<strong>on</strong>g>in</str<strong>on</strong>g>g translati<strong>on</strong>s), <strong>and</strong> each of those editi<strong>on</strong>s has various<br />

manifestati<strong>on</strong>s (HTML, EpiDoc transcripti<strong>on</strong>s, etc.). In additi<strong>on</strong>, while the papyrological items <strong>and</strong><br />

their metadata <str<strong>on</strong>g>in</str<strong>on</strong>g> different databases can sometimes have a 1:1 relati<strong>on</strong>ship (such as is usually the case<br />

between the DDbDP <strong>and</strong> the HGV) there can also be overlap (such as between the APIS <strong>and</strong> the other<br />

two databases). Each database also has complicated <str<strong>on</strong>g>in</str<strong>on</strong>g>ternal relati<strong>on</strong>ships; for example, although the<br />

HGV utilizes the idea of a “pr<str<strong>on</strong>g>in</str<strong>on</strong>g>cipal editi<strong>on</strong>” <strong>and</strong> chooses a s<str<strong>on</strong>g>in</str<strong>on</strong>g>gle can<strong>on</strong>ical publicati<strong>on</strong> of a papyrus,<br />

it also <str<strong>on</strong>g>in</str<strong>on</strong>g>cludes other earlier publicati<strong>on</strong>s of the same papyrus <str<strong>on</strong>g>in</str<strong>on</strong>g> its metadata. The DDbDP follows the<br />

same basic idea but creates a new record that l<str<strong>on</strong>g>in</str<strong>on</strong>g>ks to stub records for the older editi<strong>on</strong>s of each<br />

papyrus.<br />

To better represent the complexity of these relati<strong>on</strong>ships, Cayless graphed them <str<strong>on</strong>g>in</str<strong>on</strong>g> Mulgara 499 (a<br />

scalable RDF database that is based <strong>on</strong> Java), so that he could use SPARQL queries to fetch data <strong>and</strong><br />

then map these to easily retrievable <strong>and</strong> citable URLs that follow a st<strong>and</strong>ard pattern. Results from<br />

SPARQL queries will also be made available as Notati<strong>on</strong>3 500 <strong>and</strong> JSON formats to create both humanreadable<br />

<strong>and</strong> -usable mach<str<strong>on</strong>g>in</str<strong>on</strong>g>e <str<strong>on</strong>g>in</str<strong>on</strong>g>terfaces to the data available through the PN. Cayless also reported<br />

that he was look<str<strong>on</strong>g>in</str<strong>on</strong>g>g <str<strong>on</strong>g>in</str<strong>on</strong>g>to us<str<strong>on</strong>g>in</str<strong>on</strong>g>g the DC TERMS vocabulary as well as other relevant <strong>on</strong>tologies such as<br />

the FRBR vocabulary. 501 Ultimately, Cayless hoped to l<str<strong>on</strong>g>in</str<strong>on</strong>g>k the bibliography <str<strong>on</strong>g>in</str<strong>on</strong>g> <str<strong>on</strong>g>in</str<strong>on</strong>g>dividual papyrus<br />

records to Zotero 502 <strong>and</strong> to ancient places names <str<strong>on</strong>g>in</str<strong>on</strong>g> Pleiades. “It all works well with my design<br />

philosophy for papyri.<str<strong>on</strong>g>in</str<strong>on</strong>g>fo,” Cayless c<strong>on</strong>cluded, “which is that it should c<strong>on</strong>sist of data (<str<strong>on</strong>g>in</str<strong>on</strong>g> the form of<br />

EpiDoc source files <strong>and</strong> representati<strong>on</strong>s of those files), retrievable via sensible URLs, with modular<br />

services surround<str<strong>on</strong>g>in</str<strong>on</strong>g>g the data to make it discoverable <strong>and</strong> usable.”<br />

A recent article by Roger Bagnall has offered an <str<strong>on</strong>g>in</str<strong>on</strong>g>-depth discussi<strong>on</strong> of the IDP project. As he<br />

expla<str<strong>on</strong>g>in</str<strong>on</strong>g>ed, the goals of the IDP have changed s<str<strong>on</strong>g>in</str<strong>on</strong>g>ce it was first c<strong>on</strong>ceptualized <str<strong>on</strong>g>in</str<strong>on</strong>g> 1992 <str<strong>on</strong>g>in</str<strong>on</strong>g> two specific<br />

ways:<br />

497 http://idp.atlantides.org/trac/idp/wiki/<br />

498 http://www.papyri.<str<strong>on</strong>g>in</str<strong>on</strong>g>fo<br />

499 http://www.mulgara.org/<br />

500 Notati<strong>on</strong>3 or N3 is a “shorth<strong>and</strong> n<strong>on</strong>-XML serializati<strong>on</strong> of Resource Descripti<strong>on</strong> Framework models, designed with human-readability <str<strong>on</strong>g>in</str<strong>on</strong>g> m<str<strong>on</strong>g>in</str<strong>on</strong>g>d.”<br />

http://en.wikipedia.org/wiki/Notati<strong>on</strong>3<br />

501 http://vocab.org/frbr/core.html<br />

502 http://www.zotero.org/

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!