26.12.2014 Views

Rome Wasn't Digitized in a Day - Council on Library and Information ...

Rome Wasn't Digitized in a Day - Council on Library and Information ...

Rome Wasn't Digitized in a Day - Council on Library and Information ...

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

159<br />

was aimed at two types of users: general users of libraries who wished to exam<str<strong>on</strong>g>in</str<strong>on</strong>g>e manuscripts, <strong>and</strong><br />

“professi<strong>on</strong>al students of texts” or philologists, whom they def<str<strong>on</strong>g>in</str<strong>on</strong>g>ed as “critical editors of classical or<br />

medieval works that are h<strong>and</strong>-written <strong>on</strong> material supports of various types (paper, papyrus, st<strong>on</strong>e)”<br />

(Bozzi <strong>and</strong> Calabretto 1997). The authors thus developed a “philological workstati<strong>on</strong>” that <str<strong>on</strong>g>in</str<strong>on</strong>g>cluded<br />

four major features: (1) the ability to look up digital images <str<strong>on</strong>g>in</str<strong>on</strong>g> an archive; (2) the transcripti<strong>on</strong>,<br />

annotati<strong>on</strong>, <strong>and</strong> <str<strong>on</strong>g>in</str<strong>on</strong>g>dex<str<strong>on</strong>g>in</str<strong>on</strong>g>g of images; (3) the view<str<strong>on</strong>g>in</str<strong>on</strong>g>g of transcribed versi<strong>on</strong>s of texts <strong>and</strong> creat<str<strong>on</strong>g>in</str<strong>on</strong>g>g an<br />

“Index Locorum”; <strong>and</strong> (4) the automatic match<str<strong>on</strong>g>in</str<strong>on</strong>g>g of words found <str<strong>on</strong>g>in</str<strong>on</strong>g> transcripti<strong>on</strong>s, the “Index<br />

Locorum,” <strong>and</strong> annotati<strong>on</strong>s with the relevant porti<strong>on</strong> of the source-document image that c<strong>on</strong>ta<str<strong>on</strong>g>in</str<strong>on</strong>g>s the<br />

word. This last feature, while desired by many other digital editi<strong>on</strong> <strong>and</strong> manuscript projects, is still an<br />

area of unresolved <strong>and</strong> active research (Cayless 2008, Cayless 2009, Porter et al. 2009).<br />

In an overview of their philological workstati<strong>on</strong>, Bozzi <strong>and</strong> Calabretto listed the functi<strong>on</strong>s that it<br />

supported. To beg<str<strong>on</strong>g>in</str<strong>on</strong>g> with, the workstati<strong>on</strong> allowed users to search manuscript collecti<strong>on</strong>s <strong>and</strong> to create<br />

transcripti<strong>on</strong>s of digital images of manuscripts <strong>and</strong> export them as RTF or SGML. One important<br />

feature was the <str<strong>on</strong>g>in</str<strong>on</strong>g>dex<str<strong>on</strong>g>in</str<strong>on</strong>g>g of transcripti<strong>on</strong>s that could be used by philologists to generate an “Index<br />

Verborum” <strong>and</strong> an “Index Locorum” for each script <str<strong>on</strong>g>in</str<strong>on</strong>g> the manuscript (e.g., Greek <strong>and</strong> Lat<str<strong>on</strong>g>in</str<strong>on</strong>g>). The<br />

“Index Verborum” c<strong>on</strong>ta<str<strong>on</strong>g>in</str<strong>on</strong>g>ed all the words appear<str<strong>on</strong>g>in</str<strong>on</strong>g>g <str<strong>on</strong>g>in</str<strong>on</strong>g> the transcripti<strong>on</strong> <strong>and</strong> the words that were<br />

corrected by the user (us<str<strong>on</strong>g>in</str<strong>on</strong>g>g the text variant functi<strong>on</strong>), while the “Index Locorum” displayed “the<br />

positi<strong>on</strong>s <str<strong>on</strong>g>in</str<strong>on</strong>g> which each word occurs <str<strong>on</strong>g>in</str<strong>on</strong>g> the manuscript.” In additi<strong>on</strong>, annotati<strong>on</strong>s could be created <strong>on</strong><br />

manuscript transcripti<strong>on</strong>s, <strong>and</strong> all annotati<strong>on</strong>s c<strong>on</strong>ta<str<strong>on</strong>g>in</str<strong>on</strong>g>ed two dist<str<strong>on</strong>g>in</str<strong>on</strong>g>ct fields, <strong>on</strong>e for free comments <strong>and</strong><br />

the critical apparatus, <strong>and</strong> <strong>on</strong>e for variants, syn<strong>on</strong>yms, <strong>and</strong> the correcti<strong>on</strong> of syntax. The BAMBI<br />

workstati<strong>on</strong> also supported automatic column <strong>and</strong> l<str<strong>on</strong>g>in</str<strong>on</strong>g>e recogniti<strong>on</strong> <strong>and</strong>, even more important, the<br />

automatic creati<strong>on</strong> of a word-image c<strong>on</strong>cordance (if a transcripti<strong>on</strong> for a manuscript was available) that<br />

matches each word of the text with the appropriate porti<strong>on</strong> of the image. The c<strong>on</strong>cordance was built<br />

automatically, <strong>and</strong> this module provided a simultaneous view of the transcripti<strong>on</strong> <strong>and</strong> the image so the<br />

user could check its accuracy. It also allowed the user to query the manuscript collecti<strong>on</strong> by select<str<strong>on</strong>g>in</str<strong>on</strong>g>g a<br />

word <str<strong>on</strong>g>in</str<strong>on</strong>g> either the transcripti<strong>on</strong> or <strong>on</strong> the image. The BAMBI prototype made use of HyTime (an<br />

extensi<strong>on</strong> of SGML) to model works <strong>on</strong> ancient manuscripts, <str<strong>on</strong>g>in</str<strong>on</strong>g> particular because it allowed<br />

“specificati<strong>on</strong> of l<str<strong>on</strong>g>in</str<strong>on</strong>g>ks between text <strong>and</strong> part of image (part of an object).”<br />

While the fuller technical details of this workstati<strong>on</strong> are somewhat outdated as of this writ<str<strong>on</strong>g>in</str<strong>on</strong>g>g, the<br />

unanswered issues identified by the BAMBI project are still largely relevant for digital philology.<br />

Bozzi <strong>and</strong> Calabretto noted that the follow<str<strong>on</strong>g>in</str<strong>on</strong>g>g requirements needed to be met: better st<strong>and</strong>ards-based<br />

tools for the descripti<strong>on</strong> of manuscripts; more sophisticated image-process<str<strong>on</strong>g>in</str<strong>on</strong>g>g rout<str<strong>on</strong>g>in</str<strong>on</strong>g>es (although they<br />

called for the enhancement of microfilm images rather than the images of manuscripts themselves); “a<br />

comprehensive soluti<strong>on</strong> for the management of text variants”; “tools based <strong>on</strong> image process<str<strong>on</strong>g>in</str<strong>on</strong>g>g<br />

facilities <strong>and</strong> l<str<strong>on</strong>g>in</str<strong>on</strong>g>guistic (statistical) facilities for the electr<strong>on</strong>ic restorati<strong>on</strong> of miss<str<strong>on</strong>g>in</str<strong>on</strong>g>g text elements”;<br />

new models for collaborative work (though work today has moved bey<strong>on</strong>d client-server models based<br />

<strong>on</strong> the web); <strong>and</strong> a survey of the technical <strong>and</strong> legal issues <str<strong>on</strong>g>in</str<strong>on</strong>g>volved <str<strong>on</strong>g>in</str<strong>on</strong>g> creat<str<strong>on</strong>g>in</str<strong>on</strong>g>g “widespread, multisource<br />

services offer<str<strong>on</strong>g>in</str<strong>on</strong>g>g digital versi<strong>on</strong>s of library materials <strong>and</strong> the tools for their use” (Bozzi <strong>and</strong><br />

Calabretto 1997). As has been seen <str<strong>on</strong>g>in</str<strong>on</strong>g> this review, the challenges of manuscript descripti<strong>on</strong>, advanced<br />

image process<str<strong>on</strong>g>in</str<strong>on</strong>g>g, the management of text variants, the creati<strong>on</strong> of sophisticated digital tools,<br />

collaborative workspaces, <strong>and</strong> comprehensive open-source digital libraries rema<str<strong>on</strong>g>in</str<strong>on</strong>g> topics of c<strong>on</strong>cern.<br />

Other research <str<strong>on</strong>g>in</str<strong>on</strong>g> digital philology has been c<strong>on</strong>ducted by the Aristarchus project, 519 <strong>and</strong> an article by<br />

Franco M<strong>on</strong>tanari (M<strong>on</strong>tanari 2004) has provided an overview of the electr<strong>on</strong>ic tools for classical<br />

519 http://www.aristarchus.unige.it/<str<strong>on</strong>g>in</str<strong>on</strong>g>dex_<str<strong>on</strong>g>in</str<strong>on</strong>g>glese.php

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!