Rome Wasn't Digitized in a Day - Council on Library and Information ...
Rome Wasn't Digitized in a Day - Council on Library and Information ...
Rome Wasn't Digitized in a Day - Council on Library and Information ...
You also want an ePaper? Increase the reach of your titles
YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.
159<br />
was aimed at two types of users: general users of libraries who wished to exam<str<strong>on</strong>g>in</str<strong>on</strong>g>e manuscripts, <strong>and</strong><br />
“professi<strong>on</strong>al students of texts” or philologists, whom they def<str<strong>on</strong>g>in</str<strong>on</strong>g>ed as “critical editors of classical or<br />
medieval works that are h<strong>and</strong>-written <strong>on</strong> material supports of various types (paper, papyrus, st<strong>on</strong>e)”<br />
(Bozzi <strong>and</strong> Calabretto 1997). The authors thus developed a “philological workstati<strong>on</strong>” that <str<strong>on</strong>g>in</str<strong>on</strong>g>cluded<br />
four major features: (1) the ability to look up digital images <str<strong>on</strong>g>in</str<strong>on</strong>g> an archive; (2) the transcripti<strong>on</strong>,<br />
annotati<strong>on</strong>, <strong>and</strong> <str<strong>on</strong>g>in</str<strong>on</strong>g>dex<str<strong>on</strong>g>in</str<strong>on</strong>g>g of images; (3) the view<str<strong>on</strong>g>in</str<strong>on</strong>g>g of transcribed versi<strong>on</strong>s of texts <strong>and</strong> creat<str<strong>on</strong>g>in</str<strong>on</strong>g>g an<br />
“Index Locorum”; <strong>and</strong> (4) the automatic match<str<strong>on</strong>g>in</str<strong>on</strong>g>g of words found <str<strong>on</strong>g>in</str<strong>on</strong>g> transcripti<strong>on</strong>s, the “Index<br />
Locorum,” <strong>and</strong> annotati<strong>on</strong>s with the relevant porti<strong>on</strong> of the source-document image that c<strong>on</strong>ta<str<strong>on</strong>g>in</str<strong>on</strong>g>s the<br />
word. This last feature, while desired by many other digital editi<strong>on</strong> <strong>and</strong> manuscript projects, is still an<br />
area of unresolved <strong>and</strong> active research (Cayless 2008, Cayless 2009, Porter et al. 2009).<br />
In an overview of their philological workstati<strong>on</strong>, Bozzi <strong>and</strong> Calabretto listed the functi<strong>on</strong>s that it<br />
supported. To beg<str<strong>on</strong>g>in</str<strong>on</strong>g> with, the workstati<strong>on</strong> allowed users to search manuscript collecti<strong>on</strong>s <strong>and</strong> to create<br />
transcripti<strong>on</strong>s of digital images of manuscripts <strong>and</strong> export them as RTF or SGML. One important<br />
feature was the <str<strong>on</strong>g>in</str<strong>on</strong>g>dex<str<strong>on</strong>g>in</str<strong>on</strong>g>g of transcripti<strong>on</strong>s that could be used by philologists to generate an “Index<br />
Verborum” <strong>and</strong> an “Index Locorum” for each script <str<strong>on</strong>g>in</str<strong>on</strong>g> the manuscript (e.g., Greek <strong>and</strong> Lat<str<strong>on</strong>g>in</str<strong>on</strong>g>). The<br />
“Index Verborum” c<strong>on</strong>ta<str<strong>on</strong>g>in</str<strong>on</strong>g>ed all the words appear<str<strong>on</strong>g>in</str<strong>on</strong>g>g <str<strong>on</strong>g>in</str<strong>on</strong>g> the transcripti<strong>on</strong> <strong>and</strong> the words that were<br />
corrected by the user (us<str<strong>on</strong>g>in</str<strong>on</strong>g>g the text variant functi<strong>on</strong>), while the “Index Locorum” displayed “the<br />
positi<strong>on</strong>s <str<strong>on</strong>g>in</str<strong>on</strong>g> which each word occurs <str<strong>on</strong>g>in</str<strong>on</strong>g> the manuscript.” In additi<strong>on</strong>, annotati<strong>on</strong>s could be created <strong>on</strong><br />
manuscript transcripti<strong>on</strong>s, <strong>and</strong> all annotati<strong>on</strong>s c<strong>on</strong>ta<str<strong>on</strong>g>in</str<strong>on</strong>g>ed two dist<str<strong>on</strong>g>in</str<strong>on</strong>g>ct fields, <strong>on</strong>e for free comments <strong>and</strong><br />
the critical apparatus, <strong>and</strong> <strong>on</strong>e for variants, syn<strong>on</strong>yms, <strong>and</strong> the correcti<strong>on</strong> of syntax. The BAMBI<br />
workstati<strong>on</strong> also supported automatic column <strong>and</strong> l<str<strong>on</strong>g>in</str<strong>on</strong>g>e recogniti<strong>on</strong> <strong>and</strong>, even more important, the<br />
automatic creati<strong>on</strong> of a word-image c<strong>on</strong>cordance (if a transcripti<strong>on</strong> for a manuscript was available) that<br />
matches each word of the text with the appropriate porti<strong>on</strong> of the image. The c<strong>on</strong>cordance was built<br />
automatically, <strong>and</strong> this module provided a simultaneous view of the transcripti<strong>on</strong> <strong>and</strong> the image so the<br />
user could check its accuracy. It also allowed the user to query the manuscript collecti<strong>on</strong> by select<str<strong>on</strong>g>in</str<strong>on</strong>g>g a<br />
word <str<strong>on</strong>g>in</str<strong>on</strong>g> either the transcripti<strong>on</strong> or <strong>on</strong> the image. The BAMBI prototype made use of HyTime (an<br />
extensi<strong>on</strong> of SGML) to model works <strong>on</strong> ancient manuscripts, <str<strong>on</strong>g>in</str<strong>on</strong>g> particular because it allowed<br />
“specificati<strong>on</strong> of l<str<strong>on</strong>g>in</str<strong>on</strong>g>ks between text <strong>and</strong> part of image (part of an object).”<br />
While the fuller technical details of this workstati<strong>on</strong> are somewhat outdated as of this writ<str<strong>on</strong>g>in</str<strong>on</strong>g>g, the<br />
unanswered issues identified by the BAMBI project are still largely relevant for digital philology.<br />
Bozzi <strong>and</strong> Calabretto noted that the follow<str<strong>on</strong>g>in</str<strong>on</strong>g>g requirements needed to be met: better st<strong>and</strong>ards-based<br />
tools for the descripti<strong>on</strong> of manuscripts; more sophisticated image-process<str<strong>on</strong>g>in</str<strong>on</strong>g>g rout<str<strong>on</strong>g>in</str<strong>on</strong>g>es (although they<br />
called for the enhancement of microfilm images rather than the images of manuscripts themselves); “a<br />
comprehensive soluti<strong>on</strong> for the management of text variants”; “tools based <strong>on</strong> image process<str<strong>on</strong>g>in</str<strong>on</strong>g>g<br />
facilities <strong>and</strong> l<str<strong>on</strong>g>in</str<strong>on</strong>g>guistic (statistical) facilities for the electr<strong>on</strong>ic restorati<strong>on</strong> of miss<str<strong>on</strong>g>in</str<strong>on</strong>g>g text elements”;<br />
new models for collaborative work (though work today has moved bey<strong>on</strong>d client-server models based<br />
<strong>on</strong> the web); <strong>and</strong> a survey of the technical <strong>and</strong> legal issues <str<strong>on</strong>g>in</str<strong>on</strong>g>volved <str<strong>on</strong>g>in</str<strong>on</strong>g> creat<str<strong>on</strong>g>in</str<strong>on</strong>g>g “widespread, multisource<br />
services offer<str<strong>on</strong>g>in</str<strong>on</strong>g>g digital versi<strong>on</strong>s of library materials <strong>and</strong> the tools for their use” (Bozzi <strong>and</strong><br />
Calabretto 1997). As has been seen <str<strong>on</strong>g>in</str<strong>on</strong>g> this review, the challenges of manuscript descripti<strong>on</strong>, advanced<br />
image process<str<strong>on</strong>g>in</str<strong>on</strong>g>g, the management of text variants, the creati<strong>on</strong> of sophisticated digital tools,<br />
collaborative workspaces, <strong>and</strong> comprehensive open-source digital libraries rema<str<strong>on</strong>g>in</str<strong>on</strong>g> topics of c<strong>on</strong>cern.<br />
Other research <str<strong>on</strong>g>in</str<strong>on</strong>g> digital philology has been c<strong>on</strong>ducted by the Aristarchus project, 519 <strong>and</strong> an article by<br />
Franco M<strong>on</strong>tanari (M<strong>on</strong>tanari 2004) has provided an overview of the electr<strong>on</strong>ic tools for classical<br />
519 http://www.aristarchus.unige.it/<str<strong>on</strong>g>in</str<strong>on</strong>g>dex_<str<strong>on</strong>g>in</str<strong>on</strong>g>glese.php