26.12.2014 Views

Rome Wasn't Digitized in a Day - Council on Library and Information ...

Rome Wasn't Digitized in a Day - Council on Library and Information ...

Rome Wasn't Digitized in a Day - Council on Library and Information ...

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

45<br />

Philologists use digital corpora but they must verify results <strong>on</strong> pr<str<strong>on</strong>g>in</str<strong>on</strong>g>ted editi<strong>on</strong>s, <str<strong>on</strong>g>in</str<strong>on</strong>g> order to<br />

evaluate if the text retrieved is attested <str<strong>on</strong>g>in</str<strong>on</strong>g> every manuscript, <strong>on</strong>ly <str<strong>on</strong>g>in</str<strong>on</strong>g> the codex optimus, <str<strong>on</strong>g>in</str<strong>on</strong>g> an<br />

error pr<strong>on</strong>e family of manuscripts, <str<strong>on</strong>g>in</str<strong>on</strong>g> a scholium, <str<strong>on</strong>g>in</str<strong>on</strong>g> the <str<strong>on</strong>g>in</str<strong>on</strong>g>direct traditi<strong>on</strong> or if it is c<strong>on</strong>jectured<br />

by a modern scholar. In short, the text of the reference editi<strong>on</strong> has no scientific value without<br />

the apparatus. … (Boschetti 2007).<br />

He noted that two excepti<strong>on</strong>s to this phenomen<strong>on</strong> are the Homer Multitext Project <strong>and</strong> Musisque<br />

Deoque, 140 both of which are seek<str<strong>on</strong>g>in</str<strong>on</strong>g>g to enrich the corpora they create with variants <strong>and</strong> c<strong>on</strong>jectures.<br />

Boschetti articulated that there were two basic methods to add apparati critici to digital critical<br />

editi<strong>on</strong>s. The first method was based <strong>on</strong> the automatic collati<strong>on</strong> of diplomatic editi<strong>on</strong>s, where digital<br />

diplomatic editi<strong>on</strong>s are def<str<strong>on</strong>g>in</str<strong>on</strong>g>ed as “complete transcripti<strong>on</strong>s of s<str<strong>on</strong>g>in</str<strong>on</strong>g>gle manuscripts” with encoded<br />

<str<strong>on</strong>g>in</str<strong>on</strong>g>formati<strong>on</strong> about text layout <strong>and</strong> positi<strong>on</strong> (typically encoded <str<strong>on</strong>g>in</str<strong>on</strong>g> TEI-XML). In agreement with<br />

M<strong>on</strong>ella (2008), Boschetti commented that <strong>on</strong>e of the most useful features of markup such as TEI is<br />

that it makes it “possible to separate the actual text of the manuscript from its <str<strong>on</strong>g>in</str<strong>on</strong>g>terpretati<strong>on</strong>s.” This<br />

method is particularly useful, Boschetti argued, for texts with a limited number of manuscripts. The<br />

sec<strong>on</strong>d method <str<strong>on</strong>g>in</str<strong>on</strong>g>volves the manual fill<str<strong>on</strong>g>in</str<strong>on</strong>g>g of forms by data entry operators, an approach utilized by<br />

Musisque Deoque that, accord<str<strong>on</strong>g>in</str<strong>on</strong>g>g to Boschetti, is “useful if the aim is the acquisiti<strong>on</strong> of large amounts<br />

of apparatus’ <str<strong>on</strong>g>in</str<strong>on</strong>g>formati<strong>on</strong>, <strong>on</strong> many texts of different authors.” Both approaches have shortcom<str<strong>on</strong>g>in</str<strong>on</strong>g>gs,<br />

Boschetti po<str<strong>on</strong>g>in</str<strong>on</strong>g>ted out, for the collati<strong>on</strong> of diplomatic editi<strong>on</strong>s must be <str<strong>on</strong>g>in</str<strong>on</strong>g>tegrated with other techniques,<br />

<strong>and</strong> manual form fill<str<strong>on</strong>g>in</str<strong>on</strong>g>g is subject to human error.<br />

Neither approach would be feasible for an author like Aeschylus, whose work is accompanied by an<br />

extensive body of sec<strong>on</strong>dary analysis <strong>and</strong> large numbers of c<strong>on</strong>jectures registered <str<strong>on</strong>g>in</str<strong>on</strong>g> various<br />

commentaries <strong>and</strong> reviews. Boschetti thus proposed a third approach comb<str<strong>on</strong>g>in</str<strong>on</strong>g><str<strong>on</strong>g>in</str<strong>on</strong>g>g automatic collati<strong>on</strong><br />

<strong>and</strong> <str<strong>on</strong>g>in</str<strong>on</strong>g>formati<strong>on</strong> extracti<strong>on</strong>:<br />

The automatic pars<str<strong>on</strong>g>in</str<strong>on</strong>g>g of apparatuses <strong>and</strong> repertories, <str<strong>on</strong>g>in</str<strong>on</strong>g> additi<strong>on</strong> to the automatic collati<strong>on</strong> for<br />

a group of relevant diplomatic transcripti<strong>on</strong>s, should be an acceptable trade-off. Subjective<br />

choices by operators <str<strong>on</strong>g>in</str<strong>on</strong>g> this case is limited to the correcti<strong>on</strong> phases. This third approach has a<br />

double goal: <strong>on</strong> <strong>on</strong>e h<strong>and</strong> it aims to parse automatically exist<str<strong>on</strong>g>in</str<strong>on</strong>g>g critical apparatuses <strong>and</strong><br />

repertories of c<strong>on</strong>jectures of Aeschylus <strong>and</strong> <strong>on</strong> the other h<strong>and</strong> it aims to discover heuristics<br />

useful for any collecti<strong>on</strong> of variants <strong>and</strong>/or c<strong>on</strong>jectures with a similar structure (Boschetti<br />

2007).<br />

Boschetti designed a complete methodology that began with the automatic collati<strong>on</strong> of three reference<br />

editi<strong>on</strong>s for Aeschylus so that there would be a unified reference editi<strong>on</strong> <strong>on</strong> which to map the<br />

apparatuses <strong>and</strong> repertories of c<strong>on</strong>jectures. The sec<strong>on</strong>d step was to c<strong>on</strong>duct a manual survey of various<br />

apparatuses <str<strong>on</strong>g>in</str<strong>on</strong>g> order to identify typical structures (e.g., verse number, read<str<strong>on</strong>g>in</str<strong>on</strong>g>g to substitute a word <str<strong>on</strong>g>in</str<strong>on</strong>g><br />

text, manuscript, <strong>and</strong> scholar). After identify<str<strong>on</strong>g>in</str<strong>on</strong>g>g references to verses, Boschetti developed a typology<br />

of read<str<strong>on</strong>g>in</str<strong>on</strong>g>gs <strong>and</strong> sources for the <str<strong>on</strong>g>in</str<strong>on</strong>g>formati<strong>on</strong> <str<strong>on</strong>g>in</str<strong>on</strong>g> the apparatus. He noted the most frequent case <str<strong>on</strong>g>in</str<strong>on</strong>g>volved<br />

read<str<strong>on</strong>g>in</str<strong>on</strong>g>gs where an orthographic or a morphological variant would “substitute a s<str<strong>on</strong>g>in</str<strong>on</strong>g>gle word <str<strong>on</strong>g>in</str<strong>on</strong>g> the<br />

reference editi<strong>on</strong>.” The most comm<strong>on</strong> other operati<strong>on</strong>s were deleti<strong>on</strong>, additi<strong>on</strong>, <strong>and</strong> transpositi<strong>on</strong> of<br />

text. In terms of sources, they were typically “<strong>on</strong>e or more manuscripts for variants” <strong>and</strong> “<strong>on</strong>e or more<br />

scholars for c<strong>on</strong>jectures,” occasi<strong>on</strong>ally followed by accurate bibliographical <str<strong>on</strong>g>in</str<strong>on</strong>g>formati<strong>on</strong>. One major<br />

difficulty, Boschetti noted, was that the same manuscript or author could be abbreviated differently <str<strong>on</strong>g>in</str<strong>on</strong>g><br />

140 Musisque Deoque is “a digital archive of Lat<str<strong>on</strong>g>in</str<strong>on</strong>g> poetry, from its orig<str<strong>on</strong>g>in</str<strong>on</strong>g>s to the Italian Renaissance” that was established <str<strong>on</strong>g>in</str<strong>on</strong>g> 2005 <strong>and</strong> is creat<str<strong>on</strong>g>in</str<strong>on</strong>g>g a Lat<str<strong>on</strong>g>in</str<strong>on</strong>g><br />

poetry database that is “supplemented <strong>and</strong> updated with critical apparatus <strong>and</strong> exegetical equipments.” http://www.mqdq.it/mqdq/home.jspl<str<strong>on</strong>g>in</str<strong>on</strong>g>gua=en.

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!