Rome Wasn't Digitized in a Day - Council on Library and Information ...
Rome Wasn't Digitized in a Day - Council on Library and Information ...
Rome Wasn't Digitized in a Day - Council on Library and Information ...
You also want an ePaper? Increase the reach of your titles
YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.
129<br />
As part of this phase, the VMR at the University of Birm<str<strong>on</strong>g>in</str<strong>on</strong>g>gham 410 also plans to jo<str<strong>on</strong>g>in</str<strong>on</strong>g> with a parallel<br />
VMR be<str<strong>on</strong>g>in</str<strong>on</strong>g>g built at the University of Münster <str<strong>on</strong>g>in</str<strong>on</strong>g> Germany <strong>and</strong> provide seamless access to both<br />
collecti<strong>on</strong>s. Four features dist<str<strong>on</strong>g>in</str<strong>on</strong>g>guish Interediti<strong>on</strong> from previous manuscript digitizati<strong>on</strong> projects: (1) it<br />
is designed around granular metadata, so <str<strong>on</strong>g>in</str<strong>on</strong>g>stead of simply present<str<strong>on</strong>g>in</str<strong>on</strong>g>g metadata records for whole<br />
manuscripts, records are provided for each page image, for the transcripti<strong>on</strong> of the text <strong>on</strong> that page,<br />
<strong>and</strong> for specify<str<strong>on</strong>g>in</str<strong>on</strong>g>g what text is <strong>on</strong> that page; (2) “the metadata states the exact resource type associated<br />
with the URL specified <str<strong>on</strong>g>in</str<strong>on</strong>g> each record” (e.g., if a text file is <str<strong>on</strong>g>in</str<strong>on</strong>g> XML <strong>and</strong> what schema has been used);<br />
(3) all VMR materials will be stored <str<strong>on</strong>g>in</str<strong>on</strong>g> Birm<str<strong>on</strong>g>in</str<strong>on</strong>g>gham’s <str<strong>on</strong>g>in</str<strong>on</strong>g>stituti<strong>on</strong>al repository <strong>and</strong> be accessible<br />
through the library <strong>on</strong>l<str<strong>on</strong>g>in</str<strong>on</strong>g>e public access catalog (OPAC), <strong>and</strong> (4) the VMR will support full reuse of its<br />
materials not just access to them.<br />
This fourth feature is perhaps most unique, for as seen by the survey of projects <str<strong>on</strong>g>in</str<strong>on</strong>g> this secti<strong>on</strong>, the<br />
focus of much manuscript-digitizati<strong>on</strong> work has often been <strong>on</strong> support<str<strong>on</strong>g>in</str<strong>on</strong>g>g the discovery of digital<br />
manuscripts for use <strong>on</strong>l<str<strong>on</strong>g>in</str<strong>on</strong>g>e rather than <strong>on</strong> the ability for scholars to get access to the raw digital<br />
materials <strong>and</strong> use them <str<strong>on</strong>g>in</str<strong>on</strong>g> their own projects. The VMR plans to provide access to all the metadata they<br />
create through a syndicated RSS feed so that users can create their own <str<strong>on</strong>g>in</str<strong>on</strong>g>terfaces to VMR data. In<br />
additi<strong>on</strong>, they plan to allow other users to add material to the VMR by creat<str<strong>on</strong>g>in</str<strong>on</strong>g>g a “metadata record for<br />
the resource follow<str<strong>on</strong>g>in</str<strong>on</strong>g>g VMR protocols” <strong>and</strong> then add it to the RSS feed of any VMR project. The<br />
importance of support<str<strong>on</strong>g>in</str<strong>on</strong>g>g new collaborati<strong>on</strong> models that allow many <str<strong>on</strong>g>in</str<strong>on</strong>g>dividuals to c<strong>on</strong>tribute related<br />
digital manuscript resources has also been discussed <str<strong>on</strong>g>in</str<strong>on</strong>g> Rob<str<strong>on</strong>g>in</str<strong>on</strong>g>s<strong>on</strong> (2009, 2010).<br />
While the amount of metadata about manuscripts, as well as digital images <strong>and</strong> transcripti<strong>on</strong>s of<br />
manuscripts, that have become available <strong>on</strong>l<str<strong>on</strong>g>in</str<strong>on</strong>g>e has <str<strong>on</strong>g>in</str<strong>on</strong>g>creased, there are still few easy ways to l<str<strong>on</strong>g>in</str<strong>on</strong>g>k<br />
between them if they exist <str<strong>on</strong>g>in</str<strong>on</strong>g> different collecti<strong>on</strong>s. A related problem is the limited ability to at least<br />
partially automate the l<str<strong>on</strong>g>in</str<strong>on</strong>g>k<str<strong>on</strong>g>in</str<strong>on</strong>g>g of manuscript images with their transcripti<strong>on</strong>s, even if both are known to<br />
exist. Arianna Ciula has argued that the work of palaeographers would greatly benefit from descriptive<br />
encod<str<strong>on</strong>g>in</str<strong>on</strong>g>g or technology that supported more sophisticated l<str<strong>on</strong>g>in</str<strong>on</strong>g>k<str<strong>on</strong>g>in</str<strong>on</strong>g>g between images <strong>and</strong> texts,<br />
particularly “the possibility to export the associati<strong>on</strong> between descripti<strong>on</strong>s of specific palaeographical<br />
properties <strong>and</strong> the coord<str<strong>on</strong>g>in</str<strong>on</strong>g>ates with<str<strong>on</strong>g>in</str<strong>on</strong>g> a manuscript image <str<strong>on</strong>g>in</str<strong>on</strong>g> a st<strong>and</strong>ard format such as the encod<str<strong>on</strong>g>in</str<strong>on</strong>g>g<br />
proposed by the TEI facsimile module or SVG” (Ciula 2009).<br />
Recently Hugh Cayless has developed a series of tools <strong>and</strong> techniques to assist <str<strong>on</strong>g>in</str<strong>on</strong>g> this process that have<br />
been grouped under the name img2XML 411 <strong>and</strong> have been described <str<strong>on</strong>g>in</str<strong>on</strong>g> detail <str<strong>on</strong>g>in</str<strong>on</strong>g> Cayless (2008, 2009).<br />
As has been previously discussed by M<strong>on</strong>ella (2008) <strong>and</strong> Boschetti (2009), Cayless noted that<br />
manuscript transcripti<strong>on</strong>s are typically published <str<strong>on</strong>g>in</str<strong>on</strong>g> <strong>on</strong>e of two formats, either as a critical editi<strong>on</strong><br />
where the editors’ comments are <str<strong>on</strong>g>in</str<strong>on</strong>g>cluded as an <str<strong>on</strong>g>in</str<strong>on</strong>g>tegral part of the text or as a diplomatic transcripti<strong>on</strong><br />
that tries to “faithfully reproduce the text” (Cayless 2009). While TEI allows the producti<strong>on</strong> of both<br />
types of transcripti<strong>on</strong>s from the same marked-up text, Cayless argued that the next important step is to<br />
automatically l<str<strong>on</strong>g>in</str<strong>on</strong>g>k such transcripti<strong>on</strong>s to their page images. While many systems l<str<strong>on</strong>g>in</str<strong>on</strong>g>k manuscript<br />
images <strong>and</strong> transcripti<strong>on</strong>s <strong>on</strong> the page level, 412 the work of Cayless sought to support more granular<br />
l<str<strong>on</strong>g>in</str<strong>on</strong>g>k<str<strong>on</strong>g>in</str<strong>on</strong>g>g, such as at the level of <str<strong>on</strong>g>in</str<strong>on</strong>g>dividual l<str<strong>on</strong>g>in</str<strong>on</strong>g>es or even words.<br />
410 The VMR at Birm<str<strong>on</strong>g>in</str<strong>on</strong>g>gham has been funded by JISC <strong>and</strong> is be<str<strong>on</strong>g>in</str<strong>on</strong>g>g created by the Institute of Textual Scholarship <strong>and</strong> Electr<strong>on</strong>ic Edit<str<strong>on</strong>g>in</str<strong>on</strong>g>g (ITSEE),<br />
http://www.itsee.bham.ac.uk/<br />
411 http://github.com/hcayless/img2xml<br />
412 One such system is EPPT (discussed earlier <str<strong>on</strong>g>in</str<strong>on</strong>g> this paper), <strong>and</strong> another tool listed by Cayless is the Image Markup Tool (IMT),<br />
http://www.tapor.uvic.ca/%7Emholmes/image_markup/<str<strong>on</strong>g>in</str<strong>on</strong>g>dex.php, which allows a user to annotate “rectangular secti<strong>on</strong>s of an image” by us<str<strong>on</strong>g>in</str<strong>on</strong>g>g a draw<str<strong>on</strong>g>in</str<strong>on</strong>g>g<br />
tool with which they can first “draw shape overlays <strong>on</strong> an image” <strong>and</strong> then these overlays can then be l<str<strong>on</strong>g>in</str<strong>on</strong>g>ked to “text annotati<strong>on</strong>s entered by the user.”<br />
(Cayless 2009).