Rome Wasn't Digitized in a Day - Council on Library and Information ...
Rome Wasn't Digitized in a Day - Council on Library and Information ...
Rome Wasn't Digitized in a Day - Council on Library and Information ...
You also want an ePaper? Increase the reach of your titles
YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.
92<br />
Part of the research of the Pleiades project, therefore, has been to determ<str<strong>on</strong>g>in</str<strong>on</strong>g>e how best to turn digital<br />
resources such as their gazetteer <str<strong>on</strong>g>in</str<strong>on</strong>g>to repurposeable knowledge bases. 301 Elliott <strong>and</strong> Gillies (2009b)<br />
predict that <str<strong>on</strong>g>in</str<strong>on</strong>g>creas<str<strong>on</strong>g>in</str<strong>on</strong>g>gly those who hold geographic data <strong>and</strong> wish to make it freely available <strong>on</strong>l<str<strong>on</strong>g>in</str<strong>on</strong>g>e<br />
will provide access to their data through a variety of web services.<br />
Despite their desire to make all of Pleiades c<strong>on</strong>tent available to be remixed <strong>and</strong> mashed up, these<br />
efforts have met with some obstacles:<br />
In our web services, we employ proxies for our c<strong>on</strong>tent (KML 302 <strong>and</strong> GeoRSS 303 -enhanced<br />
Atom 304 feeds) so that users can visualize <strong>and</strong> exploit it <str<strong>on</strong>g>in</str<strong>on</strong>g> a variety of automated ways. In this<br />
way, we provide a computati<strong>on</strong>ally acti<strong>on</strong>able bridge between a nuanced, scholarly publicati<strong>on</strong><br />
<strong>and</strong> the geographic discovery <strong>and</strong> exploitati<strong>on</strong> tools now emerg<str<strong>on</strong>g>in</str<strong>on</strong>g>g <strong>on</strong> the web. But for us, these<br />
formats are lossy: they cannot represent our data model <str<strong>on</strong>g>in</str<strong>on</strong>g> a structured way that preserves all<br />
nuance <strong>and</strong> detail <strong>and</strong> permits ready pars<str<strong>on</strong>g>in</str<strong>on</strong>g>g <strong>and</strong> exploitati<strong>on</strong> by software agents. Indeed, we<br />
have been unable to identify a st<strong>and</strong>ard XML-based data format that simply <strong>and</strong> losslessly<br />
supports the full expressi<strong>on</strong> of the Pleiades data model (Elliott <strong>and</strong> Gillies 2009b).<br />
To provide a lossless export, they plan to produce file sets composed of ESRI shape files with attribute<br />
tables <str<strong>on</strong>g>in</str<strong>on</strong>g> CSV, a soluti<strong>on</strong> that, despite the proprietary nature of the ShapeFile format, does allow them<br />
to download time-stamped files <str<strong>on</strong>g>in</str<strong>on</strong>g>to the <str<strong>on</strong>g>in</str<strong>on</strong>g>stituti<strong>on</strong>al repository at New York University. Although they<br />
would prefer to use <strong>on</strong>ly open formats, Elliott <strong>and</strong> Gillies argued that the ShapeFile format is used<br />
around the world <strong>and</strong> can be decoded by open-source software, a fact that gives it a “high likelihood of<br />
translati<strong>on</strong> <str<strong>on</strong>g>in</str<strong>on</strong>g>to new formats <str<strong>on</strong>g>in</str<strong>on</strong>g> the c<strong>on</strong>text of l<strong>on</strong>g-term preservati<strong>on</strong> repositories.” The experience of<br />
Pleiades illustrates the challenges of want<str<strong>on</strong>g>in</str<strong>on</strong>g>g to create open-access resources while hav<str<strong>on</strong>g>in</str<strong>on</strong>g>g to deal with<br />
the limits of open formats <strong>and</strong> l<strong>on</strong>g-term preservati<strong>on</strong> needs.<br />
N<strong>on</strong>etheless, the open-access nature of Pleiades <strong>and</strong> the ability to l<str<strong>on</strong>g>in</str<strong>on</strong>g>k to <str<strong>on</strong>g>in</str<strong>on</strong>g>dividual places with<str<strong>on</strong>g>in</str<strong>on</strong>g> it<br />
makes it a natural source to <str<strong>on</strong>g>in</str<strong>on</strong>g>tegrate with other digital classics projects <str<strong>on</strong>g>in</str<strong>on</strong>g> numismatics, epigraphy, <strong>and</strong><br />
papyrology, or any digital resource that makes extensive use of historical place names with<str<strong>on</strong>g>in</str<strong>on</strong>g> the<br />
ancient world. Indeed, Pleiades is actively work<str<strong>on</strong>g>in</str<strong>on</strong>g>g with other projects through the C<strong>on</strong>cordia <str<strong>on</strong>g>in</str<strong>on</strong>g>itiative<br />
to <str<strong>on</strong>g>in</str<strong>on</strong>g>tegrate its c<strong>on</strong>tent with other digital projects <strong>and</strong> “develop st<strong>and</strong>ards-based mechanisms for crossproject<br />
geographic search.”<br />
The HESTIA Project<br />
While smaller <str<strong>on</strong>g>in</str<strong>on</strong>g> scale than Pleiades, the HESTIA (Herodotus Encoded Space-Text-Imag<str<strong>on</strong>g>in</str<strong>on</strong>g>g-<br />
Archive) 305 project provides an <str<strong>on</strong>g>in</str<strong>on</strong>g>terest<str<strong>on</strong>g>in</str<strong>on</strong>g>g look at how digital technology <strong>and</strong> spatial analysis 306 can<br />
301 The gazetteer data created by the Pleiades project has served as <strong>on</strong>e key comp<strong>on</strong>ent of the Digital Atlas of Roman <strong>and</strong> Medieval Civilizati<strong>on</strong><br />
(http://medievalmap.harvard.edu/icb/icb.dokeyword=k40248&pageid=icb.page188865), which “offers a series of maps <strong>and</strong> geodatabases bear<str<strong>on</strong>g>in</str<strong>on</strong>g>g <strong>on</strong><br />
multiple aspects of Roman <strong>and</strong> medieval civilizati<strong>on</strong> <str<strong>on</strong>g>in</str<strong>on</strong>g> the broadest terms” <strong>and</strong> makes extensive use of the Barr<str<strong>on</strong>g>in</str<strong>on</strong>g>gt<strong>on</strong> Atlas of the Greek <strong>and</strong> Roman<br />
World. The recently announced “Google Ancient Places” project is also utiliz<str<strong>on</strong>g>in</str<strong>on</strong>g>g the data made available by Pleiades<br />
(http://googleancientplaces.wordpress.com/2010/12/08/fill<str<strong>on</strong>g>in</str<strong>on</strong>g>g-<str<strong>on</strong>g>in</str<strong>on</strong>g>-some-gaps-<str<strong>on</strong>g>in</str<strong>on</strong>g>-gap/).<br />
302 KML, formerly known as “keyhole markup language,” was created by Google <strong>and</strong> is ma<str<strong>on</strong>g>in</str<strong>on</strong>g>ta<str<strong>on</strong>g>in</str<strong>on</strong>g>ed as an open st<strong>and</strong>ard by the Open Geospatial<br />
C<strong>on</strong>sortium (http://www.opengeospatial.org/st<strong>and</strong>ards/kml/). KML is an “XML language focused <strong>on</strong> geographic visualizati<strong>on</strong>, <str<strong>on</strong>g>in</str<strong>on</strong>g>clud<str<strong>on</strong>g>in</str<strong>on</strong>g>g annotati<strong>on</strong> of<br />
maps <strong>and</strong> images” <strong>and</strong> is used by a number of open-source mapp<str<strong>on</strong>g>in</str<strong>on</strong>g>g projects. (http://code.google.com/apis/kml/documentati<strong>on</strong>/mapsSupport.html).<br />
303 GeoRSS (http://www.georss.org/Ma<str<strong>on</strong>g>in</str<strong>on</strong>g>_Page) is a “lightweight, community driven way to extend exist<str<strong>on</strong>g>in</str<strong>on</strong>g>g feeds with geographic <str<strong>on</strong>g>in</str<strong>on</strong>g>formati<strong>on</strong>.”<br />
304 Accord<str<strong>on</strong>g>in</str<strong>on</strong>g>g to Wikipedia (http://en.wikipedia.org/wiki/Atom_(st<strong>and</strong>ard)), the name Atom applies to two related st<strong>and</strong>ards, while the Atom Syndicati<strong>on</strong><br />
Format is a “XML language used for web feed” (http://www.ietf.org/rfc/rfc4287.txt), the Atom Publish<str<strong>on</strong>g>in</str<strong>on</strong>g>g Protocol is a “HTTP-based protocol for<br />
creat<str<strong>on</strong>g>in</str<strong>on</strong>g>g <strong>and</strong> updat<str<strong>on</strong>g>in</str<strong>on</strong>g>g web resources.”<br />
305 http://www.open.ac.uk/Arts/hestia/<str<strong>on</strong>g>in</str<strong>on</strong>g>dex.html<br />
306 The HESTIA project hosted a workshop <str<strong>on</strong>g>in</str<strong>on</strong>g> July 2010 entitled “New Worlds Out of Old Texts: Interrogat<str<strong>on</strong>g>in</str<strong>on</strong>g>g New Techniques for the Spatial Analysis of<br />
Ancient Narratives” that brought together numerous projects that are us<str<strong>on</strong>g>in</str<strong>on</strong>g>g spatial-analysis techniques <str<strong>on</strong>g>in</str<strong>on</strong>g> various classical discipl<str<strong>on</strong>g>in</str<strong>on</strong>g>es (http://www.artshumanities.net/event/new_worlds_out_old_texts_<str<strong>on</strong>g>in</str<strong>on</strong>g>terrogat<str<strong>on</strong>g>in</str<strong>on</strong>g>g_new_techniques_spatial_analysis_ancient_narratives).