26.12.2014 Views

Rome Wasn't Digitized in a Day - Council on Library and Information ...

Rome Wasn't Digitized in a Day - Council on Library and Information ...

Rome Wasn't Digitized in a Day - Council on Library and Information ...

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

255<br />

subsystems can beg<str<strong>on</strong>g>in</str<strong>on</strong>g> to merge <str<strong>on</strong>g>in</str<strong>on</strong>g>to <strong>on</strong>e larger eHumanities DE while still ma<str<strong>on</strong>g>in</str<strong>on</strong>g>ta<str<strong>on</strong>g>in</str<strong>on</strong>g><str<strong>on</strong>g>in</str<strong>on</strong>g>g their <str<strong>on</strong>g>in</str<strong>on</strong>g>dividual<br />

characters <strong>and</strong> strengths” (Aschenbrenner et al. 2009). Two prerequisites for such successful<br />

<str<strong>on</strong>g>in</str<strong>on</strong>g>teroperability, they argue, are “loosely coupled services” <strong>and</strong> the “visibility of resources.” While they<br />

proposed a reference <strong>on</strong>tology for both services <strong>and</strong> documents <str<strong>on</strong>g>in</str<strong>on</strong>g> eHumanities, they stressed that any<br />

<str<strong>on</strong>g>in</str<strong>on</strong>g>frastructure design also must take user needs <str<strong>on</strong>g>in</str<strong>on</strong>g>to account <strong>and</strong> ideally have users <str<strong>on</strong>g>in</str<strong>on</strong>g>volved from the<br />

very beg<str<strong>on</strong>g>in</str<strong>on</strong>g>n<str<strong>on</strong>g>in</str<strong>on</strong>g>g. “Novel <str<strong>on</strong>g>in</str<strong>on</strong>g>frastructure that is imposed <strong>on</strong> the user will fail,” Aschenbrenner et al.<br />

predicted; “TextGrid has doma<str<strong>on</strong>g>in</str<strong>on</strong>g> experts as core partners <str<strong>on</strong>g>in</str<strong>on</strong>g> the team, <strong>and</strong> these experts are shap<str<strong>on</strong>g>in</str<strong>on</strong>g>g<br />

issues such as st<strong>and</strong>ards <strong>and</strong> community-build<str<strong>on</strong>g>in</str<strong>on</strong>g>g” (Aschenbrenner et al. 2009).<br />

TextGrid thus made use of both doma<str<strong>on</strong>g>in</str<strong>on</strong>g> experts <strong>and</strong> computer scientists <str<strong>on</strong>g>in</str<strong>on</strong>g> terms of def<str<strong>on</strong>g>in</str<strong>on</strong>g><str<strong>on</strong>g>in</str<strong>on</strong>g>g st<strong>and</strong>ards<br />

for their project, <strong>and</strong> Aschenbrenner et al. reported that TextGrid has used noth<str<strong>on</strong>g>in</str<strong>on</strong>g>g but open st<strong>and</strong>ards<br />

to promote the fullest amount of <str<strong>on</strong>g>in</str<strong>on</strong>g>teroperability. They also took <str<strong>on</strong>g>in</str<strong>on</strong>g>to account the three <str<strong>on</strong>g>in</str<strong>on</strong>g>teroperability<br />

layers identified by the European Informati<strong>on</strong> Framework: “technical, semantic, <strong>and</strong> organizati<strong>on</strong>al”<br />

(Aschenbrenner et al. 2009). Earlier research by the TextGrid group had highlighted the challenges of<br />

both syntactic <strong>and</strong> semantic differences <str<strong>on</strong>g>in</str<strong>on</strong>g> humanities data sets <str<strong>on</strong>g>in</str<strong>on</strong>g> terms of achiev<str<strong>on</strong>g>in</str<strong>on</strong>g>g mean<str<strong>on</strong>g>in</str<strong>on</strong>g>gful data<br />

<str<strong>on</strong>g>in</str<strong>on</strong>g>tegrati<strong>on</strong>. “In the humanities, the major obstacle to data <str<strong>on</strong>g>in</str<strong>on</strong>g>teroperability is syntactic <strong>and</strong> semantic<br />

heterogeneity,” Dimitriadis et al. stated, add<str<strong>on</strong>g>in</str<strong>on</strong>g>g that “roughly speak<str<strong>on</strong>g>in</str<strong>on</strong>g>g, it is the differences <str<strong>on</strong>g>in</str<strong>on</strong>g><br />

term<str<strong>on</strong>g>in</str<strong>on</strong>g>ology that make it so difficult to cross the boundaries <strong>and</strong> create a jo<str<strong>on</strong>g>in</str<strong>on</strong>g>t doma<str<strong>on</strong>g>in</str<strong>on</strong>g> of language<br />

resources that can be utilized seamlessly” (Dimitriadis et al. 2006). Similar research by Shen et al.<br />

(2008) had reported that two major types of <str<strong>on</strong>g>in</str<strong>on</strong>g>teroperability challenges for digital libraries were<br />

syntactic <strong>and</strong> semantic, with syntactic be<str<strong>on</strong>g>in</str<strong>on</strong>g>g at the level of applicati<strong>on</strong>s <strong>and</strong> semantic <str<strong>on</strong>g>in</str<strong>on</strong>g>teroperability<br />

as the “knowledge-level <str<strong>on</strong>g>in</str<strong>on</strong>g>teroperability” that allows digital libraries to be <str<strong>on</strong>g>in</str<strong>on</strong>g>tegrated <strong>and</strong> <str<strong>on</strong>g>in</str<strong>on</strong>g>cludes “the<br />

ability to bridge semantic c<strong>on</strong>flicts aris<str<strong>on</strong>g>in</str<strong>on</strong>g>g from differences <str<strong>on</strong>g>in</str<strong>on</strong>g> implicit mean<str<strong>on</strong>g>in</str<strong>on</strong>g>gs, perspectives, <strong>and</strong><br />

assumpti<strong>on</strong>s, thus creat<str<strong>on</strong>g>in</str<strong>on</strong>g>g a semantically compatible <str<strong>on</strong>g>in</str<strong>on</strong>g>formati<strong>on</strong> envir<strong>on</strong>ment.”<br />

Although the development <strong>and</strong> use of st<strong>and</strong>ards to promote <str<strong>on</strong>g>in</str<strong>on</strong>g>teroperability were called for by many<br />

projects such as TextGrid, other research, <str<strong>on</strong>g>in</str<strong>on</strong>g>clud<str<strong>on</strong>g>in</str<strong>on</strong>g>g that by the LaQuAT project, has po<str<strong>on</strong>g>in</str<strong>on</strong>g>ted out that<br />

st<strong>and</strong>ards have their limits as well:<br />

While there are a variety of st<strong>and</strong>ardisati<strong>on</strong> activities with the aim of <str<strong>on</strong>g>in</str<strong>on</strong>g>creas<str<strong>on</strong>g>in</str<strong>on</strong>g>g <str<strong>on</strong>g>in</str<strong>on</strong>g>teroperability<br />

between digital resources <strong>and</strong> enabl<str<strong>on</strong>g>in</str<strong>on</strong>g>g them to be used <str<strong>on</strong>g>in</str<strong>on</strong>g> comb<str<strong>on</strong>g>in</str<strong>on</strong>g>ati<strong>on</strong>, st<strong>and</strong>ardisati<strong>on</strong> al<strong>on</strong>e is<br />

unlikely to solve all problems related to l<str<strong>on</strong>g>in</str<strong>on</strong>g>k<str<strong>on</strong>g>in</str<strong>on</strong>g>g up data. Humanists still have to deal with<br />

legacy data <str<strong>on</strong>g>in</str<strong>on</strong>g> diverse <strong>and</strong> often obsolete formats, <strong>and</strong> even when st<strong>and</strong>ards are used the sheer<br />

variety of data <strong>and</strong> research means that there is a great deal of flexibility <str<strong>on</strong>g>in</str<strong>on</strong>g> how the st<strong>and</strong>ards<br />

are applied. Moreover, st<strong>and</strong>ards are generally developed with<str<strong>on</strong>g>in</str<strong>on</strong>g> particular discipl<str<strong>on</strong>g>in</str<strong>on</strong>g>es or<br />

doma<str<strong>on</strong>g>in</str<strong>on</strong>g>s, whereas research is often <str<strong>on</strong>g>in</str<strong>on</strong>g>ter-discipl<str<strong>on</strong>g>in</str<strong>on</strong>g>ary, mak<str<strong>on</strong>g>in</str<strong>on</strong>g>g use of varied materials, <strong>and</strong><br />

<str<strong>on</strong>g>in</str<strong>on</strong>g>corporat<str<strong>on</strong>g>in</str<strong>on</strong>g>g data c<strong>on</strong>form<str<strong>on</strong>g>in</str<strong>on</strong>g>g to different st<strong>and</strong>ards. There will <str<strong>on</strong>g>in</str<strong>on</strong>g>evitably be diversity of<br />

representati<strong>on</strong> when <str<strong>on</strong>g>in</str<strong>on</strong>g>formati<strong>on</strong> is gathered together from different doma<str<strong>on</strong>g>in</str<strong>on</strong>g>s <strong>and</strong> for different<br />

purposes, <strong>and</strong> c<strong>on</strong>sequently there will always be a need to <str<strong>on</strong>g>in</str<strong>on</strong>g>tegrate this diversity (Hedges<br />

2009).<br />

Hedges argued that the realities of legacy data <str<strong>on</strong>g>in</str<strong>on</strong>g> the humanities, the differ<str<strong>on</strong>g>in</str<strong>on</strong>g>g applicati<strong>on</strong> of st<strong>and</strong>ards,<br />

<strong>and</strong> the doma<str<strong>on</strong>g>in</str<strong>on</strong>g> specificity of many st<strong>and</strong>ards necessitate the design of <str<strong>on</strong>g>in</str<strong>on</strong>g>frastructure soluti<strong>on</strong>s that can<br />

<str<strong>on</strong>g>in</str<strong>on</strong>g>tegrate diverse data. He suggested that research by the grid community <strong>on</strong> the “<str<strong>on</strong>g>in</str<strong>on</strong>g>tegrati<strong>on</strong> of<br />

structured <str<strong>on</strong>g>in</str<strong>on</strong>g>formati<strong>on</strong>” <strong>and</strong> turn<str<strong>on</strong>g>in</str<strong>on</strong>g>g data repositories <str<strong>on</strong>g>in</str<strong>on</strong>g>to “virtualized data resources” <strong>on</strong> a grid may<br />

allow digital repositories to hide the “heterogeneity of digital objects” from their users, rather than<br />

try<str<strong>on</strong>g>in</str<strong>on</strong>g>g to force all data <str<strong>on</strong>g>in</str<strong>on</strong>g>to <strong>on</strong>e st<strong>and</strong>ard. Whatever soluti<strong>on</strong>s are pursued, this secti<strong>on</strong> has <str<strong>on</strong>g>in</str<strong>on</strong>g>dicated

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!