Rome Wasn't Digitized in a Day - Council on Library and Information ...
Rome Wasn't Digitized in a Day - Council on Library and Information ...
Rome Wasn't Digitized in a Day - Council on Library and Information ...
You also want an ePaper? Increase the reach of your titles
YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.
255<br />
subsystems can beg<str<strong>on</strong>g>in</str<strong>on</strong>g> to merge <str<strong>on</strong>g>in</str<strong>on</strong>g>to <strong>on</strong>e larger eHumanities DE while still ma<str<strong>on</strong>g>in</str<strong>on</strong>g>ta<str<strong>on</strong>g>in</str<strong>on</strong>g><str<strong>on</strong>g>in</str<strong>on</strong>g>g their <str<strong>on</strong>g>in</str<strong>on</strong>g>dividual<br />
characters <strong>and</strong> strengths” (Aschenbrenner et al. 2009). Two prerequisites for such successful<br />
<str<strong>on</strong>g>in</str<strong>on</strong>g>teroperability, they argue, are “loosely coupled services” <strong>and</strong> the “visibility of resources.” While they<br />
proposed a reference <strong>on</strong>tology for both services <strong>and</strong> documents <str<strong>on</strong>g>in</str<strong>on</strong>g> eHumanities, they stressed that any<br />
<str<strong>on</strong>g>in</str<strong>on</strong>g>frastructure design also must take user needs <str<strong>on</strong>g>in</str<strong>on</strong>g>to account <strong>and</strong> ideally have users <str<strong>on</strong>g>in</str<strong>on</strong>g>volved from the<br />
very beg<str<strong>on</strong>g>in</str<strong>on</strong>g>n<str<strong>on</strong>g>in</str<strong>on</strong>g>g. “Novel <str<strong>on</strong>g>in</str<strong>on</strong>g>frastructure that is imposed <strong>on</strong> the user will fail,” Aschenbrenner et al.<br />
predicted; “TextGrid has doma<str<strong>on</strong>g>in</str<strong>on</strong>g> experts as core partners <str<strong>on</strong>g>in</str<strong>on</strong>g> the team, <strong>and</strong> these experts are shap<str<strong>on</strong>g>in</str<strong>on</strong>g>g<br />
issues such as st<strong>and</strong>ards <strong>and</strong> community-build<str<strong>on</strong>g>in</str<strong>on</strong>g>g” (Aschenbrenner et al. 2009).<br />
TextGrid thus made use of both doma<str<strong>on</strong>g>in</str<strong>on</strong>g> experts <strong>and</strong> computer scientists <str<strong>on</strong>g>in</str<strong>on</strong>g> terms of def<str<strong>on</strong>g>in</str<strong>on</strong>g><str<strong>on</strong>g>in</str<strong>on</strong>g>g st<strong>and</strong>ards<br />
for their project, <strong>and</strong> Aschenbrenner et al. reported that TextGrid has used noth<str<strong>on</strong>g>in</str<strong>on</strong>g>g but open st<strong>and</strong>ards<br />
to promote the fullest amount of <str<strong>on</strong>g>in</str<strong>on</strong>g>teroperability. They also took <str<strong>on</strong>g>in</str<strong>on</strong>g>to account the three <str<strong>on</strong>g>in</str<strong>on</strong>g>teroperability<br />
layers identified by the European Informati<strong>on</strong> Framework: “technical, semantic, <strong>and</strong> organizati<strong>on</strong>al”<br />
(Aschenbrenner et al. 2009). Earlier research by the TextGrid group had highlighted the challenges of<br />
both syntactic <strong>and</strong> semantic differences <str<strong>on</strong>g>in</str<strong>on</strong>g> humanities data sets <str<strong>on</strong>g>in</str<strong>on</strong>g> terms of achiev<str<strong>on</strong>g>in</str<strong>on</strong>g>g mean<str<strong>on</strong>g>in</str<strong>on</strong>g>gful data<br />
<str<strong>on</strong>g>in</str<strong>on</strong>g>tegrati<strong>on</strong>. “In the humanities, the major obstacle to data <str<strong>on</strong>g>in</str<strong>on</strong>g>teroperability is syntactic <strong>and</strong> semantic<br />
heterogeneity,” Dimitriadis et al. stated, add<str<strong>on</strong>g>in</str<strong>on</strong>g>g that “roughly speak<str<strong>on</strong>g>in</str<strong>on</strong>g>g, it is the differences <str<strong>on</strong>g>in</str<strong>on</strong>g><br />
term<str<strong>on</strong>g>in</str<strong>on</strong>g>ology that make it so difficult to cross the boundaries <strong>and</strong> create a jo<str<strong>on</strong>g>in</str<strong>on</strong>g>t doma<str<strong>on</strong>g>in</str<strong>on</strong>g> of language<br />
resources that can be utilized seamlessly” (Dimitriadis et al. 2006). Similar research by Shen et al.<br />
(2008) had reported that two major types of <str<strong>on</strong>g>in</str<strong>on</strong>g>teroperability challenges for digital libraries were<br />
syntactic <strong>and</strong> semantic, with syntactic be<str<strong>on</strong>g>in</str<strong>on</strong>g>g at the level of applicati<strong>on</strong>s <strong>and</strong> semantic <str<strong>on</strong>g>in</str<strong>on</strong>g>teroperability<br />
as the “knowledge-level <str<strong>on</strong>g>in</str<strong>on</strong>g>teroperability” that allows digital libraries to be <str<strong>on</strong>g>in</str<strong>on</strong>g>tegrated <strong>and</strong> <str<strong>on</strong>g>in</str<strong>on</strong>g>cludes “the<br />
ability to bridge semantic c<strong>on</strong>flicts aris<str<strong>on</strong>g>in</str<strong>on</strong>g>g from differences <str<strong>on</strong>g>in</str<strong>on</strong>g> implicit mean<str<strong>on</strong>g>in</str<strong>on</strong>g>gs, perspectives, <strong>and</strong><br />
assumpti<strong>on</strong>s, thus creat<str<strong>on</strong>g>in</str<strong>on</strong>g>g a semantically compatible <str<strong>on</strong>g>in</str<strong>on</strong>g>formati<strong>on</strong> envir<strong>on</strong>ment.”<br />
Although the development <strong>and</strong> use of st<strong>and</strong>ards to promote <str<strong>on</strong>g>in</str<strong>on</strong>g>teroperability were called for by many<br />
projects such as TextGrid, other research, <str<strong>on</strong>g>in</str<strong>on</strong>g>clud<str<strong>on</strong>g>in</str<strong>on</strong>g>g that by the LaQuAT project, has po<str<strong>on</strong>g>in</str<strong>on</strong>g>ted out that<br />
st<strong>and</strong>ards have their limits as well:<br />
While there are a variety of st<strong>and</strong>ardisati<strong>on</strong> activities with the aim of <str<strong>on</strong>g>in</str<strong>on</strong>g>creas<str<strong>on</strong>g>in</str<strong>on</strong>g>g <str<strong>on</strong>g>in</str<strong>on</strong>g>teroperability<br />
between digital resources <strong>and</strong> enabl<str<strong>on</strong>g>in</str<strong>on</strong>g>g them to be used <str<strong>on</strong>g>in</str<strong>on</strong>g> comb<str<strong>on</strong>g>in</str<strong>on</strong>g>ati<strong>on</strong>, st<strong>and</strong>ardisati<strong>on</strong> al<strong>on</strong>e is<br />
unlikely to solve all problems related to l<str<strong>on</strong>g>in</str<strong>on</strong>g>k<str<strong>on</strong>g>in</str<strong>on</strong>g>g up data. Humanists still have to deal with<br />
legacy data <str<strong>on</strong>g>in</str<strong>on</strong>g> diverse <strong>and</strong> often obsolete formats, <strong>and</strong> even when st<strong>and</strong>ards are used the sheer<br />
variety of data <strong>and</strong> research means that there is a great deal of flexibility <str<strong>on</strong>g>in</str<strong>on</strong>g> how the st<strong>and</strong>ards<br />
are applied. Moreover, st<strong>and</strong>ards are generally developed with<str<strong>on</strong>g>in</str<strong>on</strong>g> particular discipl<str<strong>on</strong>g>in</str<strong>on</strong>g>es or<br />
doma<str<strong>on</strong>g>in</str<strong>on</strong>g>s, whereas research is often <str<strong>on</strong>g>in</str<strong>on</strong>g>ter-discipl<str<strong>on</strong>g>in</str<strong>on</strong>g>ary, mak<str<strong>on</strong>g>in</str<strong>on</strong>g>g use of varied materials, <strong>and</strong><br />
<str<strong>on</strong>g>in</str<strong>on</strong>g>corporat<str<strong>on</strong>g>in</str<strong>on</strong>g>g data c<strong>on</strong>form<str<strong>on</strong>g>in</str<strong>on</strong>g>g to different st<strong>and</strong>ards. There will <str<strong>on</strong>g>in</str<strong>on</strong>g>evitably be diversity of<br />
representati<strong>on</strong> when <str<strong>on</strong>g>in</str<strong>on</strong>g>formati<strong>on</strong> is gathered together from different doma<str<strong>on</strong>g>in</str<strong>on</strong>g>s <strong>and</strong> for different<br />
purposes, <strong>and</strong> c<strong>on</strong>sequently there will always be a need to <str<strong>on</strong>g>in</str<strong>on</strong>g>tegrate this diversity (Hedges<br />
2009).<br />
Hedges argued that the realities of legacy data <str<strong>on</strong>g>in</str<strong>on</strong>g> the humanities, the differ<str<strong>on</strong>g>in</str<strong>on</strong>g>g applicati<strong>on</strong> of st<strong>and</strong>ards,<br />
<strong>and</strong> the doma<str<strong>on</strong>g>in</str<strong>on</strong>g> specificity of many st<strong>and</strong>ards necessitate the design of <str<strong>on</strong>g>in</str<strong>on</strong>g>frastructure soluti<strong>on</strong>s that can<br />
<str<strong>on</strong>g>in</str<strong>on</strong>g>tegrate diverse data. He suggested that research by the grid community <strong>on</strong> the “<str<strong>on</strong>g>in</str<strong>on</strong>g>tegrati<strong>on</strong> of<br />
structured <str<strong>on</strong>g>in</str<strong>on</strong>g>formati<strong>on</strong>” <strong>and</strong> turn<str<strong>on</strong>g>in</str<strong>on</strong>g>g data repositories <str<strong>on</strong>g>in</str<strong>on</strong>g>to “virtualized data resources” <strong>on</strong> a grid may<br />
allow digital repositories to hide the “heterogeneity of digital objects” from their users, rather than<br />
try<str<strong>on</strong>g>in</str<strong>on</strong>g>g to force all data <str<strong>on</strong>g>in</str<strong>on</strong>g>to <strong>on</strong>e st<strong>and</strong>ard. Whatever soluti<strong>on</strong>s are pursued, this secti<strong>on</strong> has <str<strong>on</strong>g>in</str<strong>on</strong>g>dicated