26.12.2014 Views

Rome Wasn't Digitized in a Day - Council on Library and Information ...

Rome Wasn't Digitized in a Day - Council on Library and Information ...

Rome Wasn't Digitized in a Day - Council on Library and Information ...

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

57<br />

While the FRBR hierarchy <str<strong>on</strong>g>in</str<strong>on</strong>g>cludes Works, Expressi<strong>on</strong>s, Manifestati<strong>on</strong>s, <strong>and</strong> Items, the CTS protocol<br />

uses the terms Work, Editi<strong>on</strong>, Translati<strong>on</strong>, <strong>and</strong> Exemplar. As communicated by Porter et al. (2006),<br />

CTS extends the FRBR hierarchy upward by “group<str<strong>on</strong>g>in</str<strong>on</strong>g>g Works under a noti<strong>on</strong>al entity called<br />

“TextGroup,’” an entity that can refer to authors for literary texts or to corpora such as <str<strong>on</strong>g>in</str<strong>on</strong>g>scripti<strong>on</strong>s<br />

(e.g., “Berl<str<strong>on</strong>g>in</str<strong>on</strong>g>” for a published corpus of papyri). It also extends the FRBR hierarchy downward to<br />

support the “identificati<strong>on</strong> <strong>and</strong> abstracti<strong>on</strong> of citable chunks of text (Homer, Iliad Book 1, L<str<strong>on</strong>g>in</str<strong>on</strong>g>e 123), or<br />

ranges of citable chunks (Hom.~ Il. 1.123-2.22).” This both downward <strong>and</strong> upward extensi<strong>on</strong> of FRBR<br />

is important, for as Romanello (2008) underscores, “it allows <strong>on</strong>e to reach a higher granularity when<br />

access<str<strong>on</strong>g>in</str<strong>on</strong>g>g documents hierarchically <strong>and</strong> supports the use of a citati<strong>on</strong> scheme referr<str<strong>on</strong>g>in</str<strong>on</strong>g>g to each level of<br />

the entire document hierarchical structure.” Romanello notes that another important feature of CTS is<br />

that it enables the differentiati<strong>on</strong> of “different exemplars” of the same text.<br />

As noted above, citati<strong>on</strong>s <str<strong>on</strong>g>in</str<strong>on</strong>g> CTS are expressed as CTS URNs. CTS URNs “provide the permanent<br />

can<strong>on</strong>ical references that Can<strong>on</strong>ical Text Services (CTS) rely <strong>on</strong> <str<strong>on</strong>g>in</str<strong>on</strong>g> order to identify or retrieve<br />

passages of text. These references are a k<str<strong>on</strong>g>in</str<strong>on</strong>g>d of Uniform Resource Name (URN).” 169 URNs, accord<str<strong>on</strong>g>in</str<strong>on</strong>g>g<br />

to RFC2141 170 , “are <str<strong>on</strong>g>in</str<strong>on</strong>g>tended to serve as persistent, locati<strong>on</strong>-<str<strong>on</strong>g>in</str<strong>on</strong>g>dependent, resource identifiers.” Smith<br />

(2009) provides extensive <str<strong>on</strong>g>in</str<strong>on</strong>g>formati<strong>on</strong> <strong>on</strong> the syntax of CTS-URNs <strong>and</strong> their role <str<strong>on</strong>g>in</str<strong>on</strong>g> the CTS.<br />

This same article also addressed the importance of st<strong>and</strong>ards such as CTS (Smith 2009). “Source<br />

citati<strong>on</strong> is just <strong>on</strong>e part of scholarly publicati<strong>on</strong>, <strong>and</strong> c<strong>on</strong>venti<strong>on</strong>s for cit<str<strong>on</strong>g>in</str<strong>on</strong>g>g resources digitally must be<br />

viewed as part of a larger architectural design,” Smith expla<str<strong>on</strong>g>in</str<strong>on</strong>g>ed, “when the digital library is the global<br />

<str<strong>on</strong>g>in</str<strong>on</strong>g>ternet, the natural architecture for scholarly publicati<strong>on</strong>s is a hierarchy of service.” In additi<strong>on</strong> to the<br />

need for st<strong>and</strong>ards <strong>and</strong> c<strong>on</strong>venti<strong>on</strong>s for cit<str<strong>on</strong>g>in</str<strong>on</strong>g>g resources or parts of resources, the importance of<br />

st<strong>and</strong>ard c<strong>on</strong>venti<strong>on</strong>s or <strong>on</strong>tologies for add<str<strong>on</strong>g>in</str<strong>on</strong>g>g semantic encod<str<strong>on</strong>g>in</str<strong>on</strong>g>g for named entities (such as citati<strong>on</strong>s)<br />

to sec<strong>on</strong>dary literature <strong>and</strong> also to web pages <strong>and</strong> then l<str<strong>on</strong>g>in</str<strong>on</strong>g>k them to <strong>on</strong>l<str<strong>on</strong>g>in</str<strong>on</strong>g>e representati<strong>on</strong>s of primary<br />

<strong>and</strong> other sources has been the subject of a series of recent blog posts by Sebastian Heath of the<br />

American Numismatics Society (ANS). 171<br />

In a 2010 post entitled “RDFa Patterns for Ancient World References,” 172 Heath described his efforts<br />

to encode the year, different named entities such as Polem<strong>on</strong>, an imperial cult, <strong>and</strong> two text citati<strong>on</strong>s<br />

with<str<strong>on</strong>g>in</str<strong>on</strong>g> a chosen text. He wanted to embed this <str<strong>on</strong>g>in</str<strong>on</strong>g>formati<strong>on</strong> <str<strong>on</strong>g>in</str<strong>on</strong>g>to a sample web page us<str<strong>on</strong>g>in</str<strong>on</strong>g>g st<strong>and</strong>ards<br />

such as RDFa 173 to make the data “automatically recognizable by third-parties.” To encode this<br />

<str<strong>on</strong>g>in</str<strong>on</strong>g>formati<strong>on</strong> he utilized RDFa <strong>and</strong> a number of other <strong>on</strong>tologies with resolvable namespaces (e.g.,<br />

Dbpedia, CITO, FOAF, FRBR, OWL, RDFS, SKOS). Heath listed two references with<str<strong>on</strong>g>in</str<strong>on</strong>g> his sample<br />

text, the first to a published <str<strong>on</strong>g>in</str<strong>on</strong>g>scripti<strong>on</strong> <strong>and</strong> the sec<strong>on</strong>d to a recently published book. Interest<str<strong>on</strong>g>in</str<strong>on</strong>g>gly,<br />

while Heath was able to l<str<strong>on</strong>g>in</str<strong>on</strong>g>k to a bibliographic descripti<strong>on</strong> of the published book with<str<strong>on</strong>g>in</str<strong>on</strong>g> WorldCat,<br />

encod<str<strong>on</strong>g>in</str<strong>on</strong>g>g the reference to the published <str<strong>on</strong>g>in</str<strong>on</strong>g>scripti<strong>on</strong> with CITO (Citati<strong>on</strong> Typ<str<strong>on</strong>g>in</str<strong>on</strong>g>g Otology) was<br />

problematic because there was no value for “cites as a primary source” available with<str<strong>on</strong>g>in</str<strong>on</strong>g> this <strong>on</strong>tology.<br />

An additi<strong>on</strong>al complicati<strong>on</strong> was that Heath simply wanted to cite the <str<strong>on</strong>g>in</str<strong>on</strong>g>dividual <str<strong>on</strong>g>in</str<strong>on</strong>g>scripti<strong>on</strong>, <strong>and</strong> there<br />

was no way to cite just <strong>on</strong>e <str<strong>on</strong>g>in</str<strong>on</strong>g>scripti<strong>on</strong> or “work” with<str<strong>on</strong>g>in</str<strong>on</strong>g> the larger published volume of <str<strong>on</strong>g>in</str<strong>on</strong>g>scripti<strong>on</strong>s.<br />

“The c<strong>on</strong>cept of “Primary Source” <strong>and</strong> references thereto is important for the Humanities <strong>and</strong> we need<br />

a way of <str<strong>on</strong>g>in</str<strong>on</strong>g>dicat<str<strong>on</strong>g>in</str<strong>on</strong>g>g its usage,” Heath noted. “It's also important that I'm referr<str<strong>on</strong>g>in</str<strong>on</strong>g>g to the publicati<strong>on</strong> of<br />

169 http://chs75.chs.harvard.edu/projects/dig<str<strong>on</strong>g>in</str<strong>on</strong>g>c/techpub/cts-urn-overview<br />

170 RFC2141 is a “memo” released by the Internet Eng<str<strong>on</strong>g>in</str<strong>on</strong>g>eer<str<strong>on</strong>g>in</str<strong>on</strong>g>g Task Force (IETF) that specified the “can<strong>on</strong>ical syntax” for URNs<br />

(http://tools.ietf.org/html/rfc2141)<br />

171 http://numismatics.org<br />

172 http://mediterraneanceramics.blogspot.com/2010/01/rdfa-patterns-for-ancient-world.html<br />

173 “RDFa is a specificati<strong>on</strong> for attributes to express structured data <str<strong>on</strong>g>in</str<strong>on</strong>g> any markup language” (http://www.w3.org/TR/rdfa-syntax/). See also<br />

http://www.w3.org/TR/xhtml-rdfa-primer/

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!