18.07.2013 Views

The Corpus Thread - Det Danske Sprog- og Litteraturselskab

The Corpus Thread - Det Danske Sprog- og Litteraturselskab

The Corpus Thread - Det Danske Sprog- og Litteraturselskab

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

3.2. Header structure 35<br />

<br />

<br />

appDesc<br />

(may be left out)<br />

(optional)<br />

<br />

<br />

<strong>The</strong> element has the following attributes:<br />

xml:id unique XML identifier which is referenced by the corresponding annotation<br />

layer in the text.<br />

type specifies both the task (segmentation, annotation) and whether it was<br />

performed by an automatic application or a manual procedure (or a<br />

combination of both).<br />

subtype gives a further description of the applied tool taken from a fixed<br />

list of options.<br />

ident supplies a unique identifier for the application/procedure.<br />

version supplies a version number for the application/procedure. 19<br />

n gives supplementary info about the applied tag set or tokenization mode.<br />

when gives the date when the application was executed on the text.<br />

<strong>The</strong> element contains an element giving a freetext<br />

description of the application.<br />

<strong>The</strong> element within references that/those application/applications<br />

whose output has been used as input for the application<br />

in question as annotations can be added as layers on each other, cf.<br />

Chapter 4. This element is left out if an annotation refers to the base version<br />

of the text and not to another annotation layer.<br />

Finally, the optional element may reference certain resources a<br />

given tool has been using in cases where this is important.<br />

19 It may seem weird to apply version numbers to manual procedures. However, the version<br />

attribute is mandatory in TEI and also manual procedures may alter over time and<br />

should in any case be thoroughly documented – that is versioned.

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!