The Corpus Thread - Det Danske Sprog- og Litteraturselskab
The Corpus Thread - Det Danske Sprog- og Litteraturselskab
The Corpus Thread - Det Danske Sprog- og Litteraturselskab
Create successful ePaper yourself
Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.
3.2. Header structure 26<br />
(identifying number) supplies an identifying code for a text.<br />
<br />
organizationName<br />
textId<br />
<br />
availDesc anonymisationDesc<br />
availDesc anonymisationDesc<br />
availDesc anonymisationDesc<br />
<br />
<br />
<strong>The</strong> element contains the name of the organization 9 responsible<br />
for the distribution of the electronic text sample. Usually there<br />
can only be one distributor for each text even though TEI allows to repeat<br />
this element as often as needed. <strong>The</strong> inventory of strings denoting distributors<br />
should be invariant, i.e. one name only per distributor.<br />
<strong>The</strong> obligatory CTB text id is given as contents of an<br />
element. Some dialects of TEI introduce an attribute id of the element<br />
which is illegal according to strict TEI. Other types of text, project-,<br />
or institution-internal identifications may be given in additional <br />
elements whose type attributes indicate the specific type of id.<br />
<strong>The</strong> text strings in (‘anonymous block’) 10 elements given under<br />
for both restricted (attribute status is set to “restricted”)<br />
and free (attribute status is set to “free”) give availability information for<br />
three fixed user categories: academic users, non-commercial users, and all<br />
types of users.<br />
Academic users are defined as users who are affiliated with the DK-<br />
CLARIN consortium.<br />
Non-commercial users are academic users not affiliated with the DK-<br />
CLARIN consortium, users from educational or governmental institutions.<br />
All users are any type of users including commercial users.<br />
<strong>The</strong> DK-CLARIN license committee has finally, i.e. at the end of the project,<br />
concluded that the types of licenses should be employed: public, academic<br />
and restricted and that licenses are to be managed outside text headers.<br />
However, WP 2.1 will stick to the categories and values described above.<br />
9 In DK-CLARIN this will typically be a member of the DK-CLARIN consortium.<br />
10 This type of elements is preferred to the alternative which is semantically misleading<br />
– these are no paragraphs but blocks of information.