The Corpus Thread - Det Danske Sprog- og Litteraturselskab
The Corpus Thread - Det Danske Sprog- og Litteraturselskab
The Corpus Thread - Det Danske Sprog- og Litteraturselskab
You also want an ePaper? Increase the reach of your titles
YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.
3.3. Filling in the header 83<br />
3.3.3 Additional value sets for text classification<br />
Text classification outside the scope of standard TEI header semantics<br />
is achieved by using a number of schemes inside the<br />
element. This special information is needed to enable older<br />
corpus material like the DDOC and KORPUS 2000 to be easily integrated in<br />
the new structure. <strong>The</strong> following types of information are inherited from<br />
these two corpora, the general structure for the element being<br />
<br />
where the schemes are in use can be seen under myClassification, see 3.3.2.1<br />
on page 61.<br />
In CTB, there is no scheme for genre information. Instead,<br />
the element under is used. DDOC and KOR-<br />
PUS 2000 genre values (as well as other obsolete values in an CTB context)<br />
should be mapped to the CTB header, see Chapter ??.