18.07.2013 Views

The Corpus Thread - Det Danske Sprog- og Litteraturselskab

The Corpus Thread - Det Danske Sprog- og Litteraturselskab

The Corpus Thread - Det Danske Sprog- og Litteraturselskab

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

3.3. Filling in the header 83<br />

3.3.3 Additional value sets for text classification<br />

Text classification outside the scope of standard TEI header semantics<br />

is achieved by using a number of schemes inside the<br />

element. This special information is needed to enable older<br />

corpus material like the DDOC and KORPUS 2000 to be easily integrated in<br />

the new structure. <strong>The</strong> following types of information are inherited from<br />

these two corpora, the general structure for the element being<br />

<br />

where the schemes are in use can be seen under myClassification, see 3.3.2.1<br />

on page 61.<br />

In CTB, there is no scheme for genre information. Instead,<br />

the element under is used. DDOC and KOR-<br />

PUS 2000 genre values (as well as other obsolete values in an CTB context)<br />

should be mapped to the CTB header, see Chapter ??.

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!