The Corpus Thread - Det Danske Sprog- og Litteraturselskab
The Corpus Thread - Det Danske Sprog- og Litteraturselskab
The Corpus Thread - Det Danske Sprog- og Litteraturselskab
You also want an ePaper? Increase the reach of your titles
YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.
3.3. Filling in the header 54<br />
Legal values<br />
Value Description<br />
nil Info has not been determined yet<br />
empty Info is irrelevant, non-existent, or undeterminable<br />
file <strong>The</strong> source of the text is an electronic file. Default<br />
ocr-raw <strong>The</strong> text is OCR-scanned but not proof-read<br />
ocr-proof <strong>The</strong> text is OCR-scanned and proof-read<br />
keyed-raw <strong>The</strong> text is manually keyed but not proof-read<br />
keyed-proof <strong>The</strong> text is manually keyed and proof-read<br />
double-keyed <strong>The</strong> text is double-keyed, i.e. keyed in two versions by two<br />
pdf-converted-<br />
acrobat9<br />
pdf-converted-<br />
pdf2xml<br />
⊲ captureYear<br />
<strong>The</strong> year of data capture.<br />
Properties<br />
individual typists, both versions are automatically compared<br />
and manually corrected<br />
Converted from PDF by Acrobat 9<br />
Converted from PDF by pdf2xml<br />
Value set<br />
type<br />
XML name n/a<br />
descriptive<br />
Legal values Four-digit years which may be extended to full dates<br />
following the pattern yyyy-mm-dd.<br />
⊲ certainty<br />
<strong>The</strong> degree of certainty of how precise some data, typically dates, are.<br />
Properties<br />
Value set<br />
type<br />
enumerated, closed<br />
XML name vs_certainty.xml