Automatic detection of new domain-specific words, using document ...
Automatic detection of new domain-specific words, using document ...
Automatic detection of new domain-specific words, using document ...
You also want an ePaper? Increase the reach of your titles
YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.
4.2. Problems<br />
• Highly frequent <strong>domain</strong>-<strong>specific</strong> types in a text only count 1<br />
−→ Largest intersection <strong>of</strong> D and the set <strong>of</strong> text tokens, W<br />
• Domains with large vocabularies are likely to get a better score<br />
−→ Size <strong>of</strong> a <strong>domain</strong>-<strong>specific</strong> vocabulary, |D|