Automatic detection of new domain-specific words, using document ...
Automatic detection of new domain-specific words, using document ...
Automatic detection of new domain-specific words, using document ...
Create successful ePaper yourself
Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.
• Classification procedure<br />
– Reflect properties <strong>of</strong> the material to be processed<br />
∗ Token overlap between a text and a <strong>domain</strong> vocabulary<br />
∗ Size <strong>of</strong> the <strong>domain</strong>-<strong>specific</strong> vocabulary<br />
∗ Uniqueness <strong>of</strong> a certain type for a particular <strong>domain</strong><br />
∗ Ratio between recognised and unrecognised tokens