Exploiting Corpora with Sketch Engine - NLP Centre - Masaryk ...
Exploiting Corpora with Sketch Engine - NLP Centre - Masaryk ...
Exploiting Corpora with Sketch Engine - NLP Centre - Masaryk ...
Create successful ePaper yourself
Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.
Contents Current Trends in Corpus Processing <strong>Sketch</strong> <strong>Engine</strong> Finding Collocations Using CQL<br />
Size is not everything . . .<br />
Why are qualitative aspects so important – well this can’t be really<br />
a question, right?<br />
web is the most used data source to obtain enough source<br />
texts – „web as corpus“<br />
web is garbage (by definition)<br />
garbage as corpus?<br />
building corpora from web requires extensive post-processing<br />
Miloš Jakubíček LCL UK, <strong>NLP</strong>C FI MU CZ<br />
<strong>Exploiting</strong> <strong>Corpora</strong> <strong>with</strong> <strong>Sketch</strong> <strong>Engine</strong>