23.07.2013 Views

Exploiting Corpora with Sketch Engine - NLP Centre - Masaryk ...

Exploiting Corpora with Sketch Engine - NLP Centre - Masaryk ...

Exploiting Corpora with Sketch Engine - NLP Centre - Masaryk ...

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

Contents Current Trends in Corpus Processing <strong>Sketch</strong> <strong>Engine</strong> Finding Collocations Using CQL<br />

Size is not everything . . .<br />

Why are qualitative aspects so important – well this can’t be really<br />

a question, right?<br />

web is the most used data source to obtain enough source<br />

texts – „web as corpus“<br />

web is garbage (by definition)<br />

garbage as corpus?<br />

building corpora from web requires extensive post-processing<br />

Miloš Jakubíček LCL UK, <strong>NLP</strong>C FI MU CZ<br />

<strong>Exploiting</strong> <strong>Corpora</strong> <strong>with</strong> <strong>Sketch</strong> <strong>Engine</strong>

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!