Exploiting Corpora with Sketch Engine - NLP Centre - Masaryk ...
Exploiting Corpora with Sketch Engine - NLP Centre - Masaryk ...
Exploiting Corpora with Sketch Engine - NLP Centre - Masaryk ...
Create successful ePaper yourself
Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.
Contents Current Trends in Corpus Processing <strong>Sketch</strong> <strong>Engine</strong> Finding Collocations Using CQL<br />
Zipf’s law II<br />
may be simplified to inductive definition:<br />
Zipf’s law (simplified)<br />
frequency of the n-th element fn ≈ 1<br />
n · f1<br />
⇒ frequency is inversely proportional to the rank according to<br />
frequency<br />
⇒ one needs really large corpora to capture all the variety of<br />
many language phenomena<br />
Miloš Jakubíček LCL UK, <strong>NLP</strong>C FI MU CZ<br />
<strong>Exploiting</strong> <strong>Corpora</strong> <strong>with</strong> <strong>Sketch</strong> <strong>Engine</strong>