Development of a Stemmer for the Greek Language - SAIS
Development of a Stemmer for the Greek Language - SAIS
Development of a Stemmer for the Greek Language - SAIS
Create successful ePaper yourself
Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.
Figure 4: The <strong>Greek</strong> alphabet<br />
The <strong>Greek</strong> alphabet contains 24 letters, 7 vowels (Α, Ε, Η, Ι, Ο, Υ and Ω) and 17<br />
consonants (Β, Γ, ∆, Ζ, Θ, Κ, Λ, Μ, Ν, Ξ, Π, Ρ, Σ, Τ, Φ, Χ and Ψ).<br />
The character “ς” called “teliko sigma” and it replaces <strong>the</strong> “σ” only when it is<br />
written at <strong>the</strong> end <strong>of</strong> a word and <strong>for</strong> <strong>the</strong> lower case letters.<br />
Every word with more than 2 syllables, takes a tone-mark called “tonos”, over one<br />
vowel and under specific rules. In some cases we can have tone-mark over onesyllable<br />
words or even words with two tone-marks. This tone-mark is used only<br />
<strong>for</strong> <strong>the</strong> lower case letters too.<br />
2.3 Definitions<br />
Be<strong>for</strong>e a brief explanation <strong>of</strong> <strong>the</strong> most important grammatical rules <strong>for</strong> <strong>the</strong> <strong>Greek</strong><br />
language it is useful to define some important terms, common used in this <strong>the</strong>sis.<br />
Syllable is <strong>the</strong> piece <strong>of</strong> word consisted <strong>of</strong> at least one single vowel or one vowel<br />
followed by one or more consonants.<br />
Stem or Theme is <strong>the</strong> static (unchangeable) part on <strong>the</strong> start <strong>of</strong> a word.<br />
Derivative is a word created by ano<strong>the</strong>r word if an affix added on its <strong>the</strong>mes.<br />
Derivational suffix is a suffix <strong>of</strong> a derivative word<br />
12