Development of a Stemmer for the Greek Language - SAIS
Development of a Stemmer for the Greek Language - SAIS
Development of a Stemmer for the Greek Language - SAIS
You also want an ePaper? Increase the reach of your titles
YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.
Rule-set 13<br />
if (word ends on ΗΘΗΚΑ|ΗΘΗΚΕΣ|ΗΘΗΚΕ){<br />
remove <strong>the</strong> suffix;<br />
}<br />
if (word ends on ΗΚΑ|ΗΚΕΣ|ΗΚΕ){<br />
remove <strong>the</strong> suffix;<br />
If (remaining part is ∆ΙΑΘ|Θ|ΣΥΝΘ…) ||<br />
(remaining part ends on ΣΦ|ΟΘ|ΠΙΘ…){<br />
add “ΗΚ”;<br />
}<br />
}<br />
The rule removes <strong>the</strong> suffix ΗΘΗΚΑ, ΗΘΗΚΕΣ and ΗΘΗΚΕ <strong>for</strong> all <strong>the</strong> words,<br />
and <strong>the</strong> suffixes ΗΚΑ, ΗΚΕΣ and ΗΚΕ <strong>for</strong> a group <strong>of</strong> words.<br />
Example:<br />
ΧΤΙΖΩ ΧΤΙΖ*<br />
ΧΤΙΣΤΗΚΕ ΧΤΙΣΤ*<br />
The rule doesn’t affect one group <strong>of</strong> words that by chance have similar suffixes.<br />
Example:<br />
∆ΙΑΘΗΚΗ ∆ΙΑΘΗΚ<br />
∆ΙΑΘΗΚΕΣ ∆ΙΑΘΗΚ<br />
*Notice <strong>the</strong> difference on <strong>the</strong> present and <strong>the</strong> past stem<br />
Rule-set 14<br />
if (word ends on ΟΥΣΑ|ΟΥΣΕΣ|ΟΥΣΕ){<br />
remove <strong>the</strong> suffix;<br />
if (remaining part is ΦΑΡΜΑΚ|ΧΑ∆|ΜΕ∆…) || (remaining<br />
part ends on ΠΟ∆ΑΡ|ΒΛΕΠ…) || (remaining part ends on<br />
vowel){<br />
add “ΟΥΣ”;<br />
}<br />
}<br />
The rule removes <strong>the</strong> suffix ΟΥΣΑ, ΟΥΣΕΣ and ΟΥΣΕ <strong>for</strong> a group <strong>of</strong> words.<br />
Example:<br />
ΧΤΥΠΩ ΧΤΥΠ<br />
ΧΤΥΠΟΥΣΕΣ ΧΤΥΠ<br />
26