A computational grammar and lexicon for Maltese
A computational grammar and lexicon for Maltese
A computational grammar and lexicon for Maltese
Create successful ePaper yourself
Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.
Web application<br />
• The user interface <strong>for</strong> the web application was designed in an ad hoc manner <strong>and</strong> no real<br />
study or research was per<strong>for</strong>med into user interfaces <strong>for</strong> online lexica. No tracking of<br />
website usage is recorded by the application, so there is thus no way of analysing usage<br />
in order to improve the interface.<br />
• The user facilities <strong>for</strong> reporting mistakes <strong>and</strong> adding new contributions to the <strong>lexicon</strong><br />
are completely anonymous <strong>and</strong> there is no protection against spam. Requiring users to<br />
register <strong>and</strong> login be<strong>for</strong>e providing feedback would increase the number of steps required<br />
<strong>and</strong> likely reduce the amount feedback received.<br />
• A single bug in the morphological <strong>grammar</strong> can produce hundreds or thous<strong>and</strong>s of incorrect<br />
word <strong>for</strong>ms. Firstly, this may be distracting to users who may try to flag each of<br />
them individually (with no added benefit). Secondly, no method has been put in place<br />
<strong>for</strong> regenerating incorrect word <strong>for</strong>ms after making a change in the <strong>grammar</strong>.<br />
4.3 Related work<br />
4.3.1 Computational linguistics resources <strong>for</strong> <strong>Maltese</strong><br />
Much of the background <strong>for</strong> <strong>Maltese</strong> <strong>computational</strong> linguistics — in particular the Maltilex<br />
project — is discussed in section 1.2.3. This section mentions some more disparate works which<br />
are or less direct relevance or use to this work. A useful background on some of the past work<br />
in <strong>Maltese</strong> computation linguistics can be found in (Dalli, 2002a, section 2.5).<br />
The earliest mentioned morphological analyser <strong>for</strong> <strong>Maltese</strong> seems to been written by Galea<br />
(1996) in an undergraduate project entitled “Morphological Analysis of <strong>Maltese</strong> Verbs”. Un<strong>for</strong>tunately<br />
neither the source code nor the publication itself were available at the time of writing.<br />
Another verb morphology generator <strong>for</strong> <strong>Maltese</strong> was written by Ramon Casha from the<br />
Malta Linux User Group. The work is unpublished <strong>and</strong> was obtained via personal communication<br />
with the author. While it is known to have been used to generate a aspell dictionary <strong>for</strong><br />
<strong>Maltese</strong> 1 , no metrics <strong>for</strong> its coverage could be found. A continuation of this work has recently<br />
been undertaken by Mike Vella 2 .<br />
A proprietary online English-<strong>Maltese</strong> dictionary is maintained by Ian Vella 3 , containing<br />
some 12,219 English entries. The dictionary contains no other lexical in<strong>for</strong>mation.<br />
Methods <strong>for</strong> building a <strong>computational</strong> <strong>lexicon</strong> <strong>for</strong> <strong>Maltese</strong> based on corpus extraction are<br />
covered at length in (Dalli, 2002b). The primary difficulty with this approach is that stemming<br />
or lemmatisation in a language with a rich, non-concatenative morphology is especially tough.<br />
Dalli et al. (2004) describes the use of web services <strong>for</strong> linguistic resources, focusing mainly<br />
on system architectures <strong>and</strong> communication protocols. Web service technologies have matured<br />
1 ftp://ftp.gnu.org/gnu/aspell/dict/mt, accessed 2013-09-04<br />
2 https://github.com/vellamike/maltese-spell-checker, accessed 2013-09-04<br />
3 http://www.englishmaltesedictionary.com/mt.htm, accessed 2013-07-18<br />
62