A computational grammar and lexicon for Maltese
A computational grammar and lexicon for Maltese
A computational grammar and lexicon for Maltese
You also want an ePaper? Increase the reach of your titles
YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.
tionary <strong>and</strong> making it available online. At the time of writing, the project is still in the process<br />
of updating the original dictionary to include modern entries <strong>and</strong> had not yet entered the digitisation<br />
phase (personal communication, July 2013).<br />
1.2.4 MLSS<br />
As part of the Metanet4U initiative, the <strong>Maltese</strong> Language Software Services (MLSS) 4 portal<br />
has recently made available a number of language-processing tools <strong>for</strong> <strong>Maltese</strong>. These tools<br />
cover the following tasks:<br />
• paragraph <strong>and</strong> sentence splitting<br />
• tokenisation (splitting running text into individual tokens)<br />
• part of speech tagging (categorising each token in running text with its part of speech)<br />
1.2.5 Other resources<br />
<strong>Maltese</strong> root-<strong>and</strong>-pattern verb database<br />
The online database of root-<strong>and</strong>-pattern verbs (Camilleri & Spagnol, 2013) is based on the exhaustive<br />
enumeration of 1,923 roots <strong>and</strong> over 4,142 verbs by Spagnol (2011). The original data<br />
was converted from Excel <strong>for</strong>mat into a relational database, with a web-based interface then<br />
built around it. This interface provides the ability to search <strong>for</strong> roots <strong>and</strong> verbs quickly using<br />
regular expression syntax. In addition to in<strong>for</strong>mation on roots, classes <strong>and</strong> derived <strong>for</strong>ms from<br />
the original source, fields <strong>for</strong> English gloss, transitivity in<strong>for</strong>mation, <strong>and</strong> verb frequency were<br />
also added. The database is hosted on the <strong>Maltese</strong> Language Resource Server (MLRS) 5 <strong>and</strong><br />
provides both a user interface <strong>and</strong> a JSON web service.<br />
Corpus of broken plurals<br />
Mayer et al. (2013) have put together a large corpus of broken plurals in <strong>Maltese</strong>, containing<br />
654 singular-broken plural pairs mainly based on a list of synchronically used or known<br />
broken plurals compiled by Schembri (2012). The study is based on a comprehensive inventory<br />
of broken plural <strong>for</strong>ms which an average educated speaker of <strong>Maltese</strong> would be expected to<br />
have. In cases of a broken plural having multiple variants, only the most common or frequent is<br />
given. The corpus also includes a small number of collective <strong>for</strong>ms, such as basal (‘onions’) <strong>and</strong><br />
ġebel (‘stones’), which share the same CV structure of broken plurals but which differ morphosyntactically.<br />
It is available in CSV <strong>for</strong>mat from the MLRS 6 website.<br />
4 http://metanet4u.research.um.edu.mt/, accessed 2013-05-03<br />
5 http://mlrs.research.um.edu.mt/resources/verbalroots/, accessed 2013-05-03<br />
6 http://mlrs.research.um.edu.mt/index.php?page=33, accessed 2013-08-14<br />
8