26.12.2013 Views

A computational grammar and lexicon for Maltese

A computational grammar and lexicon for Maltese

A computational grammar and lexicon for Maltese

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

The definite article is the proclitic il-, which becomes l- in front of vowels <strong>and</strong> where the l<br />

changes to match succeeding coronal consonants: il- + dar → id-dar (‘the house’). The article<br />

also joins with prepositions, ma + il- + <strong>for</strong>nar → mal-<strong>for</strong>nar (‘with the baker’). It is invariant<br />

<strong>for</strong> gender, number <strong>and</strong> used <strong>for</strong> both both nouns/adjectives. Absence of the article gives the<br />

indefinite <strong>for</strong>m.<br />

Except <strong>for</strong> the numeral wieħed (‘one’), all cardinal numerals precede the noun they quantify.<br />

This is true <strong>for</strong> all other quantifiers <strong>and</strong> demonstratives, which are located to the left of the<br />

noun which has to host the definite article if the demonstrative is used: dan + il- + ajruplan →<br />

dal-ajruplan (‘this aeroplane’). Proper nouns are exempt from this rule.<br />

The pronouns available in <strong>Maltese</strong> are similar to those as in English. Pronouns are often<br />

enclitic, indicating possession when joined with nouns (e.g. missier + tiegħi → missieri, ‘my<br />

father’), <strong>and</strong> as direct <strong>and</strong>/or indirect objects when attached to verbs (e.g. ħareġ + aħna → ħariġna,<br />

(‘he took us out’)). Pronouns are only used as topic or <strong>for</strong> emphasis; otherwise <strong>Maltese</strong> is<br />

generally a pro-drop language. <strong>Maltese</strong> has only a single relative pronoun li which is common<br />

<strong>for</strong> all persons, genders <strong>and</strong> number.<br />

Verbs have two moods: indicative <strong>and</strong> imperative (there is no infinitive in <strong>Maltese</strong>). The<br />

indicative consists of two aspects: perfective <strong>for</strong> actions that have been completed, <strong>and</strong> imperfective<br />

<strong>for</strong> those which are ongoing. Other tenses are <strong>for</strong>med by combining with the auxiliary<br />

verb kien (‘he was’) <strong>and</strong> other particles such as ma <strong>for</strong> the negative.<br />

(Rosner & Joachimsen, 2012; Stolz, 2011)<br />

Vocabulary<br />

Given the heterogeneous makeup of <strong>Maltese</strong>, it is no surprise that its vocabulary is also quite<br />

varied. A study of the etymological sources of Aquilina’s dictionary surprisingly found that the<br />

majority of lexical entries in <strong>Maltese</strong> are in fact Siculo-Italian (52%), followed by Semitic (32%)<br />

<strong>and</strong> English (6%). The remaining percentage can be accounted <strong>for</strong> by purely local <strong>for</strong>mations<br />

<strong>and</strong> lexical items with unknown etymologies.<br />

On the other h<strong>and</strong>, type vs. token frequency counts in journalistic texts have revealed that<br />

almost 73% of items are of Semitic origin. This can be explained by the fact that prepositions,<br />

pronouns, <strong>and</strong> other grammatical words are mostly derived from Arabic. (Spagnol, 2011, p. 12)<br />

1.1.3 <strong>Maltese</strong> <strong>and</strong> the digital age<br />

A recent META-NET report on the language (Rosner & Joachimsen, 2012) indicates that while a<br />

number of linguistic resources do exist <strong>for</strong> the languages, <strong>Maltese</strong> is in general under-represented<br />

in digital terms when compared to other European languages. As shown in figure 1.1, the<br />

study finds that <strong>Maltese</strong> is best resourced with speech synthesis <strong>and</strong> text corpora, but very<br />

poorly represented in other areas of <strong>computational</strong> linguistics. Specifically, to date there are no<br />

<strong>computational</strong> <strong>grammar</strong>s, morphological analysers, or text generation systems <strong>for</strong> <strong>Maltese</strong>.<br />

A Eurobarometer report (European Commission, 2011) found that in Malta, 89.5% of the<br />

respondents claimed that <strong>Maltese</strong> was their mother tongue. However when it comes to Internet<br />

4

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!