26.12.2013 Views

A computational grammar and lexicon for Maltese

A computational grammar and lexicon for Maltese

A computational grammar and lexicon for Maltese

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

include the derived <strong>for</strong>m number <strong>and</strong> root radicals where relevant. Thus rikeb (‘he rode’)<br />

becomes rikeb_RKB_1_9451_V.<br />

2. A function is defined in the abstract syntax module <strong>for</strong> this identifier, with a category to<br />

match its POS.<br />

3. A linearisation <strong>for</strong> this function is defined in the concrete syntax module, which uses a<br />

smart paradigm <strong>for</strong> the particular POS <strong>and</strong> supplying (minimally) a lemma <strong>for</strong>m. In the<br />

case of <strong>Maltese</strong> verbs, the root is also supplied since this in<strong>for</strong>mation is available in the<br />

<strong>lexicon</strong>.<br />

At the end of this process one has a pair of modules which together constitute a monolingual<br />

dictionary in GF. As with the rest of a resource <strong>grammar</strong>, such modules are typically not used<br />

directly but imported in other application <strong>grammar</strong>s as required.<br />

Quality<br />

It is common to require some post-editing of the generated dictionary <strong>grammar</strong>s since invariably<br />

in some cases the wrong paradigm will be chosen. As dictionaries are often quite large<br />

however, this might turn into a somewhat time-consuming task.<br />

While GF smart paradigms are generally designed to work with minimal in<strong>for</strong>mation, all<br />

relevant data that exists in the source word list can <strong>and</strong> should be be used in order to improve<br />

the accuracy of the smart paradigm. Specifically, valency in<strong>for</strong>mation would help in order to<br />

choose between the correct <strong>for</strong>ms of each lexical category, such as V, V2 etc. Un<strong>for</strong>tunately only<br />

minimal amounts of valency in<strong>for</strong>mation is available in the given sources <strong>and</strong> thus can only be<br />

made use of to a small extent.<br />

3.3.2 Dictionary modules<br />

A monolingual dictionary in GF has function identifiers which are not shared with other languages.<br />

For this reason, the dictionary must consist of both an abstract <strong>and</strong> concrete module.<br />

The following is an example from each:<br />

abstract DictMltAbs = Cat ** {<br />

...<br />

fun rikeb_RKB_1_9451_V : V ;<br />

...<br />

}<br />

concrete DictMlt of DictMltAbs = CatMlt ** open ParadigmsMlt in {<br />

...<br />

lin rikeb_RKB_1_9451_V = mkV "rikeb" (mkRoot "r-k-b") ;<br />

...<br />

}<br />

57

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!