26.12.2013 Views

A computational grammar and lexicon for Maltese

A computational grammar and lexicon for Maltese

A computational grammar and lexicon for Maltese

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

2.4.4 Coverage<br />

Abstract function implementation<br />

The following functions from the NumeralTransfer module were not implemented in the <strong>Maltese</strong><br />

<strong>grammar</strong>:<br />

active2passive dconcat digits2num digits2numeral dn dn10 dn100 dn1000 dn1000000a dn1000000b<br />

dn1000000c nd nd10 nd100 nd1000 nd1000000 num2digits<br />

A bug in the GF compiler means that no warning is generated when these functions have no<br />

linearisation rules. However, these functions are in fact not implemented in any of the language<br />

implementations in the RGL <strong>and</strong> thus were not deemed essential.<br />

Regression test results<br />

Table 2.20 shows the results of running the treebank regression tests against the version of the<br />

<strong>Maltese</strong> resource <strong>grammar</strong> at the time of publication. As can be seen, a considerable number<br />

of linearisations (15%) do not match their gold st<strong>and</strong>ards. These failures can be generally<br />

attributed to one of the following:<br />

• Irregular vowel changes in verbs (require more <strong>for</strong>ms in the <strong>lexicon</strong>).<br />

• Incorrect selection of stem <strong>for</strong>m <strong>and</strong>/or suffix <strong>for</strong>m when joining enclitic pronouns.<br />

• Insufficient work on relative clauses.<br />

• Errors in the use of the copula kien in some tense combinations.<br />

• Bugs in the numerals module.<br />

It is worth pointing out that the totals in the table give a measure to how well the <strong>grammar</strong><br />

covers the treebank set itself; There is no measure however of how completely or evenly the<br />

treebanks cover the <strong>Maltese</strong> language in general. Note also that all the testing carried out is<br />

only evaluating linearisation; no testing is per<strong>for</strong>med in the parsing direction.<br />

Table 2.20: Treebank list <strong>and</strong> testing results<br />

Name Description Trees Passed Rate (%)<br />

ap Adjectival phrases 2 1 50.0<br />

articles Articles 5 5 100.0<br />

exx-resource<br />

Wide-coverage treebank from testsuite/libraries/<br />

in the GF repository<br />

186 111 59.7<br />

n-clitics Enclitic pronouns on nouns 49 35 71.4<br />

np Noun phrases 1 1 100.0<br />

Continued on next page<br />

42

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!