25.08.2013 Views

PDF (Online Text) - EURAC

PDF (Online Text) - EURAC

PDF (Online Text) - EURAC

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

Another very useful feature of xfst is the ability to create scripts with several<br />

commands in a sequence. The later commands can operate on the output of earlier<br />

commands, and can thus create a cascade of finite-state transducers. Xfst also provides<br />

convenient ways of outputting all the words recognised by a given transducer, which<br />

proved very useful in the creation of the online reference (see section 5). An updated<br />

version of xfst (Beesley & Karttunen forthcoming 2006) also includes support for utf-<br />

8.<br />

While finite-state technology is very good at generating and recognising regular<br />

expressions, it has a harder time capturing other features of natural language such<br />

as non-concatenative morphological structure. The next section describes some<br />

adaptations that allow FST to handle many of the non-concatenative patterns in<br />

Georgian.<br />

In addition, FST is not designed to represent a dynamic, living mental lexicon of<br />

an actual speaker. It does not provide any mechanisms for probabilistic decisions, or<br />

for recognition and generation of novel inflectional forms. The concluding section<br />

discusses some possible future developments in this area.<br />

4. Computational Model of the Georgian Verb<br />

4.1 General Idea<br />

As argued above, Georgian verb morphology can be described as a series of patterns<br />

at various levels of regularity. Most of the patterns specify particular morphosyntactic<br />

or semantic properties of verb forms and the corresponding combinations of elements<br />

in the morphological templates. In the model proposed here, screeve formation is<br />

viewed as lexical or semi-regular, and pronominal agreement is viewed as completely<br />

regular.<br />

Screeve formation for different conjugation classes (transitive, unergative,<br />

unaccusative, and inverse) is fairly different in Georgian, and so each conjugation class<br />

is implemented as a separate network. Nevertheless, the principles for composing<br />

each network are the same.<br />

The model is implemented as a cascade of finite-state transducers, that is, as<br />

several levels of FST networks such that the result of composing a lower-level network<br />

serves as input to a higher-level network. The levels correspond to the division of<br />

templatic patterns into completely lexical (Level 1) and semi-regular (Level 2). Level<br />

3 contains completely regular patterns that apply to the results of both Level 1 and<br />

Level 2. The result of compiling Level 3 patterns is the full set of conjugations for the<br />

235

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!