A generic framework for Arabic to English machine ... - Acsu Buffalo
A generic framework for Arabic to English machine ... - Acsu Buffalo
A generic framework for Arabic to English machine ... - Acsu Buffalo
Create successful ePaper yourself
Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.
the system capability <strong>to</strong> deal with a large number of structure variations, it becomes sig-<br />
nificantly more powerful since each structure represents all possible sentences of that<br />
structure regardless of the specific words included. This is a significant point, since vo-<br />
cabulary is easy <strong>to</strong> develop, but structure requires much more ef<strong>for</strong>t.<br />
The major challenge we faced was <strong>to</strong> use RRG within a <strong>machine</strong> translation system.<br />
UniArab is the first MT system that uses RRG.In the <strong>Arabic</strong> linguistic tradition there is<br />
not a clear-cut, well-defined analysis of the inven<strong>to</strong>ry of parts of speech in <strong>Arabic</strong>. We<br />
found that the existent classifications were not suitable, and so we had <strong>to</strong> create a clas-<br />
sification that made sense <strong>for</strong> RRG-based translation. We were able <strong>to</strong> extract logical<br />
structures that made sense from natural <strong>Arabic</strong>. And we were also able <strong>to</strong> generate En-<br />
glish translations from this logical structure.<br />
Some specific challenges included dealing with the absence of the copula verb, ‘<strong>to</strong> be’,<br />
in <strong>Arabic</strong>. To solve this, we had <strong>to</strong> look at some <strong>Arabic</strong> sentence which do not contain<br />
verbs, and correctly deduce how <strong>to</strong> extract the copula. Free word order was another chal-<br />
lenge due <strong>to</strong> its widespread presence in <strong>Arabic</strong>, and this was solved by detailed analysis<br />
of the source language and incorporating this in the logical structure.<br />
We have discoverd that RRG is a realistic basis <strong>for</strong> <strong>machine</strong> translation systems. The<br />
use of a sentence’s logical structure <strong>to</strong> create translations is robust and gives high quality<br />
translations which can deal with some of the challenges of languages like <strong>Arabic</strong>.<br />
Our work has contributed the first <strong>machine</strong> translation system based on RRG, which<br />
we have used <strong>to</strong> prove its effectiveness <strong>for</strong> MT. This was a major challenge as we had<br />
little work <strong>to</strong> refer <strong>to</strong>. We have also advanced work on <strong>Arabic</strong> language classification,<br />
130