A generic framework for Arabic to English machine ... - Acsu Buffalo
A generic framework for Arabic to English machine ... - Acsu Buffalo
A generic framework for Arabic to English machine ... - Acsu Buffalo
Create successful ePaper yourself
Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.
1.3. TECHNOLOGIES<br />
ditransitive). The <strong>Arabic</strong> language is written from right <strong>to</strong> left and has a unique letter<br />
shape. Words are written in horizontal lines from right <strong>to</strong> left. The letter shape depends<br />
on its position in the word; initial (prefix), medial (infix), final (suffix) or (Isolated).<br />
In technical linguistic terms, <strong>Arabic</strong> is a ‘pro–drop’ or ‘pronoun–drop’ language. It can<br />
define who takes the action by using conjugations. The pro–drop parameter is an aspect of<br />
grammar that allows subjects <strong>to</strong> be optional in some languages. That is, every inflection<br />
in a verb paradigm is specified uniquely and does not need <strong>to</strong> use independent pronouns<br />
<strong>to</strong> differentiate the person, number, and gender of the verb. The system should cover and<br />
solve the “pro–drop” challenge in <strong>Arabic</strong>.<br />
1.3 Technologies<br />
We introduce the main technologies used <strong>to</strong> support the development of the research pre-<br />
sented in this thesis. These technologies are mainly the XML language and Java. The<br />
most recent recommendation of the XML language has been presented by Bray et al.<br />
(2008). XML has become the default standard <strong>for</strong> data exchange among heterogeneous<br />
data sources (Arciniegas 2000). The UniArab system allows data <strong>to</strong> be s<strong>to</strong>red in XML<br />
<strong>for</strong>mat. This data can then be queried, exported and serialized in<strong>to</strong> any <strong>for</strong>mat the devel-<br />
oper wishes. The Java programming language is used <strong>to</strong> implement the logical structures.<br />
The primary advantage being that Java is plat<strong>for</strong>m-independent and thus highly suitable<br />
<strong>for</strong> MT.<br />
Advantages of XML<br />
XML is a generalized way <strong>to</strong> s<strong>to</strong>re data, which is not married <strong>to</strong> any particular technology.<br />
This makes it easy <strong>to</strong> s<strong>to</strong>re something, and then come back and grab it later with some<br />
other technology <strong>for</strong> processing. Using XML <strong>to</strong> exchange in<strong>for</strong>mation offers a number<br />
of advantages, including the following:<br />
5