22.08.2013 Views

A generic framework for Arabic to English machine ... - Acsu Buffalo

A generic framework for Arabic to English machine ... - Acsu Buffalo

A generic framework for Arabic to English machine ... - Acsu Buffalo

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

1.3. TECHNOLOGIES<br />

ditransitive). The <strong>Arabic</strong> language is written from right <strong>to</strong> left and has a unique letter<br />

shape. Words are written in horizontal lines from right <strong>to</strong> left. The letter shape depends<br />

on its position in the word; initial (prefix), medial (infix), final (suffix) or (Isolated).<br />

In technical linguistic terms, <strong>Arabic</strong> is a ‘pro–drop’ or ‘pronoun–drop’ language. It can<br />

define who takes the action by using conjugations. The pro–drop parameter is an aspect of<br />

grammar that allows subjects <strong>to</strong> be optional in some languages. That is, every inflection<br />

in a verb paradigm is specified uniquely and does not need <strong>to</strong> use independent pronouns<br />

<strong>to</strong> differentiate the person, number, and gender of the verb. The system should cover and<br />

solve the “pro–drop” challenge in <strong>Arabic</strong>.<br />

1.3 Technologies<br />

We introduce the main technologies used <strong>to</strong> support the development of the research pre-<br />

sented in this thesis. These technologies are mainly the XML language and Java. The<br />

most recent recommendation of the XML language has been presented by Bray et al.<br />

(2008). XML has become the default standard <strong>for</strong> data exchange among heterogeneous<br />

data sources (Arciniegas 2000). The UniArab system allows data <strong>to</strong> be s<strong>to</strong>red in XML<br />

<strong>for</strong>mat. This data can then be queried, exported and serialized in<strong>to</strong> any <strong>for</strong>mat the devel-<br />

oper wishes. The Java programming language is used <strong>to</strong> implement the logical structures.<br />

The primary advantage being that Java is plat<strong>for</strong>m-independent and thus highly suitable<br />

<strong>for</strong> MT.<br />

Advantages of XML<br />

XML is a generalized way <strong>to</strong> s<strong>to</strong>re data, which is not married <strong>to</strong> any particular technology.<br />

This makes it easy <strong>to</strong> s<strong>to</strong>re something, and then come back and grab it later with some<br />

other technology <strong>for</strong> processing. Using XML <strong>to</strong> exchange in<strong>for</strong>mation offers a number<br />

of advantages, including the following:<br />

5

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!