11.07.2015 Views

Encyclopedia of Computer Science and Technology

Encyclopedia of Computer Science and Technology

Encyclopedia of Computer Science and Technology

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

LLAN See local area network.language translation s<strong>of</strong>twareAnyone who has learned a new language has also gainedan appreciation for how difficult it is to translate from onelanguage to another while preserving the intent, meaning,<strong>and</strong> context <strong>of</strong> the original. Not surprisingly, developings<strong>of</strong>tware to perform this task, <strong>of</strong>ten called “machinetranslation” (MT), has also proven to be difficult. (For amore general discussion <strong>of</strong> how languages can be representedor studied using a computer, see linguistics <strong>and</strong>computing.)Rules-Based ApproachesThere are several approaches that can be taken to automaticlanguage translation. A rules-based system parses the originaltext to construct an intermediate representation. Theprogram then “transfers” the represented structure to anequivalent structure in the target language, drawing uponextensive lexicons (dictionaries) containing such thingsas phrase structures, word structures (morphology), <strong>and</strong>semantics (meanings). Developing this extensive knowledgebase <strong>and</strong> the rules for manipulating it is the mostchallenging part <strong>of</strong> developing rules-based language translationsystems. (For more on the general process <strong>of</strong> computer“underst<strong>and</strong>ing” <strong>of</strong> language, see natural languageprocessing.)Generally a translation produced by a rules-based systemwill be intelligible to a speaker <strong>of</strong> the target language,who will be able to underst<strong>and</strong> the broad meaning <strong>of</strong> theoriginal text. However, it is likely to sound “awkward” <strong>and</strong>miss certain nuances.A simplified approach is based on a dictionary <strong>of</strong> wordsor phrases <strong>and</strong> their meanings. Each source word or phraseis simply looked up <strong>and</strong> converted to its equivalent in thetarget language. Because it does not deal with grammaticalstructure or context, this method is not very satisfactoryexcept perhaps for translating simple lists or catalogues.Statistical ApproachesThe other main approach to automatic translation relieson statistical analysis <strong>of</strong> a large body <strong>of</strong> text (corpus) thatis already translated into two languages. For example, theBayes theorem (see Bayesian analysis) can be used to estimatethe probability that string A in French (for example,“c’est un chien”) will occur in the English version as stringA’ “it’s a dog.”). Depending on the application, the sameapproach can be applied word for word, phrase for phrase,or sentence for sentence. Statistical approaches have hadgood success (particularly if the corpus is both representative<strong>and</strong> sufficiently extensive). However, since it is basedon probability, there is always a chance that a segment <strong>of</strong>text will be given the most likely translation rather than themeaning intended by the writer.Evaluation <strong>and</strong> ApplicationsThere are a number <strong>of</strong> features in real human languages<strong>and</strong> usage that are challenging for translation s<strong>of</strong>tware todeal with. Words can be ambiguous due to multiple mean-270

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!