02.11.2014 Views

untangling_the_web

untangling_the_web

untangling_the_web

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

DOCID: 4046925<br />

UNCLASSIFIEDHFOR OFFI€IAb bJ8li g~II.Y<br />

Enter text or a URL: [chanae text direction]<br />

[hide intematlonal keyboard]<br />

Hebrew<br />

. '.,<br />

Kevboard<br />

'.. ( ~I_ ..., m ..,<br />

I I· ':' 1''1< _ ::-,~<br />

1-.1<br />

,~ 'i '...: " i.l """, .~ "'1'" , l·j ,.., ~ 1 .., ... , 'i v-<br />

WI.· r.J .. ,., 11 ••\ _ 1....<br />

----<br />

Choose ano<strong>the</strong>r 'from' language to lise its script. UJfll.};: illtl~l<br />

Fagan Finder Translation Wizard<br />

http://www.faganfinder.com/translate/<br />

Then copy and paste <strong>the</strong> Hebrew term or phrase into a search engine such as<br />

Google and Yahoo (and many international search engines) that handle multiple<br />

character encodings and you can search on <strong>the</strong> term without having to load a<br />

Hebrew keyboard.<br />

Here are some o<strong>the</strong>r interesting and important facts and features about <strong>the</strong> Fagan<br />

Finder translation wizard. The wizard gives users <strong>the</strong> option to list all translation<br />

matches, to choose multiple translators (when available), to select dialects, to list<br />

matches if <strong>the</strong>re are several translation tools so users can pick one or more. Also, in<br />

addition to HTML, PDF and SWF (Flash) file formats are supported for <strong>web</strong>page<br />

translations.<br />

Ano<strong>the</strong>r useful tool that is part of <strong>the</strong> Fagan Finder translation wizard is <strong>the</strong><br />

language identification interface. Here users can enter ei<strong>the</strong>r text or a uri, and<br />

Fagan Finder will query a set of language identification tools to try to determine <strong>the</strong><br />

language and encoding of <strong>the</strong> text. However, keep in mind that it often easy to<br />

determine <strong>the</strong> language/encoding of a <strong>web</strong>page ei<strong>the</strong>r by looking at <strong>the</strong> Page Info<br />

(in Mozilla only; look at charset) or Page Source in both Mozilla and Internet<br />

Explorer. Look for charset, which will indicate a specific character set, which you can<br />

<strong>the</strong>n use to identify <strong>the</strong> language. For example, Windows-1256, ISO 8859-6, ISO<br />

8859-1 all are Arabic language encodings.<br />

However, you cannot always find this information using Page Source/Info and, for<br />

individual words or phrases, Fagan Finder's language identification interface is<br />

especially valuable. My tests of <strong>the</strong> language identification tool indicate it usually<br />

does a good job of identifying ei<strong>the</strong>r <strong>the</strong> language alone or <strong>the</strong> language and <strong>the</strong><br />

encoding. Fagan Finder's language identification is not a single tool but a single<br />

interface to many language identification tools. Users have <strong>the</strong> option to choose<br />

among 10 language identification tools. Given <strong>the</strong> complexity and difficulty of<br />

language identification, it's probably a good idea to compare <strong>the</strong> results of several<br />

different tools and not rely on one.<br />

294 UNCLASSIFIEDHFOR OFFI€lAb l:ISE eNbY

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!