23.11.2013 Aufrufe

tekom-Jahrestagung 2012 - ActiveDoc

tekom-Jahrestagung 2012 - ActiveDoc

tekom-Jahrestagung 2012 - ActiveDoc

MEHR ANZEIGEN
WENIGER ANZEIGEN

Erfolgreiche ePaper selbst erstellen

Machen Sie aus Ihren PDF Publikationen ein blätterbares Flipbook mit unserer einzigartigen Google optimierten e-Paper Software.

Sprachtechnologie / Language Technology<br />

LT 13<br />

Presentation<br />

A (terminology) service layer for Moses SMT<br />

Nathalie De Sutter, Joachim Van den Bogaert, CrossLang, Gent, Belgium<br />

Introduction<br />

Moses SMT is finding its way to localization providers thanks to its<br />

promise of cheap (Open Source) and fast Machine Translation (MT).<br />

With the numerous internet sources that walk novice users through<br />

their first steps, setting up and training Moses SMT, has become increasingly<br />

straightforward. An active community of researchers, developers<br />

and professional users provides additional support on Moses SMT’s<br />

mailing list. However, when it comes to deploying Moses in a live production<br />

environment, there is very little documentation to be found.<br />

Very likely, this is due to the proprietary nature of professional thirdparty<br />

Moses SMT-based solutions: people prefer valuable insights to<br />

remain unpublished. There are of course some Open Source one-man<br />

efforts, but the usability of these kinds of solutions is restricted to a<br />

prototyping environment, or at best a human-operated batch system. A<br />

more trivial reason for the lack of information on live Moses SMT systems,<br />

may be that publications on setting up Moses properly tend to get<br />

technical, boring, and without any scientific value: there is no honour<br />

in making a great system behave nicely. A final reason for the lack of<br />

deployment information is that re-engineering Moses SMT is not a hot<br />

topic anymore. A couple of years ago it might have been, but currently,<br />

there are companies out there that sell pre-installed machines ready for<br />

use, so why would users invest in well-engineered solutions, let alone<br />

report on all the well-known technologies integrated in such a system<br />

(load balancing, scheduling, multi-threading, service-oriented architectures,<br />

upscaling, outscaling, decoupling, …)?<br />

Project requirements<br />

Still, we believe that a production and processing framework around<br />

Moses SMT adds value, for two reasons:<br />

−−<br />

Changing market demands: we see that customers are not satisfied<br />

anymore with a solution that actually runs, has a nice front-end or<br />

allows them to execute data training with a user-friendly GUI. Nowadays,<br />

users want difficult languages via hybrid solutions, integrated<br />

terminology management and post-editor friendly output. In the near<br />

future we expect systems combination with real-time quality assessment<br />

to be in demand. Putting all this on top of Moses SMT requires<br />

a well crafted environment.<br />

−−<br />

Flexibility and maintenance: a customized Moses SMT solution needs<br />

to be adaptable to a customer’s demands. If improved translations are<br />

required by adding extra processing or linguistic modules, MT technology<br />

providers need to be able to offer such kind of solutions. At the<br />

same time, changes to a system should be portable to other systems<br />

and remain manageable in terms of maintenance.<br />

To be able to deal with these requirements, we developed a framework<br />

around Moses SMT which facilitates production and the development<br />

of extra processing tasks. In what follows, we will give a brief account of<br />

the design decisions we took to implement this framework.<br />

260<br />

<strong>tekom</strong>-<strong>Jahrestagung</strong> <strong>2012</strong>

Hurra! Ihre Datei wurde hochgeladen und ist bereit für die Veröffentlichung.

Erfolgreich gespeichert!

Leider ist etwas schief gelaufen!