Designing a Machine Translation System for Canadian Weather ...
Designing a Machine Translation System for Canadian Weather ...
Designing a Machine Translation System for Canadian Weather ...
Create successful ePaper yourself
Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.
<strong>Designing</strong> a <strong>Weather</strong> Warning <strong>Translation</strong> <strong>System</strong> 24<br />
Table 8. Examples of mismatches between Watt’s output and the reference<br />
translation, illustrating different types of errors. Grammatical errors are<br />
underlined.<br />
Watt Reference Note<br />
SEVERE THUNDERSTORMS<br />
MAY DEVELOP THIS<br />
AFTERNOON INTO THE<br />
EARLY EVENING .<br />
IT MAKING CONDITIONS<br />
VERY UNCOMFORTABLE .<br />
THERE HAVE BEEN AS<br />
NUM MM TO ST. JOHN<br />
S IN PART DUE TO<br />
DRIER AIR WHICH HAS<br />
ENGULFED THE CENTRE<br />
OF HURRICANE AS IT<br />
ITS CHANGEOVER , POST<br />
TROPICAL STORM .<br />
THEY SHOULD PASS<br />
THROUGH THE<br />
MINERVE-RIVIERE<br />
ROUNGE AREA IN THE<br />
NEXT HOUR .<br />
THE POSSIBILITY OF<br />
DEVELOPMENT OF SEVERE<br />
THUNDERSTORMS EXISTS<br />
FOR THIS AFTERNOON<br />
INTO EARLY EVENING .<br />
THIS SYSTEM WILL<br />
MAKE THE WEATHER<br />
UNCOMFORTABLE .<br />
ONLY NUM MM FELL<br />
AT ST. JOHN S DUE<br />
PARTLY TO THE DRY<br />
AIR THAT MOVED INTO<br />
THE CENTRE OF THE<br />
HURRICANE AS IT WAS<br />
UNDERGOING TRANSITION<br />
TO POST-TROPICAL .<br />
THEY SHOULD<br />
PASS THROUGH THE<br />
MINERVE-RIVIERE ROUGE<br />
AREA IN THE NEXT HOUR<br />
.<br />
Different wording used by<br />
Watt, without any difference<br />
in meaning<br />
A grammatical error<br />
(missing IS) and an<br />
unresolved anaphora<br />
obfuscate the meaning of<br />
the translation.<br />
Near-gibberish, due to<br />
the rarity of the wordings<br />
used in the source text.<br />
The location RIVIERE<br />
ROUGE is misspelled because<br />
of an error in the<br />
source sentence. See the<br />
discussion.<br />
This can prove useful when using such a subjective evaluation protocol. Further<br />
evaluations, employing more rigorous methods, are described in Section 8. At this<br />
stage, the errors we detected show no discernible recurrent pattern, and are within<br />
the bounds of the limitations expected from typical SMT output. When producing<br />
French, we did observe however some rare but systematic errors in agreement<br />
between distant words. We are still working on possible solutions.<br />
Ultimately, with a sentence error rate of 16% when translating from French and<br />
24% from English (at least in our random sample), the translation of a complete<br />
bulletin containing on average 5.8 sentences (Section 4.5) will contain on average<br />
about one error. This precludes the use of Watt as a fully automated translation