Programme booklet (pdf)

More documents

Recommendations

Info

50 CLIN 21 – CONFERENCE PROGRAMME Measuring the Impact of Controlled Language on Machine Translation Text via Readability and Comprehensibility Abstract Doherty, Stephen Centre for Next Generation Localisation, Dublin This paper describes a recent study of the readability and comprehensibility of English software documentation, which has been translated into French by Matrex, a state-ofthe-art (phrase-based statistical) machine translation system. The primary aim of the study is to examine what, if any, effects there are on the readability and comprehensibility of the machine translation output following the application of controlled language (CL) rules on the source language texts. Our hypothesis is that the application of CL rules would result in an observable increase in readability and comprehensibility of the target language text. Our approach consisted of a three-pronged evaluation of the texts by means of (i) readability indices in both the source and target languages: (ii) an eye tracking measurement of readability: and (iii) a post-task qualitative measurement of comprehensibility, using recall and likert-scale human evaluations. We also looked at correlations between automatic machine translation evaluation metrics (e.g. BLEU, GTM etc.) and the evaluation results mentioned above in an attempt to bridge the gap between human and automatic approaches to evaluation. The paper will first describe some background and context in the relevant research areas, followed by a presentation of the methods employed with a particular focus on the measurement of readability via eye tracking and tentative results in this regard. Corresponding author: stephen.doherty2@mail.dcu.ie
PRESENTATION ABSTRACTS Abstract Memory-based text completion van den Bosch, Antal Tilburg University The commonly accepted technology for fast and efficient word completion is the prefix tree, or trie. As a word is keyed in, the trie can be queried for unicity points and best guesses. We present three improvements over the normal prefix trie in experiments in which we measure the percentage of keypresses saved on both in-domain and out-ofdomain test text, emulating a perfectly alert user who would select correct suggestions promptly. First, we train a suffix trie that tests backwards from the most recent keypresses. Conditioned on first letters, the suffix trie model yields about 10% more saved keypresses than the baseline character saving percentage on in-domain test data. Second, the suffix trie model can be straightforwardly extended to testing on characters of previous words. Adding this context yields another 10% increase in character savings. Third, when we train the context-rich suffix trie model to complete the current word and predict the next one in one go, character savings go up another 4%. In a learning experiment on Dutch texts we observe character savings of up to 44% on in-domain test data where the baseline prefix tree savings percentage is 19%. On out-of-domain twitter data, the prefix trie baseline of 19% is only mildly surpassed by the suffix tree variants to 24% character savings. We develop an explanation for the spectacular success of the suffix tree approach on in-domain data, and review the applicability of the approach in real-world text entry contexts. Corresponding author: Antal.vdnBosch@uvt.nl 51
Page 3 and 4: Ghent, February 11 th 2011 21 st me
Page 5: Welcome! For the first time in its
Page 8 and 9: 6 CLIN 21 - CONFERENCE PROGRAMME Co
Page 10 and 11: 8 CLIN 21 - CONFERENCE PROGRAMME CL
Page 12 and 13: CLIN 21 - CONFERENCE PROGRAMME 09:0
Page 14 and 15: 11:30 - 12:30 12:30 - 12:50 12 CLIN
Page 16 and 17: 14 CLIN 21 - CONFERENCE PROGRAMME R
Page 19: Rethinking anaphora Abstract Massim
Page 22 and 23: 20 CLIN 21 - CONFERENCE PROGRAMME A
Page 34 and 35: Abstract 32 CLIN 21 - CONFERENCE PR
Page 36 and 37: Abstract 34 Nauze, Fabrice Q-go Clu
Page 38 and 39: 36 CLIN 21 - CONFERENCE PROGRAMME C
Page 44 and 45: 42 CLIN 21 - CONFERENCE PROGRAMME D
Page 46 and 47: 44 CLIN 21 - CONFERENCE PROGRAMME E
Page 50 and 51: 48 CLIN 21 - CONFERENCE PROGRAMME L
Page 56 and 57: 54 CLIN 21 - CONFERENCE PROGRAMME P
Page 64 and 65: 62 CLIN 21 - CONFERENCE PROGRAMME S
Page 68 and 69: 66 CLIN 21 - CONFERENCE PROGRAMME T
Page 70 and 71: 68 CLIN 21 - CONFERENCE PROGRAMME F
Page 76 and 77: 74 CLIN 21 - CONFERENCE PROGRAMME U
Page 78 and 79: 76 CLIN 21 - CONFERENCE PROGRAMME W
Page 84 and 85: Abstract 82 Authorship Verification
Page 86 and 87: 84 CLIN 21 - CONFERENCE PROGRAMME D
Page 90 and 91: 88 CLIN 21 - CONFERENCE PROGRAMME O
Page 96 and 97: 94 CLIN 21 - CONFERENCE PROGRAMME T
Page 99: List of Participants 97
Page 102 and 103:
100 CLIN 21 - CONFERENCE PROGRAMME
Page 104 and 105:
102 CLIN 21 - CONFERENCE PROGRAMME
show all

Programme booklet (pdf)

Create successful ePaper yourself

Delete template?

Save as template?