Chapter 2. Interactive reference grammars 35Example (14) shows a German nom<strong>in</strong>al phrase where the nom<strong>in</strong>al head is deleted.In the English orig<strong>in</strong>al a substitution is used to express the same mean<strong>in</strong>g. S<strong>in</strong>cesubstitution does not work <strong>for</strong> German (see above), deletion seems to be anotherstrategy to translate this structure. In example (15), we f<strong>in</strong>d two Germanverbs (danken (thank) andtun (do)), the second subject is, however, deleted. Inthe English orig<strong>in</strong>al the subject we is repeated <strong>for</strong> the second verb. The samephenomenon can be observed <strong>in</strong> example (16): The English orig<strong>in</strong>al repeats thewords the way we, which are deleted <strong>in</strong> the German translation. In both examples,German is more elliptical express<strong>in</strong>g the cohesive l<strong>in</strong>ks implicitly, whereasEnglish uses repetitions express<strong>in</strong>g the lexical cohesion more explicitly. These concordancesshow that the English-German corpus can also be used <strong>in</strong>versely help<strong>in</strong>gtranslators to f<strong>in</strong>d compensations <strong>for</strong> German constructions which do not exist<strong>in</strong> English.4. Conclud<strong>in</strong>g remarks and outlookThe need <strong>for</strong> l<strong>in</strong>guistically annotated corpora is observed across all branches ofl<strong>in</strong>guistics, and the translation branch is no exception. There are certa<strong>in</strong>ly researchquestions which do not require such a detailed l<strong>in</strong>guistic analysis, but which canbe resolved us<strong>in</strong>g unannotated data or automatic annotation without the burdenof construct<strong>in</strong>g a big and complex language resource <strong>in</strong> advance. However, translationproblems result<strong>in</strong>g from the specificity of a language or a register need amore detailed l<strong>in</strong>guistic analysis <strong>in</strong> order to be answered.In this chapter, we have suggested that monol<strong>in</strong>gual and multil<strong>in</strong>gual treebankscan assume the role of grammatical reference works <strong>for</strong> translation tra<strong>in</strong><strong>in</strong>gand practice. In order <strong>for</strong> corpora to serve this purpose, they need to be enrichedwith l<strong>in</strong>guistic <strong>in</strong><strong>for</strong>mation (Section 2). We have discussed the notions ofmonol<strong>in</strong>gual and parallel treebank<strong>in</strong>g and <strong>in</strong>troduced some of the most importanttreebanks. In Section 3, we have shown that l<strong>in</strong>guistic annotation can makea corpus a valuable resource <strong>for</strong> deal<strong>in</strong>g with some typical translation problems.Offer<strong>in</strong>g the possibility to search <strong>for</strong> grammatical constructions (such as word ordervariation, cleft<strong>in</strong>g, rais<strong>in</strong>g constructions as well as substitutions and deletions)these treebanks are a much more powerful resource compared to parallel corporaof raw texts. And contrast<strong>in</strong>g them to usual (pr<strong>in</strong>ted) reference grammars, theconcordances generated on the basis of the treebanks are far more comprehensiveand exhaustive. Thus, the CroCo Corpus as well as the Penn and TiGer Treebanksprove to provide a wealth of <strong>in</strong><strong>for</strong>mation <strong>for</strong> the translation of typologicallanguage problems.Besides typologically motivated translation solutions, also examples <strong>for</strong> registerspecificlanguage use and their appropriate translations (i.e., register-specific
36 Silvia Hansen-Schirratranslation behaviour) can be found <strong>in</strong> the treebanks. This means, however, thatthe corpus design should be representative <strong>in</strong> terms of size and relevant <strong>in</strong> termsof registers.For deal<strong>in</strong>g with more complex k<strong>in</strong>ds of translation problems, a translationcorpus should be annotated with more abstract l<strong>in</strong>guistic <strong>in</strong><strong>for</strong>mation, e.g., semanticand discourse <strong>in</strong><strong>for</strong>mation. This requires more comprehensive annotationmethods and more sophisticated query facilities – both of which are current researchissues <strong>in</strong> computational l<strong>in</strong>guistics. Future work also <strong>in</strong>volves the applicabilityof treebanks as grammatical reference works to other language pairs.ReferencesAbeillé, A., Clément, L., & K<strong>in</strong>yon, A. (2000) Build<strong>in</strong>g a treebank <strong>for</strong> French. In Proceed<strong>in</strong>gs ofthe LREC 2000, Athens, Greece, 87–94. Paris: ELRA/ELDA.Bosco, C., Lombardo, V., Vassallo, D., & Lesmo, L. (2000). Build<strong>in</strong>g a treebank <strong>for</strong> Italian: Adatadriven annotation schema. In Proceed<strong>in</strong>gs of the LREC 2000, Athens, Greece, 87–94.Paris: ELRA/ELDA.BrantsS.,S.Dipper,P.Eisenberg,S.Hansen-Schirra,E.König,W.Lezius,C.Rohrer,G.Smith&H. Uszkoreit (2003) TIGER: L<strong>in</strong>guistic Interpretation of a German Corpus. In E. H<strong>in</strong>richs&K.Simov(eds.)Journal of <strong>Language</strong> and Computation (JLAC), Special Issue.Brants, T. (1999) Tagg<strong>in</strong>g and pars<strong>in</strong>g with Cascaded Markov Models – automation of corpusannotation. Saarbrücken, Germany: German Research Centre <strong>for</strong> Artificial Intelligence& Saarland University: Saarbrücken Dissertations <strong>in</strong> Computational L<strong>in</strong>guistics and<strong>Language</strong> Technology (vol. 6).Brants, T. (2000) TnT – A Statistical Part-of-Speech Tagger. In Proceed<strong>in</strong>gs of the Sixth Conferenceon Applied Natural <strong>Language</strong> Process<strong>in</strong>g ANLP-2000. Seattle, WA.Brants, T. & O. Plaehn (2000) Interactive Corpus Annotation. In Proceed<strong>in</strong>gs of the LREC 2000,Athens, Greece, 87–94. Paris: ELRA/ELDA.Christ, O. (1994) A modular and flexible architecture <strong>for</strong> an <strong>in</strong>tegrated corpus query system.In Proceed<strong>in</strong>gs of COMPLEX 94, 3rd Conference on Computational Lexicography and Textresearch. Budapest: 23–32.Cuřín, J., M. Čmejrek, J. Havelka & V. Kuboň (2004) Build<strong>in</strong>g Parallel Bil<strong>in</strong>gual SyntacticallyAnnotated Corpus. In Proceed<strong>in</strong>gs of The First International Jo<strong>in</strong>t Conference on Natural<strong>Language</strong> Process<strong>in</strong>g, Ha<strong>in</strong>an Island, Ch<strong>in</strong>a, 141–146.Greenbaum, S. (ed.), (1996) Compar<strong>in</strong>g English worldwide: The International Corpus of English.Ox<strong>for</strong>d, UK: Clarendon Press.Hajic, J. (1999) Build<strong>in</strong>g a syntactically annotated corpus: The Prague Dependency Treebank.In E. Hajicova (ed.) Issues of valency and mean<strong>in</strong>g. Studies <strong>in</strong> honour of Jarmila Panevova.Prague, Czech Republic: Charles University Press.Hansen-Schirra, S., S. Neumann & M. Vela (2006) Multi-dimensional Annotation andAlignment <strong>in</strong> an English-German <strong>Translation</strong> Corpus. In Proceed<strong>in</strong>gs of the 5th Workshopon NLP and XML (NLPXML-2006) at EACL 2006, Trento, 4th April 2006, 35–42.Hawk<strong>in</strong>s, J. A. (1986) A comparative typology of English and German. London: Croom Helm.
- Page 3 and 4: Benjamins Translation Library (BTL)
- Page 5 and 6: 8 TMThe paper used in this publicat
- Page 7 and 8: VITopics in Language Resources for
- Page 9 and 10: VIII Topics in Language Resources f
- Page 11 and 12: XTopics in Language Resources for T
- Page 13 and 14: XIITopics in Language Resources for
- Page 15 and 16: 2 Lynne Bowker and Michael Barlowde
- Page 17 and 18: 4 Lynne Bowker and Michael BarlowFi
- Page 19 and 20: 6 Lynne Bowker and Michael Barlow2.
- Page 21 and 22: 8 Lynne Bowker and Michael BarlowOn
- Page 23 and 24: 10 Lynne Bowker and Michael Barlow4
- Page 25 and 26: 12 Lynne Bowker and Michael Barlowb
- Page 27 and 28: 14 Lynne Bowker and Michael Barlows
- Page 29 and 30: 16 Lynne Bowker and Michael Barlowp
- Page 31 and 32: 18 Lynne Bowker and Michael Barlowt
- Page 33 and 34: 20 Lynne Bowker and Michael Barlowg
- Page 35 and 36: 22 Lynne Bowker and Michael BarlowM
- Page 37 and 38: 24 Silvia Hansen-Schirraphenomenon;
- Page 39 and 40: 26 Silvia Hansen-Schirratreebank pr
- Page 41 and 42: 28 Silvia Hansen-Schirracurrently a
- Page 43 and 44: 30 Silvia Hansen-Schirrawhichaltern
- Page 45 and 46: 32 Silvia Hansen-Schirra(1) We cont
- Page 47: 34 Silvia Hansen-Schirrarealisation
- Page 52 and 53: chapter 3Corpora for translator edu
- Page 54 and 55: Chapter 3. Corpora for translator e
- Page 56 and 57: Chapter 3. Corpora for translator e
- Page 58 and 59: Chapter 3. Corpora for translator e
- Page 60 and 61: Chapter 3. Corpora for translator e
- Page 62 and 63: Chapter 3. Corpora for translator e
- Page 64 and 65: Chapter 3. Corpora for translator e
- Page 66 and 67: Chapter 3. Corpora for translator e
- Page 68: Chapter 3. Corpora for translator e
- Page 71 and 72: 58 Belinda Maiahindsight, one can n
- Page 73 and 74: 60 Belinda Maiabeen translated by m
- Page 75 and 76: 62 Belinda Maiastudy reformulations
- Page 77 and 78: 64 Belinda MaiaSearchablecorporaenc
- Page 79 and 80: 66 Belinda Maia- Find definition ca
- Page 81 and 82: 68 Belinda MaialishaEuropeanMaster
- Page 83 and 84: 70 Belinda MaiaMaia, B. and L. Sarm
- Page 85 and 86: 72 Carme Colominas and Toni Badiadi
- Page 87 and 88: 74 Carme Colominas and Toni Badiala
- Page 89 and 90: 76 Carme Colominas and Toni Badiath
- Page 91 and 92: 78 Carme Colominas and Toni Badiath
- Page 93 and 94: 80 Carme Colominas and Toni BadiaTa
- Page 95 and 96: 82 Carme Colominas and Toni BadiaAs
- Page 97 and 98: 84 Carme Colominas and Toni BadiaFi
- Page 99 and 100:
86 Carme Colominas and Toni Badiaco
- Page 101 and 102:
88 Carme Colominas and Toni BadiaVa
- Page 103 and 104:
90 Rachélle GautonIzwaini (2003:17
- Page 105 and 106:
92 Rachélle Gautonneeded by the Ba
- Page 107 and 108:
94 Rachélle GautonThese electronic
- Page 109 and 110:
96 Rachélle Gautonthat of Bantu la
- Page 111 and 112:
98 Rachélle GautonSeeagainFig.1for
- Page 113 and 114:
100 Rachélle GautonLocke, translat
- Page 115 and 116:
102 Rachélle GautonHaving to work
- Page 117 and 118:
104 Rachélle Gautonmore, after suc
- Page 119 and 120:
106 Rachélle GautonMcEnery, A. and
- Page 121 and 122:
108 Marie-Josée de Saint Roberta c
- Page 123 and 124:
110 Marie-Josée de Saint Robertpre
- Page 125 and 126:
112 Marie-Josée de Saint Robertlef
- Page 127 and 128:
114 Marie-Josée de Saint Robertcap
- Page 129 and 130:
116 Marie-Josée de Saint Robertinf
- Page 131 and 132:
118 Marie-Josée de Saint Roberttra
- Page 134 and 135:
chapter 8Global content managementC
- Page 136 and 137:
Chapter 8. Global content managemen
- Page 138 and 139:
Chapter 8. Global content managemen
- Page 140 and 141:
Chapter 8. Global content managemen
- Page 142 and 143:
Chapter 8. Global content managemen
- Page 144 and 145:
Chapter 8. Global content managemen
- Page 146 and 147:
Chapter 8. Global content managemen
- Page 148 and 149:
chapter 9BEYTransA Wiki-based envir
- Page 150 and 151:
Chapter 9. BEYTrans 137Trans system
- Page 152 and 153:
Chapter 9. BEYTrans 1391. Facilitat
- Page 154 and 155:
Chapter 9. BEYTrans 141lators to ch
- Page 156 and 157:
Chapter 9. BEYTrans 1435.2 Translat
- Page 158 and 159:
Chapter 9. BEYTrans 1456. BEYTrans:
- Page 160 and 161:
Chapter 9. BEYTrans 1476.2.2 Multil
- Page 162 and 163:
Chapter 9. BEYTrans 149Bey, Y., C.
- Page 164 and 165:
chapter 10Standardising the managem
- Page 166 and 167:
Chapter 10. Standardising multiling
- Page 168 and 169:
Chapter 10. Standardising multiling
- Page 170 and 171:
Chapter 10. Standardising multiling
- Page 172 and 173:
Chapter 10. Standardising multiling
- Page 174 and 175:
Chapter 10. Standardising multiling
- Page 176 and 177:
Chapter 10. Standardising multiling
- Page 178 and 179:
Chapter 10. Standardising multiling
- Page 180 and 181:
Chapter 10. Standardising multiling
- Page 182 and 183:
Chapter 10. Standardising multiling
- Page 184 and 185:
Chapter 10. Standardising multiling
- Page 186 and 187:
chapter 11Tagging and tracing Progr
- Page 188 and 189:
Chapter 11. Tagging and tracing Pro
- Page 190 and 191:
Chapter 11. Tagging and tracing Pro
- Page 192 and 193:
Chapter 11. Tagging and tracing Pro
- Page 194 and 195:
Chapter 11. Tagging and tracing Pro
- Page 196 and 197:
Chapter 11. Tagging and tracing Pro
- Page 198 and 199:
Chapter 11. Tagging and tracing Pro
- Page 200 and 201:
Chapter 11. Tagging and tracing Pro
- Page 202 and 203:
Chapter 11. Tagging and tracing Pro
- Page 204 and 205:
Chapter 11. Tagging and tracing Pro
- Page 206 and 207:
Chapter 11. Tagging and tracing Pro
- Page 208 and 209:
chapter 12Linguistic resources and
- Page 210 and 211:
Chapter 12. Linguistic resources an
- Page 212 and 213:
Chapter 12. Linguistic resources an
- Page 214 and 215:
Chapter 12. Linguistic resources an
- Page 216 and 217:
Chapter 12. Linguistic resources an
- Page 218 and 219:
Chapter 12. Linguistic resources an
- Page 220 and 221:
Chapter 12. Linguistic resources an
- Page 222 and 223:
Chapter 12. Linguistic resources an
- Page 224 and 225:
Chapter 12. Linguistic resources an
- Page 226 and 227:
Chapter 12. Linguistic resources an
- Page 228 and 229:
IndexAAfrican language translatorX,
- Page 230 and 231:
Index 217Expert Advisory Group onLa
- Page 232 and 233:
Index 219open standards 206, 208,21
- Page 234 and 235:
Benjamins Translation LibraryA comp
- Page 236:
27 Beylard-Ozeroff, Ann, Jana Král