Translation Universals.pdf - ymerleksi - home
Translation Universals.pdf - ymerleksi - home
Translation Universals.pdf - ymerleksi - home
You also want an ePaper? Increase the reach of your titles
YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.
54 Silvia Bernardini and Federico Zanettin<br />
things, the choice of texts to be translated (Even-Zohar 2000 [1978/1990];<br />
Vanderauwera 1985):<br />
Dutch fiction is chosen for translation either in the function of assumed target<br />
taste or in that of the status the work has acquired at the source pole, often as<br />
a combination of the two. (Vanderauwera 1985:132)<br />
In our experience of corpus construction, this issue has been of central<br />
importance in assessing the comparability of different corpora.<br />
3. Issues in translation corpus design and construction:<br />
the CEXI example<br />
CEXI is an English-Italian <strong>Translation</strong>al Corpus being developed at the School<br />
for Interpreters and Translators, University of Bologna at Forlì. The aim of the<br />
project is to arrive at a bi-directional and parallel corpus of approximately four<br />
million words of contemporary texts in two languages, XML-tagged following<br />
the TEI guidelines (Sperberg-McQueen & Burnard 2001), aligned and accessible<br />
online (see Zanettin 2000, 2002). Following projects like the ENPC for Norwegian,<br />
Compara for Portuguese, and the Chemnitz corpus for German, CEXI<br />
is restricted to what is probably the most prototypical (and most easily samplable)<br />
form of translation, i.e. printed books. Altogether 624 titles (half translated<br />
from English into Italian, the other half translated from Italian into English)<br />
were randomly selected from the Unesco Index <strong>Translation</strong>um database<br />
(1998), half of them fiction (roughly corresponding to the Universal Decimal<br />
Classification category of Literature/Children’s Literature as assigned within<br />
the Index <strong>Translation</strong>um) and half non-fiction (divided into nine subcategories<br />
following the Index/UDC criteria). After removing titles which were repeated,<br />
outside our time frame, impossible to locate etc., requests for permission were<br />
made to the copyright owners.<br />
3.1 Preliminary norms 1: non-fiction<br />
While trying to set up a sampling frame for the non-fiction component of the<br />
corpus, it became clear that, in the Index <strong>Translation</strong>um at least, a different<br />
“weight” is associated with the different genres in each direction (on this point<br />
see also Mauranen 2001). The numbers of texts translated from Italian into<br />
English and from English into Italian in each of the UDC subdomains are not<br />
comparable (Zanettin 2002). The following table summarizes these differences: