You are on page 1of 8

148 No 148

2012 11 CAFLE Nov 2012

* *


1 2

1 100089 2 710128








H319 3 A 1001-5795201206-0003-0008

20 90 -
1998 3 unconstrained
1
70

20 2 6 70
1987

Roukos et
al 1995 McEnery Wilson 2001 168
IBM
J IBM T J Watson Re-
search Center Bell
Bell Communications Research Inc / Bellcore 90
9
Kenny 2001 114
-
1
alignment algorithm Church et al
1 1 - 1993 faux amis
- the Canadi- Klavans Tzoukermann 1995
an Hansard Corpus
90



2012-04-20
10ZD127

normalised
false starts

McEnery Wilson 2001 168 1 3
- German-English
Parallel Corpus of Literary TranslationGEPCOLT

lexical normalisa-
1 2 - tion lexical creativity
- Chemnitz E-G Transla-
tion Corpus 1993


150 20 80 90
2001 50 Kenny 2001 114 2001
2 970270 1
021 Kenny 2001 114
055
TEC
header files
Olohan
2004 58-59

Kenny 2000 GEPCOLT
creative compounds and
Chemnitz collocations
Internet Grammar

Kenny 2001 GEPCOLT
- the London-
Lund corpus ---
The Lancaster-Oslo-Bergen Corpus 2001 187
210
-
Olohan Baker 2000 Kenny
2005 that
say
that
Schmied 199419981999 Schmied Schaffler da
1996 Schaffler 1997 Smitterberg et al 2000 The da
English / German Translation Corpus The project http / /
www tu-chemnitz de / phil / english / chairs / linguist / real / that Kenny
independent / transcorpus / index htm 2005 161
GEPCOLT


4

/ p
GEPCOLT / p s / s
2
presentative constructions
1 4 -
- English-Norweigian Par-
allel CorpusENPC
University of Oslo
Stig veras 1998
Johansson Bergen - Blum-Kulka
Norwegian Computing Centre for the Humanities 1986
Knut Hofland Jarle Ebel-
ing Signe Oksefjell

- ENPC


1 1 5
XML
100 100
260 1
Knut Hofland
Jarle Ebeling 1 5 -
Translation Corpus Explorer - The English-Swed-
ish Parallel CorpusESPC 1993
Lund University
1 ENPC Johansson 1999 /2002 3
Swedish Council for Research in Human-

ities and Social Sciences 1997

Gteborg University
30 30 30 30

20 20 20 20 Bengt AltenbergMats JohanssonMikael


50 50 50 50 Svensson Karin Aijmer
671 700 629 900 699 400 661 500 - ENPC
Corpus of Translated Finnish
ENPC


ENPC 1994 1997 1997 2001 2 8 64
2001 72
ENPC 10 000
Guidelines for Electronic Text Enco- 15 000
ding and Interchange TEI Text Encoding Ini-
tiative XML Sperberg-McQueen Bur-
nard1994
5

lish-Italian translationCEXI 1990


University of Bologna Forl


ENPC

ESPC
2

2 ESPC Altenberg et al 2001
CEXI - ENPC


25 25
1 5

7 6

17 14
prose
39 47

3 3

2 2 1976 2000
5 6 80 4
3 0 92 40
7 9 52 368 1 1 5
1 3
460
3 4
Zanettin 2002
15 20
Bernardini Zanettin 2004 -
64 72
CEXI

8


ESPC

CEXI - ENPC
Aijmer
2001 Axelsson 2006 Fredriksson 2006 Gustawsson
2002 Ruin 2001
2006 Johansson 2001 Zanettin 2000
ESPC


5%

2

1 6 - -
- Corpus of Eng-
6

1 7 - English-Por- header
tugese Parallel Corpus Compara
--
ENPC EPIC
POS
Treetagger Freeling


University of Stuttgart IMS 2010 101-102
IMS Corpus Workbench system
2

Computational processing
of Portugese project DISPARA
Frankenberg-Gacia 2004 -
Compara
2 1
Frankenberg-Gacia
-
1993 150
2001 50 Kenny 2001 114
- GEPCOLT 2001
1 8 2 -
The European Parliament In- 260 - 2000
terpreting CorpusEPIC 460

mode genre
2004 time span




POS
false starts
140 4
Org normalised McEnery Wilson 2001 168
Int
EPIC




paralinguistic
features paralinguistic features
7

2010 101




-
-
GEPCOLT
- 2 4
-


prose

1976 2000 -
-
1976 2000


2 2





2000

-
3


2005
Kenny 2001






2011 917-918
2 3

1 AijmerK Probably in Swedish translationsa case of
translationese A In S Alln S Berg S -G
MalmgrenK Norn B Ralph eds Gller stamsuf-

fix och ordC Gteborg Meijerbergs institut fr svensk et- 11 JohanssonS J Ebeling S Oksefjell English-Norwe-
2001 1 13
ymologisk forskning gian Parallel Corpus ManualZ Department of British
2 AltenbergB K Aijmer M Svensson The English- and American StudiesUniversity of Oslo
1999 /2002
Swedish Parallel Corpus ESPC Manual of enlarged version 12 KennyD Translators at play exploitations of collocation-
OL http / / www sol lu se / engelska / corpus / corpus / es- al norms in German-English translationA In B Dodd
pc html acessed on 15 /03 /2012
2001 ed Working with German CorporaC Birmingham
3 AxelssonK Tag questions in English translations from University of Birmingham Press
1999 143 160
Swedish and Norwegianare there differencesA In B 13 KennyD Lexis and Creativity in Translation A corpus-
Englund-Dimitrova H Landqvist eds Svenska som based studyM Manchester St Jerome
2001
kllsprk och mlsprk C Gteborg 14 KennyD Parallel corpora and translation studies old
versttarutbildningen vid Humanistiska fakulteten2006 4 questionsnew perspectives Reporting that in Gepcolt a
21 case studyA In G BarnbrookP Danielsson M
4 BernardiniS F Zanettin When is a Universal not a U- Mahlberg eds Meaningful Texts The Extraction of Se-
niversal Some limits of current corpus-based methodologies mantic Information from Monolingual and Multilingual cor-
for the investigation of translation universalsA In A poraC London Continuum
2005 154 165
Mauranen and P Kujamki eds Translation Universals 15 KlavansJ E Tzoukermann Combining corpus and
Do They ExistC Amsterdam John Benjamins2004 machine-readable dictionary data for building bilingual lexi-
51 62 consJ Machine Translation 10 3 185 218
1995
5 Blum-KulkaS Shifts of cohesion and coherence in transla- 16 McEneryT A Wilson Corpus linguistics An intro-
tionA In J House S Blum-Kulka eds Interlin- duction 2nd edition M Edinburgh Edinburgh Univer-
gual and Intercultural Communication Discourse and Cogni- sity Press
2001
tion in Translation and Second Language Acquisition Studies 17 OlohanM M Baker Reporting that in Translated
C Tbingen Gunter Narr Verlag
1986 17 35 English Evidence for Subconscious Processes of Explicita-
6 Frankenberg-GaciaA Are translations longer than source tionJ Across Languages and Cultures20001 2 141
texts A corpus-based study of explicitationA Paper pres- 158
ented at the Third International CULT Corpus Use and 18 OlohanM Introducing Corpora in Translation Studies
Learning to Translate ConferenceBarcelona22 24 Jan- M London Routledge
2004
uary 2004OL http / / www linguateca pt / Repositorio / 19 versL In search of the third code An investigation of
Frankenberg-Garcia2004 doc accessed 04 /07 /2006 norms in literary translationJ Meta 43 4 571
1998
2004 588
7 FredrikssonA On passives and translation strategies in 20 RoukosS D Graff D Melamed Hansard French /
parallel textsA In B Englund-Dimitrova H Landqvist EnglishZ Philadelphia Linguistic Data Consortium
eds Svenska som kllsprk och mlsprkC Gteborg http / / www ldc upenn edu / Catalog / CatalogEntry jsp
versttarutbildningen vid Humanistiska fakulteten
2006 75 catalogId = LDC95T20 accessed on 24 /03 /2012
1995
91 21 RuinI Nonfinite versus finite constructionsa problem
8 GustawssonE Translation of English verbal idioms into in the translation of Swedish literary texts into EnglishA
SwedishA In B Englund-Dimitrova H Landqvist In W Vagle K Wikberg eds New directions in Nor-
eds Svenska som kllsprk och mlsprkC Gteborg dic text linguistics and discourse analysis Methodological is-
versttarutbildningen vid Humanistiska fakulteten
2006 92 suesC Oslo Novus Forlag
2001 243 253
108 22 SchmiedJ H Schaffler Explicitness as a universal
9 JohanssonM Clefts in contrast a contrastive study of it feature of translationA In M Ljung ed Corpus-
clefts and wh clefts in English and Swedish texts and transla- based studies in English Papers from the seventeenth Inter-
tionsJ Linguistics
2001
39 547 582 national Conference on English Language Research on Com-
10 JohanssonM Clefts in English and Swedish A contrastive puterized Corpora ICAME 17 C Amsterdam Rodopi
study of it-clefts and wh-clefts in original texts and transla- 1996 21 36
tionsM unpublished PhD dissertation Department of 23 SchmiedJ Translation and cognitive structures Hermes
EnglishLund University
2002 J Journal of Linguistics
1994 13 169 181

24 SchmiedJ Differences and similarities of close cognates am Rodopi


1999 21 30
English with and German mitA In S Johansson S 26 ZanettinF CEXI Designing and English-Italian Transla-
Oksefjell eds Corpora and cross-linguistic research tional CorpusA In B Ketteman G Marko eds
Theorymethod and case studiesC Amsterdam Rodopi Teaching and Learning by Doing Corpus AnalysisC Am-
1998 255 176 sterdam Rodopi
2002 329 343
25 SchmiedJ Applying contrastive corpora in modern cont- 27 J
rastive grammars the Chemnitz Internet Grammar of Eng- 43 6 911 923
2011
lishA In H Hasselgard S Oksefjell eds Out of 28 J
corpora Studies in honour of Stig JohanssonC Amster- 7 5 100 105
2010

Construction and Application of Parallel Corpora Issues and Comments


WANG Ke-fei1 HUANG Li-bo2
1 Beijing Foreign Studies UniversityBeijing 100089
2 Xi'an Foreign Studies UniversityXi'anShanxi710128China
Abstract The paper makes a comprehensive review on the construction and application of 8 major bilingual parallel
corpora in the world Some pitfalls in the corpus-based researches are revealed despite their rich experiences in
technology and research method The issues are manifested as follow the size of the corpora is not big enough to be
representative research topics are not as variedespecially in respects of language and translation research from a
historical perspective corpora for special purposes are less developed researches in application and technology
based on a parallel corpus still wait to be done The questions above turned to be the topics to discussthe authors
thinkand are conducive to researches on the construction of a large-scale and multipurpose corpus
Key words Parallel Corpus Construction and Application

10