You are on page 1of 5

118 No.

118
2007 12 CAFL
E De
c. 2007


, ,
(
, 400044)

: ,
, ,

: ;

:
H319.
3 :A :
1001-5795(2007)12-0021-0005

: (l en
gth
-ba
sed)(Br
own
eta
l, 1991;Ga
le Ch
urc
h, 1991a
);
, (l
exi
cal
-ba
sed
)(Ka
y Ro
sch
eis
en, 1993);
, , (c
omb
ina
ti
on)(T
an Na
gao
, 1995;Wu, 1994)
Br own Ga le ,
,
, ,
, Bro
wn ,
, Ga
le Chur
ch
, ,
, , 96 97%
, ( 21 22
, )

, , ,
, ,
Ka
y Ro
she
ise
n


1
, ;

1.1 Chen(1993)
Ka
y Ro
she
ise
n
, ,
,

:
:
:

:
:

:
:

:2006-01-05

21
, :

Ha
nsa
rd 3
1.
Br
own Ga
le , (mu
lti
-wo
rdu
nita
li
gnme
nt
, MWU)

, (r ob
ust
) ,
(c
ogn
ate)
(Ch
urc
h, 1993)
, ,
,
, ,
;

, , , , n-g
ram
, Da -
, ga
n Chu
rch Te
rmi
ght
S
mad
jae
tal
. Ch
am-
(Ta
n Na
gao
, 1995;Wu
, 1994;;Co
ll
ie
r, 1998; p
oll
ion Mc
Ene
rye
tal
.(1997)

Vr
oni
s, 1999;
Mel
ame
d, 2000) , AS
MT
,
1.2 AS
MT
, 4
1.
(
)
, ,

, Ga
le
, , Ch
urc
h(1991a
) ,
(Br o
wn
eta
l., 1993, , 2003 )
EM(e
xpe
cta
tio
n-ma
ximi
zat
io
n) ,
,
, EM , ,
(Ga le Churc
h, Kite
tal
.(2004)

;Z
1991b h
ang Yi
nge
tal
., 2001 ) , Ga
le
Ch
urc
h , , ,
94.6%
, (Cont
in-
2
ge
ncyTabl
e) ,
EM , ,
, (
)
,
( )
,
,
, , ,
Ke
r(1997)
, ,
, ,
Hu
ang
(2000)
, ,
,
(Ne
cip
, 2006)
,

22
, :

, , Wu(1995)
Br
own EM
, 91.2% 95.
1%,
Wu(1994 ) Ga
le Fu
nga
ndChu
rch
(1994)

Ch
urc
h , K-v
ec
,
, , 1:
1 , (2004)

90% S
un(1999)
,
,
1:0 0:1
(
)
, ,
, (2002)

, , 85%
, , ,
, , 93% (
, 1997;
,
(2000)
, , 2001;Pi
1999; ao, 2001) Pi
ao(2001)

,
,
, 80.63% (1999)

, (2000:61),
2:2
Ch
uange
tal
.(2005)
,
- S
MC(Ch
ine
se- ,
Engl
is
hSi
nor
amaMa
gaz
ineCo
rpu
s) , ,
,
, 93%
, ,
(2005)
,
, , (2006)


, ,
, (2005: 93%
36) ,
(2006)
(2002)

, ,
,
17 (2003)
N-g
ram ,
20.3%85.2% ,
92.
5% (2003)

- ,
, , ,
, , (2006)
,
, ,

23
, :

, [ 2] Br
own
, P.F., La
i, H.C.a
ndMe
rce
r, R.L.Al
ign
ing

(2006)
s
ent
enc
esi
npa
ral
le
lco
rpo
ra[ C] .P
roc
eed
ing
soft
he29t
h
An
nua
lMe
eti
ngo
ftheAs
soc
iat
ionf
orCo
mpu
tat
iona
lLi
ngui
s-
, ,
t
ics
, 1991.
82.
76%
[ 3] Che
n, S
.Al
igni
ngs
ent
enc
esi
nbi
li
ngu
alc
orp
orau
sin
gle
xi-
Ga
le
c
ali
nfo
rma
tio
n[ C] .Pr
oce
edi
ngso
fthe31t
hAn
nua
lMe
eti
ng
, o
fth
eAs
soc
iat
ionf
orCo
mpu
tat
ion
alLi
ngu
ist
ic
s, 1993.
+ , [ 4] Chu
ang
.C.Ta
ndKe
vinC.Al
ign
ingPa
ral
lelBi
li
ngua
lCo
r-
p
oraS
tat
ist
ic
all
ywi
th Pu
nct
uat
ionCr
it
eri
a[ J
] .Co
mpu
ter
, S
cie
ncea
ndI
nfo
rma
tio
nEn
gin
eer
ing, 2005, 10:1.
, [ 5] Chu
rch
, L.W.Ch
ara
lig
n:p
rog
ramf
ora
lig
nin
gpa
ral
lel
, , t
ext
satt
hec
har
act
erl
eve
l[ C] .P
roc
eed
ing
soft
he31 t
hAn
-

n
ualMe
eti
ngo
fth
eAs
soc
iat
ionf
orCo
mpu
tat
ion
alLi
ngui
s-
t
ics
, 1993.
,
[ 6] Co
lli
er, N.
, On
o, K.
, a
ndHi
raka
wa, H.AnEx
per
ime
nt
(1999:
47) ,
i
nHy
bri
dDi
cti
ona
rya
ndSt
at
is
ti
cals
ent
enc
eal
ig
nme
nt[ C] .
Ki
teta
l.(2004)
Pr
oce
edi
ngso
fthe36t
hAnn
ualMe
eti
ngo
ftheAs
soc
iat
ion
(2003)
f
orCo
mput
ati
ona
lLi
ngu
ist
icsa
ndt
he17t
hIn
ter
nat
iona
lCo
n-
, f
ere
nceo
nCo
mpu
tat
ion
alLi
ngu
ist
ic
s, 1998.
:
[ 7] Fu
ng, P.a
ndChu
rch
, K.W.K-v
ec:An
ewa
ppr
oac
hfo
r
, , a
lig
nin
gpa
ral
lelt
ext
s[ C] .Pr
oce
edi
ngso
fth
e15t
hIn
ter
na-
/ , / t
ion
alCo
nfe
renc
eonCo
mpu
tat
ion
alLi
ngu
ist
ics1994.
, / ; [ 8] Ga
le, W.a
ndCh
urc
h, K.Ap
rog
ramf
ora
lig
nin
gse
nte
nce
s

/ ;
/ i
nbi
li
ngu
alc
orp
ora[ C] .P
roc
eed
ing
soft
he29t
hAnn
ual
Me
eti
ngo
fth
eAs
soc
iat
ionf
orCo
mpu
tat
iona
lLi
ngu
ist
ics
.

1991(a
)
,
[ 9] Ga
le, W.
, Ch
urc
h, K.I
den
tif
yin
gwo
rdc
orr
esp
ond
enc
esi
n

p
ara
lle
lte
xts
[ C] .P
roc
eed
ing
soft
he4t
hDARPA S
pee
ch
a
ndNa
tur
alLa
ngu
ageWo
rks
hop1991(b
).
[ 10] Hu
angJ
.X.a
ndCh
oiK.S.Chi
nes
e-Ko
rea
nwo
rda
lig
n-
3
me
ntb
ase
donl
i
ngu
ist
icc
omp
ari
son[ C] .Pr
oce
edi
ngso
f
t
he38t
hAn
nua
lMe
eti
ngo
fth
eAs
soc
iat
ionf
orCo
mpu
ta-
, t
ion
alLi
ngui
st
ic
s.2000.
- [ 11] Ka
y, M.a
ndRo
sch
eis
en, M.Te
xt-t
ran
sla
tio
nal
i
gnme
nt

, [J
] .Co
mpu
tat
io
nalLi
ngu
ist
ics
.1993(19:1).
[ 12] Ke
r, S
.J.a
ndCh
angJ
.S.Ac
las
s-ba
seda
ppr
oac
htowo
rd
,
a
li
gnme
nt[J
] .Co
mput
at
ion
alLi
ngui
st
ics
, 1997(23:2).
,
[ 13] Ki
t, C.Y.
, We
bst
er, J
.J.
, S
in, K.k
., Pa
n, H.
H.a
nd
,
Li
, H .Cl
aus
eAl
ign
mentf
orBi
li
ngua
lHo
ngKo
ngLe
gal
Te
xtswi
thAv
ail
abl
eLe
xic
alRe
sou
rce
s[ Z] .h
ttp:/ /pe
r-
s
ona
l.c
it
yu.e
du.h
k/ c
tck
it/pa
per
s/i
ccp
ol2003-c
lau
se-a
-
l
ign.
pdf2004.

[ 14] Mc
Ene
ryT., J
ean
-Ma
rcL.
, Mi
cha
elO., a
ndJ
eanV.
[ 1] Br
own
, P.F., De
llaPi
etr
a, S.A.
, De
ll
aPi
etr
a, V.J
. Thee
xpl
oi
tat
io
nofmul
ti
li
ngu
ala
nno
tat
edc
orp
oraf
ort
erm
a
ndMe
rce
r, R.L.Th
eMa
the
mat
i
cso
fSt
ati
st
ica
lMa
chi
ne e
xtr
act
ion
[ M] .i
nRo
gerGa
rsi
de, Ge
off
reyLe
echa
ndAn
-
Tr
ans
lat
ion
:Pa
rame
terEs
ti
mat
ion[ J
] .Co
mpu
tat
ion
alLi
n- t
hon
yMc
Ene
ry(e
ds.), Co
rpu
sAn
not
ati
on Li
ngu
ist
ic
g
uis
ti
cs, 1993, 19:2. I
nfo
rma
tio
nfr
omCo
mpu
terTe
xtCo
rpo
ra:Lo
ngma
n, 1997.

24
, :

[ 15] Me
lame
d, I
.D.Mo
del
sofTr
ans
nat
ion
alEq
uiv
ale
ncea
- [ 25] , , .

mo
ngWo
rds
[J] .Co
mput
at
ion
alLi
ngu
ist
ics
, 2000, 26:2. [ J
]. , 2002.11.
[ 16] Ne
cipF.A., a
ndBo
nni
eJ.D.A ma
ximum e
ntr
opya
p- [ 26] , , , .

p
roa
cht
oco
mbi
nin
gwo
rda
lig
nme
nts
[ C] .Pr
oce
edi
ngso
f [ J
]. , 2006.5.
Hu
manLa
ngu
ageTe
chno
log
yCo
nfe
ren
ceo
fth
eNo
rthA- [ 27] , , .

me
ric
anCh
apt
ero
fth
eACL2006. [ J
]. , 2003.5.
[ 17] Pi
ao, S
.S.Pa
ral
lelc
orp
oraa
nda
li
gnme
nt:Wh
ati
sit
? [ 28] .
Wh
atdowea
lig
n? [ Z] .h
ttp:/ / www.
lan
cs.
ac/uk/s
taf
f/ [ D] .
.2006.
p
iao
s/r
ese
arc
h/a
lig
nme
nt/a
lig
nme
nt.h
tm, 2001.1. [ 29] , , , .

[ 18] S
unL., DuL.
, Su
n, Y.F.a
ndJ
inY.B.S
ent
enc
ea- [ J
]. , 1997.1.
l
i
gnme
nto
fEng
li
sh-Ch
ine
sec
omp
lexbi
li
ngua
lco
rpo
ra[ Z] . [ 30] , , , .

h
ttp:/ / www.k
ort
erm.k
ais
t.a
c.k
r/n
lpr
s99/ma
l99-p
ape
ra/ [ J
]. , 2003.1.
ma
l-109.
pdf.1999. [ 31] , , .
[ J
].
[ 19] Ta
n, C.L.a
ndNa
gao
, M.Au
toma
ti
cal
ign
mento
fJa
pa- , 2004.8.
n
ese
-Ch
ine
seb
ili
ngu
alt
ext
s[ J] .I
EICE Tr
ans
act
ion
son [ 32] , , .

I
nfo
rma
ti
ona
ndS
yst
ems
, 1995.1. [ A] .
[ M] .

[ 20] Vr
oni
s, J
.Fr
omt
her
ose
tt
ast
onet
othei
nfo
rma
ti
ons
oci
e- :
, 2001.
t
y:as
urv
eyo
fpa
ral
le
lte
xtpr
oce
ssi
ng[ Z] .ht
t
p:/ /www. [ 33] , , .

u
p.u
niv
-mr
s.f
r/ v
ero
nis
/pdf
/2000-PTP-c
hap
ter d
1.pf [ J
]. , 2003.5.
1999. [ 34] , , , .

[ 21] Wu, D.Al
i
gni
ngap
ara
lle
lEng
lis
h-Ch
ine
sec
orp
uss
tat
i
sti
- [ J
]. , 2000.12.
c
all
ywi
thl
exi
calc
rit
eri
a[ C] .Pr
oce
edi
ngso
fth
e32 t
hAn- [ 35] .
[ D] .
.
n
ualMe
eti
ngo
fth
eAs
soc
iat
ionf
orCo
mpu
tat
io
nalLi
ngui
s- 1999.
t
i
cs, 1994. [ 36] , , .

[ 22] Wu
, D.La
rge
-sc
alea
uto
mat
ice
xtr
act
iono
fanEng
lis
h-Ch
i- [ A] .
, .

n
eset
ran
sla
tio
nle
xic
on[J
] .Ma
chi
net
ran
sla
tio
n, 1995. [ M] .
:
, 2003.
[ 23] Zh
angY., Br
own
, R.D.a
ndRo
ber
tE.F.Ada
pti
nga
nd [ 37] , , .

e
xamp
le-b
ase
dtr
ans
lat
ions
yst
emt
oCh
ine
se[ C] .Pr
oce
ed- [J
]. , 2006, 9.
i
ngso
fHLT:F
irs
tInt
ern
ati
ona
lCo
nfe
renc
eonHu
manL
an- [ 38] , , .

g
uag
eTe
chn
olo
gyRe
sea
rch, 2001. [J
]. , 2006, 2.
[ 24] .
[ 39] , .

[ J
]. , 2002. [ J
]. , 2005, 5.

AnOv
erv
iewo
ftheAl
ignme
nto
fBi
li
ngu
alPa
ral
le
lCor
por
a
HUA
NGJ
un-h
ong
, FANYu
n, HUA
NGPi
ng
(Co
ll
egeo
fFo
rei
gnLa
ngu
age
s, Ch
ong
qin
gUn
ive
rsi
ty, Ch
ong
qin
g400044, Chi
na)
Ab s
tra
ct:Base
do nwi
del
i
ter
atur
erevi
ewonpar
all
elcor
por
a, t
hisa
rti
cl
efir
stc
arr
ieso
utth
ere
sea
rchondi
ffe
ren
t
a
lgo
rit
hmsofali
gnmentont
hel
ing
uist
icl
eve
lofpa
ragr
aph, s
ent
ence
, cl
auseaswel
laswordsi
npar
all
elc
orpo
raand
t
hena
nal
yse
sth
eira
dva
nta
gesa
ndd
isa
dva
nta
ges
;fi
nal
lyi
tdi
scu
sse
sth
eal
gor
it
hmso
fal
ig
nme
ntwh
icha
rer
est
ri
ct
edi
n
a
li
gni
ngChin
ese-Engl
is
hpara
ll
elc
orp
us.Ita
imst
opr
ovi
deat
ech
nic
alb
asi
sfo
rco
nst
ruc
ti
ngs
mal
l-s
cal
eCh
ine
se-En
gli
sh
p
ara
lle
lco
rpor
af oro
urres
ear
chin
sti
t
ute
.
Ke
ywo
rds
:Pa
ral
lelCo
rpo
ra;Al
ig
nme
nt

25

You might also like