Professional Documents
Culture Documents
Sinh vin
Trng Ngc Sn.
MC LC
LI CM N ................................................................................................... 1
MC LC ......................................................................................................... 2
M U .......................................................................................................... 4
CHNG 1:
1.2.2
Mt vi tn hiu ri rc c bn ..................................................... 8
1.2.3
1.2.4
1.2.5
1.2.6
CHNG 2:
m thanh .................................................................................... 25
3.1.2
3.2.2
Cc n v x l ting ni .......................................................... 33
3.2.3
3.2.4
CHNG 4:
M U
Ngy nay, cng vi s pht trin nhanh chng ca cng ngh thng tin,
trong c cng ngh x l m thanh. c bit trong lnh vc x l m thanh
trong nhn dng ting Vit c mt ngha quan trng mang li nhiu ng
dng thit thc cho x hi, mang li nhng thay i mang tnh cch mng
trong nhiu lnh vc, pht thanh, truyn hnh, vin thng... Trong vi thp k
gn y, nhn dng l mt vn cun ht nhiu nh khoa hc cc lnh vc
khc nhau : Ton hc, iu khin, in t, sinh hc ... Trc s pht trin
mnh m ca cng ngh thng tin, vn nhn dng cng c quan tm
nhiu hn nhm nng cao hiu qu giao tip ngi - my.
Trn th gii, cc ngn ng ph bin nh Anh, Php... c nhiu
phn mm nhn dng rt hiu qu. Vit Nam c nhiu cng trnh nghin
cu v lnh vc nhn dng ting ni (Speech recognition) trn c s l thuyt
cc h thng thng minh nhn to, nhiu kt qu tr thnh sn phm
thng mi nh ViaVoice, Dragon..., cc h thng bo mt thng qua nhn
dng ting ni cc h quay s in thoi bng ging ni... Trin khai nhng
cng trnh nghin cu v a vo thc t ng dng vn ny l mt vic
lm ht sc c ngha c bit trong giai on cng nghip ho hin i ho
hin nay ca nc ta.
Mc ch ca ti l nghin cu xy dng mt chng trnh nhn
dng ting ni ting Vit trong mi trng c nhiu vi u vo l tp t hn
ch l ting vit sau so snh vi cc mu c sn a ra kt qu. Ngoi
phn m u v kt lun n gm 4 chng:
Chng 1 : Tn hiu C s x l Tn hiu
Chng 2 : c trng Ting Vit
Chng 3 : Bi ton nhn dng Ting ni
Chng 4: Chng trnh Demo
CHNG 1:
TN HIU - C S X L TN HIU
Tn hiu
Tn hiu v mt ton hc l hm biu din trng thi vt l ca thng
1.2.
x(n)
2
1.5
1.7
0.9
1.0
0.7
-
1.2
0.7
4
-4 Biu din
-2 -1th ca
0 tn1 hiu
2 ri3rc theo 5thi gian.
Hnh 1.3.
-0.8
phng php khc tng i thun tin c s dng biu din tn hiu
(hoc dy) ri rc theo thi gian.
a. Biu din bng hm
vi n = 1,3
1,
V d: x(n) = 4, vi n = 2
0, vi cc gi tr cn li
-2` -1
x(n .... 0
dng
ch thi im gc (n = 0).
1.2.2 Mt vi tn hiu ri rc c bn
a. Dy mu n v
Tn hiu ny cn c gi l dy xung n v v c nh ngha nh
sau:
(n)
1,
0,
n=0
n 0
(n)
1
-2
-1
0
1
2
3
1.4 Biu din th ca tn hiu mu n v
n
b. Dy nhy bc n v
1,
0,
n>0
n<0
(n k ) v
k 0
0 1 2 3 4 5 6
x ( n) ,
n
N
1
x ( n) 2
2N 1 n N
Lim
n
EN
x ( n)
n
Lim
EN
Lim
N
1
EN
2N 1
10
k
N
l v hn.
1
N
N 1
x ( n)
n 0
1
-4
-3
-2
-1
x(n)
1
-5 -4 -3 -2 -1
53
n
Tn hiu chn c th c biu din qua cng thc:
xe (n)
1
[ x(n) x( n)]
2
1
[ x(n) x( n)]
2
12
x ( n)
= x e(n) xo(n)
Nh vy mt tn hiu bt k c th c biu din di dng tng ca
tn hiu chn v tn hiu l
1.2.4 Cc thao tc x l n gin trn tn hiu ri rc theo thi gian.
Trong phn ny ta s xem xt mt vi x l n gin lin quan n cc
bin c lp v bin ca tn hiu.
a.Php dch cc bin c lp.
Tn hiu x(n) c th c dch chuyn theo thi gian bng cch thay th
bin c lp n bi n- k trong k l s nguyn. Nu k l s nguyn dng th
kt qu ca s dch chuyn v thi gian s l s tr ca tn hiu vi k n v
ca thi gian. Nu k l s m th kt qu ca s dch chuyn theo thi gian l
s vt trc ca tn hiu vi k n v thi gian.
b. Php nhn, cng v php ly t l.
Vic thay i ca bin tn hiu ri rc theo thi gian c th c
thc hin qua cc php ton (thao tc) cng, nhn, ly t l.
Ly t l cn c gi l php nhn ca dy vi hng s v thc hin
bng cch nhn gi tr ca mi mu vi chnh hng s . Gi s rng s
c k hiu l A, khi ta c th vit:
y(n) = Ax(n), -
n
13
y(n) = ax(n)
x2(n)
14
y(n)=x1(n)x2(n)
x2(n)
y(n) = x(n-1)
Z
-1
y(n) = x(n+1)
15
0) th h thng c gi l c
th h c gi l h
nh v hn.
b. H thng bt bin v khng bt bin theo thi gian
Mt h c gi l bt bin theo thi gian nu nh c trng vo/ra ca
n khng thay i theo thi gian
nh l. Mt h thng relaxed c gi l bt bin theo thi gian khi
v ch khi:
x(n) T y(n)
suy ra
x(n-k) T y(n-k)
16
x1(n)
a1
y(n)
a2
x2(n)
x1(n)
T
a1
+
x2(n)
y(n)
a2
17
CHNG 2:
2.1.
c im ca Ting Vit
Ting ni thng xut hin di nhiu hnh thc m ta gi l m
c im ng m
Trong ting Vit c mt loi n v c bit gi l ting. V mt ng
c im t vng
Mi ting, ni chung, l mt yu t c ngha. Ting l n v c s ca
c im ng php
T ca ting Vit khng bin i hnh thi. c im ny s chi phi
2.5.
hin i hi in
nh ay nhy i
20
Vn
t
m u
m m
m chnh
m cui
21
22
CHNG 3:
3.1.1
m thanh
+ sng m v cm gic m
- Khi mt vt giao ng v mt pha no , lp khng kh lin trc
+ Cng m
- L nng lng c sng m truyn trong mt n v thi gian qua
mt n v din tch t vung gc vi phng truyn (w/m2). i vi tai
ngi, cng m (I) l tham s khng quan trng bng gi tr t s I/I 0 vi
(I0 l cng chun). Ngi ta nh ngha n ca m thanh L qua biu
thc sau:
L=lg(I/I0)
Th nguyn ca L l Ben (k hiu: B). Nh vy khi L=1,2,3 c ngha
l cng m I ln hn 10, 102, 103ln cng m chun I0
Sau y l mt s mc m lng
- Ting n trong phng: khong 30 dB
- Ting n o ngoi ng ph: khong 90 dB
- Ngng au tai: khong 130 dB
+ to ca m
to ca m (m lng) i vi tai ngi khng trng vi cng
m. Tai ngi nghe thnh nht i vi cc m trong min tn s 1000-5000Hz
v nghe m cao thnh hn m trm.
+ m sc
m sc l mt c tnh sinh l ca m, c hnh thnh trn c s cc
c tnh vt l ca m l tn s v bin . Thc nghim chng t rng khi
mt dao ng m thanh pht ra mt m c tn s f0 th ng thi cng pht ra
cc m c tn s f1=2f0, f3=3f0
m c tn s f0 gi l m c bn hay ho m th nht, cc m c tn s
cao hn gi l ho m th 2, th 3,m c bn bao gi cng mnh nht, cc
ho m c tc dng quyt nh m sc ca m c bn. Tu theo cu trc
khoang ming, c hng v khoang mi ca tng ngi m c cc ho m
khc nhau.
26
(3.1.1)
| x(m) | w(n m)
(3.1.2)
Zn
(3.1.3)
Trong :
sgn[x(n)]
= 1 khi x(n)
AMDF ( P)
1 ko N
| yi
N i ko 1
yi
(3.1.4)
28
3.2.
3.2.1
ni, gm :
- Hng tip cn m hc ng m hc.
- Hng tip cn nhn dng theo mu thng k.
- Hng tip cn tr tu nhn to.
- Hng tip cn s dng mng nron.
3.2.1.1 Hng tip cn m hc ng m hc nhn dng ting ni
Khuynh hng m hc ng m hc da trn l thuyt v ng m
hc. L thuyt ny cho rng tn ti cc n v ng m xc nh, c tnh phn
bit trong li ni v cc n v ng m c c trng bi mt tp cc c
tnh tn hiu ting ni . Mc d cc c tnh m hc ca cc n v ng m
bin thin rt ln theo c ging ngi ni ln tc ng ca cc n v ng m
xung quanh (cn gi l hin tng ng pht m), nhng vn tn ti cc qui
tc cho php gii quyt nhng vn nh vy
Bc u tin trong hng tip cn m hc ng m hc nhn dng
ting ni l phn on v gn nhn. Bc ny chia tn hiu ting ni thnh cc
on c cc c tnh m hc c trng cho mt (hoc vi) n v ng m
(hoc lp), ng thi gn cho mi on m thanh mt hay nhiu nhn ng
m ph hp.
Bc th hai xc nh mt t (hoc mt chui t) ng trong s chui
cc nhn ng m c to ra sau bc mt v phi tun th mt s iu kin
rng buc (tc l cc t c chn ra trong t in cho trc phi ph hp
nguyn tc ng php v c ngha)
S khi ca h thng nhn dng ting ni theo hng m hc ng
m hc th hin trn Hnh 1.1
29
31
Cc n v x l ting ni
3.2.2.1 Tn s ly mu
Qu trnh ly mu to ra tn hiu ri rc hoc tn hiu s t tn hiu
11 mu.
(0,28 - 1)
(2-15, 215-1)
k hiu l s n . Nh vy:
~
s 2n
n 0
N
sn
sn
n 0
nhiu.
3.2.2.4 Thng s n nhiu.
Cch xc nh: Thng bo yu cu ngi s dng dng ni trong 3
giy v thu tn hiu trong thi gian ly ting n nhiu ca mi trng
xung quanh. Ngng im lng c xc nh l nng lng cao nht ca cc
frame. Ngoi ra c th dng bin i Fourier tnh ra cc tn s nhiu phc
v cho vic lc nhiu.
3.2.2.5 Lc nhiu
Hin ti, vic lc nhiu ca h thng c thc hin theo phng php
kinh in l dng php bin i Fourier vi thut ton FFT. Dng bin i
Fourier thun xc nh c cc tn s tham gia v loi i tt c tn s khng
thuc phm vi ting ni (nu bit c phm vi tn s ng ca ngi s dng
th kt qu lc s cng cao) bng cch cho cc h s tng ng gi tr zero sau
bin i ngc li.
34
3.2.3
Mt s k thut kh nhiu
1. K thut CMS
y l mt k thut thng dng kh nhiu trong cc h thng nhn
1
T
O( )
(3.2.1)
t 1
0,2
0,1z 1 0,2z 2
1 0,94z 1
0,1z
(3.2.2)
u vo
m thanh
Trch
c trng
Quyt nh
nhn dng
qu nhn
dddng
H thng so snh mu
Vi h thng so snh mu, y ch l m t mt cch khi qut nht
cho mt ng dng nhn dng ting ni. Di y l s khi cho h thng
nhn dng ting ni theo tng t ring bit.
Cc t mu
so snh
u ra kt
u vo
m thanh
Trch
c trng
Quyt nh
nhn dng t
qu nhn
dng
36
Cc t
mu so
mu so
snh
snh
Trch
Nhn dng
Nhn dng
qu nhn
c trng
m v
dng
u vo
m thanh
u ra kt
3.2.4.2.1 nh ngha
tng ng( hay ging nhau) gia cc mu c xc nh bng
cng thc ton hc. N cho php ta khng nh s ging nhau ca 2 mu.
Gi s ta c 2 vector mu u v Ai , Aj n chiu. ging nhau L(Ai ,
Aj) phi tho mn cc iu kin sau:
- S o ging nhau phi dng : L(Ai , Aj) >= 0;
- Phi c tnh i xng
d1(Ai , Aj) = {
a jk ) 2 }1/2
( a ik
(3.2.3)
k 1
a ik
d2(Ai , Aj) =
a jk
(3.2.4)
k 1
a jk )
(3.2.5)
38
a ik
d4(Ai , Aj) =
1/
a jk
(3.2.6)
k 1
a ik
a i a jk
aj
k 1
d5(Ai , Aj) =
a ik
1/2
ai .
a jk
k 1
k 1
x ik
1
=
N
aj
(3.2.7)
y:
1
ai =
N
aj
k 1
x jk
k 1
a ik
a jk
k 1
a ik
a jk
d6(Ai , Aj) =
(3.2.8)
i yn
yn
+ G.
i 1
Trong :
i (i=1..p)
l cc h s d bo
Sai s d bo
ring phn
gi tr
n
2
n
. y ta thc hin o hm
(i=1..p), tnh gi tr h s d bo
m vi
t cc tiu.
2
E yn
. yn
. yn i
G .n
i 1
2E
=>
yn
G.
yn
i 1
=>
E [ yn i yn j ]
E [ yn yn j ]
i 1
. gii h trn ta
cn tnh c cc E [yn-i . yn-j]. C hai phng php cho php ta tnh cc E [yni
41
R yy (k )
yn . yn
n k 1
R yy (1)
...
R yy ( P 1)
R yy (1)
R yy (0)
... R yy ( P 2)
2
.
...
...
...
...
...
R yy ( P 1) R yy ( P 2) ...
R yy (0)
P
R yy (1)
R yy (2)
...
R yy ( P)
t E0 = Ryy(0) , i = 0
2.
i=1
i 1
3.
( i 1)
ki = (
R yy ( i
1)
R yy (i) ) / Ei 1 )
j 1
4.
5.
(i)
(i)
= ki
(i-1)
j
ki2
+ ki .
(i-1)
j
6.
Ei = ( 1
7.
Nu i < P th quay li 2.
(vi j = 1..i-1)
) Ei-1
42
43
CHNG 4:
4.1.
t trong ting Vit. Trc ht, chc nng chnh ca chng trnh l m phng
c cng vic nhn dng cc t n ca ting Vit. N l c s cho vic
nhn dng cc n v ting Vit ln hn nh: t ghp, cm t, cu Chng
trnh gm 2 chc nng chnh:
+ Hun luyn h thng: Chc nng ny nhm mc ch to v cp nht
vo c s d liu cc c trng c bn nht ca t, tc l cho my hc
trch rt cc c trng ca t i vi nhiu ngi ni, phc v nhu cu
nhn dng t cho nhiu ngi khc nhau. Mi ngi thc hin cho my hc
mt s t ting Vit v sau s ghi m t nhng ln ni khc ri cho h
thng nhn dng.
+ Nhn dng t n (t ch c mt ting) ca ting Vit t file ngun:
Mt t ch c nhn dng sau khi cho my hc v t , chc nng ny
nhm nhn dng cc t t file m thanh. Nu cha c ta phi ghi m t cn
nhn dng bng trnh SoundRecorder ca Window tch hp sn trong h
thng ri ghi vo cc file Wave, sau vi nhn dng cc file m thanh ny
bng chc nng nhn dng ca chng trnh. Hoc kim tra kh nng nhn
dng chng trnh ta s dng cc t ghi m sn trong th mc Data-forNhanDang, do cc t ny c mt tp hp cc mu c hc trc v
lu cc c trng ca cc t trong c s d liu.
+ Nhn dng trc tip qua Microphone: Trn c s t c hc
ri. H thng s thng trc ch ngi ni ni vo Micro v hin th thng
tin nhn c dng text ln mn hnh. ng thi sng m c hin th trc
quan trong hp nh.
+ Ngoi ra cn c cc chc nng khc nh:
- Ghi m: ghi m cc t mu hc v cc t nhn dng.
- Hin th thng tin v file Wave ang c.
44
M hnh nhn dng t ting Vit da trn phng php d bo tuyn tnh
LPC-10. Mi mt mu t s c chia thnh cc frame nh. Sau thc hin
tnh ton h s LPC-10 cho tng frame, cho tt c cc frame, b tham s ny
s c lu tr trong c s d liu.
Ta c th hnh dung thao tc to d liu t 1 mu nh sau :
+ File m thanh c ct trch ly phn d liu m thanh c ting
ni.
+ Chun ho thi gian
+ Chun ho bin ng
45
46
47
48
NH GI KT QU V KT LUN
Vi ti c giao, sau thi gian thc hin n tt nghip, vn dng
nhng kin thc c bn c hc cng vi n lc bn thn, s ch bo tn
tnh ca gio vin hng dn - Thc S V Vn Tng n Nhn dng tp
t hn ch Ting Vit trong mi trng nhiu hon thnh. Chng
trnh p ng c c bn cc yu cu t ra.
Nhng vn t c:
+ p ng c tn ti yu cu l nhn dng t ting vit
+ Khi s lng mu hun luyn ln th kt qu nhn dng t cht
lng
+ Th nghim h thng cho kt qu nhn dng tt khi m s lng t
khng ln (hn 20 t).
+ H thng nhn dng tt vi cng ngi ni v nhng ngi tham gia
hun luyn mu.
+ Khi ngi ni khng tham gia hun luyn mu th kt qu nhn dng
cha c kh quan.
Cc yu cu ca tng cht lng h thng
+ Chn mu hun luyn phi l cc mu chun, t nhiu
+ Tng s lng mu hc
+ Kim tra, nghe th trc khi cp nht vo CSDL
Hng pht trin ca n
+ Lm c s thit k h thng nhn dng cm t v cu
+ Pht trin chng trnh giao tip vi my tnh trc tip qua
Microphone thc hin mt s cu lnh c bn.
49
Digital
Signal
Processing:
Principles,
Algorithms,
and
50