Professional Documents
Culture Documents
Objective
Speech Recognition is the process of recognizing the word (predefined) spoken by the
speaker on the basis of information included in speech waves. GMM or Gaussian Mixture
model algorithm compares the cepstral coefficients generated by speech samples in the
training and testing phase. Furthermore this technique makes it possible to use the
speaker’s voice to verify their identity. This project is implemented in ADSP 2181
processor.
Project Description
In testing phase, the input speech is matched with stored references models (s)
and recognition decision is made on the basis of Mel Frequency Cepstrum Coefficients
(MFCC) , Gaussian Mixture model(GMM).
Block Diagram
Speech
Input Windowi Mel-
Framing
F ng |FFT|
r
F
F Filtering
F
r r F
a r
m a a r
a a
i m m
m m
n i i
Recognizg i i
n Static
n
n
ed O/P GMMg g n
F coefficient
g DCT g
Classifier F
r F s
a F r
r a
m a r
i a m
m i
n i m
g i n
n g
g n
g
Implementation