Speech is the medium through
which human beings can communicate with each
other efficiently. Speech synthesis and recognition
are two phases of speech. In this paper, focus is
given on speech recognition. Speech recognition is
the conversion of spoken words into text with the
help of some electronic device like computer. A
number of methods are available for recognition of
speech in different languages using various units
like vowels/consonants, words, phonemes, or
syllables. No much work has been done in Punjabi
language. So, in this Discrete Wavelet Transform
method is described for recognition of consonants
in isolated Punjabi words.
Original Title
Recognition of Consonants in Isolated Punjabi Words using DWT
Speech is the medium through
which human beings can communicate with each
other efficiently. Speech synthesis and recognition
are two phases of speech. In this paper, focus is
given on speech recognition. Speech recognition is
the conversion of spoken words into text with the
help of some electronic device like computer. A
number of methods are available for recognition of
speech in different languages using various units
like vowels/consonants, words, phonemes, or
syllables. No much work has been done in Punjabi
language. So, in this Discrete Wavelet Transform
method is described for recognition of consonants
in isolated Punjabi words.
Speech is the medium through
which human beings can communicate with each
other efficiently. Speech synthesis and recognition
are two phases of speech. In this paper, focus is
given on speech recognition. Speech recognition is
the conversion of spoken words into text with the
help of some electronic device like computer. A
number of methods are available for recognition of
speech in different languages using various units
like vowels/consonants, words, phonemes, or
syllables. No much work has been done in Punjabi
language. So, in this Discrete Wavelet Transform
method is described for recognition of consonants
in isolated Punjabi words.
Recognition of Consonants in Isolated Punjabi Words using DWT Manpreet Kaur 1 , Amanpreet Kaur 2 , Jasmeen Gill 3
1 Department of Computer Science, RIMT, Mandi Gobindgarh, Punjab, India
2 Department of Computer Science, BBSBEC, Fatehgarh Sahib, Punjab, India
3 Department of Computer Science, RIMT, Mandi Gobindgarh, Punjab, India
ABSTRACT: Speech is the medium through which human beings can communicate with each other efficiently. Speech synthesis and recognition are two phases of speech. In this paper, focus is given on speech recognition. Speech recognition is the conversion of spoken words into text with the help of some electronic device like computer. A number of methods are available for recognition of speech in different languages using various units like vowels/consonants, words, phonemes, or syllables. No much work has been done in Punjabi language. So, in this Discrete Wavelet Transform method is described for recognition of consonants in isolated Punjabi words. Keywords: Speech recognition, Recognition types, DWT. 1. INTRODUCTION Speech recognition is the process of translation of spoken words into text form. The recognition of speech is done with the help of a computer to which speech is given as input. Speech recognition is important because speech is the natural medium of communication between human beings. Speech recognition can also be called Automatic Speech Recognition, Computer Speech Recognition or Speech to Text recognition. Speech recognition techniques help the human beings to understand speech through machines. There are many types of speech recognition depending on the various types of modes of recognition. On the basis of speech mode, the speech recognition can be of isolated words, connected words, continuous speech. For speaker basis, it can be speaker dependent and speaker independent. Speech can also be recognized on the basis of vocabulary. It can be small sized, medium sized and large sized vocabulary recognition. Speech recognition is also based on the speaking style of speech which can be dictation type and spontaneous speech [2]. There are various methods available for recognition of speech for different units in various languages. Vowels can be recognized in continuous speech using formants which involves vocal tract resonant frequencies [3]. But vowels suffer the problemof overlapping. Consonant/Vowel units in a continuous speech can be recognized using Auto associative Neural Networks and Support Vector machines. In this, vowel onset points are predicted [4]. But there is drawback for unequal durations. Acoustic Modelling technique can be used for speech recognition using phoneme like units but in phonemes there is large variability [5]. To reduce these defects, it is required to choose suitable unit and method. Moreover, Punjabi language requires such kind of research. In this paper, Discrete Wavelet Transform method is described for recognition of consonants in isolated Punjabi words. 2. DISCRETE WAVELET TRANSFORMS Discrete Wavelet Transformmethod is a technique in which speech can be discretely sampled. The main advantage of Discrete Wavelet Transform is International Journal of Computer Trends and Technology (IJCTT) volume 9 number 2 Mar 2014 ISSN: 2231-2803 http://www.ijcttjournal.org Page92
that it can use time and frequency of speech simultaneously. It emphasizes on signal interval efficiently. As speech is non-stationary and Discrete Wavelet Transformtechnique can be used for non-stationary signals. In this technique, frequency components can be resolved into parts, approximation coefficients and detailed coefficients. Approximation coefficients contain low frequency components and detailed coefficients contain high frequency components [1]. There are many types of Discrete Wavelet Transform (DWT). Haar, Coiflet and Daubechies are three types of DWT that are discussed here and used in this work. 2.1 Haar Wavelets Haar is the simplest formof wavelets. It collects input values, store difference and then gives the sum. This process is repeated recursively and collects sums to give next scale. Haar Wavelets are related to mathematical operation that is Haar Transform. These Wavelets act as prototype for other wavelets and helps in studying the other wavelet transforms. In this calculations can be easily handled and discrete signals are involved [6]. Discrete signal is defined as a function of time with values at discrete instants. It can be expressed as f =(f 1 , f 2 , f 3 ................. f n ) Where n is positive integer. Haar Wavelets also decomposes the signal into two sub signals in half its length. One sub signal is running average or trend and first trend is denoted as a 1 =(a 1 , a 2 , a 3 ..................a n/2 ) Other sub signal is running difference or fluctuation. The first fluctuation is denoted as d 1 =(d 1, d 2, d 3........................ d n/2 ) Next, the first level Haar Transformis given as the mapping H 1 defined by f (a 1 d 1 ) fromdiscrete signal f to its first trend a 1 and first fluctuation d 1 . 2.2 Daubechies Wavelets Daubechies Wavelets are also computed similarly as Haar Wavelets by using running averages and running differences via scalar products with scaling signals and wavelets. In this type of Wavelets, the scaling signals and wavelets produce averages and differences using more values from the signals. This improves the functionality of these wavelets and provides better tools for signal processing. There are many types of Daubechies Wavelets but they are very similar with each other. The simplest one is Daud4 Wavelet Transform. The first level of Daub4 Transform is given by mapping D 1 defined as f ( a 1 d 1 ) froma discrete signal f to first trend a 1 and first fluctuation d 1 [7]. 2.3 Coiflet Wavelets Coiflet Wavelets maintain the close match between trend values and original signal values. The simplest formof these wavelets is Coif6 Wavelets. The Coif6 scaling numbers satisfy the following identity
1 2 + 2 2 + 3 2 + 4 2 + 5 2 + 6 2 =1 where is the scaling number which means each scaling signal has energy equals to 1. Further, the wavelet numbers satisfy the following
1 + 2 + 3 + 4 + 5 + 6 =0 International Journal of Computer Trends and Technology (IJCTT) volume 9 number 2 Mar 2014 ISSN: 2231-2803 http://www.ijcttjournal.org Page93
which means zero fluctuation when signal is linear in support. The Coif6 and Daub4 are different from each other in scaling number characteristics. The Coif6 wavelets produce better matching of trend sub signal with the original signal than Daub4 and also improve accuracy [8]. 3. IMPLEMENTATION All this work is done in mat lab using signal processing tool. The following are the steps involved in the recognition of consonants in Punjabi words. (i) First all the signal samples are loaded in the database. (ii) Then, input speech signal is loaded. (iii) The signal is converted into single row matrix. (iv) Correlation is found between input signal and database signals and show waveform. (v) Discrete Wavelet Transforms (HAAR, COIFLET and DAUBECHIES) are applied on input signal. (vi) Discrete Wavelet Transforms (HAAR, COIFLET and DAUBECHIES) are applied on database signal. (vii) Frequency matching is done of each letter of Punjabi word with letters present in database. (viii) Result of recognized Punjabi letter having maximummatched rate is found. (ix) All matched results are shown as output. 4. RESULTS AND DISCUSSION In this paper, the consonants are recognized in isolated Punjabi words using Discrete Wavelet Transform. First, there is waveform of Punjabi words and . There are four consonants in first word and three consonants in second word. Then, the recognized consonants from these Punjabi words are shown. Next, the results are shown with the help of graphs and bar graphs.
Figure 1: Waveformof Punjabi word
Figure 2: First recognized consonant
Figure 3: Second recognized consonant
0 0.5 1 1.5 2 2.5 3 3.5 4 x 10 4 -0.3 -0.2 -0.1 0 0.1 0.2 0.3 0.4 International Journal of Computer Trends and Technology (IJCTT) volume 9 number 2 Mar 2014 ISSN: 2231-2803 http://www.ijcttjournal.org Page94
Figure 4: Third recognized consonant
Figure 5: Fourth recognized consonant
Figure 6: PSNR and MSE of Punjabi speech signals
Figure 7: Waveformof Punjabi word
Figure 8: First recognized consonant
Figure 9: Second recognized consonant
0 0.5 1 1.5 2 2.5 3 3.5 4 x 10 4 -0.3 -0.2 -0.1 0 0.1 0.2 0.3 0.4 International Journal of Computer Trends and Technology (IJCTT) volume 9 number 2 Mar 2014 ISSN: 2231-2803 http://www.ijcttjournal.org Page95
Figure 10: Third recognized consonant
Figure 11: PSNR and MSE of Punjabi speech signals 5. CONCLUSION In the given paper, the Punjabi consonants are recognized in isolated Punjabi words using Discrete Wavelet Transform. Punjabi language lacks research in this field. Therefore, Punjabi language is chosen to recognize consonants in Punjabi words using Discrete Wavelet Transform because this technique includes both time and frequency simultaneously. Moreover, this method can also be used for non stationary signals. This method also gives better accuracy rates. 6. FUTURE SCOPE This work can also be extended for recognition of isolated words, connected words and continuous speech. Further, in this work consonants are recognized in a Punjabi word in order of consonant series but in future work they can be recognized in order of their place in the given word. REFERENCES [1] M. F. Tolba, T. Nazmy, A. A. Abdelhamid and M. E. Gadallah A Novel Method for Arabic Consonant/Vowel Segmentation using Wavelet Transform in IJ ICIS, Vol. 5, No. 1, J uly 2005 [2] M. A. Anusuya and S. K. Katti Speech Recognition by Machine: A Review in IJ CSIS, Vol. 6, No. 3, 2009. [3] Biljana Prica and Sinisa Ilic Recognition of Vowels in Continuous Speech by Using Formants Facta Universitatis, SER.: ELEC. ENERG. vol. 23, no. 3, December 2010. [4] Suryakanth V. Caigashetty, C. Chandra Sekhar, and B. Yegnanarapna Spotting Consonant-Vowel Units in Continuous Speech using Auto Associative Neural Networks and Support Vector Machines IEEE Workshop on Machine Learning for Signal Processing, 2004. [5] C. H. Lee, E. Giachin, L. R. Rabiner, R. Pieraccini and A. E. Rosenberg Improved Acoustic Modelling for Continuous Speech Recognition in Speech Research Department AT and T Bell Laboratories, Murray Hill. [6] C. S. Burrus, R. H. Gopinath, and H. Guo Introduction to Wavelets and Wavelet Transforms, A Primer Prentice Hall, Englewood Cliffs, NJ , 1998. [7] C. K. Chui Wavelets: A Mathematical Tool for Signal Analysis SIAM, Philadelphia, PA, 1997. [8] H. L. Resnikoff and R.O. Wells Wavelet Analysis: The Scalable Structure of Information Springer, New York, NY, 1998. AT&T Bell Laboratories Murray Hill E. Giachin L. R. Rabiner, R. Pieraccini and A. E. Rosenberg