Professional Documents
Culture Documents
Overview
How is speech produced by humans? Vocal organs
Lungs and trachea (windpipe) Larynx (voicebox) Vocal tract
Speech generation
Excitation and modulation
Articulatory phonetics
Phonetic alphabets Consonants: manner and place of articulation Vowels
Vocal Folds (within the Larynx) vibrate to give pitch Trachea Lungs
Provide energy for speech production
Larynx
The larynx is a continuation of the trachea but contains a highly specialised cartilage structure and associated muscles. The most important parts of the larynx are: 1. The vocal folds (cords) 2. The arytenoid cartilage The arytenoid cartilages are the main controllers of the vocal folds. Breathing: Swallowing: vocal folds held open vocal folds closed.
The vocal folds stretch across the larynx and when closed, separate the larynx from the trachea. The opening made by the vocal folds is known as the glottis.
6
Pitch period, T
The frequency of the pitch signal above is f = 1/T Speech sounds produced in this way are known as voiced. When the tension of the vocal chords is less, they become quite loose. When air is forced from the lungs the effect is now to create a turbulent airflow rather than pulses. This is known as unvoiced speech.
8
Unvoiced Sounds
Whispering occurs when the vocal chords are moved close together but with a small opening remaining Air passing through this from the lungs creates turbulent airflow with the resulting excitation signal resembling wideband noise
Excitation signal There are no regular pulses of air and so the excitation has no specific pitch period Speech sounds produced by this form of excitation are termed unvoiced. Examples are /s/ as in step and /f/ as in from
9
Laryngograph
A laryngograph is an instrument for recording the movements of the vocal folds during speech A small high frequency signal (3 MHz) is transmitted across the larynx. As the vocal folds move, the conductance of this signal changes, and this change is recovered by the laryngograph.
10
Vocal Tract
The vocal tract is normally considered to be everything past the vocal folds. Vocal tract can be divided into following regions: 1. Oral cavity 2. Nasal cavity 3. Pharynx and contains a number of articulators: a.Tongue b.Soft palate (velum) c.Hard palate d.Teeth e.Alveolar ridge f.Lips
11
Vocal Tract
The vocal tract can be considered to be a series of connected cavities, each with their own resonant frequencies. The movement of the articulators (tongue, velum) changes the physical shape of these cavities and hence the resonant properties of the vocal tract. As the air from the larynx flows through the vocal tract, the resonant frequencies of these cavities modulates different effects onto the air flow and creates different speech sounds. The use of different components of the vocal tract has different effects on the produced speech signal. Moving the tongue changes the shape of the oral cavity and hence its resonant properties. The nasal cavity can also be closed off by the velum.
12
Speech
13
Modulation
Modulation is the process of imposing information on the excitation, or glottal, waveform Considering the effect of modulation from a physiological point of view is termed articulatory phonetics While considering modulation in terms of its acoustic properties is known as acoustic phonetics Articulatory phonetics considers how changes in the vocal organs change the resulting speech sounds. These changes are mainly effected by the tongue, but also other components such as the velum, teeth and lips Acoustic phonetics considers the modulation as a filtering operation applied to the excitation waveform. The vocal tract can be thought of as an acoustic tube which has various resonant frequencies These resonant frequencies can be adjusted by the articulators in the 14 vocal tract and are known as formants
Formants
F2 F3 F4
Excitation signal
Speech signal
15
Excitation (glottis)
Excitation consists of wideband noise - no pitch
Speech
Excitation signal
Speech signal
19
20
21
Consonants
Consonants are relatively easy to define in anatomical terms. They are principally distinguished by: 1. Place of articulation 2. Manner of articulation 3. Voicing or phonation
22
Place of articulation
This is the place in the vocal tract where the major source of constriction occurs It is defined in terms of both the active and the passive articulators
Description Between lips Lower lip to upper teeth Front of tongue between teeth Front of tongue to alveolar ridge Front of tongue between alveolar ridge hard palate Middle of tongue to hard palate Root of tongue to rear of mouth Tongue to uvular Tongue to pharynx Constriction at glottis
Example pea fee thigh see she you key French r (none in English) Arabic (none in English) 23 sor of (Cockney)
Manner of articulation
Manner of articulation is characterised by the degree of constriction and the manner of its release into the following sound.
24
Voicing
Indicates presence or absence of phonation Previous tables have shown that consonants can have voiced or unvoiced forms, depending on phonation Examples is voiced zzz and unvoiced sss
25
Vowels
Vowels are less well defined than consonants. The tongue rarely touches another organ in the vocal tract so gives no specific points of articulation Vowels can be roughly described in terms of their position in the mouth by four variables:
1. Tongue high or low 2. Tongue front or back 3. Lips rounded or unrounded 4. Nasalised or unnasalised
Considering only the position of the tongue, a vowel diagram can be used to show the location of different vowels.
Front High Middle Low Back Front High Middle Low
heed hid hayed ahead bud hawed 26 hoed
Back
whod
i e
u o
head had
Vowels
The vowels shown in the diagram are most common in English, but the diagram is not entirely consistent. Front vowels are pronounced with the lips unrounded (heed). Back vowels are pronounced with lips rounded (whod). This is customary in English, but not in all languages. Leads to two vowel diagrams to accommodate the differences - one for rounded vowels and the other for unrounded vowels. The neutral vowel [] is one of the most commonly used and is given the name schwa. Examples of schwa are start of above and end of soda. Velum determines whether a vowel is nasalised or unnasalised. When the velum is open this opens up the nasal cavity which can impart more structure onto the vowel sound. In the English language, no distinction is made between nasal and unnasalised vowels. However in some languages (for example French) the difference is important.
27
Summary
Examined vocal organs used in speech generation - lungs/trachea, larynx/vocal cords, vocal tract Considered speech generation in terms of an excitation signal and its modulation by vocal tract Looked at differences between voiced and unvoiced sounds Articulatory phonetics - IPA alphabet of speech sounds Vowels and consonants - position of articulators to produce the different speech sounds Continuous speaking effects and coarticulation
29