Professional Documents
Culture Documents
www.seipub.org/aiaas
kking29@uthsc.edu; 2mrastatter@ecu.edu
*1
Abstract
When addressing acoustic parameters of speech, much of the
previous research has focused on fundamental frequency
(F0), duration, and intensity as the primary acoustic
characteristics within a prosodic structure. The findings
from these studies are limited in relation to natural speech as
they tend to focus on structured, controlled tasks with best
productions. This study addresses questions related to the
effect of task on the acoustic correlates of stress in sentences,
in addition differences in productions were investigated
when following an instruction versus following a question.
Fifteen speakers of general American English were recorded
in two speaking tasks. Tasks used were: (1) a detailed
instruction as to what should be produced and (2) a question,
without any instruction, to elicit responses. Measurements of
fundamental frequency (F0), amplitude, duration, and total
duration were taken in all contexts. Findings revealed
significant differences between tasks for all acoustic
parameters measured: F0, amplitude, and duration. The
pattern for the magnitude of the change also varied between
the tasks. These findings suggested that the natural variation
observed in spoken language is reduced with directed tasks
and that an individuals production of stress changes
dependent on the task. Implications for past and future
research have been discussed.
Keywords
Acoustic; Prosody; Intonation; Methodology; Speech; Fundamental
Frequency; Amplitude; Duration
Introduction
Prosody plays a central role in human communication.
During speech acts, the intended message is not
conveyed just by words but by varying the
intonational or suprasegmental attributes of the
message, such as prosody. Much research has been
conducted to identify and quantify the contributing
acoustic parameters to intonation. Prosody and
intonational patterns effect how speech is perceived by
others. Defining prosody involves identifying the
suprasegmental features inherent in a message, which
generally includes pitch, rate, intensity, and duration
(Cutler, Dahan, and van Donselaar, 1997). Much
111
www.seipub.org/aiaas
www.seipub.org/aiaas
www.seipub.org/aiaas
TABLE 1 MEAN, STANDARD DEVIATION, AND COEFFICIENT OF VARIATION FOR PEAK FREQUENCY, PEAK AMPLITUDE, DURATION FOR EACH STRESSED
WORD AND TASK
BEV
INSTRUCTION
Peak Frequency
Peak Amplitude
Duration
QUESTION
Peak Frequency
Peak Amplitude
Duration
Mean
SD
163.55
71.19
369.96
33.28
3.66
89.24
20.3
5.1
24.1
170.2
71.09
381.46
31.74
3.12
82.92
158.90
68.01
256.41
37.17
2.26
37.28
23.4
3.3
14.5
148.35
66.10
359.64
38.52
5.57
91.56
Mean
Peak Amplitude
The analysis indicated statistically significant main
effects for word (F2, 111= 5.027, p = .008) and task
(F1,111=19.34, p < .001) but no interaction, indicating
that peak amplitude was significantly higher for Bev
when stressed as compared to the other words. Peak
amplitude also was significantly higher when the
manner of elicitation was a given instruction versus
the natural context of question.
Statistically significant main effects for word (F2, 111=
28.25, p <.001) and task (F1,111=37.96, p < .001) and a
marginally significant interaction (F2, 111= 2.764, p
=.06) were indicated when measuring peak amplitude
and loves was stressed. These results revealed that
peak amplitude was significantly higher for loves
when stressed as compared to the other words. Peak
amplitude also was significantly higher when the
manner of elicitation was a given instruction versus
the natural context of a question. An interaction of
word by task also was marginally significant.
Analysis of Bob, when targeted, indicating a
statistically significant main effect for word (F 2, 111=
14.959, p <.001) and for task (F 2, 111= 19.25, p <.001)
but no interaction. The peak amplitude for Bob was
significantly higher as compared to other words and
was higher when produced following an instruction
versus the natural context of answering a question.
Duration
The analysis indicated statistically significant main
effects for word (F2, 111= 20.19, p <.001) and task
(F1,111=23.84, p < .001) and an interaction for word by
task (F2, 111= 14.15, p <.001), indicating that duration
was significantly longer for Bev when stressed as
compared to the other words and was dependent on
task (target word elicited by instruction or question).
Duration of Bev was significantly longer following
an instruction to stress the word than when occurring
in a natural context.
114
LOVES
Coefficient of
SD
Variation
Coefficient
of Variation
BOB
Mean
SD
Coefficient of
Variation
18.6
4.4
21.7
132.65
66.12
370.99
32.47
3.23
31.37
24.5
4.9
8.5
26.0
8.4
27.1
135.55
63.29
364.04
32.691
5.10
34.4
24.1
8.1
9.4
www.seipub.org/aiaas
Discussion
This study suggested that the acoustic parameters in
correlation with stress are dependent upon the
method used to elicit the production. A significant
difference was found between the speech productions
following elicitation of a production when given an
www.seipub.org/aiaas
116
www.seipub.org/aiaas
Y,
Pihko
E,
and
Pulvermuller
F.
(2005).
1000: 288-292.
Muerer, E.M., Wender, M.C.O., von Eye Corleta, H., and
Capp, E. (2004). Female suprasegmental
parameters
in
reproductive
age
and
speech
menopause.
28.
117