Professional Documents
Culture Documents
1. Introduction
Rhythmic scansions have a specific tempo, that is, they develop with a given num-
ber of stressed (prominent) syllables per unit of time. The shorter the time inter-
val between two stressed syllables, the faster the tempo. The ratio of stressed to
unstressed syllables the density can create a more or less emphasised speech
style. Finally, we pay attention to the variations in speech rate, i.e. the number of
syllables per unit of time (most often, per second).
It is commonly recalled that rhythmic scansions or speech rate do not have
fixed signalling value as grammar does although they have at their base an
iconic value (Auer et al. 1999:153). Our main research questions are the following:
. Some authors employ the term of sound symbolism (see for example Hinton et al. 1994).
. Fischer (1999:131133) uses the term (phonological) iconicity instead of (sound) sym-
bolism. He wants to avoid any confusion with Peirces distinction between symbol (arbitrary
and conventional) and icon (motivated, like: image, diagram or metaphor).
Direct association - Indirect association
Sound symbolism
Imitative Corporeal Synesthetic Conventional
Hinton et al. (1994)
Iconicity
Fischer (1999) Auditory Articulatory Associative I Associative II
Crystal defines iconicity by saying that individual sounds are thought to reflect
properties of the world, and thus have meaning (Crystal 1987:174). As soon as
we consider the iconic value of rhythm (and not of individual vowels or conso-
nants), this definition has to be adapted. Furthermore, we consider how iconicity
functions in interactive speech as opposed to its meaning in the language system
(see for example the phonesthemes illustrated in Section1).
In the line of interactional research on interpretation, we consider that indi-
viduals engaged in a verbal encounter do not just rely on literal or denotational
meaning to interpret what they hear (Gumperz 2001:126). More often than not,
they build inferences on their expectations about what is to follow, on cultur-
ally specific background knowledge, and on particular cues like intonation and
r hythmic patterns.
We think that rhythm in speech contributes to the construction of mean-
ing in interaction and we try to explain how, by looking over three proposals,
namely the dual encoding model (Fnagy 1993, 1999), the experiential approach
to discourse interpretation (Auchlin 2008) and the contextualisation process
(Gumperz 1992, 2001).
Fnagy assumes a twofold encoding of any utterance, first grammatical and
then iconic. Any utterance generated by the Grammar goes through the so-called
Modulateur (modulator or distorter), which grafts secondary messages on the
utterance (Figure 1). Iconicity belongs to that second coding.
Modulateur
Rcepteur
Emetteur
e
air
m
Message Message
m
ra
primaire complete
G
Figure 1. Model of double coding by Fnagy (1983:229)
In other words, vocal and prosodic variations (of intonation, register or rhythm)
have a significant influence on how one will experience a speech event. Prosody
is interpreted according to its temporal synchronisation (or desynchronisation)
with other communicative aspects (such as turn-taking, utterance segmenta-
tion, emphasis, etc.). Prosody is iconic in that it evokes the degree of effort the
speaker puts in its speech production and the way he converges (or not) with its
co-interactant (see Auer et al. 1990; Auchlin & Simon 2004).
Although Gumperz notion of contextualisation has more to do with indexi-
cality than with iconicity, its contribution to the understanding of iconic meaning
is essential.
. Prosodies make available and sensitive, numerous pieces of information related to the
speaker, to his motivational state concerning the fact that he speaks (to X), or the subject
he evokes (in front of X) [] By this, prosodies invite to reconsider the linguistic meaning,
taking into account the temporal experience, i.e. the time needed for the meaning elaboration.
The vocal-prosodic efforts seem to be essentially dedicated to cause the emergence of shared
meaning experience. (our translation).
From what precedes, we retain the following key-ideas that connect rhythm to
iconic meaning:
4. The span of rhythmic phenomena may vary from a very local focus (on a
single word) to a global one (when foregrounding an utterance or a sequence).
In the next sections, we discuss the three following types of rhythmic iconicity:
iconicity created by global (Section 4) or local (Section 5) rhythmic phenom-
ena and rhythm interpreted as a contextualisation cue, with an iconic meaning
potential (Section6).
. That is, a high ratio of stressed to unstressed syllables, stressed syllables being indicated by
a primary accent () within the transcription. In this example, density ranges from 1/1 (ceux)
to 2/5 (que les lecteurs), with a mean ratio of 1/2.5.
. Nonproportionalfont is used for analyzing rhythmically regular passages. We
adopt the convention from Auer et al. (1999:ixxi), as it has been adapted for French by Simon
and Grobet (2005:4). Temporal intervals are delimited by a slash (/). The closer they are, the
shorter the time interval from one stressed syllable to the other.
. Experiments reported by Couper-Kuhlen (1993:24) and Auer et al. (1999:5154) suggest
that a difference in duration of 20% between two temporal intervals can be tolerated without
disturbing the perception of isochrony.
. Resulting in a perception of 6 well-defined groups: (il arrive) (que les lecteurs) (prfrent)
(ceux) (qui les laissent) (dormir).
i la riv k le ze lk t pre fr
il arrive 521 ms que les lecteurs 524ms prfrent 369m
TVB (578.38581.58s) Prosogram v2.4e
s ki le ls dr mi m
369m ceux 310ms qui les laissent 160ms dormir mais
TVB (578.58584.78s) Prosogram v2.4e
. Prosogram does not provide information about voice quality variation because the vocal
folds vibration is not regular enough to produce F0.
. Prosogram (Mertens 2004) is a stylised transcription of prosody based on the processing
of time, intensity and F0. Prosogram displays the F0 contour calculated by Praat software
(Boersma & Weeninck 2009) as discrete sequences of stylised pitch (thick lines). The styl-
isation is an estimation of pitch contour by human listeners based on perception studies.
The evolution of speech over time is represented on the top of Prosogram: from one vertical
mark to another, there is a 100 milliseconds (ms) span. For instance, in Prosogram 1, the first
pause duration is 521ms and the duration of preceding syllable [iv] is approximately 400 ms.
Prosogram displays that the syllable [iv] has falling pitch contour.
What is iconic in this passage? On the one hand, one can notice the rhythmic
and intonative imitation of falling asleep, with a marked intonation of finality
(falling movement). On the other hand, one can simultaneously perceive the voice
quality signalling speakers irritation about the fact that voters are actually put to
sleep. Therefore, we simultaneously have the iconic mimic of a situation (voters
falling asleep) and information about the attitude of the speaker (who is critical of
this state of affairs).
Example (2) illustrates iconicity conveyed by means of variation in speech rate.
The speech rate of three consecutive discourse segments displaying great variation
and contrast was measured (number of syllables by second) and compared.
(2) draper12 (to lose control)
a.
porter le fer dans la plaie et investiguer
toutcrin [4.9syl/sec]
turning the knife in the wound and to
investigate
unduly
b.
quitte draper quelques fois [6.8syl/sec]
even if he lost control from time to time
c.
est devenu oui limperator le matre absolu
dujournal [4syl/sec]
he became yes the emperor the supreme master of
the newspaper
In this example, the journalist is commenting on the attitude of the editor in chief
of the French newspaper Le Monde, who is quitting his position. Segment (b) is
characterised by an increase of 40% in speech rate when compared to segment (a).
It stands out by an acceleration of the speech rate and by a lower intensity [65dB]
with respect to the average intensity (72dB) of the two surrounding segments. We
attribute an iconic value to this accelerated speech rate: the verb draper implies
a loss of control and is semantically related to speed: if someone loses control
(drape) it is often because of a high speed.
From the syntactical viewpoint, the fast segment is an interpolated clause. In
our corpus, interpolated clauses are usually realised with a lower register and a
faster speech rate,13 its purpose being to provide the quotation source. This is not
the case here: the interpolated clause is actually part of the quotation and functions
to characterise the quoted person as someone rapid (thereby, likely to do mistakes).
. The French verb draper is most often used in a moving machine context, for unexpected
(out of control, side) motion.
. See Gachet and Avanzi (2008) for the review of prosodic features of interpolated clause
in French.
The third example of iconic rhythm at a global level implies rhythmic slowing
down. The speaker mimics the discourse of an aged man in his seventies, namely
the French President Jacques Chirac.14
Fast speech rate (6syl/sec) in segment (a) is followed by a significant slowing down
(3.9syl/sec) in segment (b). The contrast between the two segments is responsible
for perceiving the second one as very slow.
A closer look at vowel duration and quality explains their contribution to the
global perception of an alteration in rhythm. Apart from the salient variation of
the speech rate, the semantic and phonetic contents of 4 words vitesse (speed),
vitalit (vitality), admirable (admirable) and art (art) lead to the interpre-
tation of the first and second segments respectively as vivacious/dynamic/rapid
versus passive/old/slow. The first two items contain a front close vowel [i] that is
associated (Ohala 1984; Fnagy 1983) with smallness, shrillness and rapid move-
ments. The first [i] of vitalit is 6ms long and the [i] of vitesse is 8ms, which is much
shorter than the duration of the [a] vowel in art or admirable. The second [a] of
admirable lasts for 20ms and the one of art takes up 30ms. The openness of the
vowel [a] is associated with largeness, gentleness, and with slow and heavy move-
ments. The voice quality of the vowels is modified as well: the first [i] of vitalit
is voiceless but nevertheless invested with a large amount of articulatory energy
while the [a] of art is pronounced with a creaky voice that signals a decrease in
breath and energy.
. The corpus contains data from 2004: at that time, Jacques Chirac was the French President
and Nicolas Sarkozy his Minister of Domestic Affairs.
Together, these elements speech rate, vowel duration and quality iconically
represent the contrast between two opposite persons: Nicolas Sarkozy as some-
one full of energy and Jacques Chirac as someone who runs at half speed, who is
exhausted.
This illustrates how rhythmic phenomena (acceleration or deceleration of
speech rate or reduction resulting from joint temporal and intonative phenomena)
can receive an iconic interpretation when the temporal movement mimics a part
of the lexical content of a word or an utterance.
The two following examples illustrate rhythmic iconicity located on a single word
or a phrase whose sound feature imitates an object from everyday life. We will see
that rhythm can be obtained in a different way, not only by speech rate as in the
previous examples, but also by repetition of the same melodic pattern.
The syntagm norme vague (huge wave) in Example (4) is what Fnagy
(1999:14) calls self-representation of linguistic units: it is a description and
mimesis of its proper content at the same time. The rhythmic pronunciation of
the items norme vague (huge wave) and gonfle (growing bigger) iconically
represents threatening.
(4) norme vague (huge wave)
que lnorme vague qui gonfle en Asie
that the huge wave that is growing in Asia
k l e n rm v a g k i g f l n a z i
que l norme vague qui gonfle en Asie
PONT (453,32456,02s) Prosogram v2.4e
Figure 3. Prosogram of Example (4). Grey lines highlight the iconic melody (shaped like a
wave) on the phrases norme vague and gonfle en Asie
a
w
l de m kat ke tj ma i k l ba ni
le dmocrate chrtien Jean Marie Colombani
TVB (76.2279.22s) Prosogram v2.4e
b
l dwi pl nl pi l m da so
ulturel Edwy Plenel 364ms ont pris le monde d assaut 884ms
TVB (82.2285.22s) Prosogram v2.4e
c
il ze t ta li je sil di v
884 ms ils taient allis 579ms s ils divorcent
TVB (85.2288.22s) Prosogram v2.4e
Figure 4. Prosogram of Example (5). Letters (a, b, c) mark the beginnings of the three
segments. Rectangles indicate the F0 declination, semicircles indicate the bridge accent
andvertical arrows indicate final accents
. See the duration between successive beats, as indicated in the transcription (in ms).
Contextualisation cues do not have a clearly identified symbolism, but they bring
the listener to draw inferences about what is said. Those cues are related simulta-
neously to the signalling of reported speech and to the signalling of the speakers
attitude.
In Example (6) soixante-deux pour cent (sixty two percent) the variation of
speech rate and accentuation is the cue for understanding the relationship between
the radio journalist and the written text that he is quoting. Sixty two percent is
the result of an opinion poll in which French citizens were asked whether they
would approve the European Constitution. The percentage is repeated three times,
each occurrence having its own prosodic and rhythmic colour.
Segment (a) has a speech rate of 4.2syll/sec; the rhythm is regular with a progres-
sive decrease in interval duration. The regularity of the rhythm is also ensured by
a repetition of soixante-deux pour cent, with a short but unusual extension of the
schwa vowel [] in soixante [swast] and by the insistence accent on deux. In seg-
ment (b), speech rate accelerates by 47% [6.2syll/sec] and immediately decelerates
by 10% [5.5syll/sec] on the last segment.
a
s wa s t d p ur s s wa s t d p urs
soixante deux pour cent soixante deux pour cent
TVB (596.9.3599.93s) Prosogram v2.4e
s d e s c s j a l i s t s p a t i z
cent 461ms des socialistes 350ms sympathisants 191msles
TVB (599.93602.93s) Prosogram v2.4e
b
le s p a t i z s c s j a l i s t p a d e m i l i t k iv v c t e l p rm
es sympathisants socialistes pas des militants 268ms quivont voter leremier
TVB (602.93605.93s) Prosogram v2.4e
mjed e s b s w a s t d p ur s s f a v c r a
premier dcembre 283ms soixante deux pour cent sont favorables
TVB (605.93608.93s) Prosogram v2.4e
Figure 5. Prosogram of Example (6). Vertical arrows indicate accented syllables responsible
for a regular rhythm perceived on the segment (a)
The fastest segment (b) is an interpolated clause (like in the example draper)
integrated in the quotation. The interpolated clause is a reformulation of the
quotations content and is realised by juxtaposition: les sympathisants socialistes
pas des militants. This reformulation makes clear who the quotation is talking
about. The syntactic parallelism of reformulation (repetition and juxtaposition)
is superimposed by the voice and prosody that indicates that the speaker is agi-
tated about the result of opinion poll. The syntactic parallelism accompanied by
the vocal comment creates what we call prosodic reformulation.16 It adds a
further meaning to the utterance. In other words, reformulation is enhanced by
prosody.
Example (7) illustrates a switch from the introductory part of the reported
speech to the quoted direct speech. The speech rate on the introductory part
is fast [7.6syll/sec] and slows down on the reported direct speech [4.5syll/sec].
The quotation refers to a legal text about the reduction of working time from 40
to 35 hours per week in France that raised an animated discussion in the media
in 2004.
a
e l fi ga rod si te pa tri k lje ki pre zi d y m pe
et le Figaro de citer euh Patrick Ollier quiest prsident UMP
notretxt (02.70s) Prosogram v2.4e
b
pe dla k mi sj de za f re k n mik d la s ble ki di
UMP de la commission des affaires conomiques de l Assemble qui dit 572ms
notretxt (2.705.40s) Prosogram v2.4e
m n tr tk st n tr tk st n ra o k
72 mais notre texte notre texte n' aura aucun
notretxt (5.408.10s) Prosogram v2.4e
o k ka rak tr k tr
aucun aucun caractre contraignant
notretxt (8.1010.80s) Prosogram v2.4e
Figure 6. Prosogram of Example (7). Rectangles indicate the vowel lengthening and dynamic
tones on aucun
mean F0 value is 90Hz and the F0 register ranges from 60 to 224Hz; the mean
intensity is 69dB. Segment (b) is 40% slower [4.5syll/sec] with a much higher
F0 mean (207Hz) and an expansion of the F0 register (66368Hz); the intensity
increases slightly (5dB).
The repetition (notre texte notre texte aucun aucun) is associated with
insistence, with ample and dynamic tones on aucun [ok] and extra-lengthening
of the syllable [k] (447ms and 550 ms). The vocal-prosodic posture the lower
extended vowel [] refers iconically to largeness, meaning big absence
because of the semantic content of aucun (not any). Moreover, the utterance
starts with an argumentative mais (but) and the prosody of the repeated seg-
ment aucun aucun supplements and completes the argumentation. The varia-
tion in voice quality indicates that the speaker takes over the voice of the quoted
person, whether he imitates his physical features or not. For these reasons, the
vocal and prosodic representation of the quoted person can be a cue for irony.
As a matter of fact, the imitator often seeks for parody and caricature, and this
reveals his viewpoint.
7. C
onclusion
References
Auchlin, A. 2008. Du phonostyle lethos, les prosodies comme interfaces entre sens et corps.
Paper presented at the 3rd International Symposium on Discourse Analysis: Emotions,
Ethos and Argumentation, April 14, in Belo Horizonte, Brazil.
Auchlin, A. & Simon, A.C. 2004. Gabarits prosodiques, empathie(s) et attitudes. Cahiers de
lInstitut de Linguistique de Louvain 30(13): 181206.
Auer, P., Couper-Kuhlen E. & Di Luzio A. 1990. Isochrony and uncomfortable moments in
conversation. In Learning, Keeping and Using Language, Vol. 2, M.A.K. Halliday, J.Gibbons
& H. Nicholas (eds), 269281. Amsterdam: John Benjamins.
Auer, P., Couper-Kuhlen E. & Mller F. 1999. Language in Time. The Rhythm and Tempo of
Spoken Interaction. Oxford: OUP.
Boersma, P. & Weeninck D. 2009. Praat. Doing Phonetics by Computer (Version 5.1.20). http://
www.praat.org/ (31 October 2009).
Bolinger, D. 1965. The atomination of meaning. Language 41: 555573.
Couper-Kuhlen, E. 1993. English Speech Rhythm: Form and Function in Everyday Verbal Interac-
tion [Pragmatics & Beyond New Series 25]. Amsterdam: John Benjamins.
Couper-Kuhlen, E. 2003.Intonation and discourse: Current views from within. In The Hand-
book of Discourse Analysis, D. Schiffrin, D. Tannen & H. Ehernberger Hamilton (eds),
1334. Oxford: Blackwell.
Crystal, D. 1987. Sound symbolism. In The Cambridge Encyclopedia of Language, 174175.
Cambridge: CUP.
Fischer, A. 1999. What, if anything, is phonological iconicity? In Form Miming Meaning
[Iconicity in Language and Literature 1], M. Nnny & O. Fischer (eds), 123134. Amsterdam:
John Benjamins.
Fnagy, I. 1980. Laccent franais: Accent probabilitaire. In Laccent en franais contemporain,
I. Fnagy & P.R. Lon (eds). Studia Phonetica 15: 123233.
Fnagy, I. 1983. La vive voix. Essais de psycho-phontique. Paris: Payot.
Fnagy, I. 1999. Why iconicity? In From Miming Meaning [Iconicity in Language and
Literature 1], M. Nnny & O. Fischer (eds), 336. Amsterdam: John Benjamins.
Gachet, F. & Avanzi M. 2008. La prosodie des parenthses en franais spontan. Verbum 31(1):
5384.
Gumperz, J.J. 1992. Contextualization and understanding. In Rethinking Context: Language as
an Interactive Phenomenon, A. Duranti & C. Goodwin (eds), 229252. Cambridge: CUP.
Gumperz, J.J. 2001. Inference. In Key Terms in Language and Culture, A. Duranti (ed.), 126128.
Oxford: Blackwell.
Gussenhoven, C. 2002. Intonation and interpretation: Phonetics and phonology. In Speech
Prosody 2002: Proceedings of the First International Conference on Speech Prosody,
4757. A ix-en-Provence: ProSig and Universit de Provence Laboratoire Parole et
Langage.
Hinton, L., Nichols J. & Ohala J.J. (eds). 1994. Sound Symbolism. Cambridge: CUP.
Mertens, P. 2004. The prosogram: Semi-automatic transcription of prosody based on a tonal
perception model. In Proceedings of Speech Prosody 2004, B. Bel & I. Marlien (eds). http://
bach.arts.kuleuven.be/pmertens/prosogram/