Professional Documents
Culture Documents
high
nominal
low
Fig. 2.1. Sound as a pressure wave. The peaks represent times when air molecules
are clustered, causing higher pressure. The valleys represent times when the air
density (and hence the pressure) is lower than nominal. The wave pushes against
the eardrum in times of high pressure, and pulls (like a slight vacuum) during times
of low pressure, causing the drum to vibrate. These vibrations are perceived as
sound.
Sound waves can be pictured as graphs such as in Fig. 2.1, where high-
pressure regions are shown above the horizontal line, and low-pressure regions
are shown below. This particular waveshape, called a sine wave, can be char-
acterized by three mathematical quantities: frequency, amplitude, and phase.
The frequency of the wave is the number of complete oscillations that occur
in one second. Thus, a sine wave with a frequency of 100 Hz (short for Hertz,
after the German physicist Heinrich Rudolph Hertz) oscillates 100 times each
second. In the corresponding sound wave, the air molecules bounce back and
forth 100 times each second.
2
The ear actually responds to sound pressure, which is usually measured in deci-
bels.
2.2 What Is a Spectrum? 13
The human auditory system (the ear, for short) perceives the frequency of a
sine wave as its pitch, with higher frequencies corresponding to higher pitches.
The amplitude of the wave is given by the dierence between the highest and
lowest pressures attained. As the ear reacts to variations in pressure, waves
with higher amplitudes are generally perceived as louder, whereas waves with
lower amplitudes are heard as softer. The phase of the sine wave essentially
species when the wave starts, with respect to some arbitrarily given starting
time. In most circumstances, the ear cannot determine the phase of a sine
wave just by listening.
Thus, a sine wave is characterized by three measurable quantities, two of
which are readily perceptible. This does not, however, answer the question of
what a sine wave sounds like. Indeed, no amount of talk will do. Sine waves
have been variously described as pure, tonal, clean, simple, clear, like a tuning
fork, like a theremin, electronic, and ute-like. To refresh your memory, the
rst few seconds of sound example [S: 8] are purely sinusoidal.
As sound (in the physical sense) is a wave, it has many properties that are
analogous to the wave properties of light. Think of a prism, which bends
each color through a dierent angle and so decomposes sunlight into a family
of colored beams. Each beam contains a pure color, a wave of a single
frequency, amplitude, and phase.3 Similarly, complex sound waves can be
decomposed into a family of simple sine waves, each of which is characterized
by its frequency, amplitude, and phase. These are called the partials, or the
overtones of the sound, and the collection of all the partials is called the
spectrum. Figure 2.2 depicts the Fourier transform in its role as a sound
prism.
This prism eect for sound waves is achieved by performing a spectral
analysis, which is most commonly implemented in a computer by running a
program called the Discrete Fourier Transform (DFT) or the more ecient
Fast Fourier Transform (FFT). Standard versions of the DFT and/or the FFT
are readily available in audio processing software and in numerical packages
(such as Matlab and Mathematica) that can manipulate sound data les.
3
For light, frequency corresponds to color, and amplitude to intensity. Like the
ear, the eye is predominantly blind to the phase.