Professional Documents
Culture Documents
40 (2007) 991–1001
Strobovideolaryngoscopy
and Laboratory Voice Evaluation
Scott M. Kaszuba, MD, C. Gaelyn Garrett, MD*
Department of Otolaryngology, Vanderbilt Voice Center, Vanderbilt University,
7302 Medical Center East, South Tower, 1215 21st Avenue South, Nashville,
TN 37232-8783, USA
Strobovideolaryngoscopy
True vocal fold vibration is a complicated physiologic function, the ob-
servation of which far outreaches the visual capabilities of the human eye
with a normal light source. The human adducted vocal folds cyclically
* Corresponding author.
E-mail address: gaelyn.garrett@mcmail.vanderbilt.edu (C.G. Garrett).
0030-6665/07/$ - see front matter Ó 2007 Elsevier Inc. All rights reserved.
doi:10.1016/j.otc.2007.05.006 oto.theclinics.com
992 KASZUBA & GARRETT
open and close between 60 to 1500 times per second, depending on the pho-
natory pitch. Stroboscopic light visually makes the vocal fold vibrations ap-
pear to slow down so that the impression of vocal fold vibrations can be
observed and processed. Stroboscopy capitalizes on the inherent optic prop-
erties of our visual organ and exploits the limitations of observation of the
unaided eye. According to Talbot’s law, the human eye can perceive no
more than five distinct images per second. Each image therefore lingers
on the retina for approximately 0.2 seconds after exposure. Stroboscopic
flashes make the vocal folds appear to slow down by advancing the light
pulse through successive glottal cycles in percentage increments. Individual
still images are recorded at selected points from sequential vibratory cycles
and the human eye automatically fills in the missing pieces by fusing the im-
ages into what it sees as motion. This apparent motion is attributable to
a phenomenon called persistence of vision. Additional instrumentation
added to the stroboscopic light can facilitate the recording and documenta-
tion of the perceived vocal fold vibratory properties [2–4]. Strobovideolar-
yngoscopy as a whole allows the physician to observe important vocal
fold activities, which allows appropriate diagnostic decision making.
A brief historical overview allows full appreciation of the evolution of stro-
bovideolaryngoscopy. Indirect laryngoscopy was first described, but not yet
popularized, by Bozzini in 1806 when he constructed an angled speculum
with a mirror insert that was meant to examine various body cavities, includ-
ing the human larynx. It was not until 1854 that indirect laryngoscopy gained
wider acceptance when Manuel Garcia, a Spanish-born voice teacher who had
a limited gag reflex, first visualized his larynx with a small dental mirror using
sunlight as a light source. In 1895, Oertel followed suit and was credited with
creating the first laryngostroboscope. His device consisted of a variable-speed
perforated disc that was interspersed between a light source and the practi-
tioner’s head mirror [5,6]. Since that time, strobovideolaryngoscopy has
evolved into finely controlled, high-intensity light sources with fiberoptic endo-
scopes or distal camera scopes coupled with analog or digital recording
devices.
Strobovideolaryngoscopy in current clinical practice relies on a combi-
nation of several instruments: a stroboscopic light source, an endoscope,
a microphone, a video camera, a recording device, and a video monitor.
Stroboscopy is best performed in conjunction with video recording and ar-
chiving for complete clinical review and documentation. The examination
may be performed by transnasal flexible laryngoscopy with distal chip tech-
nology or perorally with a rigid angled telescope. Video cameras are now
available in single-chip and three-chip versions. A single-chip camera uses
a single array of light-sensing elements known as charge couple devices
(CCDs). Three-chip cameras use a dichroic prism, which divides the incom-
ing images into the three primary colors and offers more accurate color and
higher resolution. Analog or digital recording technologies are then used for
image capture, documentation, and reproduction [3,7].
STROBOVIDEOLARYNGOSCOPY AND LABORATORY EVALUATION 993
The illusion of apparent slow motion of the vibrating vocal folds during
strobovideolaryngoscopy evolves from the collection of several sequential
still images of the folds at selected time intervals during repeated glottal cy-
cles at a given vibratory frequency. This illusion is called the stroboscopic
glottal cycle and can be of any desirable duration. In addition, the strobo-
scopic flashes can be emitted either at the same frequency as phonation,
known as synchronization, or at a slight variation of the frequency, known
as asynchronization. This feature of stroboscopy is producible through tech-
nological communication between the microphone and the strobe light
source. By synchronizing the stroboscopic flashes to the fundamental fre-
quency of the vibrating vocal folds, a perceptual stopped image or standstill
of the vocal folds is produced. An asynchronized mode is generated by cal-
ibrating the stroboscopic flashes at a consistent frequency slightly different
than the produced phonatory fundamental frequency. This variation allows
successive light impulses to strike at different phases of the vibratory cycle
and produce a video image of one apparent cycle of vibration actually ob-
tained from different portions of several cycles. Another option, which al-
lows the examiner to manipulate the apparent glottal cycle by operation
of a rocking foot pedal, furthers the stop-action capability of the strobovideo-
laryngoscopy system. This feature is particularly useful when the exact loca-
tion of the vocal fold lesion is being determined in relation to movement of
the upper and lower lips during an approximation phase of the cycle [2,3,7].
The strobovideolaryngoscopic examination is most clinically useful to the
practitioner when a standard protocol is used for the acquisition of the data
and its interpretation. Phonatory tasks during the examination should be
performed at low, normal, and high pitches and in the range of the speaking
or singing problem area, if known. Once recorded, a standardized approach
to the interpretation of the examination allows consistency in diagnosing
and comparing laryngeal pathology. Once the initial examination is com-
pleted and recorded, additional repeat testing at predetermined time inter-
vals allows for evaluation of response to treatment. Although there is
arguably no one gold standard for the interpretation of a strobovideolar-
yngoscopic examination, several aspects of the examination are often rated.
The specific features of the vibratory pattern of the true vocal folds often
addressed include symmetry, periodicity, mucosal wave ratings, amplitude
of vibration, shape and contour of the glottal margin, and glottic closure.
Particular attention is also given to any adynamic segments and the presence
or absence of vertical phase difference [2,6,7]. Vocal fold symmetry remains
intact in the absence of abnormalities along the glottal margin. Periodicity
refers to the regularity of the vibratory cycles with the idea that normal vo-
cal folds should vibrate in mirror image to each other and vibrate the same
with successive cycles. Aperiodic vibrations may prohibit the synchroniza-
tion of the strobe light. The mucosal wave is generally described as the trav-
eling wave across the vocal fold superior surface from medial to lateral.
Abnormalities of the mucosal cover, including the epithelial layer or
994 KASZUBA & GARRETT
superficial lamina propria, are the most common causes of mucosal wave re-
duction. The mucosal wave should be differentiated from the vertical phase
difference, which is created normally by the presence of an upper lip and
lower lip at the medial vibratory vertical closing surface. Amplitude of vi-
bration is a relative feature of the mucosal wave judged by the trained ob-
server as reduced, normal, or excessive. Normal variations in amplitude
occur with changes in vocal intensity. Glottal closure is described as com-
plete; incomplete with anterior, mid, or posterior glottal chinks; and hour-
glass, usually secondary to mid-vocal fold lesions.
From a clinical standpoint, strobovideolaryngoscopy has proved to be
a valuable tool for the diagnosis of laryngeal pathology given the detailed
physical examination it provides of the vocal tract and the vibratory margin
of the vocal fold. Stroboscopic features of nodules, for example, often in-
clude symmetric but reduced amplitude of vibration, maintenance of period-
icity, intact mucosal waves, and hourglass closure. Vocal fold polyps, which
are frequently unilateral, have asymmetric vibration and variable periodicity
depending on the size and shape of the polyp. Mucosal wave can be absent
because of mass effect with large polyps or intact with broader-based polyps.
The wave is generally intact on the contralateral side. Glottic closure is un-
derstandably asymmetric. Cysts within the vocal fold lamina propria can
have the greatest adverse effect of the nonneoplastic lesions on the vibratory
characteristics. Mucosal wave is frequently absent and aperiodic if present.
A change in diagnosis and altered assessment of vocal pathology based on
the strobovideolaryngoscopic findings can occur in 10% to 30% of cases
[4,8]. Furthermore, abnormal findings have been reported in up to 58% of
healthy, asymptomatic professional singers stressing the importance of
screening examinations for certain populations of patients [9].
Strobovideolaryngoscopy is not a test to be done in the absence of other
clinical data. It is only a valuable complement to a thorough vocal history
and physical examination. The technique inherently suffers from the limita-
tion of being a composite recording made from several glottal cycles, in con-
trast to high-speed photography or high-speed digital video, which records
an entire vibratory cycle and provides detailed cycle-to-cycle variations.
Even with this limitation, strobovideolaryngoscopy remains an invaluable
tool in the diagnostic armamentarium of the voice specialist.
Glottography
Glottography is a general technique that monitors the vibration of the
vocal folds by the transmission of a probe signal from one side of the larynx
to the other. The probe signal itself can be directed in either a vertical plane
or horizontal plane. Current probing signals most commonly used in glot-
tography include electrical current flow, light transmission, and ultrasonic
waves. The time variation of the glottis combined with laryngeal tissues
STROBOVIDEOLARYNGOSCOPY AND LABORATORY EVALUATION 995
Electroglottography
Electroglottography (EGG) is a technique based on the principle that hu-
man tissue can conduct an electrical current with laryngeal tissues being
a moderately good conductor of electricity. It is performed by placing two
electrodes above the thyroid laminae on the external neck and measuring
the impedance between them with a high-frequency, low-current signal.
Ohm’s law states that a current must flow through a system if its resistance
is to be measured. Based on this law, when the vocal folds are touching
a greater current flows through them compared with when they are open.
The electroglottographic signal represents the contact area between the
two vocal folds and can be used to determine when the vocal folds are closed
and how fast they are closing [10–13]. This characteristic contrasts with pho-
toglottography (PGG), which gives information about the separation of the
vocal folds and little information about the nature of vocal fold contact.
Various manufacturers provide instrumentation that produces, records,
and displays the electroglottographic signal. Several authors over the past
two decades have commented on the shape of the EGG waveform as it relates
to the underlying physiology of vocal fold vibration. Interpretation of EGG
waveforms remains controversial, however, especially as it relates to analyzing
vocal fold pathology. When used in conjunction with other laboratory tech-
niques, the interpretation of the EGG display becomes more reliable. For ex-
ample, synchronized strobovideolaryngoscopy and EGG have been shown to
be an effective tool for verifying information from the EGG waveform with
stroboscopic images [12,14]. Also, recent research is moving toward standard-
ization of normal EGG measurements with the goal of allowing this test to
serve as a reference for the diagnosis and follow-up of dysphonic patients [15].
There are limitations of EGG. The most obvious one for the voice specialist
is that it cannot be used with all dysphonic subjects. Patients who have a uni-
lateral vocal fold paralysis have a considerably diminished or absent signal
because of lack of good contact of the vocal folds. Obese or thick necks
may impede proper placement of the electrodes or hinder the electrical current
resulting in a poor EGG tracing. Finally, severe hoarseness may render
996 KASZUBA & GARRETT
laryngeal tissue irritable and passing an electrical current through this envi-
ronment may produce an undesirable physiologic response [10,13].
Photoglottography
Photoglottography is a technique that estimates glottal area during pho-
nation. The principle of PGG is based on the concept that the glottis may
act like a shutter through which light can pass in proportion to the degree
of opening of the vocal folds. Light is usually directed transnasally from
above the glottis and is detected by an optoelectronic device over the skin
of the trachea immediately beneath the vocal folds. The external photosen-
sor then converts the light intensity absorbed into electric voltage, which can
be recorded and converted into a graphic display. The direction of the light
path during the study has no impact on the ability to record the PGG signal;
therefore the light source may be placed above or below the glottis or on the
external neck with the photosensor in the opposite complementary position.
Typically, for the best functional examination with the additional advantage
of simultaneous laryngeal observation, a transnasal flexible laryngoscope is
used as the light source with the photosensor placed externally on the neck.
PGG gives some clinical data during the open phase of phonation with
two common measurements routinely obtained. The speed quotient mea-
sures the symmetry of the opening and closing parts of the open phase,
and the open quotient is the time of the open phase of the vocal folds di-
vided by the total period of vibration. Some problems may exist with the
validity of the quantitative information obtained from this technique. These
are most often believed to be attributable to several extrinsic factors, includ-
ing inability to standardize the amount of light projected on the larynx,
changes in light-transmission characteristics of the glottis because of its ver-
tical movement during phonation, and volume changes of the hypopharynx
and supraglottis during different vowel productions. Overall, the PGG
waveform is considered complementary to the EGG signal [10,11,13].
Ultrasound glottography
Ultrasound glottography (UGG) is a technique in which ultrasonic waves
are constantly applied across the laryngeal area of the neck during phonation.
The border between the vocal fold surface and the glottal air is determined by
the difference in acoustic impedance between two media (air and soft tissue).
Like all ultrasound studies, it is based on the frequency shift produced when
a continuous ultrasonic beam is reflected back from or transmitted through
a tissue medium. In UGG, a narrow-beam ultrasound transducer is placed ex-
ternally on one side of the neck near the larynx with a receiver on the other
side. The ultrasonic signal is aimed at the air tissue interface of the glottis.
STROBOVIDEOLARYNGOSCOPY AND LABORATORY EVALUATION 997
Videokymography
Videokymography is a laboratory technique that was developed as
a means of using television technology to visualize real-time vibratory acti-
vities of a small area of the glottis. This visualization is accomplished by us-
ing a line scan camera that is capable of limiting its entire field of view and
scanning of the endoscopic image to a rapid repetition of a single line. Each
new scan of the same line is stacked on top of the others from superior to
inferior so that a screen image is built up with time represented in the
vertical direction. The line scan camera therefore records a small area of
the vocal fold in a real-time fashion while it vibrates and allows for subtle
aperiodic irregularities or phase asymmetries to be observed and doc-
umented. A major shortcoming of this technique is that any movement
of either the larynx or endoscope during signal acquisition changes the locus
being observed. Also, the line image produced is not a complete image of the
larynx. Some training is also required for interpretation of the examination
results. Although still regarded as mainly an experimental technique for
laboratory voice testing, new generation digital videokymographic systems
are currently being developed in hopes of becoming an important tool for
routine clinical laryngeal examination [10,16,17].
Summary
Laboratory and strobovideolaryngoscopy voice evaluation are important
parts of the clinical work-up of the dysphonic patient. When selected appro-
priately with appreciation of their limitations, the techniques discussed
afford the voice specialist the opportunity to make informed diagnostic
decisions and improve the overall quality of care delivered.
References
[1] Rosen CA, Murry T. Diagnostic laryngeal endoscopy. Otolaryngol Clin North Am 2000;
33(4):751–8.
[2] Bless DM, Swift E. Stroboscopy: new diagnostic techniques and applied physiology. In:
Fried MP, editor. The larynx: a multidisciplinary approach. 2nd edition. St. Louis (MO):
Mosby; 1996. p. 81–100.
[3] Cleveland TF. Principles of stroboscopy. In: Ossoff RH, Shapshay SM, Woodson GE, et al,
editors. The larynx. 1st edition. Philadelphia: Lippincott Williams and Wilkins; 2003.
p. 71–6.
STROBOVIDEOLARYNGOSCOPY AND LABORATORY EVALUATION 1001
[4] Sataloff RT, Speigel JR, Hawkshaw MJ. Strobovideolaryngoscopy: results and clinical
value. Ann Otol Rhinol Laryngol 1991;100:724–7.
[5] Karmody CS. The history of laryngology. In: Fried MP, editor. The larynx: a multidisciplin-
ary approach. 2nd edition. St. Louis (MO): Mosby; 1996. p. 3–11.
[6] Bless DM, Hirano M, Feder RJ. Videostroboscopic evaluation of the larynx. Ear Nose
Throat J 1987;66:289–96.
[7] Boehme G, Gross M. Stroboscopy. In: Boehme G, editor. Stroboscopy. 1st edition. London:
Whurr Publishers; 2005. p. 18–106.
[8] Casiano RR, Zaveri V, Lundy DS. Efficacy of videostroboscopy in the diagnosis of voice
disorders. Otolaryngol Head Neck Surg 1992;107(1):95–100.
[9] Elias ME, Sataloff RT, Rosen DC, et al. Normal strobovideolaryngoscopy: variability in
healthy singers. J Voice 1997;11(1):104–7.
[10] Baken RJ, Orlikoff RF. Laryngeal function. In: Boehme G, editor. Clinical measurement of
speech and voice. 2nd edition. San Diego (CA): Singular Publishing Group; 2000.
p. 393–440.
[11] Bless DM, Hirano M. Vocal fold vibration. In: Boehme G, editor. Videostroboscopic
examination of the larynx. San Diego (CA): Singular Publishing Group; 1993. p. 23–35.
[12] Karnell MP. Supplementary techniques. In: Boehme G, editor. Videoendoscopy from
velopharynx to larynx. San Diego (CA): Singular Publishing Group; 1994. p. 127–43.
[13] Colton RH, Casper JK. The voice history, examination, and testing. In: Boehme G, editor.
Understanding voice problems: a physiological perspective for diagnosis and treatment. 2nd
edition. Philadelphia: Lippincott Williams and Wilkins; 1996. p. 186–241.
[14] Anastaplo S, Karnell MP. Synchronized videostroboscopic and electroglottographic
examination of glottal opening. J Acoust Soc Am 1988;83:1883–90.
[15] Kania RE, Hartl DM, Hans S, et al. Fundamental frequency histograms measured by
electroglottography during speech: a pilot study for standardization. J Voice 2006;20(1):
18–24.
[16] Schutte HK, Svec JG, Sram F. First results of clinical application of videokymography.
Laryngoscope 1998;108(8):1206–10.
[17] Qiu Q, Schutte HK. A new generation videokymography for routine clinical vocal fold
examination. Laryngoscope 2006;116(10):1824–8.
[18] Kendall KA, Browning MM, Skovlund SM. Introduction to high-speed imaging of the
larynx. Curr Opin Otolaryngol Head Neck Surg 2005;13(3):135–7.
[19] Hertegard S. What have we learned about laryngeal physiology from high-speed digital
videoendoscopy? Curr Opin Otolaryngol Head Neck Surg 2005;13(3):152–6.
[20] Smits I, Ceuppens P, De Bodt MS. A comparative study of acoustic voice measurements by
means of Dr. Speech and computerized speech lab. J Voice 2005;19(2):187–96.
[21] Hillman RE, Heaton JT, Masaki A, et al. Ambulatory monitoring of disordered voices. Ann
Otol Rhinol Laryngol 2006;115(11):795–801.
[22] Heman-Ackah YD, Heuer RJ, Michael DD, et al. Cepstral peak prominence: a more reliable
measure of dysphonia. Ann Otol Rhinol Laryngol 2003;112(4):324–33.
[23] Wolfe V, Martin D. Acoustic correlates of dysphonia: type and severity. J Commun Disord.
1997;30(5):403–15.
[24] Eadie TL, Baylor CR. The effect of perceptual training on inexperienced listeners’ judgments
of dysphonic voice. J Voice 2006;20(4):527–44.
[25] Jacobson BH, Johnson A, Grywalsky C, et al. The voice handicap index (VHI): development
and validation. Am J Speech Lang Pathol 1997;6:66–70.
[26] Hogikyan ND, Sethuraman G. Validation of an instrument to measure voice-related quality
of life (V-RQOL). J Voice 1999;13(4):557–69.
[27] Murry T, Medrado R, Hogikyan ND, et al. The relationship between ratings of voice quality
and quality of life measures. J Voice 2004;8(2):183–92.
[28] Rosen CA, Lee AS, Osborne J, et al. Development and validation of the voice handicap
index-10. Laryngoscope 2004;114(9):1549–56.