Statistics and Partitioning of Species Diversit - Russell Lande

Nordic Society Oikos
6WDWLVWLFVDQG3DUWLWLRQLQJRI6SHFLHV'LYHUVLW\DQG6LPLODULW\DPRQJ0XOWLSOH&RPPXQLWLHV
$XWKRUV5XVVHOO/DQGH
6RXUFH2LNRV9RO1R0D\SS
3XEOLVKHGE\%ODFNZHOO3XEOLVKLQJRQEHKDOIRI1RUGLF6RFLHW\2LNRV
6WDEOH85/http://www.jstor.org/stable/3545743
$FFHVVHG
Your use of the JSTOR archive indicates your acceptance of JSTOR's Terms and Conditions of Use, available at
http://www.jstor.org/page/info/about/policies/terms.jsp. JSTOR's Terms and Conditions of Use provides, in part, that unless
you have obtained prior permission, you may not download an entire issue of a journal or multiple copies of articles, and you
may use content in the JSTOR archive only for your personal, non-commercial use.
Please contact the publisher regarding any further use of this work. Publisher contact information may be obtained at
http://www.jstor.org/action/showPublisher?publisherCode=black.
Each copy of any part of a JSTOR transmission must contain the same copyright notice that appears on the screen or printed
page of such transmission.
JSTOR is a not-for-profit service that helps scholars, researchers, and students discover, use, and build upon a wide range of
content in a trusted digital archive. We use information technology and tools to increase productivity and facilitate new forms
of scholarship. For more information about JSTOR, please contact support@jstor.org.
Blackwell Publishing and Nordic Society Oikos are collaborating with JSTOR to digitize, preserve and extend
access to Oikos.
http://www.jstor.org
OIKOS76: 5-13. Copenhagen1996
MINIREVIEW
Minireviewsprovidesan opportunityto summarizeexistingknowledgeof selected

ecologicalareas,with specialemphasison currenttopics whererapidand significant
advancesare occurring.Reviewsshouldbe conciseand not too wide-ranging.All key
referencesshouldbe cited. A summaryis required.
Statisticsand partitioningof speciesdiversity,and similarity

amongmultiplecommunities
Russell Lande
Lande,R. 1996.Statisticsand partitioningof speciesdiversity,and similarityamong
multiplecommunities.- Oikos 76: 5-13.
Speciesrichness,Shannoninformation,and Simpsondiversityare the three most
measuresof speciesdiversity.The samplingbias and
commonlyused nonparametric
varianceof thesemeasuresdiffergreatly.Speciesrichnessmay be seriouslyunderestimated for even very large samplesfrom a speciosecommunity.The bias in species
richnessand Shannoninformationdepend on unknownparametersof the species
abundancedistribution.An unbiasedestimatorexists only for Simpsondiversity.
Each of these measuresis concave, so that the total diversityin a pooled set of
communitiesexceeds(or equals)the averagediversitywithincommunities.The total
diversityin a set of communitiescan thereforebe partitionedinto positive,additive
componentswithin and among communities,correspondingto a- and P-diversity.
PartitioningSimpsondiversitycorrespondsto an analysisof variance.The proportion of the total diversityfound withincommunitiesprovidesa naturalmeasureof
similarityamong multiple communities.The expected similarityamong multiple
randomsamplesfrom the same communitydependson the numberof samplesand
on the underlyingmeasureof diversity.
R. Lande,Dept of Biology,Univ.of Oregon,Eugene,OR 97403-1210,USA.
Measures of species diversity play a central role in
ecology and conservation biology (Whittaker 1960,
1972, Williams 1964, MacArthur 1965, Peet 1974,
Pielou 1975, Grassle et al. 1979, Magurran 1988, Noss
and Cooperrider 1994). The most commonly employed
measures of species diversity are species richness (number of species present in a community), and those based
on species frequencies involving Shannon information,
H, and Simpson concentration, k.
Whittaker (1960, 1972) defined the important concepts of species diversity within and among communities (a- and P-diversity), and the total species diversity
in a set of communities (y-diversity). Various measures
of species diversity among communities have been proposed, especially for patterns of species richness along
environmental gradients (Whittaker 1960, 1972,
MacArthur 1965, 1972, Pielou 1975, Allan 1975, Routledge 1977, Wilson and Mohler 1983, Wilson and
Shmida 1984, Magurran 1988).
A measure of species diversity should ideally be
nonparametric and statistically accurate. It should be
applicable to any community independent of species
abundance distribution, and should have small bias and
sampling variance in samples of moderate size. An
important property for a diversity measure, first discussed by Lewontin (1972) for genetic diversity, is strict
Accepted25 September1995
CopyrightC OIKOS 1996
ISSN 0030-1299
Printedin Ireland- all rightsreserved
OIKOS 76:1 (1996)
concavity. This means that the total diversity in a

pooled set of communities equals or exceeds the average diversity within communities, with equality only for
identical communities. Lewontin partitioned the total
diversity into a sum of the average diversity within
communities and the diversity among communities. As
we will see below, such an additivepartition of diversity
leads naturally to a measure of similarity among multiple communities.
Although considerable effort has been devoted to
analyzing basic statistics of diversity measures, less
attention has been given to theoretical or empirical
partitioning of total species diversity within and among
individuals from a given community are different species, also called the Gini coefficient, which can be used
as a measure of diversity (Pielou 1969). The inverse of
Simpson concentration, 1/?, is often employed to measure species diversity, and for a given number of species, S, in a community it has a maximum value equal
most popular measures of species diversity, and critically evaluate their relative merits with respect to the
above properties. Finally I discuss the relative merits of
these measures for assessing similarity among communities and for extrapolating total species diversity in a
another, using some measure of species diversity. Hutcheson (1970) described a t-test for the significance of
to S when all speciesare equallyfrequent.
Statistics of species diversity

If two or more samples are known to be different, either
a prioribecausethey come from differentcommunities
communities,and some of the most useful resultsare or after rejection of the null hypothesis of homogeneity
scatteredthroughthe genetic,statisticaland ecological among samples, it may then be appropriate to test the
literature.Here I collect and extendresultson the three hypothesis that one community is more diverse than
Nonparametric measures of species diversity
differencesin the Shannon information measure of

diversityin largesamples.More generally,a resampling
scheme such as the jackknifeor bootstrapis suitable
(Efron 1982,Magurran1988).To assessthe accuracyof
commonlyused nonparametricmeasuresof speciesdiversity,I here collect and deriveapproximateformulas
for their samplingbias and variance.
The most popular measures of species diversity are

nonparametricand do not depend on any particular
species abundancedistribution,such as the log series
(Fisheret al. 1943),brokenstick (MacArthur1957),or
lognormal(Preston1948)models,fromwhichrealcommunitieswill deviateto some extent.
Let S be the actual numberof speciesin a community

composedof a verylarge(effectivelyinfinite)numberof
individuals.The frequencyof the ith speciesis Pi. Using
a caratto denote a samplevalue, the numberof species
in a sampleof size N is S, with mean
region from sampleswithin and among communities.
Species richness
This is simply the number of species in a community (or
Species richness
E[S]=S- -
(1 -pi)N
(3a)
sample)based on presence,ratherthan relativeabun- (Grassleand Smith 1976). The rarefactionformula of

dance.
Hurlbert(1971)givesthe analogof (3a)for samplesfrom
a finitecommunity.Speciesi is likelyto be presentin a
Shannon information
of size N only if piN > 1. Hence in a highly
Letpi be the frequencyof speciesi in a community.The sample
the observednumberof species,9,
diverse
community
averageinformationper individualis
the actualnumberof species,
underestimate
may
greatly
S
will be absent from
rare
because
S,
frequently
species
H=(1)
p\ lnp,
even very large samples. Several methods have been
=1i
Shannon and Weaver (1962). For a given number of developedfor estimatingthe actual numberof species
species,S, the informationreachesits maximumvalue, by extrapolationfromsamplesof varioussizes(reviewed
InS, when all speciesare equallyfrequentin the com- in Colwell and Coddington 1994). All extrapolation
munity.Many authorsuse the exponentialof Shannon methods for estimating the actual number of species in
information,e", which for a given numberof species a community make implicit or explicit assumptions
about the form of the abundancedistributionof rare
has a maximumequal to S.
species, and hence may be in error to an unknown
Simpson concentration
extent.
The probabilitythat two randomlychosen individuals
The samplingvarianceof species richnessis exactly
froma givencommunityare the samespecies,calledthe (corrected from Stromgren et al. 1973)
S
"concentration"by Simpson(1949), is
Var[S] =
(1 -)N[1
-(1
p)N]
(2)
i= 1
16
is the probability that two randomly chosen
+ 2Z
[(1-p-pj)N
-(1 -p)N(1l -p)N].
(3b)
i>j
OIKOS 76:1 (1996)
The first (single) summation contains the variances

of presence (1) versus absence (0) of each species.
The second (double) summationcontains the covariances of presenceversus absence for pairs of species,
which are negativeand become negligiblein comparision to the variances for sufficientlylarge sample
sizes.
For largesamplesthe approximatevarianceof Simpson

diversity is
4
Var[l - ] -
Z p-2
(5d)
whichis the sameas that for Simpson's(1949)unbiased
estimator of concentration, S.
Fig. 1 compares the sampling properties of these
Shannon information
three diversity measures for communities of low or high
In a randomsample of N individualsfrom a commudiversity. It can be seen that the estimated species
nity, the diversitycalculatedusing the estimatedspe- richness can greatly underestimate the actual number of
cies frequenciespj, is denoted as /. For large N this
species even for very large samples in a highly diverse
has approximatemean and variance
community. The standard deviation of species richness
S(4a) also is rather large. Shannon information has a sub(4a) stantial bias for small samples, but the bias is small
E[H]-H2N
when 2N >>S. The standard deviation of Shannon diS
p (ln p )2- H2 /N
Var[H]
(4b) versity is much smaller than that for species richness.
1
_i=
Simpson diversity, 1 - k is not only unbiased, but also
(Pielou 1966, Hutcheson 1970, Bowman et al. 1971). has the smallest standard deviation.
Note that the bias in H dependson the actual number of species, S, which is generallyunknown.Hence
an unbiased estimator of Shannon information
does not exist. Accurate estimation of H requires Concavityof speciesdiversitymeasures
samplinglarge numbersof individuals,with 2N much A desirable property of a measure of species diversity is
greaterthan the actual numberof speciesin the com- that the total diversityin a set of communitiesshould
munity.
be greater than or equal to the (weighted) average
diversitywithin the communities(Lewontin1972). Let
Simpson diversity
Pielou (1975) and Patil and Taillie (1982) draw an

analogy betweendiversitywhich measuresthe variety
of categorical(species)identities,and variancewhich
measuresthe dispersionin quantitaivemeasurements.
It does not appear to have been previouslynoticed
that the Simpsonmeasureof speciesdiversitywithin a
community, 1- X, can be expressed precisely as a
variance.If individualk of species i is denoted as a
point in S-dimensional space, with coordinates
(xlk, ..., Xsk) where xik= 1 and all other coordinates
are 0, then the total variance per individual in species
identity within the community is
ZE[(xik--pi)2]
i
p,(l -p,)
= 1- .
(5a)
DT be the total species diversity in a set of communities

(or samples), computed using the weighted average
species frequencies among communities. Denote the
diversity within community j as Dj, and let the propor-
tional "weight" of this community be qi such that
E qj= 1. The weights associated with each community may reflect the relative abundance of the communities, the sizes of samples from the communities, or
equal weights. A diversity measure is strictly concave
when
DT > E qDijwith equalityonly for identical

communities.
(6)
Speciesrichnessobviouslyis strictlyconcave.
A continuousmeasureof diversitybased on species
frequencies is strictly concave if and only if its matrix of
Becausethis is a varianceit follows directlythat in a second derivatives is negative definite (Marcus and
random sample of N individualsfrom a community, Minc 1964).
estimates of the Simpson diversity, 1- X, calculated
The Shannon information measure of species diverusing the estimatedspecies frequenciesPj, have mean sity, H, is strictly concave (Aczel and Daroczy 1975:
exactly
E[l-]=(
1-
)(l1-X).
(5b)
Thus an unbiasedestimatorof Simpsondiversityis

1-
(N (- 1)-).
N I)
OIKOS 76:1 (1996)
(Sc)
35).
The Simpson concentration, X, is strictly convex.
Hence the diversity measure 1- k is strictly concave.
However, the more commonly used inverse Simpson
measure of diversity, 1/X, is in general not concave
(Patil and Taillie 1982). Thus in some cases using the
inverse Simpson measure, the total diversity in a set of
communities may be less than the average diversity
7
S=30
<Co
Fig. 1. Estimated species

richness, S, Shannon diversity, H, and Simpson diversity, 1- X , as a function of
sample size, N, for hypothetical communities in
which the species abundance
distribution is lognormal
with the standard deviation
of the natural logarithm of
species frequencies o = 2.
Solid lines give the mean diversity, and dashed lines
show approximate 95%
confidence intervals (plus
and minus two standard deviations). In communities
with a total of S = 30 or
300 species the frequency of
the most abundant species is
42.6% or 17.4%, respectively.
S= 300
30
L" It
20
200
I)
a)
ci)
0)
10
/,
I
/
100
C.)
cD
<z
1-
0
2
co
0)
ZZXZNNNNN?
_-----------_ _
III,
I!
II
1.5
.dI
1.5
1--
- -
I-
C3
-.
I
1.0
1.0
-
.n
E
- ---
..
-.-.....-..-..-..-..-..-..-.._
0.5
0.5
03)
ti
--
100
---
200
---
300
.--
400
---
500
1000 2000
3000
4000
5000
Sample size N
within communities, which implies the possiblity of a

negative diversity among communities. The simplest
case is that of communities with only two species in
which the frequency of the most common species exceeds (1 + /3)/2 _ 0.7887. The same result can occur
also in more diverse communities with substantial unevenness in species frequencies (see Table 1).
Additive partition of diversity

The total species diversity in a pooled set of communities can be partitioned into additive components within
and among communities, so that total diversity and its
components have the same units and can be compared
directly. An additive partition of diversity therefore
8
seems more natural than the multiplicative partitions

described by Whittaker (1960, 1972) and Routledge
(1979). A partition would be most easily interpreted if
the different components of diversity could be expressed
using the same general formula.
Consider a set of communities in which the frequency
of species i in community j is pij, such that lipi = 1.
Let qj be the proportional weight associated with community j, based on its sample size or importance. The
total species diversity, DT, is defined in terms of the
weighted average frequency of species i in the pooled
set of communities, p = Ej qijpi' For a measure of diversity that is concave, the total diversity in a set of
communities is always greater than or equal to the
weighted average diversity within communities (eq. 6),
and can therefore be additively partitioned into nonnegative components within and among communities,
OIKOS 76:1 (1996)
Table 1. Examplesin whichthe inverseSimpsonmeasureof speciesdiversity,1/k, violatesconcavity,so that the diversityof a

pooled set of communitiesis less than the averagediversitywithincommunities.Pooled communitieshave equal weights.
Example1
Example2
Speciesfrequenciesin community
1
2
pooled
0.8
p=
P2=
1.0
0.2
0.0
0.9
0.50
0.1
0.03
p7 =
=
1.4706
1/k=
1.0000
1.2195
(1 ,i + 1/X2)/2= 1.2353
= Damong + Dwithin
(7)
where Dwithin = Yj qjDj.
Species richness
In a pooled set of communities the total species richness
is ST and the species richness in community j is Sj. The
among-community component of total species richness
is
ST-
0.2022
0.0955
0.0703
0.09
0.05
0.04
p5=
P6 -
Dr
0.3529
0.26
p=3
p4 =
P8
Speciesfrequenciesin community
2
pooled
qj(ST-
Swithin-E
Sj).
(8)
ST- Sj is the discrepancy between total species richness

and that in the jth community, which is non-negative.
Formula (8) can be compared with Whittaker's multiplicative partition of total diversity in which 3-diversity
is measured by the ratio P = ST/Swithin.
Shannon information
The information diversity among communities can be
expressed as
Hamong=
-E Pi ln i -E qHj
i
= E qjH(,T).
(9a)
(9b)
(9c)
is the "discrimination information" between community j and the pooled set of communities, which is
non-negative (Kullback 1959, R6yni 1961, Aczel and
Daroczy 1975). MacArthur (1965) and MacArthur and
Wilson (1967) employed (9a) with equal weights to
measure the component of bird species diversity between two communities. Lewontin (1972) used (9a) to
partition genetic diversity in human populations, and
his method was elaborated for species diversity by
Allan (1975). In the special case when no species is
OIKOS 76:1 (1996)
0.0641
0.05205
0.0578
0.04390
0.02
0.01
0.0515
0.1057
3.019
4.981
(1/k, + 1/k2)/2= 4.000
0.03575
0.05785
3.895
present in more than a single community the diversity

among communities is maximized and takes the usual
form for information
max Hamong -E
qj ln qj.
This occurs, for example, in the "hierarchicalmodel" of

Pielou (1969, 1975) where different communities are
composed of distinct taxonomic groups.
Simpson diversity
For the measure of species diversity, D = 1 - k, based
on Simpson's measure of concentration, the diversity
among communities can be expressed in terms of the
variance in species frequencies among communities.
Weighting the jth community by its overall frequency
or importance, qj, the variance among communities is
d2= E (pZ
qJ
j
-_i)2
= qIX_
E
X
E/2.
i
(lOa)
Therefore
DT= 1-ZEp
= d2 + Dwithin.
=
H(,T) E Pi ln(pij/Pi)
0.42645
0.23110
0.09275
0.06015
(lOb)
d2 is thus a natural measure of species diversity among

communities. -i (pi - i)2 is the squared distance in
specios frequencies between community j and the
pooled set of communities, which is non-negative.
Essentially the same approach to partitioning genetic
diversity within and among populations was developed
by Nei (1973, 1987). Patil and Taillie (1982) noted its
applicability to partitioning species diversity at two or
more levels, e.g. sets of communities from different
biotic provinces. As already shown in formula (5a), the
Simpson diversity within a community is also a variance. Thus, using multivariate analysis of variance, the
9
total species diversityin a set of communitiescan be

partitionedinto additivecomponentsof the same functional form.This approachfacilitatesstatisticalanalysis
of componentsof species diversity.Statisticaltests on
componentsof variancebased on categoricalvariables
can be performedusing methodsin (Searleet al. 1992)
or (Efron 1982).
Similarity among multiple communities

Employing a partition of total species diversity into
additivecomponentswithin and among multiplecommunities,a naturalmeasureof communitysimilarityis
lD = Dwithin/DT
(11)
1-Damong/DT
which rangesbetween0 and 1.

For species richness, this reveals that Whittaker's
measure of P-diversity,
ST/Wwithin,
is not actually a
becausedifferentsamplescontain differentsets of the

rarestspecies.The bias in similarityof Shannoninformationamongreplicatesamplesdependson the ratioof
total numberof speciesin the communityto 2N times
the diversityin the community.The bias in similarity
based on Simpsonconcentrationdependsonly on the
samplesize and is small for moderatelylarge samples.
We also can examinethe ratio of the expecteddiversity within random samples of size N from the same
communityto the expectedtotal diversityin n random
samplesof size N from the same community,E[D(N)]/
E[D(nN)]. For an infinitenumberof samples,n = z,
this equalsthe similaritycoefficientin formula(12). For
a finite numberof samples,n < oc, this ratio indicates
the approximatebehaviorof the generalformula(11)
appliedto the similarityamong multiplesamplesfrom
the same community(see Fig. 2).
Discussion
diversity,but ratherthe inverseof communitysimilarity Species richnessis the most widely used measure of
in species composition.
diversity,because of its simplicityin data acquisition
Based on the Simpsonmeasureof genetic diversity and analysis.However,it has a well-knownstatistical
(heterozygosity) Nei (1973, 1987) developed an weaknessof a potentiallylarge samplingbias, in that
analogousmeasureof geneticsimilarityamongpopula- rare speciesoften will be absent even in large samples
tions (GsT)which has been widelyused to describethe or exhaustivesurveys.
genetic structureof populations.To measurecommuThe Shannoninformationmeasureof species divernity similaritybased on Simpsondiversitywe must use sity, though popular,has a rathertenuous foundation
1 - ~ to estimatediversitywithinsamplesin orderthat in ecological theory, as noted by Pielou (1966,1969),
the similaritymeasurenot exceedunity, which it could Hurlbert (1971), and May (1975). In samples from
when using the unbiasedestimator 1 - i for a finite speciosecommunities,the Shannondiversitymay have
numberof samples(as for the Morisitaindex of simi- a substantialbias (Hutcheson 1970, Bowman et al.
laritybetweentwo communitiesbased on the unbiased 1971).For both speciesrichnessand ShannoninformaSimpson concentration [Morisita 1959, Horn 1966, tion measures of diversity, the bias depends on the
Wolda 1981]).
actualnumberof speciesin a community,whichgenerStandardized measures of community similarity, ally is unknown, so that an unbiased estimator of
rangingfrom 0 to 1, generallyare biaseddownward,so speciesrichnessor Shannoninformationdoes not exist.
that the true similaritybetweencommunities,estimated
May (1975)preferreddiversitymeasuresbasedon the
This Simpsonindex to those based on information,because
from randomsamples,tendsto be underestimated.
can be seen most clearly in the expected similarity of the apparent relationshipof Simpson's index to
among randomsamplesfrom the same community.
variances.However,the commonlyused inverseSimpConsiderfirst the simplestsituationin which a very son diversity 1/X is not concave. When partitioning
large (effectivelyinfinite) number of samples of the humangeneticdiversity,Lewontin(1972) noted that a
same size, N, are taken from the same (infinitelylarge) diversitymeasure should have the propertythat the
community. Then the total diversity of the pooled total diversityin a pooled set of communitiesis greater
samplesis the same as that in the community,DT, and than or equal to the averagediversitywithincommunithe similarityamong communitiesis
ties. Violation of this propertyof concavitycan produce the uninterpretableresult of a negativediversity
(12)
16
D= E[D]/DT
among communities.
In contrast with the inverse Simpson diversity,the
in whichE[J5]is the meanvalue of any one of the three
Simpsondiversitymeasure1 - X,whichis the probabildiversitymeasuresin formulas(3a), (4a) and (Sb).
Fig. 2 plots theseexpectedsimilaritiesamongmultiple ity that two randomlychosen individualsare different
samplesfrom the samecommunity.The amountof bias species, has a number of advantages. 1 - k is precisely
in the similaritymeasuresparallelsthat for the corre- the varianceof speciesidentitywithina community,for
sponding diversitymeasures.The similarityin species which an unbiasedestimatorexists. Using this measure
richness among replicate samples has a large bias, the total diversity in a set of communitiescan be
10
OIKOS76:1(1996)
Fig. 2. Expectedsimilarity
among n samplesof N individualsfrom the same community,based on partition
of threemeasuresof the total speciesdiversityin the
samples.The numberand
abundancedistributionof
speciesin the community
are as in Fig. 1. Solid lines
give the exact proportionof
the total speciesdiversityin
the communityexpected
withinsamples.Dashed lines
are approximations,based
on the ratio of the expected
diversitywithinsamplesto
the expectedtotal diversity
S=30
S = 300
0.8
in n = 2 or 10 samples.
- ?oo
0.6
WA
H
0.4
0.2
II
v
1-~
n = oo
n = oo
0.8
0.6
1 -x
0.4
0.2
u
100
200
300
400
500
1000 2000 3000 4000 5000
Samplesize N
partitioned into the average diversity within the communities plus the diversity among communities, with
the latter equal to the variance in species frequencies
among communities. This measure of species diversity
thus facilitates statistical partitioning of species diversity within and among communities using analysis of
variance.
Partition of total species diversity in a set of communities into additive components within and among communities provides a unifying framework with which to
measure diversity at different levels of organization
using the same general formula, so that a-, P-, and
y-diversity are measured in the same way. Additive
partition of diversity also suggests a natural measure of
similarity among multiple communities, the proportion
of the total diversity found within communities.
Coefficients of community similarity inherit the
statistical sampling properties of the diversity measures
OIKOS 76:1 (1996)
on which they are based. The similarity among a set of

communities can be expected to decrease with increasing number of communities in the set because the total
diversity increases as more distinct communities are
included. Sampling effects also contribute to decreasing
the expected similarity among communities, because
random samples deviate to some extent from the actual
communities.
The expected similarity among random samples from
the same community decreases with increasing number
of samples because more samples contain a greater
total diversity, more closely approximating that in the
actual community. For species richness, the expected
similarity among random samples from the same community can be substantially less than unity, even with
very large sample sizes. For Shannon information this
downward bias in community similarity becomes small
in samples of moderate size. For Simpson diversity
11
1- X the expected similarity among random samples

from the same community closely approaches unity
for modest sample sizes.
Measures of community similarity should not be
employed to test the null hypothesis that different
samples are drawn from the same community. When
random samples are available, analysis of contingency
tables (containing counts of individuals per species in
different samples) using Chi-squared or likelihood (G)
statistics provides a more general and powerful
method for detecting heterogeneity in species abundance distributions among communities (Gokhale and
Kullback 1978, Fienberg 1981, Sokal and Rohlf
1995).
The measures of similarity among multiple communities proposed here provide a natural description of
community structure based on additive partition of
species diversity within and among communities,
which can be readily extended to both higher and
lower levels of organization, e.g. multiple samples
within communities, and multiple communities within
geographic provinces or biomes.
Estimating the total species diversity for some taxonomic group in a region generally requires two kinds
of extrapolations: (1) using a sample from a community to estimate species diversity within the community, and (2) using a sample of communities to
estimate total species diversity within a region. Estimating species richness from even a large sample of a
speciose community may require (explicit or implicit)
assumption of a particular form of species abundance
distribution to extrapolate the number of rare species
(see Colwell and Coddington 1994). Accurate estimates of Shannon information diversity can be
achieved for moderate sample sizes by assuming a
form of species abundance distribution, or more generally by using sample sizes much larger than the
actual number of species in the community. For the
Simpson diversity the unbiased estimator (eq. 5c) accurately extrapolates the species diversity in a community even from a sample of modest size.
Estimation of total species diversity in a region,
based on randomly chosen communities, can be accomplished by partitioning observed species diversity
within and among communities, and extrapolating to
the actual number of communities in the region. Sampling effects will cause a fraction of the average diversity within communities to appear as diversity among
communities. Adjustments for this can be made by
having multiple samples from each community, or by
using the expected similarity among different samples
from the same community (Fig. 2). Accurate estimation of total species diversity in a region based on a
hierarchical random sampling scheme is most easily
accomplished using an analysis of variance for Simpson diversity.
12
- This work was supportedby NSF grant

Acknowledgements
DEB-9225127.
References
Aczel, J. and Daroczy,Z. 1975.On measuresof information
- AcademicPress,New York.
and theircharacterizations.
Allan, J. D. 1975.Componentsof diversity.- Oecologia 18:
359-367.
Bowman,K. O., Hutcheson,K., Odum,E. P. and Shenton,L.

R. 1971.Commentson the distributionof indicesof diversity. - In: Patil, G. P., Pielou, E. C. and Walters,W. E.
(eds), Statisticalecology. Volume3. Many speciespopulations, ecosystems,and systemsanalysis.PennsylvaniaState
Univ. Press,UniversityPark, PA, pp. 315-366.
Colwell,R. K. and Coddington,J. A. 1994.Estimatingterrestrial biodiversitythroughextrapolation.- Philos. Trans.
R. Soc. Lond. B. 345: 101-118.
Efron,B. 1982.Thejackknife,the bootstrapand otherresampling plans. - Society for Industrialand AppliedMathematics,Philadelphia,PA.
categoriFienberg,S. E. 1981.The analysisof cross-classified
cal data. 2nd edn. - MIT Press,Cambridge,MA.
Fisher, R. A., Corbet, A. S. and Williams,C. B. 1943. The
relationbetweenthe numberof speciesand the numberof
individualsin a randomsampleof an animalpopulation.J. Anim. Ecol. 12:42-58.
Gokhale, D. V. and Kullback,S. 1978. The informationin
contingencytables. - MarcelDekker,New York.
Grassle, J. F. and Smith, W. 1976. A similaritymeasure
sensitiveto the contributionof rarespeciesand its use in
investigationof variationin marinebenthiccommunities.
- Oecologia25: 13-22.
- , Patil, G. P., Smith,W. and Taillie, C. 1979. Ecological
diversityin theoryand practice.- InternationalCo-operative PublishingHouse, Fairland,MD.
Horn, H. S. 1966. Measurementof "overlap"in comparative
ecologicalstudies.- Am. Nat. 100:419-424.
Hurlbert,S. H. 1971.The nonconceptof speciesdiversity:a
critiqueand alternativeparameters.- Ecology 52: 577586.
Hutcheson,K. 1970.A test for comparingdiversitiesbasedon
the Shannonformula.- J. Theor. Biol. 29: 151-154.
Kullback,S. 1959.Informationtheoryand statistics.- Wiley,
New York.
Lewontin,R. C. 1972.The apportionmentof humandiversity.
- Evol. Biol. 6: 381-398.
MacArthur,R. H. 1957. On the relativeabundanceof bird
species.- Proc. Natl. Acad. Sci. USA 43: 293-295.
- 1965. Patternsof speciesdiversity.- Biol. Rev. 40: 510533.
- 1972.Geographicalecology. - Harper& Row, New York.
- and Wilson, E. 0. 1967. The theory of island biogeography. - PrincetonUniv. Press,Princeton,NJ.
Magurran,A. E. 1988. Ecologicaldiversityand its measurement. - PrincetonUniv. Press,Princeton,NJ.
Marcus,M. and Minc,H. 1964.A surveyof matrixtheoryand
matrixinequalities.- Dover, New York.
May, R. M. 1975.Patternsof speciesabundanceand diversity.
- In: Cody, M. L. and Diamond,J. M. (eds),Ecologyand
evolution of communities. Harvard Univ. Press, Cam-
bridge,MA, pp. 81-120.

Morisita,M. 1959. Measuringof interspecificassociationand
similaritybetweencommunities.- Mem. Fac. Sci. Kyushu
Univ. Ser. E Biol. 3: 65-80.
Nei, M. 1973.Analysisof gene diversityin subdividedpopulations. - Proc. Natl. Acad. Sci. USA 70: 3321-3323.
- 1987.Molecularevolutionarygenetics.- ColumbiaUniv.
Press,New York.
Noss, R. F. and Cooperrider,A. Y. 1994. Saving nature's
legacy, protecting and restoring biodiversity. - Island
Press,Washington.
OIKOS 76:1 (1996)
Patil,G. P. and Taillie,C. 1982.Diversityas a conceptand its

measurement.- J. Am. Stat. Assoc. 77: 548-561.
Peet, R. K. 1974. The measurementof species diversity.Annu. Rev. Ecol. Syst. 5: 285-307.
Pielou,E. C. 1966.Shannon'sformulaas a measureof specific
diversity:its use and misuse. - Am. Nat. 100:463-465.
- 1969. An introductionto mathematicalecology. - Wiley,
New York.
- 1975. Ecologicaldiversity.- Wiley,New York.
Preston,F. W. 1948.The commonnessand rarityof species.Ecology29: 254-283.
Reyni, A. 1961.On measuresof entropyand information.In: Neyman, J. (ed.), Proceedingsof the 4th Berkeley
Symposiumon MathematicalStatistics and Probability,
Vol. 1. Univ. of CaliforniaPress,Berkeley,CA, pp. 547561.
Routledge,R. D. 1977.On Whittaker'scomponentsof diversity. - Ecology58: 1120-1127.
- 1979. Diversityindices:which ones are admissable.- J.
Theor.Biol. 76: 503-515.
Searle,S. R., Casella,G. and McCulloch,C. E. 1992.Variance
components.- Wiley,New York.
Shannon, C. E. and Weaver, W. 1962. The mathematical
OIKOS 76:1 (1996)
theory of communication.- Univ. of Illinois Press, Urbana, IL.

of diversity.- Nature163:
Simpson,E. H. 1949.Measurement
688.
Sokal, R. R. and F. J. Rohlf. 1995.Biometry.The principles
and practiceof statisticsin biologicalresearch,3rd ed. Freeman,San Fransisco.
Str6mgren,T., Lande, R. and Engen, S. 1973. Intertidal
distributionof the faunaon muddybeachesin the Borgenfjordarea. - Sarsia53: 49-70.
Whittaker,R. H. 1960.Vegetationof the SiskiyouMountains,
Oregonand California.- Ecol. Monogr.30: 279-338.
- 1972. Evolutionand measurementof speciesdiversity.Taxon 21: 213-251.
Williams,C. B. 1964. Patternsin the balanceof natureand
related problems in quantitativeecology. - Academic
Press,London.
Wilson, M. V. and Mohler,C. L. 1983. Measuringcompositional changealong gradients.- Vegetatio54: 129-141.
- and Shmida,A. 1984.Measuringbeta diversitywith presence-absencedata. - J. Ecol. 72: 1055-1064.
Wolda, H. 1981.Similarityindices,samplesize and diversity.
- Oecologia50: 296-302.
13

Statistics and Partitioning of Species Diversit - Russell Lande

Uploaded by

Document Information

Original Title

Copyright

Available Formats

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

Statistics and Partitioning of Species Diversit - Russell Lande

Uploaded by

Copyright:

Available Formats

Nordic Society Oikos

OIKOS76: 5-13. Copenhagen1996

Minireviewsprovidesan opportunityto summarizeexistingknowledgeof selected

Statisticsand partitioningof speciesdiversity,and similarity

concavity. This means that the total diversity in a

to S when all speciesare equallyfrequent.

Statistics of species diversity

a prioribecausethey come from differentcommunities

Nonparametric measures of species diversity

differencesin the Shannon information measure of

The most popular measures of species diversity are

Let S be the actual numberof speciesin a community

region from sampleswithin and among communities.

sample)based on presence,ratherthan relativeabun- (Grassleand Smith 1976). The rarefactionformula of

is the probability that two randomly chosen

-(1 -p)N(1l -p)N].

The first (single) summation contains the variances

For largesamplesthe approximatevarianceof Simpson

whichis the sameas that for Simpson's(1949)unbiased

Pielou (1975) and Patil and Taillie (1982) draw an

DT be the total species diversity in a set of communities

tional "weight" of this community be qi such that

DT > E qDijwith equalityonly for identical

Thus an unbiasedestimatorof Simpsondiversityis

OIKOS 76:1 (1996)

Fig. 1. Estimated species

within communities, which implies the possiblity of a

Additive partition of diversity

seems more natural than the multiplicative partitions

Table 1. Examplesin whichthe inverseSimpsonmeasureof speciesdiversity,1/k, violatesconcavity,so that the diversityof a

where Dwithin = Yj qjDj.

ST- Sj is the discrepancy between total species richness

present in more than a single community the diversity

This occurs, for example, in the "hierarchicalmodel" of

d2 is thus a natural measure of species diversity among

total species diversityin a set of communitiescan be

Similarity among multiple communities

which rangesbetween0 and 1.

becausedifferentsamplescontain differentsets of the

1000 2000 3000 4000 5000

on which they are based. The similarity among a set of

1- X the expected similarity among random samples

- This work was supportedby NSF grant

Bowman,K. O., Hutcheson,K., Odum,E. P. and Shenton,L.

bridge,MA, pp. 81-120.

Patil,G. P. and Taillie,C. 1982.Diversityas a conceptand its

OIKOS 76:1 (1996)

theory of communication.- Univ. of Illinois Press, Urbana, IL.

You might also like