On Convergence Rates in The Central Limit Theorems For Combinatorial Structures

On convergence rates in the central limit theorems
for combinatorial structures

Hsien-Kuei Hwang
Institute of Statistical Science

Academia Sinica
11529 Taipei
Taiwan
August 15, 1998
Abstract
Flajolet and Soria established several central limit theorems for the parameter number of
components in a wide class of combinatorial structures. In this paper, we shall prove a simple
theorem which applies to characterize the convergence rates in their central limit theorems. This
theorem is also applicable to arithmetical functions. Moreover, asymptotic expressions are derived
for moments of integral order. Many examples from dierent applications are discussed.
Key words. Central limit theorem, convergence rate, moments, combinatorial construction, arithmeti-
cal function.
1 Introduction
This paper examines the occurrence of Gaussian laws in large random combinatorial structures. We are
mainly concerned with the rate of normal approximation to discrete random distributions. Examples
that we discuss include such diverse problems as cycles, inversions and involutions in permutations,
the construction of heaps in the eld of data structures, set partitions, and the problem of factorisatio
numerorum in number theory.
Problems in combinatorial enumeration and in number theory often lead to the decomposition of
the underlying structure into more elementary ones by suitable algebraic operations. For instance,
a permutation can be decomposed into a set of cycles, a binary tree can be dened to be either
empty or the union of a root-node and two binary trees, and a factorization of a natural number n
is a decomposition of n into a product of prime factors. It is well-known that operations like union,
sequence, cycle, set and multiset on the structural side correspond to explicitly expressible forms on
This work was partially supported by the ESPRIT Basic Research Action No. 7141 (ALCOM II) while the author
was at LIX, Ecole Polytechnique.
1
the generating function side. Hence it is useful to establish general theorems from which one can
conclude certain asymptotic results (especially, statistical properties) for parameters in the underlying
structure by verifying some basic analytic properties of the generating function. Bender initiated this
line of investigation; see [1, 2, 3, 7, 13, 14, 15, 19].
Applying singularity and probabilistic analysis on bivariate generating functions, Flajolet and
Soria [13, 14] proved a series of central limit theorems for the parameter number of components in
a wide class of combinatorial structures issuing principally from combinatorial constructions. Briey,
they obtain their results by showing that the characteristic function
n
(t) of the normalized random
variable
n
in question (to be explained below) tends, as n tends to innity, to e
1
2
t
2
; and this
establishes the weak convergence of the distribution function F
n
(x) of
n
to the standard normal law
by Levys continuity theorem. With the aid of the Berry-Esseen inequality, we shall make explicit the
convergence rates in their central limit theorems.
In the next section, we propose a simple theorem on convergence rate which turns out to have wide
applications. In particular, it is applicable to the central limit theorems of Flajolet and Soria [13, 14]
which we shall discuss in section 3. Moreover, as a direct consequence of our general assumptions on the
moment generating function, we derive an asymptotic expression for moments of integral order. Then,
we state an eective version of the central limit theorem of Haigh [16] (by establishing the convergence
rate). This theorem is borrowed from [21] where we applied it to derive the asymptotic normality
of the cost of constructing a random heap. Examples from combinatorial enumerations, computer
algorithms, probabilistic number theory, arithmetical semigroups and orthogonal polynomials will be
discussed in the nal section.
Throughout this paper, we denote by (x) the standard normal distribution:
(x) =
1
2
_
x
1
2
t
2
dt (x R).
All limits (including O, o and ), whenever unspecied, will be taken as n . All generating
functions (ordinary or exponential) will denote functions analytic at 0 with non-negative coecients.
The symbol [z
n
]f(z) represents the coecient of z
n
in the Taylor expansion of f(z).
2 Main results
Let
n
n1
be a sequence of random variables. Dene
n
= E(
n
),
2
n
= Var(
n
) and F
n
(x) =
Pr
n
<
n
+x
n
, x R. If the distribution of
n
is asymptotically normal, then F
n
(x) satises
sup
x
[F
n
(x) (x)[ 0, (1)
the pointwise convergence being uniform with respect to x in any nite interval of R. The rst
possible renement to (1) is to determine the convergence rate, which, in most cases, is of order
1
n
.
2
The most general method for establishing the convergence rate (not restricted to normal limiting
distribution) is the Berry-Esseen inequality (see (2 below) which relates the estimate of the dierence
of two distribution functions to that of corresponding characteristic functions, the latter being usually
more manageable, especially when F
n
(x) is a step-function.
Since in probability theory, this subject is almost thoroughly studied when
n
can be decomposed
as a sum of (independent or dependent) random variables, we content here with presenting two simple
theorems which, for most of our applications, turn out to be sucient. Our object is not to search
for results of the most general kind but for those which are easy to apply for combinatorial and
number-theoretic problems, especially, when probability generating function or characteristic function
are available.
The following theorem is motivated by the observation that many asymptotically normal distribu-
tions have mean and variance of the same orders.
Let
n
n1
be a sequence of integral random variables.
Suppose that the moment generating function satises the asymptotic expression:
M
n
(s) := E(e
n
s
) =

m0
Pr
n
= me
ms
= e
H
n
(s)
_
1 +O
_
1
n
__
,
the O-term being uniform for [s[ , s C, > 0, where
(i) H
n
(s) = u(s)(n)+v(s), with u(s) and v(s) analytic for [s[ and independent of n; u
(0) ,= 0;
(ii) (n) ;
(iii)
n
.
Theorem 1 Under these assumptions, the distribution of
n
is asymptotically Gaussian:
Pr
n
u
(0)(n)
_
u
(0)(n)
< x = (x) +O
_
1
n
+
1
_
(n)
_
,
uniformly with respect to x, x R.
Proof. Let
n
= u
(0)(n) and
2
n
= u
(0)(n). Dene the random variable
n
= (
n
n
)/
n
with
distribution function (characteristic function) F
n
(x) (
n
(t)) respectively.
To obtain an upper bound for the dierence [F
n
(x) (x)[, we shall estimate the ratio [(
n
(t)
e
t
2
/2
)/t[ and apply the following Berry-Essen inequality [29, p. 109]:
Let F(x) be a non-decreasing function, G(x) a dierentiable function of bounded variation on the
real line, (t) and (t) the corresponding Fourier-Stieltjes transforms:
(t) =
_

e
itx
dF(x), (t) =
_

e
itx
dG(x).
Suppose that F() = G(), F() = G(), T is an arbitrary positive number, and [G
(x)[ A.
Then for every b > 1/(2) we have
sup
<x<
[F(x) G(x)[ b
_
T
T
(t) (t)
t
dt +r(b)
A
T
, (2)
3
where r(b) is a positive constant depending only on b.
Take G(x) = (x) (so that A = 1/
2) and T = T
n
= c
n
, where 0 < c < is a suciently small
constant, we shall show that
J
n
=
_
T
n
T
n
n
(t) e
1
2
t
2
t
dt = O
_
1
n
+
1
n
_
,
so that Theorem 1 follows from the inequality (2).
Let us write M
n
(s) = e
u(s)(n)+v(s)
(1 +E
n
(s)), so that E
n
(0) = 0 and E
n
(s) = O
_
1
n
_
for [s[ .
By taking a small circle around the origin we easily deduce that E
n
(s) = O
_
[s[
1
n
_
for [s[ c. Since
[
n
(t)[ 1, we have
n
(t) = exp
_
n
it +u(
it
n
)(n) +v(
it
n
)
__
1 +O
_
[t[
n
__
= exp
_
t
2
2
+O
_
[t[ +[t[
3
n
__
+O
_
[t[
n
_
,
for [t[ T
n
. Using the inequality [e
w
1[ [w[e
|w|
for all complex w, we obtain
n
(t) e
1
2
t
2
t
= O
__
1 +t
2
n
_
exp
_
t
2
2
+O
_
[t[ +[t[
3
n
__
+
1
n
_
= O
__
1 +t
2
n
_
e
1
4
t
2
+
1
n
_
([t[ T
n
),
for c suciently small. Hence
J
n
=
_
T
n
T
n
n
(t) e
1
2
t
2
t
dt
= O
_
1
n
_
T
n
T
n
(1 +t
2
)e
1
4
t
2
dt +
1
n
_
= O
_
1
n
+
1
n
_
.
This completes the proof of Theorem 1. 2
The assumption that the moment generating function M
n
(s) exists in the neighbourhood of 0
is not a necessary condition for the validity of Theorem 1. As is evident from the proof, a weaker
sucient condition is that [u
(it)[ and [v
(it)[ are bounded for s close to 0. But the following theorem

does require the analyticity of M
n
(s).
Theorem 2 The k
th
moment of
n
satises
1
k!
m0
m
k
Pr
n
= m =
k
((n)) +O
_
(n)
k
n
_
(k = 1, 2, 3, . . .),
where
k
(X) is a polynomial in X of degree k dene by
k
(X) = [s
k
]e
u(s)X+v(s)
(k = 1, 2, 3, . . .). (3)
4
In particular, the mean and the variance of
n
satisfy
E(
n
) = u
(0)(n) +v
(0) +O
_
1
n
_
, (4)
Var(
n
) = u
(0)(n) +v
(0) +O
_
1
n
_
. (5)
Proof. Since M
n
(s) is analytic for [s[ , by Cauchys formula, we obtain
m0
Pr
n
= m
m
k
k!
=
k
((n)) +R
k
(n),
where the remainder term R
n
(n) satises
R
k
(n) = O
_
1
n
_
|s|=r
[s[
k1
e
u(s)(n)
[ds[
_
(0 < r ).
Clearly, the degree of the polynomial
k
(X) is k. Taking r = (n)
1
, we obtain
R
k
(n) = O
_
(n)
k
n
_
,
for k = 1, 2, 3, . . .
For the more rened results (4) and (5), let us consider the cumulant generating function of
n
:
n
(s) := log M
n
(s) = u(s)(n) +v(s) +Y
n
(s),
where Y
n
(s) = O
_
1
n
_
. By the analyticity of M
n
and the relation M
n
(0) = 1, the function
n
(s) is
well-dened in a neighbourhood of the origin and is analytic there. Similar arguments as above yield
the required results (4) and (5) since the mean and the variance of
n
are nothing but
n
(0) and
n
(0), respectively. This completes the proof. 2
Writing u
j
:= u
(j)
(0) and v
j
:= v
(j)
(0), j = 1, 2, 3, . . ., we have
1
(X) = u
1
X +v
1
,
2
(X) =
u
2
1
2
X
2
+
u
2
+ 2u
1
v
1
2
X +
v
2
+v
2
1
2
.
In general, some of the terms in
k
((n)) may be absorbed by the remainder term O
_
(n)
k
1
n
_
.
Corollary 1 We have
m0
m
k
Pr
n
= m = u
(0)
k
(n)
k
_
1 +O
k
_
1
(n)
+
1
n
__
,
for each xed k = 1, 2, 3, . . .
3 Combinatorial schemes of Flajolet and Soria
We now apply Theorem 1 to establish eective versions of the central limit theorems considered in
[13, 14]. The developments here are made possible since the singularity analysis provides uniform
bounds on the coecients of analytic functions.
5
3.1 Exp-log class
The exp-log class covers such problems as cycles in permutations, cycles in random mappings and
random mapping patterns, irreducibles in polynomials over nite elds (and, more generally, over an
additive arithmetic semigroup under Knopfmachers axiom A
#
), proles of increasing trees, etc. See
[13, 23, 19].
Recall that a generating function C(z) analytic at 0 is called logarithmic [13] if there exists a
constant a > 0, such that for z , > 0 being the radius of convergence of C,
C(z) = a log
1
1 z/
+H(z),
where the function H(z) is analytic in the region
:= z : [z[ + and [ arg(z )[ ( > 0, 0 < <

2
),
(with possibly a nite number of other branch cuts starting from the circle [z[ = ), and as z in
,
H(z) = K +O
_
log
1
2
1
1 z/
_
,
uniformly in z, where K is some constant. For brevity, we shall refer to as saying that C(z) is
logarithmic with parameters (, a, K).
Remark. The condition that we imposed here on H(z) is weaker than that required in [14]. They need
H(z) = K +o
_
log
1
1
1 z/
_
(z , z ),
to ensure that both the mean and the variance are a log n + O(1). But by our Theorem 2, only the
weaker condition H(z) = K + o(1) for z , z , suces to guarantee the later. This was the
original condition used in [13]. 2
We rst consider generating functions of the form
P(w, z) =

n,m0
p
nm
w
m
z
n
= exp (wC(z) +S(w, z)) (C(0) = 0),
where C(z) is logarithmic with parameters (, a, K) and the function S(w, z) is analytic for [z[ < +
and [w[ < 1 +
, ,
> 0. By singularity analysis [14, 12] (see [19] for details), we obtain
M
n
(s) =

m0
Pr
n
= me
ms
:=
[z
n
]P(e
s
, z)
[z
n
]P(1, z)
= exp
_
(e
s
1)a log n +K(e
s
1) +S(e
s
, ) S(1, ) + log
(a)
(ae
s
)
_
_
1 +O
_
1
log n
__
, (6)
uniformly for s in some complex neighbourhood of the origin.
6
Theorem 3 Let F
n
(x) = Pr
n
< a log n + x
a log n, where
n
and a are dened by (6). Then
F
n
(x) satises, uniformly with respect to x,
F
n
(x) = (x) +O
_
1
log n
_
.
Proof. The result follows from Theorem 1, since 1/(ae
s
) ,= 0 for ['(s)[ < [ log a[. 2
Theorem 4 The k
th
moment of
n
, k N
+
, satises
1
k!
m0
m
k
Pr
n
= m =
(a log n)
k
k!
_
1 +O
_
(log n)
1/2
__
,
In particular, the mean and the variance of
n
satisfy
E(
n
) = a log n a(a) +K +S
1
+O
_
(log n)
1
2
_
Var(
n
) = a log n a(a) a
2
(a) +K +S
2
+S
1
+O
_
(log n)
1
2
_
where denotes the logarithmic derivative of the Gamma function and the two constants S
1
and S
2
are dened by
S
1
=

w
S(w, )
w = 1
, S
2
=

2
w
2
S(w, )
w = 1
.
Proof. This follows from (6) and Theorem 2. 2
The above theorems apply to the following four classes of combinatorial structures when C(z) is
logarithmic:
1. Partitional complex construction: P(w, z) = exp(wC(z));
2. Partitional complex construction in which no two components are order-isomorphic:
P(w, z) =

k1
_
1 +
wz
k
k!
_
c
k
(c
k
:= k![z
k
]C(z));
3. Multiset construction: (i) total number of components (i.e., counted with multiplicity)
P(w, z) = exp
_
_
k1
w
k
k
C(z
k
)
_
_
;
(ii) number of distinct components (counted without multiplicity)
P(w, z) =

k1
_
1 +
wz
k
1 z
k
_
c
k
(c
k
:= [z
k
]C(z)),
when the dominant singularity of C is strictly inferior to 1;
4. Set construction:
P(w, z) = exp
_
_
k1
(1)
k1
k
w
k
C(z
k
)
_
_
,
when the dominant singularity of C is strictly less than 1.
Further renement of Theorem 1 is possible under slightly stronger conditions on H(z). This will
be discussed in the companion paper [20]; cf. also [19, Ch. 2].
7
3.2 Algebraic-logarithmic class
Next, let us consider generating functions of the form
P(w, z) =

n,m0
p
nm
w
m
z
n
=
1
(1 wC(z))
_
log
1
1 wC(z)
_
(C(0) = 0),
where N, 0, and + > 0. Dene the random variable
n
by its moment generating
function:
M
n
(s) =

m0
Pr
n
= m e
ms
=
[z
n
]P(e
s
, z)
[z
n
]P(1, z)
,
for suciently large n.
Denition. (1-regular function [14]) A generating function C(z) , z
q
(q = 0, 1, 2, . . .) analytic at
z = 0 is called 1-regular if (i) its Taylor expansion at z = 0 involves only non-negative coecients; (ii)
there exists a positive number < , being the radius of convergence of C(z), such that C() = 1.
Assume, without loss of generality, that C(z) is aperiodic, namely, C(z) , z
a
D(z
b
) for some generating
function D(z) and integers a, b 2.
To start with, let us rst suppose that C(z) is 1-regular and consider the equation
1
w
= C(z).
Since C(z) is 1-regular, we have C
() > 0, where satises C() = 1. By the inverse function

theorem, the above equation is locally invertible, viz, the equation has a unique solution (w) for
w 1 and (w) is analytic at w = 1. Of course (1) = .
The following lemma is fundamental to further investigations.
Lemma 1 Let P
n
(w) := [z
n
]P(w, z). The formula
P
n
(w) = A
(w)(w)
n
n
1
(log n)

0k
_
k
_
(log n)
k

0jk
g
j
() (log A(w))
kj
+
+O
_
[(w)[
n
n
2
(log n)
_
,
holds uniformly for w 1, where
A(w) =
1
w(w)C
((w))
, g
j
() =
d
j
dx
j
1
(x)
x =
.
Proof. Let us x a w, w 1. Expanding C(z) at z = (w), we get
C(z) =
1
w
(w)C
((w))
_
1
z
(w)
_
+

2
(w)
2
C
((w))
_
1
z
(w)
_
2
+ .
Thus
1
1 wC(z)
= A(w)
_
1
z
(w)
_
1
_
1 +A
1
(w)
_
1
z
(w)
_
+A
2
(w)
_
1
z
(w)
_
2
+
_
,
8
where
A
1
(w) =
(w)C
((w))
2C
((w))
, A
2
(w) =
2
(w)
_
C
((w))
2
4C
((w))
2

C
((w))
6C
((w))
_
.
For P(w, z), we get
_
1
1 wC(z)
_
_
log
1
1 wC(z)
_
= A
(w)

0j
_
j
_
(log A(w))
j
_
1
z
(w)
_
_
log
1
1 z/(w)
_
j
+
+O
_
_
1
z
(w)
_
+1
_
log
1
1 z/(w)
_
_
.
By the remark before the lemma, we can apply the singularity analysis [12]. Thus transferring to
coecients, we obtain
P
n
(w) = A
(w)

0j
_
j
_
(log A(w))
j

0kj
g
jk
()(log n)
k
+ O
_
[(w)[
n
n
2
(log n)
_
.
The double sum can be rearranged in descending powers of log n:
0j
_
j
_
(log A(w))
j

0kj
g
jk
()(log n)
k
= (log n)

0k
_
k
_
(log n)
k

0jk
g
j
() (log A(w))
kj
.
This completes the proof. 2
In particular, when = 0 and > 0, we obtain
P
n
(w) = A
(w)(w)
n
n
1
_
1 +O
_
n
1
__
;
when = 0, = 1,
P
n
(w) = (w)
n
n
1
_
1 +O
_
n
1
__
;
and when = 0 and > 0,
P
n
(w) = (w)
n
n
1
(log n)

1k
_
k
_
(log n)
k

1jk
g
j
(0) (log A(w))
kj
+
+O
_
[(w)[
n
n
2
(log n)
_
.
Note that the indices of the last two sums both begin at 1 since g
0
(0) = 1/(0) = 0.
Remark. From a computational point of view, the following expressions for g
k
(0) are useful.
g
k
(0) = k![x
k
]
1
(x)
= k!(1)
k1
[x
k1
]
1
(1 x)
9
= k!(1)
k1
[x
k1
]
(x) sin x
= k(1)
k1

0j
k2
2

(1)
j
2j
_
k 1
2j + 1
_
(k22j)
(0)
=

0j
k1
2

(1)
k1+j
2j
_
k
2j + 1
_
(k12j)
(1).
Thus, with M
n
(s) := P
n
(e
s
)/P
n
(1), we have, by Lemma 1,
M
n
(s) =
_
e
s
(e
s
)C
((e
s
))
C
()
_
(e
s
)
n
n
(1 +
,
(n)) , (7)
where
,
(n) =
_
O
_
n
1
_
, if ( > 0 and = 0) or ( = 0 and = 1);
O
_
(log n)
1
_
, if ( = 0 and 2) or ( > 0 and > 0),
uniformly for [s[ , > 0.
Dene two constants
1
=
1
C
()
, and
2
=
1
2
C
()
2
+
C
()
C
()
3

1
C
()
.
Theorem 5 Let F
n
(x) = Pr
n
<
1
n +x
2
n. Then F
n
(x) satises
F
n
(x) = (x) +O
_
1
n
+
,
(n)
_
,
uniformly in x, x R.
Proof. Using (7) and Applying Theorem 1. 2
Theorem 6 The k
th
moment of
n
satises
1
k!
m0
m
k
Pr
n
= m =
k
(n) +O(n
k
,
(n)),
where
k
(n) is a polynomial in n of degree k dened by
k
(n) := [s
k
] exp
_
nlog
(e
s
)
__
e
s
(e
s
)C
((e
s
))
C
()
_
,
for k = 1, 2, 3, . . . In particular, we have
E(
n
) =
1
n +L
1
+O(
,
(n))
Var(
n
) =
2
n +L
2
+O(
,
(n)) ,
where the two constants L
1
, L
2
are given by
L
1
:=
1
C
()
+
C
()
C
()
2
1
L
2
:=
2C
1
C
2
+
2
C
2
2
2
C
2
1
C
2
C
3
1
+C
1
C
2
2C
3
1
+2
2
C
2
2
2
C
1
C
3
+
2
C
4
1
2
2
C
2
1
C
2
+C
2
1
+C
2
1
2
2
C
4
1
,
(C
j
:= C
(j)
() for j = 1, 2, 3.)
10
Proof. This follows from (7) and Theorem 2. 2
These results apply to sequence constructions of both labelled and unlabelled structures
P(w, z) =
1
1 wC(z)
(C(0) = 0);
and to cycle constructions
P(w, z) = log
1
1 wC(z)
, P(w, z) =

k1
(k)
k
log
1
1 w
k
C(z
k
)
,
provided that the radius of convergence of C(z) in the last case is less than 1, where (k) denotes
Eulers totient function. It should be noted that for sequence constructions, we often have exponential
convergence rate for M
n
(s).
Remark. For Benders central limit theorem [1, Thm 1], the convergence rate is O
_
n
1
2
_
, provided
that the function A(s) in that theorem has bounded derivative for s near 0.
4 A classical central limit theorem
The following theorem is borrowed from [21] where we used it to derive the asymptotic normality of
the cost of constructing a random heap. Since the proof has already been given there, it is omitted
here. Without the explicit error term it is due to Haight [16].
The object of study here is sums of independent but not identically distributed random variables.
This problem has been extensively studied in the probability literature; see, for instance, [6, 27] [25,
Ch. VI] [17, 29].
Theorem 7 Let
n
n
be a sequence of random variables taking only non-negative integral values
with mean
n
and variance
2
n
. Suppose that the probability generating function P
n
(z) of
n
can be
decomposed as
P
n
(z) =

1jk
n
P
nj
(z),
for some sequence k
n
n
, where the P
nj
(z) are polynomials such that (i) each P
nj
(z) is itself a prob-
ability generating function of some random variable, say,
nj
(1 j k
n
); and (ii)
M
n
n
0,
where M
n
= max
1jk
n
deg P
nj
(z). Then the distribution of
n
is asymptotically Gaussian:
Pr
n
< x = (x) +O
_
M
n
n
_
,
uniformly with respect to x.
Proof. See [21].
11
Corollary 2 Let P
n
(z) be a probability generating function of the random variable
n
, n 1. Suppose
that, for each xed n 1, P
n
(z) is a polynomial whose roots are all located in and on the half-plane
'z 0. If
n
,
2
n
:= Var(
n
), then
n
satises
Pr
n
< x = (x) +O
_
1
n
_
,
for < x < , where
n
= E(
n
).
Proof. M = 2 in Theorem 7. 2
In particular, if all roots of P
n
(z) are non-positive, then the results of the corollary hold, cf. [18].
5 Examples
Let us discuss some typical examples from dierent applications.
Example 1. Cycles in permutations. A permutation is a set of cycles. The number of permutations
having m cycles is enumerated by the (signless) Stirling number of the rst kind s(n, m):
s(n, m) := [w
m
z
n
] exp
_
wlog
1
1 z
_
= [w
m
] (w(w + 1) (w +n 1)) (1 m n, n 1).
Both Theorems 3 and 7 apply to the random variable
n
, dened as the number of cycles in a random
permutation of 1, 2, . . . , n, where each of the n! permutations is equally likely. We obtain
Pr
n
log n
log n
< x = (x) +O
_
1
log n
_
,
a result originally due to Goncharov (without error term).
Many other examples in this same (exp-log) class can be found in [5, 13, 23, 33] [26, 2.4 and 2.5]
and [19, Ch. 5].
Example 2. Stirling numbers of the second kind. The generating function for the number of blocks
(marked by w) in set partitions satises
n,m0
S(n, m) w
m
z
n
n!
= e
w(e
z
1)
,
where S(n, m) denotes Stirling numbers of the second kind. Theorem 1 is not applicable since, as we
shall see, the mean and the variance are not of the same order. So we turn to Theorem 7. Harper [18]
showed that the polynomials
P
n
(z) :=

0mn
S(n, m) z
m
(n = 1, 2, 3, . . .),
have only simple non-positive roots. It remains to show that
n
, namely,
2
n
:= Var(
n
) =
b
n+2
b
n
b
n+1
b
n
1 ,
12
where b
n
:= n! [z
n
]e
e
z
1
are the Bell numbers. Using the asymptotic expression [9, Ch. 6]
b
n
=
e
n(+
1
1)1
+ 1
_
1

2
(2
2
+ 7 + 10)
24 n( + 1)
3
+O
_
2
n
2
__
,
where e
= n, it is not hard to show that
2
n
=
n
( + 1)
+O(1)
n
(log n)
2
;
so that Theorem 7 gives, dening Pr
n
= m := b
1
n
S(n, m),
Pr
n
< x = (x) +O
_
log n
n
_
,
uniformly with respect to x, where
n
= E(
n
) =
b
n+1
b
n
1 =
n
+O(1)
n
log n
.
The asymptotic normality is due to Harper [18]. Alternatively, we can write
Pr

n
n/
_
n/(( + 1))
< x = (x) +O
_
log n
n
_
,
but this formula cannot be further simplied by using the asymptotics of .
For other examples, see [31, 28].
Example 3. Cost of heap construction. Doberkat [11] shows that the probability generating functions
for the number of exchanges and for the number of comparisons used by Floyds algorithm to construct
a heap of size n have the form

1jn
p
n,j
(z), where p
n,j
(z) are polynomials of degree log
2
n.
Moreover, the mean and the variance are all linear. Thus Theorem 7 applies with M
n
= log n to
both cases [21]. More precisely, given a random permutation of size n, dening
n
as the number of
exchanges used by Floyds algorithm to construct a heap from this permutation, we have [21, Thm 4]
Pr
n
c
1
n
c
2
n
< x = (x) +O
_
log n
n
_
,
uniformly with respect to x, where c
1
= 2+
j1
j(2
j
1)
1
= 0.744033... and c
2
= 2
j1
j
2
(2
j
1)
2
= 0.261217... Similar result holds for the number of comparisons and other construction algorithms
preserving randomness in each step, cf. [27].
For other examples on algorithms, see [19, Ch. 1].
Example 4. Ordered set-partitions. The generating function for the number of blocks (marked by w)
in an ordered set-partition is (1 w(e
z
1))
1
. Obviously, e
z
1 is 1-regular. Theorem 5 applies. We
can also consider other constraints like parity on the size of each block and on the number of blocks,
cf. [8, p. 225226]. Many other examples can be found in [1, 14, 22] and [19, Ch. 7].
13
Example 5. Inversions in permutations. Given a permutation = (
1
,
2
, . . . ,
n
) of 1, 2, 3, . . . , n,
a pair
i
,
j
is called an inversion if i < j and
i
>
j
. Let
n
denote the number of inversions in a
random permutation of size n. Then it is easily seen that [24, 5.1.1]
Pr
n
= m =
1
n!
[z
m
]
_
(1 +z)(1 +z +z
2
) (1 +z +z
2
+ +z
n1
)
_
.
From which we obtain
E(
n
) =
n(n 1)
4
, and Var(
n
) =
n(n 1)(2n + 5)
72
, (8)
for n 1. Theorem 7 applies and we get, in view of (8),
Pr
n
n
2
/4
n
3/2
/6
< x = (x) +O
_
1
n
_
,
for all x.
Example 6. Erdos-Kac Theorem. Dene a random variable
n
by its probability distribution:
Pr
n
= m =
1
n
[k : 1 k n, (k) = m[ =
1
n
[z
m
]

1kn
z
(k)
,
where (n) denotes the number of distinct prime divisors of n, e.g., (23
4
5
6
7
8910
11
12
13
141516
) = 6.
It is known that E(
n
) = log log n + O(1), and Var(
n
) = log log n + O(1). The Erd os-Kac Theorem
states that
Pr
n
log log n
log log n
< x (x).
The convergence rate was rst conjectured by Le Veque and then proved by Renyi and Tur an [30] to
be O((log log n)
1
2
). Let us apply Theorem 1 to establish this fact.
Starting with the well-known Selbergs formula [32]
M
n
(z) := E(e
n
z
) = V (e
z
)(log n)
e
z
1
_
1 +O
_
1
log n
__
,
uniformly for [z[ , where V (t) is an entire function in the t-plane, we readily obtain
Pr
n
log log n
log log n
< x = (x) +O
_
1
log log n
_
,
uniformly with respect to x.
For many other examples, see [10] and [19, Ch. 9].
Example 7. Number of factors in square-free factorizations. Let f
n,k
denote the number of factor-
izations of n into k distinct factors greater than 1, n 2, k 1. Dene f
1,k
=
0,k
, where
a,b
is
Kroneckers symbol. Consider the bivariate generating function:
P(w, s) := 1 +

n2
n
s

m1
f
n,m
w
m
,
14
which equals to
P(w, s) =

n2
_
1 +
w
n
s
_
('s > 1, w C).
Dene a polynomial P
n
(w) := 1 +
2kn
m1
f
k,m
w
m
and a random variable
n
for which
Pr
n
= m =
[w
m
] P
n
(w)
P
n
(1)
.
We show in [19, Ch. 10] that
P
n
(w) =
ne
2
wlog n
2
(log n)
3/4
w
3/4
(1 +w)(w)
_
1 +O
_
1
log n
__
,
the error term being uniform with respect to w C (, 0]. Thus we obtain
M
n
(z) := E(e
n
z
) =
2 e
2
log n(e
z/2
1)3z/4
(e
z
)(1 +e
z
)
_
1 +O
_
1
log n
__
,
and Theorem 1 gives
Pr
log n
4
_
1
4
log n
< x = (x) +O
_
1
4
log n
_
,
for all x.
For other examples, see [19, Ch. 10].
Example 8. Hermite polynomials. The number of xed points (cycles of length 1), marked by w, in
the involutions is enumerated by
exp
_
wz +
z
2
2
_
=

n0
h
n
(w)
n!
z
n
,
where h
n
(w) are Hermite polynomials. The asymptotic behaviour of h
n
(w) is well-known [34, p. 200]:
h
n
(w) = 2
1
2
n
n/2
e
n/2+
n+1 ww
2
/4
_
1 +O
_
1
n
__
,
uniformly for all nite w. Dening
n
as the number of xed points in a random involution, we obtain
M
n
(s) = E(e
n
s
) = e
n+1 (e
s
1)
1
4
(e
2s
1)
_
1 +O
_
1
n
__
,
uniformly for [s[ . We obtain
Pr
n + 1
4
n + 1
< x = (x) +O
_
1
4
n
_
,
the O being uniform with respect to x, x R.
15
References
[1] E. A. Bender. Central and local limit theorems applied to asymptotic enumeration. Journal of
Combinatorial Theory, Series A, 15:91111, 1973.
[2] E. A. Bender and L. B. Richmond. Central and local limit theorems applied to asymptotic
enumeration. II. multivariate generating functions. Journal of Combinatorial Theory, Series A,
34:255265, 1983.
[3] E. A. Bender, L. B. Richmond and S. G. Williamson. Central and local limit theorems applied
to asymptotic enumeration. III. matrix recursions. Journal of Combinatorial Theory, Series A,
35:263278, 1983.
[4] E. A. Bender and L. B. Richmond. A generalisation of Canelds formula. Journal of Combina-
torial Theory, Series A, 41:5060, 1986.
[5] F. Bergeron, P. Flajolet, and B. Salvy. Varieties of increasing trees. In Proceedings CAAP92,
Lecture Notes in Computer Science, 581, pages 2448, Springer-Verlag, New York, 1992.
[6] P. Billingsley, Probability and Measure, Second edition, Wiley, New York, 1986.
[7] E. R. Caneld. Central and local limit theorems for the coecients of polynomials of binomial
type. Journal of Combinatorial Theory, Series A, 23:275290, 1977.
[8] L. Comtet. Advanced Combinatorics, The Art of Finite and Innite Expansions. D. Reidel
Publishing Company, Dordrecht-Holland, revised and enlarged edition, 1974.
[9] N. G. de Bruijn. Asymptotic Methods in Analysis. North-Holland, 1958. Reprinted by Dover,
1981.
[10] H. Delange. Sur des formules de Atle Selberg. Acta Arithmetica, 19:105146, 1971.
[11] E. E. Doberkat. An average case analysis of Floyds algorithm to construct heaps. Information
and Control, 61:114131, 1984.
[12] P. Flajolet and A. M. Odlyzko. Singularity analysis of generating functions. SIAM Journal on
Discrete Mathematics, 3:216240, 1990.
[13] P. Flajolet and M. Soria. Gaussian limiting distributions for the number of components in com-
binatorial structures. Journal of Combinatorial Theory, Series A, 53:165182, 1990.
[14] P. Flajolet and M. Soria. General combinatorial schemes: Gaussian limit distributions and expo-
nential tails. Discrete Mathematics, 114:159180, 1993.
16
[15] Z. Gao and L. B. Richmond. Central and local limit theorems applied to asymptotic enumera-
tion IV: multivariate generating functions. Journal of Computational and Applied Mathematics,
41:177186, 1992.
[16] J. Haigh. A neat way to prove asymptotic normality. Biometrika, 58:677678, 1971.
[17] P. Hall, Rates of Convergence in the Central Limit Theorem, Research Notes in Mathematics, 62,
Pitman Advanced Publlishing Company, Boston, 1982.
[18] L. H. Harper. Stirling behaviour is asymptotically normal. Annals of Mathematical Statistics,
38:410414, 1967.
[19] H.-K. Hwang. Theorèmes limites pour les structures combinatoires et les fonctions arithmetiques,
Thèse, Ecole polytechnique, 1994.
[20] H.-K. Hwang. On asymptotic expansions in the central and local limit theorems for combinatorial
structures. submitted.
[21] H.-K. Hwang and J.-M. Steyaert. On the number of heaps and the cost of heap construction.
Theoretical Computer Science, to appear.
[22] A. Knopfmacher, J. Knopfmacher, and R. Warlimont. Factorisatio numerorum in arithmetical
semigroups. Acta Arithmetica, 61:327336, 1992.
[23] J. Knopfmacher. Analytic Arithmetic of Algebraic Function Fields, volume 50 of Lecture Notes
in Pure and Applied Mathematics. Marcel Dekker, Inc., New York and Basel, 1979.
[24] D. E. Knuth. The Art of Computer Programming, Volume 3: Sorting and Searching. Addison
Wesley, Reading, Massachusetts, 1973.
[25] M. Loève, Probability Theory, Forth edition, Springer-Verlag, New York, 1977.
[26] H. M. Mahmoud. Evolution of Random Search Trees. John Wiley & Sons, New York, 1992.
[27] C. J. H. McDiamid and B. A. Reed. Building heaps fast. Journal of Algorithms, 10:352365,
1989.
[28] L. Mutafciev. Limit distributions related to the weighted Stirling numbers of the rst and second
kind. Collection: Limit Theorems in Probability and Statistics. Colloquia Mathematica Societatis
Janos Bolyai, 36:823842, 1982.
[29] V. V. Petrov. Sums of Independent Random Variables. Springer-Verlag, Berlin-Heidelberg, New-
York, 1975. Translated from the Russian by A. A. Brown.
17
[30] A. Renyi and P. Tur an. On a theorem of Erd os-Kac. Acta Arithmetica, 4:7184, 1958.
[31] A. Ruci nski and B. Voigt. A local limit theorem for generalized Stirling numbers. Revue Roumaine
de Mathematiques Pures et Appliquees, 35:161172, 1990.
[32] A. Selberg. Note on a paper by L. G. Sathe. Journal of the Indian Mathematical Society, 18:8387,
1954.
[33] M. Soria. Methode dAnalyse pour les Constructions Combinatoires et les Algorithmes. Thèse
d
Etat, Universite de Paris-Sud, 1990.

[34] G. Szego. Orthogonal Polynomials, volume XXIII of American Mathematical Society Colloquium
Publications. AMS, Providence, Rhode-Island, forth edition, 1977.
18

On Convergence Rates in The Central Limit Theorems For Combinatorial Structures

Uploaded by

Document Information

Original Description:

Original Title

Copyright

Available Formats

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

On Convergence Rates in The Central Limit Theorems For Combinatorial Structures

Uploaded by

Copyright:

Available Formats

On convergence rates in the central limit theorems

for combinatorial structures

Institute of Statistical Science

(0)(n). Dene the random variable

(it)[ are bounded for s close to 0. But the following theorem

() > 0, where satises C() = 1. By the inverse function

= n, it is not hard to show that

Etat, Universite de Paris-Sud, 1990.

You might also like