Professional Documents
Culture Documents
MAY 1989
Entropy Expressions and Their Estimators and Z positive definite. The entropy, denoted by H(NORM), is
for Multivariate Distributions given by
P P 1
NABIL ALI AHMED AND D. V. GOKHALE. MEMBER, IEEE H(N0RM) = - - ln(2n) + - In 121.
2 2 2
+ (1.1)
Ahstruet -Entropy expressions for several continuous multivariate dis- Let H(LN0RM) be the entropy of a multivariate lognormal
tributions are derived. Point estimation of entropy for the multinormal distribution given by
distribution and for the distribution of order statistics from Weinman's
exponential distribution is considered. The asymptotic distribution of the
uniformly minimum variance unbiased estimator for multinormal entropy
is obtained. Simulation results on convergence of the means and variances
of these estimators are provided.
where
INTRODUCTION
The entropy of a random variable X having density f(x) In=(In x1,. . . ,In x P ) ' x = ( x 1,. . ,x,,)'.
absolutely continuous with respect to the Lebesgue measure on
RJ' is given by: Then
P
H ( LNORM) = H( NORM) + p, . (14
r=l
where
Let H(L0GST) denote the entropy of a multivariate logistic
x=( X I , x2; . ,X J ' .
'
distribution given by
In receni years the concept of entropy has been used increasingly
in inferential statistics. Vasicek [12] constructed an entropy esti-
mate and a test of goodness of fit of univariate normality based
on that estimate. Dudewicz and Van der Meulen [3] proposed a
goodness-of-fit test of the uniform distribution based on maxi- r=l
mum entropy criteria. Gokhale [4] proposed a general form of a
goodness-of-fit test statistic for families of maximum entropy
distributions; particular cases of such families are normal, expo-
nential, double-exponential, gamma, and beta distributions.
Lazo and Rathie [7] derived and tabulated entropies for vari- - CO < x , < + OO,pr > 0 , - 00 < p, < + 0 0 .
ous univariate continuous probability distributions. This corre- Then
spondence extends their table and results to the entropy of
P
several families of multivariate distributions (Section I). Section
I1 deals with point estimation for the multivariate normal and the H(LOGST) = C In(p,)-ln(p!)+(p+l)A(p) (1.3)
I =1
distribution of the order statistics from the multivariate exponen-
tial distribution proposed by Weinman [13]. Two types of estima- where
tors are considered. One, denoted by H,(f), is a consistent
estimator and the second, denoted by H,(f),is the uniformly
minimum variance unbiased estimator. Section I11 deals with the
asymptotic distribution of the entropy estimators H*(f)and I 1 9 forp =l.
H, (f)for the multivariate normal distribution. To compare their
performances, means and variances obtained by Monte Carlo
simulation are tabulated. Let H(PARET0) denote the entropy of the multivariate Pareto
I. PARAMETRICENTROPY distribution of Type I1 (Arnold, [2]) given by
The expressions of entropy derived in this section often involve -(a+,)
digamma and trigamma functions, + ( a )= ( d / d a )In ('I a ) and a+i-1
+'(a) = (d/da)+(a),respectively (Magnus, [SI), and Euler's con- r=l
stant y = 0.5772156649.
The density function of the multivariate normal distribution is for x , > p , , i = 1,2,. . . , p .
given by
Then
H( PARETO)
x E R P , p E RP
r=l
Manuscript received July 3, 1987; revised July 22, 1988. This correspon-
dence was presented in part at the International Symposium on Information
and Coding Theory, University of Campinas, Campinas, Brazil. July 1987.
+
n(a+i-1)
1 1( a + i - 1 )
1
N. Ali Ahmed is with Bell Communications Research, 3 Corporate Place,
Piscataway. NJ 08854. USA.
D. V. Gokhale is with the Department of Statistics. University of California.
Riverside, CA 92521, USA.
[ I =1 (u+i)-112[ I =1 (a+i)-2]...[ I =1 (a+i)-p] '
Authorized licensed use limited to: UNIVERSITY OF SUSSEX. Downloaded on October 13, 2008 at 11:45 from IEEE Xplore. Restrictions apply.
IEEE TRANSACTIONSON INFORMATION THEORY, VOL. 35, NO. 3, MAY 1989 689
- Pa
(1.6)
Arnold [2] for the respective families of distribution), and those
for the distributions studied by Weinman [13] are given in his
reference. Hence for the sake of brevity we will not pursue this
topic here in more detail.
We first consider the UMVUE for H(N0RM).
where Definition I: Let V :( p x p ) be symmetric and positive defi-
1 nite. The random matrix V is said to follow the nonsingular
C= p-dimensional Wishart distribution with scale matrix E, and n
POP1 ' . .P p - 1 degrees of freedom, n 2 p , if the joint distribution of the distinct
elements of V is continuous with the density function of V given
The entropy H(EXPW) for the foregoing probability density by
function is
Let XI; . ., X p be the i.i.d. failure times from the multivariate and p ( V ) = 0, otherwise, where c is a numerical constant de-
exponential distribution due to Weinman [13] and Y1; . . , Yp be fined as
the corresponding order statistics. The joint density of the order
statistics of the multivariate exponential distribution (Weinman
[13]) consisting of the joint density of the random variables { y }
Authorized licensed use limited to: UNIVERSITY OF SUSSEX. Downloaded on October 13, 2008 at 11:45 from IEEE Xplore. Restrictions apply.
690 IEEE TRANSACTIONS ON INFORMATION THEORY. VOL. 35, NO. 3. MAY 1989
Theorem I : For X , : p X l , let X,,X2;..,X2;..,Xn be mu- The following lemma shows that the limit of the bias terms in
tually independent; let X, be normally distributed with parame- H,.(NORM) is equal to zero. Note that from (2.7) the bias B,,
ters 0 and Z, B>O, and let X = ( X , , X 2 ; . . , X , , ) ' , X : p x n , eauals
n 2 p. Define V = X X ' ; then V > 0 and V will have a Wishart
distribution with parameters Z, p , and n.
Proof: See Press [lo].
Lemma 2: We have B,, = 0.
Lemma I (Wijsman, [14]): The distribution of lVl/lZl is the
distribution of the product of p independent chi-squared vari- Proof: We have
ables with n, n - 1,. . ., n - p + 1 degrees of freedom, respec-
tively. Iim { B , , } = Iim
n+m
Theorem 2: The uniformly minimum variance unbiased esti-
mator for the parametric entropy of the multivariate normal
distribution is
t dt
The mean and the variance of In[X;,,+ - I ) ] are given by . (2.9)
n+l-i
E { &(NORM)} = H(N0RM)
The term in the bracket is Cauchy with parameters p = v ( n +
and 1- i ) and p = 0. Thus the last term of (2.9) is less than or equal
to l / ( n + 1- i ) :
var { H,(NORM) } =-
4 r=l
4' ( n+i-i).
~
Authorized licensed use limited to: UNIVERSITY OF SUSSEX. Downloaded on October 13, 2008 at 11:45 from IEEE Xplore. Restrictions apply.
IEEE TRANSACTIONS ON INFORMATION THEORY, VOL. 35, NO. 3, MAY 1989 691
DISTRIBUTION
111. ASYMPTOTIC OF H,(NORM) "I:= [A: ; theoretical entropy: 2.7907.
Note that from (2.1), (2.2), and the fact that In IS1 =In IVJ-
p In( n - 1) the random variable in &(NORM) is distributed as TABLE 111
THEORETICAL
( t )AND SIMULATED
(s) MEANS
AND VARIANCES
OF H,(NORM) AND H,(NORM)a
Sample Estimator
i=l Size TKJe Mean Variance
2.55852 0.1175100
The distribution of ln[X:n+r-l)] is well approximated by the 2.61510 0.1175100
normal distribution with appropriate mean and variance (Olshen, 2.32824 0.1316455
[9]). Hence H,(NORM) is also well approximated by the normal 2.38483 0.1316268
distribution with mean H(NORM), given by (1.1) and the vari- 2.59776 0.0350800
ance (2.6). 2.61510 0.0350800
To study the convergence properties of H,(NORM) and 2.52798 0.0331916
H,(NORM), we generated lo00 samples of different sizes from 2.54532 0.0331788
bivariate normal distributions with different covariance matrices 2.60486 0.0206200
2.61510 0.0206200
2. For each sample the estimates &(NORM) and H,(NORM) 2.55536 0.0206548
were calculated. Their means and variances for different sample 2.56500 0.0206537
sizes are given in Tables I-IV. From the tables it is seen that 2.61004 0.0101500
H,(NORM) converges slightly faster to the true value than 2.61510 0.0101500
"(NORM). The simulated variance of &(NORM) is slightly 2.58655 0.0104487
smaller than that of H,(NORM) as expected. This convergence 2.59161 0.0104384
behavior does not seem to be affected much by the increasing
correlation between the two variables. "Z = [ z] ; theoretical entropy: 2.6151.
Authorized licensed use limited to: UNIVERSITY OF SUSSEX. Downloaded on October 13, 2008 at 11:45 from IEEE Xplore. Restrictions apply.
692 IEEE TRANSACTIONS O N INFORMATION THEORY, VOL. 35. NO. 3. MAY 1989
Authorized licensed use limited to: UNIVERSITY OF SUSSEX. Downloaded on October 13, 2008 at 11:45 from IEEE Xplore. Restrictions apply.