Professional Documents
Culture Documents
9, SEPTEMBER 2001
AbstractUnlike low-rank matrix decomposition, which is fact was illustrated by Cattell and rigorously established under
generically nonunique for rank greater than one, low-rank three- increasingly milder conditions over a span of some 30 years by
and higher dimensional array decomposition is unique, provided
various authors [8][10], [14], [15], [20], [21], [25].
that the array rank is lower than a certain bound, and the correct
number of components (equal to array rank) is sought in the Low-rank decomposition of multidimensional arrays has
decomposition. Parallel factor (PARAFAC) analysis is a common been used as as a data analysis tool in an unusual variety
name for low-rank decomposition of higher dimensional arrays. of disciplines, e.g., [2], [5], [6], [10], [14], and [16], to
This paper develops CramrRao Bound (CRB) results for mention a few. More recently, it was shown to have exten-
low-rank decomposition of three- and four-dimensional (3-D and
4-D) arrays, illustrates the behavior of the resulting bounds, and sive application in signal processing and communications,
compares alternating least squares algorithms that are commonly including blind multiuser detection in direct-sequence code-di-
used to compute such decompositions with the respective CRBs. vision multiple-access (DS-CDMA) communications [25],
Simple-to-check necessary conditions for a unique low-rank multiple-invariance sensor array processing [24], blind beam-
decomposition are also provided.
forming in specular multipath [17], [26], and, in more general
Index TermsCramrRao bound, least squares method, ma- terms, blind diversity-combining in communications [22]. De-
trix decomposition, multidimensional signal processing. spite these wide-ranging applications, pertinent CramrRao
bounds (CRBs) for low-rank decomposition of multidimen-
I. INTRODUCTION sional arrays have been missing from the literature. The first
contribution of this paper is the development of CRBs for
pertinent CRBs (associated derivations are deferred to the minimum number of rank-one (three-way) components needed
Appendices). These bounds are discussed and illustrated in to decompose .
Section IV. Simple-to-check necessary uniqueness conditions Define an matrix with typical element
are developed in Section V. Multilinear ALS is compared with , matrix with , matrix
the CRB in Section VI. Conclusions are drawn in Section VII. with , and matrices with
Some notation conventions that will be used in this paper . Compact matrix representations of the model in (3) are
follow. made possible by employing the KhatriRao matrix product.
Transpose of . For example
Complex conjugate of .
Conjugate transpose of .
th column of . .. ..
th column of ( is the th row of ). . .
Kronecker product of and .
(4)
KhatriRao (column-wise Kronecker) product of
The superscript means that the matrix is of size
and .
and that the -index ( goes first in the product ) runs faster
the Hadamard (element-wise) product of and :
than the -index along its rows. By symmetry
;
Diagonal matrix constructed out of the th row of (5)
.
Kronecker delta: when , and and
when , and are integers.
(6)
Frobenius norm.
Matrix pseudo-inverse. Next, consider an 4-D array with typical
element
II. MULTIDIMENSIONAL LOW-RANK MODELING
Consider an matrix . rank if and only if (7)
can be written as a sum of but no less than rank-one
matrices (vector outer products) for , , , and .
Define a matrix with typical element
(1)
and matrix with . Similar to the
where is , and is . Note that we may assume three-way case, the KhatriRao product can be used to cast the
without loss of generality that the columns of and are lin- model in (7) in matrix form. For example
early independent; otherwise, at least one of the rank-one
(8)
components can be absorbed in the others. In general,
, but if , then constitutes a The KhatriRao product has the property
low-rank decomposition of . Let denote the th entry [13]; in addition, the order of KhatriRao multi-
of . A scalar view of the relationship is plications only affects the order of rows in the final result [13].
In particular, rank is not affected by the order in which the mul-
(2) tiplications are carried out.
Further generalizing to dimensions
a set of linearly dependent columns. Note that -rank is which certain simplifications are possible due to the fact that
always less than or equal to rank the noise covariance matrix can be written in convenient closed
, . -rank stands for Kruskal-rank [14]; the term form and an alternative scalar computation approach can be
was coined by Harshman and Lundy [11]. adopted. The real-parameter three-way case is the one that is
Theorem 1: (Sufficient condition for uniqueness of low-rank mostly encountered in applications of PARAFAC in other disci-
decomposition of -way arrays [21], [22]; cf. [14] for a basic plines. Under the Gaussian assumption, the CRB will be block
precursor result, and [25] for its complex counterpart). Consider diagonal in the noise and signal parameters. In the interest of
the -component -way array given in (9), and suppose that it brevity, we therefore assume that the noise variance is known
is irreducible (i.e., the rank of the -way array is ). If throughout the derivations.
A delicate point regarding these CRBs is the inherent per-
mutation and scale ambiguity. For example, in the three-way
(10) case (3), one can clearly reshuffle the components and/or set
, , and such that
then the -way array , , , , and remains unaffected. In other
has unique rank-one -way factors words
SNR (three-way array) Fig. 3. PARAFAC CRB: With versus without assuming Vandermonde
structure.
SNR (four-way array)
component matrices , , and is compared with the corre-
Unless otherwise specified, the matrices , , and were ran- sponding bound for a four-way array (Appendix B) obtained by
domly drawn from an i.i.d. standard Gaussian distribution, and augmenting the three-way model by adding another dimension
the CRB is averaged over all the unknown parameters of all ma- of size , , or . We have seen that higher dimen-
trices. sionality leads to a relaxed uniqueness condition (Theorem 1 ).
Fig. 1 is an illustration of the behavior of the CRB for Fig. 2 demonstrates that higher dimensionality also benefits in
low-rank decomposition of a complex-parameter three-way terms of CRB.
array (Appendix A) as the size of one dimension increases, with In Fig. 3, we compare the CRB of Appendix A with that of
SNR fixed at 10 dB. In this experiment, and are , Appendix C for a given three-way array with Vandermonde
whereas is augmented from to . Notice that structure in one dimension. The plot shows average CRB of
the number of parameters also increases with the sample size. the generators of the Vandermonde matrix. The bound in Ap-
However, the number of equations (data points) increases faster pendix A ignores the fact that Vandermonde structure is present
than the number of unknown parameters ( versus order of and is therefore above the bound in Appendix C. Comparisons
, respectively). One would then intuitively expect of this kind can help gauge the performance margin that can
that the CRB should decrease with increasing sample size along be gained by employing and exploiting additional model
any dimension. Although this seems to be the trend, it is not structurein this case, Vandermonde. An example could be
true in general. We have seen examples of datasets where the the choice of complex exponential (OFDMA-like) spreading
average CRB may actually increase before decreasing again as codes in the context of [25].
one adds rows to . A quadrilinear model can always be viewed as a trilinear
Fig. 2 illustrates the effect of adding an extra diversity dimen- model, i.e., in (8), let to obtain
sion. The CRB for a three-way array (Appendix A) with . This unfolding into a lower dimensional model
2078 IEEE TRANSACTIONS ON SIGNAL PROCESSING, VOL. 49, NO. 9, SEPTEMBER 2001
Fig. 4. PARAFAC CRB: Model viewed as trilinear versus quadrilinear. Fig. 5. PARAFAC CRB: CDMA diversity-combining tradeoff.
Fig. 8. TALS performance vesus CRB. (a) A. (b) B. (c) C. (d) Average. Fig. 9. QALS performance versus CRB. (a) A. (b) B. (c) G. (d) H.
squares (LS) conditioned on previously obtained estimates for
the remaining matrices; proceed to update the other matrices;
repeat until convergence of the LS cost function (this is guaran-
teed for ALS algorithms, e.g., [25]). For example, quadrilinear
ALS (QALS) aims to
into play, and this explains the good behavior of ALS, which where , , and are the corresponding noise matrices.
aims for the ML solution. Observe that the gap between ALS Therefore, we can write the likelihood function of in three
and the CRB reduces quickly with increasing SNR (note that equivalent ways:
the plots are in log-log scale). The reasons for the nonzero gap
at finite SNR are as follows.
Practical experience [5] with trilinear ALS shows that the
global minimum is usually reached as long as the con-
stituent rank-one factors are not essentially collinear along
any dimension. Nevertheless, for finite SNR, there exist a
few inevitable bad runs, which hit a local minimum.
Even if the right minimum is reached, scale and, more im-
portantly, permutation matching may fail due to residual
noise. This effect is particularly pronounced at low SNR.
In practice, some prespecified tolerance threshold is used
where denotes the th column of and similarly for and
to terminate the ALS iteration. In certain situations, con-
.
vergence can be relatively slow, which may lead to prema-
As explained in Section III, a delicate point regarding the
ture termination.
CRB for the trilinear decomposition model is the inherent per-
Even though an analytical verification of these findings would mutation and scale ambiguity. To derive a meaningful CRB, we
be highly desirable from a theoretical standpoint, it seems to be assume that the first row of and is fixed (or normalized)
very difficult to carry through without making unrealistic as- to (this takes care of scale ambiguity) and further
sumptions, due to the iterative nature of ALS. The above Monte assume that the first row of is known and consists of distinct
Carlo results are perhaps sufficient from an applications view- elements (which subsequently resolves the permutation ambi-
point. guity). This way, the number of unknown complex parameters
is instead of . Similar to [3] and
VII. CONCLUSION [28], to simplify the CRB derivation, it is useful to introduce the
Low-rank decomposition of multidimensional arrays (in equivalent complex parameter vector
dimensions) has the remarkable property of uniqueness on
the basis of low-rank structure alone. It has found extensive
applications in diverse areas, including signal processing and
communications. This paper has contributed pertinent CRBs for Let us write the log-likelihood function as
low-rank decomposition of 3-D and 4-D arrays. Aside from per-
formance evaluation, these can be used as design tools for diver-
sity tradeoff studies. Easy-to-check necessary uniqueness con-
ditions for low-rank decomposition of multidimensional arrays
have also been developed. Finally, the performance of ALS al-
gorithms that are commonly used to compute low-rank decom-
position of multidimensional arrays has been assessed relative
to the respective CRBs. It was demonstrated by means of Monte
Carlo simulation that for typical problem sizes, ALS stays close
to what is theoretically achievable performance-wise. (19)
(16) (21b)
(17)
(21c)
(18)
2082 IEEE TRANSACTIONS ON SIGNAL PROCESSING, VOL. 49, NO. 9, SEPTEMBER 2001
(21d)
(21e)
(21f)
where is the th unit coordinate vector. where the covariance matrix of and is given by
Although the computation of the FIM may look cumbersome, (25), shown at the bottom of the page. In (25), the row index and
some simplifications are possible. Note that although is column index of nonzero elements of the covariance matrix are
independent in three dimensions, i.e., indicated explicitly. and
can be obtained similarly.
(22) Furthermore, from (20) and (24), can be written as
(23)
with obvious notation. Let
(24) Then
CRB
..
.
..
. (25)
..
.
..
.
LIU AND SIDIROPOULOS: CRAMRRAO LOWER BOUNDS FOR LOW-RANK DECOMPOSITION 2083
(28)
for , , , ,
with . is i.i.d. (in , , , and )
zero mean, complex Gaussian noise of variance . Define ,
, , and as in Section II, and unfold in four equivalent
matrix representations:
(29)
(30)
(31)
The complex FIM is
(32)
(33)
Correspondingly, we can write the likelihood function in four
equivalent ways as
The partial derivatives of with respect to the unknown pa-
rameters are given by
(34)
(35)
whose size is .
where denotes the th column of and similarly for , The elements of can be computed as in
and . As mentioned in Section III, for the purpose of removing
sale and permutation ambiguity to obtain a meaningful bound,
we assume that the first row of , , , and are known.
Therefore, the unknown complex
parameter vector is (36a)
(36b)
2084 IEEE TRANSACTIONS ON SIGNAL PROCESSING, VOL. 49, NO. 9, SEPTEMBER 2001
CRB
(36c)
(36d) APPENDIX C
CRB FOR LOW-RANK DECOMPOSITION OF 3-D ARRAYS WITH
VANDERMONDE STRUCTURE IN ONE DIMENSION
Consider the three-way array in (15), and assume
without loss of generality that is Vandermonde:
(36e)
(37)
..
.
(36f)
(36g)
The partial derivatives of with respect to and are
(36h)
APPENDIX D
CRB FOR REAL-PARAMETER LOW-RANK DECOMPOSITION OF
(36j) 3-D ARRAYS
The real case yields simpler CRB expressions, due to the fact
It is cumbersome to write out the covariance matrix of
that the noise covariance matrix can be written in convenient
and , but its computation can be done in the following
closed form, and an alternative scalar computation approach can
simple way. First, construct an auxiliary four-way array U and
be adopted. Consider the three-way array with
fill it with integers from 1 to as
typical element
Thus, the log-likelihood function is Then, it can be shown that the FIM in (38) is given by
(42)
(40) .. .. ..
. . . (45)
(41)
The CRB on the variance of any unbiased estimator is then
given by the inverse of (42), provided it exists.
From (39)(41), we can obtain
ACKNOWLEDGMENT
The authors would like to thank an anonymous reviewer for
constructive comments that helped improve the quality and
readability of this paper.
REFERENCES
[1] S. Basu and Y. Bresler, The stability of nonlinear least squares problems
and the CramrRao bound, IEEE Trans. Signal Processing, vol. 48,
pp. 34263436, Dec. 2000.
[2] J. Beltran, J. Guiteras, and R. Ferrer, Three-way multivariate calibration
procedures applied to high-performance liquid chromatography coupled
with fast-scanning spectrometry (HPLC-FSFS) detection. Determina-
tion of aromatic hydrocarbons in water samples, Anal. Chem., vol. 70,
pp. 19491955, 1998.
[3] A. van den Bos, A Cramr-Rao lower bound for complex parameters,
IEEE Trans. Signal Processing, vol. 42, p. 2859, Oct. 1994.
[4] J. W. Brewer, Kronecker products and matrix calculus in system
theory, IEEE Trans. Circuits Syst., vol. CAS-25, pp. 772781, Sept.
1978.
[5] R. Bro, PARAFAC: tutorial and applications, Chemometr. Intell. Lab.
Define the matrices Syst., vol. 38, pp. 149171, 1997.
[6] J. D. Carroll and J. Chang, Analysis of individual differences in mul-
N
tidimensional scaling via an -way generalization of Eckart-Young
decomposition, Psychometrika, vol. 35, pp. 283319, 1970.
[7] R. B. Cattell, Parallel proportional profiles and other principles for de-
termining the choice of factors by rotation, Psychometrika, vol. 9, pp.
267283, 1944.
[8] R. A. Harshman, Foundations of the PARAFAC procedure: Model and
conditions for an explanatory multi-mode factor analysis, UCLA
Working Papers in Phonetics, vol. 16, pp. 184, Dec. 1970.
[9] , Determination and proof of minimum uniqueness conditions for
PARAFAC1, UCLA Working Papers Phonetics, vol. 22, pp. 111117,
1972.
2086 IEEE TRANSACTIONS ON SIGNAL PROCESSING, VOL. 49, NO. 9, SEPTEMBER 2001
[10] , PARAFAC2: mathematical and technical notes, UCLA Working [28] S. F. Yau and Y. Bresler, A compact CramrRao bound expression
Papers Phonetics, vol. 22, pp. 3344, 1972. for parametric estimation of superimposed signals, IEEE Trans. Signal
[11] R. A. Harshman and M. E. Lundy, The PARAFAC model for three-way Processing, vol. 40, pp. 12261229, May 1992.
factor analysis and multidimensional scaling, in Res. Methods Multi-
mode Data Anal., H. G. Law, C. W. Snyder, Jr, J. Hattie, and R. P. Mc-
Donald, Eds. New York: Praeger, 1984, pp. 122215.
[12] S. Kay, Fundamentals of Statistical Signal Processing. Upper Saddle
River, NJ: Prentice-Hall, 1993, vol. I, Estimation Theory.
[13] C. G. Khatri and C. R. Rao, Solutions to some functional equations
Xiangqian Liu received the B.S. degree from
and their applications to characterization of probability distributions,
Beijing Institute of Technology, Beijing, China,
Sankhya, Se. A, vol. 30, pp. 167180, 1968.
in 1997 and the M.S. degree from University of
[14] J. B. Kruskal, Three-way arrays: rank and uniqueness of trilinear de-
Virginia, Charlottesville, in 1999, both in electrical
compositions, with application to arithmetic complexity and statistics,
engineering. He is currently pursuing the Ph.D.
Lin. Algebra Applicat., vol. 18, pp. 95138, 1977.
[15] , Rank, decomposition, and uniqueness for 3-way and -way N degree with the Department of Electrical and
Computer Engineering, University of Minnesota,
arrays, in Multiway Data Analysis, R. Coppi and S. Bolasco,
Minneapolis.
Eds. Amsterdam, The Netherlands: North-Holland, 1988, pp. 718.
His research interests are in the area of signal pro-
[16] J. K. Lee, R. T. Ross, S. Thampi, and S. E. Leurgans, Resolution of
cessing for communications with focus on wireless
the properties of hydrogen-bonded tyrosine using a trilinear model of
multiuser communications.
fluorescence, J. Phys. Chem., vol. 96, pp. 91586192, 1992.
[17] X. Liu and N. Sidiropoulos, PARAFAC methods for blind beam-
forming: multilinear ALS performance and CRB, in Proc ICASSP,
Istanbul, Turkey, June 59, 2000.
[18] C. R. Rao, Minimum variance and the estimation of several parame-
ters, in Proc. Cambridge Phil. Soc., vol. 43, 1946, pp. 280283.
[19] E. Sanchez and B. R. Kowalski, Generalized rank annihilation factor Nicholas D. Sidiropoulos (M92SM99) received
analysis, Anal. Chem., vol. 58, pp. 496499, 1986. the Diploma in electrical engineering from the Aris-
[20] P. H. Schonemann, An algebraic solution for a class of subjective met- totelian University of Thessaloniki, Thessaloniki,
rics models, Psychometrika, vol. 37, p. 441, 1972. Greece, and the M.S. and Ph.D. degrees in electrical
[21] N. D. Sidiropoulos and R. Bro, On the uniqueness of multilinear de- engineering from the University of Maryland,
N
composition of -way arrays, J. Chem., vol. 14, pp. 229239, May College Park (UMCP), in 1988, 1990, and 1992,
2000. respectively.
[22] , On communication diversity for blind identifiability and the From 1988 to 1992, he was a Fulbright Fellow
N
uniqueness of low-rank decomposition of -way arrays, in Proc and a Research Assistant with the Institute for
ICASSP, Istanbul, Turkey, June 59, 2000. Systems Research (ISR), UMCP. From September
[23] , PARAFAC techniques for signal separation, in Signal Pro- 1992 to June 1994, he served his military service
cessing Advances in Wireless Communications, P. Stoica, G. Giannakis, as a Lecturer with the Hellenic Air Force Academy. From October 1993 to
Y. Hua, and L. Tong, Eds. Upper Saddle River, NJ: Prentice-Hall, June 1994, he was a Member of Technical Staff, Systems Integration Division,
2000, vol. 2, ch. 4. G-Systems Ltd., Athens, Greece. He has held Postdoctoral (from 1994 to
[24] N. D. Sidiropoulos, R. Bro, and G. B. Giannakis, Parallel factor analysis 1995) and Research Scientist (from 1996 to 1997) positions at ISR-UMCP,
in sensor array processing, IEEE Trans. Signal Processing, vol. 48, pp. before joining the Department of Electrical Engineering, University of Virginia
23772388, Aug. 2000. , Charlottesville, in July 1997 as an Assistant Professor. He is currently an
[25] N. D. Sidiropoulos, G. B. Giannakis, and R. Bro, Blind PARAFAC Associate Professor with the Department of Electrical and Computer Engi-
receivers for DS-CDMA systems, IEEE Trans. Signal Processing, vol. neering, University of Minnesota, Minneapolis. His current research interests
48, pp. 810823, Mar. 2000. are primarily in multi-way analysis and its applications in signal processing
[26] N. D. Sidiropoulos and X. Liu, Identifiability results for blind beam- for communications and networking.
forming in incoherent multipath with small delay spread, IEEE Trans. Dr. Sidiropoulos is a member of the Signal Processing for Communications
Signal Processing, vol. 49, pp. 228236, Jan. 2001. Technical Committee (SPCOM-TC) of the IEEE SP Society and currently
[27] P. Stoica and A. Nehorai, MUSIC, maximum likelihood, and serves as Associate Editor for IEEE TRANSACTIONS ON SIGNAL PROCESSING
Cramr-Rao bound, IEEE Trans. Acoust., Speech, Signal Processing, and the IEEE SIGNAL PROCESSING LETTERS. He received the NSF/CAREER
vol. 37, pp. 720741, May 1989. award for the Signal Processing Systems Program in June 1998.