Professional Documents
Culture Documents
IEEE TRANSACTIONS ON INFORMATION THEORY, VOL. 60, NO. 11, NOVEMBER 2014
I. I NTRODUCTION
0018-9448 2014 IEEE. Personal use is permitted, but republication/redistribution requires IEEE permission.
See http://www.ieee.org/publications_standards/publications/rights/index.html for more information.
7113
7114
IEEE TRANSACTIONS ON INFORMATION THEORY, VOL. 60, NO. 11, NOVEMBER 2014
(1)
(2)
rUE = rUE hT Wh
(3)
(4)
7115
However, it cannot capture all practical characteristics of residual transceiver hardware impairments. Possible refinements,
and their respective implications on our analytical results and
observations, are outlined in Section VII.
C. Uplink System Model
The reciprocal UL channel is used for pilot-based channel estimation and data transmission; see Fig. 1 and
Sections IIIIV. Similar to (2), we consider a system model
with the received signal z C N at the BS being
z = h(d + tUE ) + rBS +
(6)
(7)
(8)
7116
IEEE TRANSACTIONS ON INFORMATION THEORY, VOL. 60, NO. 11, NOVEMBER 2014
(9)
A
H
MSE = tr R dAR d RA H + AZA
(12)
where the expectations that involve tUE , rBS in MSE =
2 } are computed by first having a fixed value of h and
E{hh
2
then average over h. The LMMSE estimator in (9) is achieved
by differentiation of (12) with respect to A and equating
to zero. This vector minimizes the MSE since the Hessian
is always positive definite. The error covariance matrix and
the MSE are obtained by plugging (9) into the respective
definitions.
Based on Theorem 1, the channel can be decomposed
as h = h + where h is the LMMSE estimate in (9)
and C N1 denotes the unknown estimation error.
Contrary to conventional estimation with independent
Gaussian noise (see [32, Ch. 15.8]), h and are neither
4 For example, the ith element of BS can be expressed as x = h ,
i
i i
r
which is the product of the ith channel coefficient h i CN (0, rii ) and an
BS
UE
independent variable i CN (0, r p ). The joint product
distribution
|x |
is complex double Gaussian with the PDF f (xi ) = 2 K 0 2 i , where
i
i = rii rBS pUE is the variance and K 0 () denotes the zeroth-order modified
Bessel function of the second kind [36].
independent nor jointly complex Gaussian, but only uncorrelated and have zero mean. The covariance matrices are
E{h h H } = RC and E{ H } = C where C was given in (11).
We remark that there might exist non-linear estimators
that achieve smaller MSEs than the LMMSE estimator in
Theorem 1. This stands in contrast to conventional channel estimation with independent Gaussian noise, where the
LMMSE estimator is also the MMSE estimator [34]. However,
the difference in MSE performance should be small, since the
dependent distortion noises are relatively weak.
Corollary 1: Consider the special case of R = I and
S = 0. The error covariance matrix in (11) becomes
pUE
I.
(13)
C = 1 UE
2
p (1 + tUE + rBS ) + BS
In the high UL power regime, we have
1
I.
lim C = 1
1 + tUE + rBS
p UE
(14)
7117
(15)
i=1
H
B
B
1
1
tr(C)
E
.
(16)
i
j
=
B
B
B
i=1
j =1
(r ) ,
i > j,
where is an arbitrary scaling factor. This model basically describes a uniform linear array (ULA) where the
correlation factor between adjacent antennas is given by |r |
(for 0 |r | 1) and the phase r describes the angle of
arrival/departure as seen from the array. The correlation factor
|r | determines the eigenvalue spread in R, while r determines
6 Since we have per-symbol energy constraints, what we really compare is
one system that has an average symbol energy of B pUE and one with pUE .
Fig. 3. Estimation error per antenna element for the LMMSE estimator
in Theorem 1 and the conventional impairment-ignoring MMSE estimator.
Transceiver hardware impairments create non-zero error floors.
tr(R)
.
2
NBS
(18)
7118
IEEE TRANSACTIONS ON INFORMATION THEORY, VOL. 60, NO. 11, NOVEMBER 2014
Fig. 4. Estimation error per antenna element for the LMMSE estimator in
Theorem 1 as a function of the pilot length B. The levels of impairments are
tUE = rBS = 0.052 and different temporal correlations are considered.
Fig. 5. Estimation error per antenna element for the LMMSE estimator in
Theorem 1 as a function of the number of BS antennas. Four different channel
covariance models are considered and tUE = rBS = 0.052 .
7119
DL capacity is
C
DL
T DL
= data E
Tcoher
max
f (s|HBS ) : E{s22 } p BS
I(s; y|H, H , H
BS
UE
)
(20)
7120
IEEE TRANSACTIONS ON INFORMATION THEORY, VOL. 60, NO. 11, NOVEMBER 2014
DL
Tdata
Tcoher
E
log2 1 + h H tBS D|h|2
+ rUE hh H
UL
T UL
data E
Tcoher
2 1
+ UE
I
h
pBS
2 1
+ BS
I
h
pUE
(22)
(23)
1 h
(tBS D|h|2 + pUE
BS I)
=
2
1
tBS D|h|2 + UE
I h 2
p BS
(24)
UL
wupper
=
( BS D|h|2 +
r
2
BS
I)1 h
p UE
2
BS
I)1 h2
p UE
(25)
in the uplink.8
Proof: The proof is given in Appendix C-A.
Note that the beamforming vector in (24) and receive
combining vector in (25) only depend on the channel vector h,
hardware impairments at the BS, and the receiver noise.
DL
Hardware impairments at the UE have no impact on wupper
UL
and wupper since their distortion noise essentially act as an
interferer with the same channel as the data signal; thus
filtering cannot reduce it.
The bounds in Lemma 1 are not amenable to simple
analysis, but the lemma enables us to derive further bounds
on the channel capacities that are expressed in closed form.
Theorem 2: The downlink and uplink capacities in
(20) and (21), respectively, are bounded as
DL
Tdata
G DL
log2 1 +
Tcoher
1 + rUE G DL
UL
Tdata
G UL
=
log2 1 +
Tcoher
1 + tUE G UL
N
i=1
DL
wupper
G UL =
log2 1 + h H tUE hh H
+ rBS D|h|2
2
UE
BS
N
BS
2
2
p t rii
1
UE
1 UE e
, (28)
G DL =
E
1
BS
BS
BS
BS r
BS r
p
p
ii
ii
t
t
t
i=1
CDL CDL
upper =
(26)
CUL CUL
upper
(27)
1
1
BS
r
2
BS
E1
2
BS
pUE rBSrii
, (29)
$ t x
and E 1 (x) = 1 e t dt denotes the exponential integral.
Proof: The proof is given in Appendix C-B.
These closed-form upper bounds provide important insights
on the achievable DL and UL performance under transceiver
hardware impairments. In particular, the following two corollaries provide some ultimate capacity limits in the asymptotic
regimes of many BS antennas or large transmit powers.
Corollary 2: The downlink upper capacity bound in (26)
has the following asymptotic properties:
DL
Tdata
N
DL
log2 1 + BS
(30)
lim Cupper =
Tcoher
t + rUE N
p BS
DL
Tdata
1
1
+
.
(31)
=
log
lim CDL
2
upper
N
Tcoher
rUE
Proof: The diagonal elements
rii > 0 i ,
% N 1of R satisfy
N
by definition, thus G DL i=1
=
as
pBS
BS
BS
t
t
for fixed N, giving (30). The positive diagonal elements also
DL
G DL
implies N1 G DL > 0 as N , thus 1+GUE G DL UE
DL 0
r
r G
as N which turns (26) into (31).
This corollary shows that the DL capacity has finite ceilings
when either the DL transmit power pBS or the number of BS
antennas N grow large. The ceilings depend on the impairment
parameters tBS and rUE , but the UE impairments are clearly
N times more influential. Note that even very small hardware
impairments will ultimately limit the capacity. In other words,
the ever-increasing capacity observed in the high-SNR and
large-N regimes with ideal transceiver hardware (see [9][13])
is not easily achieved in practice.
The next corollary provides analogous results for the UL.
Corollary 3: The uplink upper capacity bound in (27) has
the following asymptotic properties:
UL
Tdata
N
=
log
(32)
1
+
lim CUL
2
upper
Tcoher
rBS + tUE N
p UE
UL
Tdata
1
UL
(33)
lim Cupper =
log2 1 + UE .
N
Tcoher
t
Proof: This is proved analogously to Corollary 2.
As seen from Corollary 3, the UL capacity also has finite
ceilings when either the UL transmit power pUE or the
number of antennas N grow large. Analogous to the DL, the
UE impairments are N times more influential than the BS
impairments and thus dominate as N .
The upper bounds in Corollaries 2 and 3 show that the
DL and UL capacities are fundamentally limited by the
transceiver hardware impairments. To be certain of the cause
of these limits, we also need lower bounds on the channel
capacities.
7121
N
CDL CDL
lower =
CUL CUL
lower
T
where the beamforming vector vDL = [v 1DL . . . v DL
K r ] and the
T
receive combining vector vUL = [v 1UL . . . v UL
K r ] are functions
(38)
E{h H v} = |E {}|2 tr(R C) + O( N )
'
&
'
&
E{|h i |2 |v i |2 } = O(1)
(40)
(1 + d 1 tUE ) tr(R C)
=
tr A(|d + tUE |2 R + )A H
(41)
i=1
where
DL
Tdata
Tcoher
|E {}|2 +O 1
N
log2 1+
)
(
1
(1+rUE )E ||2 |E {}|2 +O 1 + N 1n
N
(42)
1
where is given in (41). The terms O 1 and O N 1n
N
vanish when N , while the other terms are strictly
positive in the limit.
Proof: The expression (42) is obtained from (34) by
plugging in the expressions in Theorem 4 and multiplying each
1
= pUE tr(R1 Z 1 R) = O(N 1 ). The interference
term by tr(RC)
UE }
E{IH
1
.
term becomes pBS tr(RC)
= O N 1n
Combining the upper bound in Corollary 2 with the lower
bound in Corollary 4, we have a clear characterization of
the DL capacity behavior when N . Both bounds
are independent of tBS in the limit, thus the transmitter
hardware of the BS plays little role in massive MIMO systems.
Contrary to the upper bound, the level of receiver hardware
impairments at the BS (rBS ) is present in the lower bound (42),
through A and in . However, the numerical results in
Section IV-C reveal that the asymptotic impact of BS impairments is negligible also in the lower bound. This can also be
11 We stress that the assumption in Theorem 4 that decoding is performed
without instantaneous CSI is only made to enable closed-form lower bounds.
The BS should certainly exploit the channel estimate h and the UE might
receive a downlink pilot signal that enables estimation of the effective channel
h H vDL . While this is relatively easy to handle with ideal hardware, where the
channel estimate and estimation error are independent (see [12]), the extension
to non-ideal hardware seems intractable due the statistical dependence between
the channel estimate and estimation error.
7122
IEEE TRANSACTIONS ON INFORMATION THEORY, VOL. 60, NO. 11, NOVEMBER 2014
2
E{h H vDL |H UE }
DL
DL
SINRlower (v ) =
2
&
'
N
2
%
E{I UE |H UE } UE
(1+rUE)E |h H vDL |2 |H UE E{h H vDL |H UE } +tBS E{|h i |2 |v iDL |2 |H UE }+ HpBS
+ pBS
(36)
i=1
2
E{h H vUL |H BS }
UL
UL
&
'
SINRlower (v ) =
2
'
&
2 I)vUL |H
BS
N
E (vUL ) H (QH +BS
%
UE
UL
H
UL
2
BS
H
UL
BS
BS
2
2
BS
E{h v |H } +r
(1+t )E |h v | |H
E{|h i | |v i | |H }+
p UE
i=1
(37)
DL
Tdata
1
=
log
lim CDL
1
+
.
(43)
2
lower
N
Tcoher
rUE
In this special case, the lower bound actually approaches the
upper bound in (31) asymptotically, and any DL capacity can
be achieved by making rUE sufficiently small. The opposite
is not true; setting rBS = 0 will not make the impact of
UE impairments vanish. We therefore conclude that the DL
capacity limit is mainly determined by the level of impairments
at the UE, both in the uplink estimation (tUE ) and the
downlink transmission (rUE )although the former connection
was not visible in the upper bound since it was based on
perfect CSI.
For the uplink, we have the following similar asymptotic
capacity characterization.
Corollary 5: Consider the UL with receive combining vec
tor v = h and H BS = . If E{QH 2 } O(N n ) for some
h2
n < 1, the lower capacity bound in (35) can be expressed as
CUL
UL
Tdata
Tcoher
|E {}|2 +O 1
N
log2 1+
)
(
1
1
2
UE
2
(1+t )E || |E {}| +O
+ N 1n
N
(44)
and O N 1n
where is given in (41). The terms O
vanish when N , while the other terms are strictly
positive in the limit.
Proof: The expression (44) is obtained from (35) by
plugging in the expressions from Theorem 4 and multiplying
1
= pUE tr(R1 Z 1 R) = O(N 1 ). The
each term by tr(RC)
{v H QH v}
1
interference term becomes pEUE
=
O
.
tr(RC)
N 1n
The upper bound in Corollary 3 and the lower bound in
Corollary 5 provide a joint characterization of the uplink
capacity when N grows large. The UE impairments manifest
the behavior in both bounds; the BS impairments are present
in (42) since depends on A and , but their impact vanish
when tUE 0. By making tUE appropriately small, we can
thus achieve any UL capacity as N grows large. We therefore
conclude that it is of main importance to have high quality
hardware at the UE, which is analog to our observations
for the DL. These observations are illustrated numerically
1
N
UE
7123
Fig. 7. Lower and upper bounds on the capacity for UE = 0.052 . The
impact of hardware impairments at the BS vanishes asymptotically.
Fig. 6. Lower and upper bounds on the capacity. Hardware impairments have
a fundamental impact on the asymptotic behavior as N grows large. (a) SNR:
20 dB. (b) SNR: 0 dB.
T UL
data
data
the UE.12 Furthermore, we assume Tcoher
= Tcoher
= 0.45,
which are the percentage of DL data and UL data. These
assumptions make the bounds for the DL and UL capacities
become identical, thus we can simulate the DL and UL
simultaneously.
Fig. 6 considers a spatially uncorrelated scenario with
R = I for different levels of impairments: tUE = rBS
{0, 0.052 , 0.152 }. The meaning of these parameter values was
discussed in Remark 1. Simulation results are given for SNRs
of 20 dB and 0 dB. The capacity with ideal hardware grows
without bound as N , while the lower and upper bounds
converge to finite limits under transceiver hardware impairments. The main difference between the two SNR values is
the convergence speed, while the upper bounds are exactly the
same and the lower bounds are approximately the same. Recall
that these bounds hold under any CSI HBS at the BS and HUE
at the UE; the lower bounds represent no instantaneous CSI in
the decoding step and the upper bounds represent perfect CSI.
Although the gap between these extremes is large for ideal
hardware, the difference is remarkably small under non-ideal
hardware due to the finite capacity limit (caused by distortion
noise) and the channel hardening that makes stochastic inner
product such as h H v become increasingly deterministic as N
grows large. Since a main difference between the lower and
upper bounds is the quality of the CSI, the small difference
shows that the estimation errors have only a minor impact
on the capacity; hence, the estimation error floors described
in Section III has no dominating impact in the large-N
regime.
The asymptotic capacity limits in Fig. 6 are characterized
by the level of impairments, thus the hardware quality has
a fundamental impact on the achievable spectral efficiency.
If the SNRs are sufficiently high (e.g., 20 dB), the majority
of the multi-antenna gain is achieved at relatively low N;
in particular, only minor improvements can be achieved by
having more than N = 100 antennas. Larger numbers are,
however, useful for inter-user interference suppression and
multiplexing; see Section VI. We need many more antennas
to achieve convergence at 0 dB SNR than at 20 dB, because
a 100 times larger array gain is required to compensate for
the lower SNR. Hence, we conclude that the massive MIMO
gains are much more attractive at higher SNRs (which matches
well with the results in Section III where 2030 dB SNR was
needed to achieve a close-to-perfect channel estimate). Therefore, we only consider an SNR of 20 dB it the remainder of this
section.
Fig. 7 considers the same scenario as in Fig. 6 but with
a fixed level of impairments UE = 0.052 at the UE and
different values at the BS. As expected from the analysis, the
lower and upper capacity bounds increase with BS , but the
difference is only visible at small N since the curves converge
to virtually the same value as N . This validates that
the impact of impairments at the BS vanishes as N grows
large.
Finally, we consider the capacity behavior for different
channel covariance models, namely the four propagation
scenarios described in Section III-B. The lower and upper
7124
IEEE TRANSACTIONS ON INFORMATION THEORY, VOL. 60, NO. 11, NOVEMBER 2014
Downlink power
UL
Tpilot
pBS
pUE
+
+
Tcoher BS
Tcoher UE
DL
Tpilot
UL
Tdata
pUE
Tcoher UE
Uplink power
(46)
where the ratios of DL and UL transmission are, respectively,
Fig. 8. Lower and upper bounds on the capacity as a function of the number
of BS antennas. Four different channel covariance models are considered and
the hardware impairments are characterized by BS = UE = 0.05.
UE
p BS
UL
UL p
+(T
+T
)
[Joule] (45)
pilot
data
BS
UE
DL =
UL =
DL
Tdata
DL + T UL
Tdata
data
UL
Tdata
DL + T UL
Tdata
data
(47)
.
(48)
EEDL =
DL
CDL
DL
Tpilot
p BS
Tcoher BS
DL
Tdata
+ N + + Tcoher
T UL p UE
UE
pilot
+ Tcoher
p BS
BS
(49)
and the uplink energy efficiency is
EEUL =
UL
CUL
DL
Tpilot
p BS
Tcoher BS
UL
Tpilot
p UE
+ Tcoher
UE
+ N +
UL
Tdata
p UE
+ Tcoher
UE
(50)
The EE of any suboptimal transmission scheme is obtained by
replacing the capacities CDL and CUL with the corresponding
achievable spectral efficiencies.
13 The efficiency of a specific amplifier depends on the transmit power, but
to facilitate analysis we assume that the amplifier is optimized jointly with the
transmit power to give a specific efficiency at the particular power level. The
efficiency also depends on the PAPR and acceptable distortion noise, which
are two properties that we also keep fixed when optimizing the EE.
14 This term can also model the overhead power consumption of the network
as a whole, which enables comparison of network architectures with different
BS density, amounts of backhaul signaling, etc.
DL
Tdata
1
. (51)
log2 1 + UE
Tcoher
r + tUE + rUE tUE
DL
UL
Tdata
1
.
log2 1 + UE
Tcoher
2t + (tUE )2
(52)
p BS , p UE 0
DL
DL
DL
DL
T
T
data
log2 1 +
1
2tUE +(tUE )2
Tcoher
UL UL
Tdata
N0
max EEUL
p UE ,N0
data
log2 1 +
(53)
tUE
Tcoher
UL UL
Tdata
(54)
where the lower bounds are achieved as N using the
power scaling law in Theorem 5.
7125
If > 0, the upper bounds in (53) and (54) are still valid,
but the asymptotic EEs are
lim
max
N p BS , p UE 0
(55)
7126
IEEE TRANSACTIONS ON INFORMATION THEORY, VOL. 60, NO. 11, NOVEMBER 2014
DL
Tpilot
Tcoher
Fig. 9. Achievable energy efficiency with ideal and non-ideal hardware for
fixed transmit power (t = 0), transmit power that decreases as 1/N t for t = 12 ,
and the transmit power that maximizes the EE. The EE is computed using the
lower bounds in Theorem 3 and are valid for both DL and UL transmissions.
= 0.05.
Fig. 9 shows the achievable DL/UL energy efficiencies
using the lower capacity bounds in Theorem 3. The levels of
impairments are set to tBS = rBS = tUE = rUE = 0.052 ; see
Remark 1 for the interpretation of these parameter values. The
transmit powers are either optimized numerically for maximal
EE at each N, fixed at the value that is optimal for N = 1, or
reduced from this value according to the power scaling law in
Theorem 5 with t = 12 . Fig. 9 shows that the EE is almost the
same in all three cases, but varies a lot with the circuit power
parameters and . If = 0, the EE increases monotonically
with N and eventually converge according to (53) and (54) in
Corollary 6. On the contrary, the EE has a unique maximum
when > 0 and then decreases towards zero. The maximum
is in the range of 5 N 50 in the figure, but the exact
is
position depends on the circuit power parameters. If +
15 As a reference, these numbers correspond to 18 W and 0.18 W, respectively, for a system with an effective bandwidth of 9 MHz, since then there
are 9 106 channel uses per second.
16 As a reference, this number corresponds to 200 mW, or 23 dBm, for a
system with an effective bandwidth of 9 MHz.
Fig. 10.
7127
17 Only one pilot channel use is allocated per UE in this section. For other
pilot lengths B > 1, one can construct up to B parallel pilot signals that
are orthogonal in space. This increases the pilot power per UE, but does not
increase the total number of orthogonal pilots.
7128
IEEE TRANSACTIONS ON INFORMATION THEORY, VOL. 60, NO. 11, NOVEMBER 2014
The following
theorem 'shows how interference terms of the
&
H
UL
type E |hl v |2 |H BS in (60) behave as N grows large.
Theorem 6: Assume that no instantaneous CSI is utilized
for decoding (i.e., H BS = H UE = ), interference is treated
O(1),
l U ,
where tUE is stochastic and A, are given in Theorem 4.
The lower capacity bound in (44) is generalized by replac1
) in the denominator by
ing the term O( N 1n
tr(ARl )
2
1
tr(R C)
+O
E
.
UE
2
H
tr(AR)
N
tr A(|d + t | R + )A
lU
(62)
Proof: The proof is given in Appendix C-F.
This theorem shows that the effective interference from a
co-user depends strongly on whether it interfered with the pilot
transmission of the UE under study or not. The interference
from co-users in U , which were silent when the UE under
study sent its pilot, vanishes asymptotically since the user
channels decorrelate with N. This is the classical type of
interference and is called regular interference in this section.
In contrast, the interference from co-users in U , which were
vlDL
where
is the beamforming vector for DL transmission to
UE l U from its (arbitrary) serving BS and h l is the channel
from that BS to the UE under study. For brevity, we will not
dive into the details since these require assumptions on the
decision making at other BSs. The general behavior is however
the same: UEs with parallel UL pilots cause non-vanishing
interference to each other in the DL, while the impact of
all other interfering DL transmissions vanish as N grows
large.
C. Numerical Illustrations
The impact of inter-user interference and pilot contamination on multi-cell systems with non-ideal hardware is now
studied numerically. We consider UL scenarios with spatially
tr(R)
uncorrelated channels, define the average SNR as pUE N
2 ,
UL
Tdata
Tcoher
BS
7129
tUE
tr(ARl )
2
lU
tr(AR)
(64)
7130
IEEE TRANSACTIONS ON INFORMATION THEORY, VOL. 60, NO. 11, NOVEMBER 2014
Fig. 14. Achievable UL spectral efficiency for an average user in the multicell scenario depicted in Fig. 13. Each UE has either a unique pilot signal
or the same pilots are reused in every cell. Pilot contamination degrades the
spectral efficiency under ideal hardware, while the impact on a system with
hardware impairments (tUE = rBS = 0.12 ) is negligible.
7131
(65)
v diag(e
j 1,t
,...,e
j N,t
)hd
(66)
N
N
i1 =1 i2 =1
UE
H
= p
if CO,
(67)
if SO,
7132
IEEE TRANSACTIONS ON INFORMATION THEORY, VOL. 60, NO. 11, NOVEMBER 2014
where the first term originates from the UE and the second
term is due to the BS having either a CO or SOs at each
antenna. Recall from Theorem 4 that E{|v H h|2 } = O(N)
and E{|h i |2 |v i |2 } = O(1). This means that a BS with a CO
causes distortion that scales as O(t N), while it only scales
as O(t) when having SOs.20 In other words, it appears to
be preferable to have independent oscillators at each antenna
element in massive MIMO systems, which was also noted
in [25]. This is also consistent with our results for additive
distortion noise: impairments at the UE are N times more
influential on the capacity, thus we can degrade the quality
of the BSs oscillators with N and only get a minor loss
in performance. This property is also positive for distributed
massive MIMO deployments where the antenna separation is
large and prevents the use of a CO. A major difference from
the additive distortion noise in Section II is that the distortion
in (67) increases linearly with the time t, thus it eventually
grows large and it becomes necessary to send pilot/calibration
signals more often to mitigate it [25].
The narrowband phase-noise analysis above only considered
a lower capacity bound, thus it is possible to achieve higher
rates. In particular, the analysis assumed uncompensated freerunning oscillators, while it might be better to track the phase
noise process at the BS; for example, by using previous
received signals, extra calibration signals (see [64] and references therein), and utilizing correlation between subcarriers
in multi-carrier systems. The tracking might be more accurate
for a CO since there are O(N) observations of a single
phase drift parameter, instead of O(N) observations for N
parameters as with SOs. Another important
aspectof phase
uplink pilot transmissions. The estimates are naturally imperfect, thus bi = c(bi + ei ) where ei is the estimation error and
c is the unknown common scaling factor.
Imperfect channel reciprocity has no impact on the UL
and is not expected to change anything fundamentally in
the DL. There is a loss in received signal power since the
beamforming direction is perturbed, but there is no extra
self-interference since all the CSI available at the receiving
UE is estimated in the downlink and thus reflects the actual
downlink channel hT Db . In other words, the lower capacity
bound in (34) is still valid if we replace h by Db h everywhere
and compute the expectations with respect to the actual
distributions. The beamforming vector vDL is now a function
This perturbation of vDL , as compared
of diag(b1 , . . . , b N )h.
to having perfect reciprocity, behaves like a channel estimation
error and its impact is expected to vanish as N grows large.
Moreover, it should only have a minor impact on the interuser interference in multi-cell scenarios, since the reciprocity
calibration errors are independent of the co-user channels.
VIII. C ONCLUSION
This paper analyzed the capacity and estimation accuracy of
massive MIMO systems with non-ideal transceiver hardware.
The analysis was based on a new system model that models
the hardware impairment at each antenna by an additive
distortion noise that is proportional to the signal power at
this antenna. This model has several attractive features: it is
mathematically tractable, it has been verified experimentally
in previous works, and it can be motivated theoretically in
systems that apply compensation algorithms to mitigate the
hardware impairments.
We proved analytically that hardware impairments create
non-zero estimation error floors and finite capacity ceilings
in the uplink and downlinkirrespective of the SNR and the
number of base station antennas N. This stands in contrast to
the very optimistic asymptotic results previously reported for
ideal hardware. Despite these discouraging results, we showed
that massive MIMO systems can still achieve a huge array
gain, in the sense that relatively high spectral efficiency and
energy efficiency can be obtained. Furthermore, we proved that
only the hardware impairments at the UEs limit the capacities
as N grows large. This implies that the hardware quality at
the BS can be decreased as N grows, which is an important
insight and might become a key enabler for future network
deployments.
In multi-cell scenarios, we proved that the detrimental effect
of inter-user interference and pilot contamination drowns in
the distortion noise if a simple pilot allocation algorithm
is used to avoid the strongest forms of pilot contaminated
interference. Many quantitative conclusions can be drawn from
the numerical results in Sections IIIVI; for example, that there
is little gain in having more than 100 antennas for a singleuser link, but additional antennas are useful to suppress interuser interference in multi-cell scenarios. The asymptotic limits
under non-ideal hardware are generally reached at much fewer
antennas than the asymptotic limits for ideal hardware, which
implies that we can expect practical systems to benefit from
B1 x
.
1 + x H B1 x
(68)
b
|h|2
b
1
b
E1
E
(69)
=
1
e ar
2
a|h| + b
a
ar
ar
$ t x
where E 1 (x) = 1 e t dt denotes the exponential integral.
Proof: Since = |h|2 has the exponential distribution
with mean value r , the expectation in (69) equals
*
*
b b
1 b
e/r
d = 2 e ar
1
e ar d x (70)
a + b r
a r
x
0
1
where the equality follows from a change of variable x =
a
b + 1. Straightforward integration and identification of the
exponential integral yield the right-hand side of (69).
Lemma 4: For any a, b C and non-zero c, d C, we
have
a
b |b| |c d| + |a b| .
(71)
c
d
|c| |d|
|c|
Proof: This is straightforward to prove by using that
|ad bc| = |ad bc + bd bd| |b| |c d| + |d| |a b|.
Lemma 5: Consider M arbitrary matrices M1 , . . . , M M
C NN and an Hermitian positive semi-definite matrix B
C NN . It follows that
|tr(M1 M M B)| tr(B)
M
+
Mi 2 .
(72)
i=1
(73)
Proof: The bound in (72) follows from that B has nonnegative eigenvalues and each matrix Mi cannot amplify these
by more than Mi 2 . The O(N)-scaling follows from (72) by
using the assumptions and tr(B) NB2 .
Lemma 6 [71, Lemma B.26]: Let B C NN be deterministic and x = [x 1 . . . x N ]T C N be a stochastic vector
of independent entries. Assume that E{x i } = 0, E{|x i |2 } = 1,
and E{|x i | } = < for 2q. Then, for any q 1,
q '
&
q q
2
E x H Bx tr(B) Cq tr(BB H )
42 + 2q
where Cq is a constant depending on q only.
(74)
7133
A PPENDIX B
A PPLICATION -R ELATED R ANDOM V ECTOR R ESULTS
Lemma 7: The channel estimate h can be decomposed as
(75)
h = A (d + tUE )I + Dr h +
where A is defined in (9) and the diagonal matrix Dr has
independent CN (0, rBS pUE )-entries such that rBS = Dr h.
For any realizations of tUE and Dr , the conditional distribution is
2
tUE , Dr CN 0, A( + S + BS
h|
(76)
I)A H
where = ((d + tUE )I + Dr )R((d + tUE )I + Dr ) H .
Proof:
This characterization follows directly from
Theorem 1 and the system model defined in Section II.
Lemma 8: For the channel h and its estimate h it holds that
2
(77)
E h H h (1 + d 1 tUE )tr(R C) = O(N)
2
|1 + d 1 tUE |tr(R C) = O(N). (78)
E |h H h|
Proof: Recall that h = A h(d + tUE ) + + rBS for
A = d RZ 1 . To prove (77), we expand the argument as
2
H
h h (1 + d 1 tUE )tr(R C) 4|h H A|2
2
+ 4|h H ArBS |2 +2|d +tUE|2 h H Ahd 1 tr(R C) (79)
by using the rule |a + b|q 2q1 (|a|q + |b|q ) (from Hlders
inequality) twice. Next, we observe that
2
E{|h H A|2 } = tr A(S + BS
I)A H R = O(N)
(80)
E{|h H ArBS |2 } = rBS pUE tr ARdiag A H R
+ rBS pUE
N
i=1
tr
A(|d
+
|
R+)A
= O(N) (83)
2
t
,
2
UE
E h2 tr A(|d +t |2 R+)A H = O(1) (84)
2 I.
where A is defined in (9) and = pUE rBS Rdiag + S + BS
7134
IEEE TRANSACTIONS ON INFORMATION THEORY, VOL. 60, NO. 11, NOVEMBER 2014
2 I)A H )
Proof: By injecting the term tr(A( + S + BS
that appeared in Lemma 7 and using the rule |a + b|q
2q1 (|a|q + |b|q ) (from Hlders inequality), we bound the
left-hand side of (83) as
2
2
UE 2
H
E h2 tr A(|d +t | R+)A
2
2
2
H
2E h2 tr A( + S + BS I)A
2
I)A H
+2E tr A( + S + BS
2
tr A(|d +tUE |2 R+)A H .
(85)
tr(A(
+
S
+
I)A
)
E h
2
BS
&
'
2
2
4C2 E tr(A( + S + BS
I)A H A( + S + BS
I) H A H )
'
&
2
(86)
4C2 A42 E + S + BS
I2F = O(N)
where the first inequality follows from applying Lemma 6
on (83) for fixed tUE , Dr (note that the fourth-order moment
is 4 = 2 for complex Gaussian variables), while the second inequality follows from applying Lemma 5 twice. The
2 is constant, A = O(1),
scaling O(N) follows since BS
2
2
S F = O(N), E{tr(S)} R2 S2 E{(d + tUE )I +
Dr 2F } = O(N), and E{tr( H )} R2 E{((d + tUE )I +
Dr )((d + tUE )I + Dr ) H 2F } = O(N) using Lemma 5.
Next, we characterize the second term in (85) as
2
2
H
UE 2
H
E tr A( + S + BS I)A tr A(|d +t | R+)A
= E tr A Dr RDrH + Dr R(d + tUE )
2
+ (d + tUE )RDrH A H p UE rBS tr ARdiag A H
2
2E tr A(Dr RDrH p UE rBS Rdiag )A H
&
'
2
UE 2
H
+ 4E |d + t | E tr ADr RA = O(N). (87)
where the equality follows from plugging in the expressions
for and and noting that the terms |d +tUE |2 tr(ARA H ),
2 tr(AA H ) cancel out. The inequality follows
tr(ASA H ), and BS
again from the rule |a + b|q 2q1 (|a|q + |b|q ). The
O(N)-scaling follows since the first term in (87) is upper
bounded by ( pUE rBS )2 tr(ARdiag A H ARdiag A H ) = O(N)
using Lemma 6 and some algebra, while E{|tr(ADr RA H )|2 }
RA H A22 E{|tr(Dr )|2 } = O(N) using Lemma 5. The expression (83) now follows from combining (85)(87).
2
UE
2
H
tr A(|d +t | R+)A
h
2
E
,
2
h
2
2
UE |2 R+)A H
E h
tr
A(|d
+
t
2
= O(1) (88)
tr AA H
where the first inequality follows from the rule (a b)(a +
b) = a 2 b2 and the second inequality is due to removal
of non-zero terms from the denominator. The numerator
scales as O(N) and the denominator
scales at least lin
early with N because tr AA H min ()tr AA H
E
= O(N 1 ) (89)
+
2
(B)(N
1)
h
B
min
2
k
1
UE
k
2k + 2k k2 !(tUE ) 2
|1+d t |
E
+
= O(N 2 )
2
4
(B)
(N
1)(N
2)
h
B
B
min
2
(90)
for any even integer k, where +
min (B) > 0 denotes the smallest
2
non-zero eigenvalue of B = BS AA H and N B = rank(B).
Proof: Using the conditional distribution of the channel
estimate in (76), it holds for any integer q > 0 that
|1 + d 1 tUE |k UE
|1 + d 1 tUE |k
=E E
E
t , Dr
2q
2q
h
h
2
2
2k (1 + |d 1 tUE |k )
1
E
E
(91)
2
2q
H q
+
U BH v2
min (BS AA )
2 = v H A( + S +
where the inequality follows from h
2
+
2
H
2
H
H
2
BS I)A v BS v AA v min (BS AA H )U BH v22 where
v CN (0, I) and U B C NN B is an orthogonal basis of the
span of AA H (note that this matrix is generally rank-deficient).
The expectations were separated since U BH v22 is independent
of the smallest non-zero eigenvalue. Furthermore, the rule
|a + b|q 2q (|a|q + |b|q ) from Hlders inequality was
applied on |1 + d 1 tUE |k . Note that U BH v CN (0, I) with a
dimension reduced from N to N B .
The final result in (89)(90) follows from E{|d 1 tUE |k } =
k UE k
2
2 !(t ) and that [72, Lemma 2.10] with m = 1 and n =
N B gives
1
,
if q = 1,
1
= N B 1 1
E
(92)
2q
H
,
if
q = 2,
U v2
(N B 1)(N B 2)
B
A PPENDIX C
C OLLECTION OF P ROOFS
A. Proof of Lemma 1
The DL capacity in (20) is upper bounded as
T DL
CDL data E
max
log2 (1 + SINR(w))
w(h) : w2 =1
Tcoher
(93)
7135
I
h
E {} = E h H tBS D|h|2 + UE
pBS
|h i |2
= G DL
E
(100)
=
2
BS
UE
2
i=1 t |h i | + BS
p
where
|hT w|2
SINR(w) =
N
%
tBS
i=1
2
UE
p BS
(94)
by assuming that the interference part of n is somehow canceled, perfect CSI is available, and exploiting the corresponding optimality of single-stream Gaussian signaling [2], [6].
We can write (94) as
SINR(w) =
w H h h T w
w H tBS D|h|2 + rUE h hT +
2
UE
I w
p BS
(95)
1 h
(tBS D|h|2 + rUE h hT + pUE
BS I)
w =
2
1
tBS D 2 + UE h hT + UE
I
h 2
BS
|h|
r
p
(96)
2
BS
I w
p UE
(97)
2
1
UE
H BS D
DL
h
+
I
h
2
|h|
t
Tdata
p BS
E log2 1+
2
1 (98)
Tcoher
DL
Tdata
T DL
E {m()} data m(E {}).
Tcoher
Tcoher
(99)
C. Proof of Theorem 4
We introduce the notation
tr(R C) =
= (1 + d 1 tUE )tr(R C)
= tr A(|d + tUE |2 R + )A H .
where
(101)
(102)
E h h E
2
h
h H h
h H h
= E
E
+
2
2
h
h
h H h
h H h
E
E
+ E . (103)
h
2
2
h
In order to prove the first part of the theorem,
we must
show that right-hand side of (103) behaves as O( N ). Using
Cauchy-Schwartz inequality and that tr(AA H ), we have
h H h
E
+ E
2
h
E{h2 } + E
= O( N ) (104)
tr AA H
h H h
h
E
E
h
h
2
2
2
|1 + d 1 tUE | h
+ tr(R C)E
. (105)
2
h
7136
IEEE TRANSACTIONS ON INFORMATION THEORY, VOL. 60, NO. 11, NOVEMBER 2014
(b)
= O (N 1 )
=O (1)
(107)
where Hlders inequality was used to separate the expectations. The scaling of the first square root follows from
1
1 ) for any
Lemma 10 and that 1 tr(AA
H ) = O(N
realization of tUE . The scaling of the second square root
follows from Lemma 9. By plugging these scaling expressions
into (103), we have proved (38).
Similarly, the equivalence in (39) follows if
|h H h|
2 |1 + d 1 tUE |2 (tr(R C))2
E
h
2
2
1 UE 2 2
2 |1 + d t | h
2
tr(R C) E
2
h
2
2 |1 + d 1 tUE |2 (tr(R C))2
|h H h|
(108)
+E
2
h
2
=O N
/
2
/
H
4
/
tr(AA ) /
h
0
2
0
(a)
= O (N)
(b)
= O (N 2 )
(c)
= O (N)
(109)
1
1 ) which is a
where (a) follows from 1 tr(AA
H ) = O(N
deterministic upper bound, (b) is characterized by Lemma 10,
and (c) follows from Lemma 9. The second term behaves as
.
2
/
/ H
1 UE
/E |h h| |1 + d t |tr(R C)
/
0
(d)
= O (N)
.
/
2
/
2|h H h|
2|1 + d 1 tUE |2 (tr(R C))2
/
+E
/E
4
4
h
h
/
2
0
2
= O( N )
(e)
= O (1)
(f)
= O (1)
(110)
h2
2 have same
by Cauchy-Schwartz inequality and that h22 , h
2
asymptotic scaling. By plugging these scaling expressions into
(108), we have proved (39).
Finally, the equivalence in (40) follows since
N
N
2
2
h
(a)
2
2
2 |h i |
4
E{|h i | |v i | } 0 = E
|h i |
2
2
h
h
i=1
2 i=1
.
.
/ N
/ N
/ |h i |4
2 /
(b) h
40
E
|h i |4 0
2
4
h
h
2
4
i=1
i=1
2
h
4
= E
h24
2
h
2
.
/,
'
&
(c) /
)
(
1
(d)
8 E
0 E h84 E h
= O(1)
4
4
h
(111)
|h i |2
2
h
2
and by inserting
[|h 1 | . . . |h N | ] /h
4
Cauchy-Schwarz inequality in (b) to bound the sum by h24 .
Next, (c) is obtained by applying Hlders inequality twice
8
2
8
and (d) follows
' that E{h4 } = O(N ) and E{h4 } =
& from
1
2
2
O(N ) and E 4 = O(N ) from Lemma 10.
h2
D. Proof of Theorem 5
The bounds in this theorem are derived using the capacity
lower bounds in Corollaries 4 and 5. We begin with the DL
and note that the arguments of the expectations in (42) have
deterministic upper bounds since
tr(R C)
||
(112)
pUE tr(ARA H )
for any realization of tUE . The dominated convergence theorem implies that we can take the limit N inside
the expectations.21 Next, we observe that scaling the pilot
power pUE proportionally to 1/N tUE for some tUE > 0 means
that N tUE pUE B as N for some 0 < B < .
Therefore, we have
N tUE tr(R C) = N tUE pUE tr RZ 1 R
2
I)1 R),
Btr(R(S + BS
(113)
(114)
(1+d 1UE ) Btr R(S + 2 I)1 R2
t
BS
= 1 (115)
= E
2
1
Btr R(S + BS I) R
lim E{||2 }
2 I)1 R
|1+d 1tUE |2 Btr R(S + BS
=E
= 1 + tUE
2 I)1 R
Btr R(S + BS
(116)
7137
tBS rBS
N
0 which corresponds to the condition in the corollary. The corresponding condition for the UL is obtained
( BS )2
analogously and gives rN 0.
Finally, we note that the noise terms (for n 12 ) and
BS
which holds for any tUE > 0. The goal is to make the interferUE }
UE }
E{IH
E{IH
ence term pBS tr(RC)
= pUE pBS tr(R
1 R) vanish asymptotically
Z
UE } = O(1), which is achieved
under the assumption that E{IH
if the denominator grows to infinity with N. We note that
2
tr(RZ 1 R) R
2 scales at least linearly with N. Hence,
Z
the product pUE pBS must reduce at a slower pace than linear
with N, which implies tBS + tUE= tsum < 1.
Finally, we need the O(1/ N ) terms in (51) to still
vanish as N . Some careful but lengthy algebra
reveals that the O(N) properties
in Lemmas 910 become
1 tUE
becomes O(1/N 2 2 ) while the O(1/ N ) in the denom1
inator becomes O(1/N 2 tUE ). These terms vanish if tUE < 12 ,
which finishes the proof for the DL.
The proof for the UL is analogous since the uplink capacity
bound in (44) has the same structure and contains the same
expectations as the downlink capacity.
F. Proof of Theorem 6
The interference expressions in (61) are proved similar to
Theorem 4. For the case l U we have
H h|
2 p UE a 2
|h H h|
2
2
UE
|h
p al
l
l
E
E l
h
2
2
h
2
2
2
pUE al2 h
2
+E
= O( N ) (120)
2
h
2
where = tr A(|d + tUE |2 R + )A H and al = tr(ARl ).
This follows since the first term in (120) equals
1
1
pUE al |h H h|
+ pUE al
|hlH h|
l
E
2
h2
.
.
/
2 /
2
UE a 2
/
/
p
h
l
l
2
H
pUE al 0E
+
0E |hl h|
2
4
h
h
2
2
=O ( N )
= O( N )
E. Proof of Corollary 7
Recall from the proof of Theorem 5 that the dominated
convergence theorem can be applied, which means that we
can take the limit N inside the expectations in the
DL capacity bound of (42) and UL capacity bound of (44).
If tBS and rBS grow with N, we obtain
(1 + d 1 UE ) tr(RR1 R)2
t
diag
=1
lim |E{}|2 = E
N
1
tr(RRdiag
R)
(117)
1
1
UE
2
|1 + d t | tr(RRdiag R)
= 1 + tUE .
lim E{||2 } = E
1
N
tr(RRdiag
R)
(118)
In the DL, we further note that
N
%
tBS E{|h i |2 |v i |2 }
i=1
tr(R C)
(121)
tr(AA H )
1
O(N ).
(
)
the case l) U in (61) follows from E |hlH vUL |2 =
(Next,
E (vUL ) H Rl vUL Rl 2 = O(1) since hl and vUL are
independent.
Finally, we note that the noise term in the denominator of
(44) would be
,
(
)
pUE E |h H v|2
1
E{v H QH v}
l
=
+O
(122)
UE
UE
p tr(R C)
p tr(R C)
N
lU
=O
=O (1)
tBS rBS
N
(119)
1
since rBS tr(R C) tr(RRdiag
R) = O(N) as N .
If this term should vanish asymptotically, it is sufficient that
ACKNOWLEDGMENT
The authors would like to thank Romain Couillet and
the anonymous reviewers for indispensable feedback on this
paper.
7138
IEEE TRANSACTIONS ON INFORMATION THEORY, VOL. 60, NO. 11, NOVEMBER 2014
R EFERENCES
[1] E. Bjrnson, J. Hoydis, M. Kountouris, and M. Debbah, Hardware
impairments in large-scale MISO systems: Energy efficiency, estimation, and capacity limits, in Proc. 18th Int. Conf. Digit. Signal
Process. (DSP), Jul. 2013, pp. 16.
[2] E. Telatar, Capacity of multi-antenna Gaussian channels, Eur. Trans.
Telecommun., vol. 10, no. 6, pp. 585595, 1999.
[3] X. Gao, O. Edfors, F. Rusek, and F. Tufvesson, Linear pre-coding
performance in measured very-large MIMO channels, in Proc. IEEE
VTC Fall, Sep. 2011, pp. 15.
[4] J. Hoydis, C. Hoek, T. Wild, and S. ten Brink, Channel measurements for large antenna arrays, in Proc. Int. Symp. Wireless Commun.
Syst. (ISWCS), Aug. 2012, pp. 811815.
[5] M. Medard, The effect upon channel capacity in wireless communications of perfect and imperfect knowledge of the channel, IEEE Trans.
Inf. Theory, vol. 46, no. 3, pp. 933946, May 2000.
[6] E. Bjrnson, P. Zetterberg, M. Bengtsson, and B. Ottersten, Capacity
limits and multiplexing gains of MIMO channels with transceiver
impairments, IEEE Commun. Lett., vol. 17, no. 1, pp. 9194, Jan. 2013.
[7] W. Zhang, A general framework for transmission with transceiver
distortion and some applications, IEEE Trans. Commun., vol. 60, no. 2,
pp. 384399, Feb. 2012.
[8] A. Mller, A. Kammoun, E. Bjrnson, and M. Debbah, Linear precoding based on polynomial expansion: Reducing complexity in massive
MIMO, [Online]. Available: http://arxiv.org/abs/1310.1806
[9] F. Rusek et al., Scaling up MIMO: Opportunities and challenges with
very large arrays, IEEE Signal Process. Mag., vol. 30, no. 1, pp. 4060,
Jan. 2013.
[10] T. L. Marzetta, Noncooperative cellular wireless with unlimited numbers of base station antennas, IEEE Trans. Wireless Commun., vol. 9,
no. 11, pp. 35903600, Nov. 2010.
[11] J. Jose, A. Ashikhmin, T. L. Marzetta, and S. Vishwanath, Pilot
contamination and precoding in multi-cell TDD systems, IEEE Trans.
Wireless Commun., vol. 10, no. 8, pp. 26402651, Aug. 2011.
[12] J. Hoydis, S. ten Brink, and M. Debbah, Massive MIMO in the UL/DL
of cellular networks: How many antennas do we need? IEEE J. Sel.
Areas Commun., vol. 31, no. 2, pp. 160171, Feb. 2013.
[13] H. Q. Ngo, E. G. Larsson, and T. L. Marzetta, Energy and spectral efficiency of very large multiuser MIMO systems, IEEE Trans. Commun.,
vol. 61, no. 4, pp. 14361449, Apr. 2013.
[14] T. Schenk, RF Imperfections in High-Rate Wireless Systems: Impact and
Digital Compensation. New York, NY, USA: Springer-Verlag, 2008.
[15] C. Studer, M. Wenk, and A. Burg, MIMO transmission with residual transmit-RF impairments, in Proc. ITG/IEEE Workshop Smart
Antennas (WSA), Feb. 2010, pp. 189196.
[16] P. Zetterberg, Experimental investigation of TDD reciprocity-based
zero-forcing transmit precoding, EURASIP J. Adv. Signal Process.,
vol. 2011, Art. ID 2011:137541.
[17] M. Wenk, MIMO-OFDM Testbed: Challenges, Implementations,
and Measurement Results (Microelectronics). Konstanz, Germany:
Hartung-Gorre, 2010.
[18] B. Gransson, S. Grant, E. Larsson, and Z. Feng, Effect of transmitter and receiver impairments on the performance of MIMO in
HSDPA, in Proc. IEEE 9th Int. Workshop Signal Process. Adv. Wireless
Commun. (SPAWC), Jul. 2008, pp. 496500.
[19] E. Bjrnson, P. Zetterberg, and M. Bengtsson, Optimal coordinated
beamforming in the multicell downlink with transceiver impairments,
in Proc. IEEE Global Commun. Conf. (GLOBECOM), Dec. 2012,
pp. 47754780.
[20] E. Bjrnson and E. Jorswieck, Optimal resource allocation in coordinated multi-cell systems, Found. Trends Commun. Inf. Theory, vol. 9,
nos. 23, pp. 113381, 2013.
[21] A. Mezghani, N. Damak, and J. A. Nossek, Circuit aware design of
power-efficient short range communication systems, in Proc. 7th Int.
Symp. Wireless Commun. Syst. (ISWCS), Sep. 2010, pp. 869873.
[22] J. Qi and S. Aissa, On the power amplifier nonlinearity in MIMO
transmit beamforming systems, IEEE Trans. Commun., vol. 60, no. 3,
pp. 876887, Mar. 2012.
[23] B. Maham and O. Tirkkonen, Transmit antenna selection OFDM
systems with transceiver I/Q imbalance, IEEE Trans. Veh. Technol.,
vol. 61, no. 2, pp. 865871, Feb. 2012.
[24] T. Koch, A. Lapidoth, and P. P. Sotiriadis, Channels that heat up, IEEE
Trans. Inf. Theory, vol. 55, no. 8, pp. 35943612, Aug. 2009.
[25] A. Pitarokoilis, S. K. Mohammed, and E. G. Larsson, Uplink performance of time-reversal MRC in massive MIMO systems subject to
phase noise, IEEE Trans. Wireless Commun., to be published. [Online].
Available: http://arxiv.org/abs/1306.4495
[26] C. Studer and E. G. Larsson, PAR-aware large-scale multi-user MIMOOFDM downlink, IEEE J. Sel. Areas Commun., vol. 31, no. 2,
pp. 303313, Feb. 2013.
[27] S. K. Mohammed and E. G. Larsson, Per-antenna constant envelope
precoding for large multi-user MIMO systems, IEEE Trans. Commun.,
vol. 61, no. 3, pp. 10591071, Mar. 2013.
[28] G. Caire, N. Jindal, M. Kobayashi, and N. Ravindran, Multiuser
MIMO achievable rates with downlink training and channel state
feedback, IEEE Trans. Inf. Theory, vol. 56, no. 6, pp. 28452866,
Jun. 2010.
[29] E. Bjrnson, M. Kountouris, M. Bengtsson, and B. Ottersten, Receive
combining vs. multi-stream multiplexing in downlink systems with
multi-antenna users, IEEE Trans. Signal Process., vol. 61, no. 13,
pp. 34313446, Jul. 2013.
[30] S. Cui, A. J. Goldsmith, and A. Bahai, Energy-constrained modulation optimization, IEEE Trans. Wireless Commun., vol. 4, no. 5,
pp. 23492360, Sep. 2005.
[31] H. Holma and A. Toskala, LTE for UMTS: Evolution to LTE-Advanced,
2nd ed. New York, NY, USA: Wiley, 2011.
[32] S. Kay, Fundamentals of Statistical Signal Processing: Estimation
Theory. Englewood Cliffs, NJ, USA: Prentice-Hall, 1993.
[33] J. H. Kotecha and A. M. Sayeed, Transmit signal design for optimal
estimation of correlated MIMO channels, IEEE Trans. Signal Process.,
vol. 52, no. 2, pp. 546557, Feb. 2004.
[34] E. Bjrnson and B. Ottersten, A framework for training-based
estimation in arbitrarily correlated Rician MIMO channels with
Rician disturbance, IEEE Trans. Signal Process., vol. 58, no. 3,
pp. 18071820, Mar. 2010.
[35] B. Hassibi and B. M. Hochwald, How much training is needed in
multiple-antenna wireless links? IEEE Trans. Inf. Theory, vol. 49, no. 4,
pp. 951963, Apr. 2003.
[36] N. ODonoughue and J. M. F. Moura, On the product of independent
complex Gaussians, IEEE Trans. Signal Process., vol. 60, no. 3,
pp. 10501063, Mar. 2012.
[37] J. Hoydis, M. Kobayashi, and M. Debbah, Optimal channel training in
uplink network MIMO systems, IEEE Trans. Signal Process., vol. 59,
no. 6, pp. 28242833, Jun. 2011.
[38] S. Wagner, R. Couillet, M. Debbah, and D. T. M. Slock, Large system
analysis of linear precoding in correlated MISO broadcast channels
under limited feedback, IEEE Trans. Inf. Theory, vol. 58, no. 7,
pp. 45094537, Jul. 2012.
[39] S. L. Loyka, Channel capacity of MIMO architecture using the
exponential correlation matrix, IEEE Commun. Lett., vol. 5, no. 9,
pp. 369371, Sep. 2001.
[40] E. Bjrnson, D. Hammarwall, and B. Ottersten, Exploiting quantized
channel norm feedback through conditional statistics in arbitrarily correlated MIMO systems, IEEE Trans. Signal Process., vol. 57, no. 10,
pp. 40274041, Oct. 2009.
[41] T. Yoo and A. Goldsmith, Capacity and power allocation for fading
MIMO channels with channel estimation error, IEEE Trans. Inf. Theory,
vol. 52, no. 5, pp. 22032214, May 2006.
[42] D.-S. Shiu, G. J. Foschini, M. J. Gans, and J. M. Kahn, Fading
correlation and its effect on the capacity of multielement antenna
systems, IEEE Trans. Commun., vol. 48, no. 3, pp. 502513,
Mar. 2000.
[43] H. Yin, D. Gesbert, M. Filippou, and Y. Liu, A coordinated approach
to channel estimation in large-scale multiple-antenna systems, IEEE J.
Sel. Areas Commun., vol. 31, no. 2, pp. 264273, Feb. 2013.
[44] A. Adhikary, J. Nam, J.-Y. Ahn, and G. Caire, Joint spatial division and
multiplexingThe large-scale array regime, IEEE Trans. Inf. Theory,
vol. 59, no. 10, pp. 64416463, Oct. 2013.
[45] R. Couillet and M. Debbah, Random Matrix Methods for Wireless
Communications. Cambridge, U.K.: Cambridge Univ. Press, 2011.
[46] R. Couillet, F. Pascal, and J. W. Silverstein, Robust estimates
of covariance matrices in the large dimensional regime, IEEE
Trans. Inf. Theory, doi: 10.1109/TIT.2014.2354045. [Online]. Available:
http://arxiv.org/abs/1204.5320
[47] N. Shariati, E. Bjrnson, M. Bengtsson, and M. Debbah, Lowcomplexity polynomial channel estimation in large-scale MIMO with
arbitrary statistics, IEEE J. Sel. Topics Signal Process., to be published.
[48] H. Huh, G. Caire, H. C. Papadopoulos, and S. A. Ramprashad,
Achieving massive MIMO spectral efficiency with a not-so-large
number of antennas, IEEE Trans. Wireless Commun., vol. 11, no. 9,
pp. 32263239, Sep. 2012.
[49] G. Caire and S. Shamai, On the capacity of some channels with
channel state information, IEEE Trans. Inf. Theory, vol. 45, no. 6,
pp. 20072019, Sep. 1999.
[50] G. Auer et al., How much energy is needed to run a wireless network? IEEE Wireless Commun., vol. 18, no. 5, pp. 4049,
Oct. 2012.
[51] J. Xu, L. Qiu, and C. Yu, Improving energy efficiency through
multimode transmission in the downlink MIMO systems, EURASIP
J. Wireless Commun. Netw., vol. 2011, Art. ID 2011:200.
[52] D. W. K. Ng, E. S. Lo, and R. Schober, Energy-efficient resource
allocation in OFDMA systems with large numbers of base station antennas, IEEE Trans. Wireless Commun., vol. 11, no. 9,
pp. 32923304, Sep. 2012.
[53] H. Yang and T. L. Marzetta, Total energy efficiency of cellular
large scale antenna system multiple access mobile networks, in Proc.
IEEE Online Conf. Green Commun. (OnlineGreenComm), Oct. 2013,
pp. 2732.
[54] E. Bjrnson, M. Kountouris, and M. Debbah, Massive MIMO and small
cells: Improving energy efficiency by optimal soft-cell coordination, in
Proc. 20th Int. Conf. Telecommun. (ICT), May 2013, pp. 15.
[55] S. Verd, On channel capacity per unit cost, IEEE Trans. Inf. Theory,
vol. 36, no. 5, pp. 10191030, Sep. 1990.
[56] G. Auer et al., (2012). D2.3: Energy efficiency analysis of the reference
systems, areas of improvements and target breakdown, FP7 EARTH,
Tech. Rep. INFSO-ICT-247733. [Online] https://www.ict-earth.eu/
[57] 3rd Generation Partnership Project, Further advancements for E-UTRA
physical layer aspects (Release 9), Tech. Rep. 3GPP TS 36.814,
Mar. 2010.
[58] C. Studer, M. Wenk, and A. Burg, System-level implications of
residual transmit-RF impairments in MIMO systems, in Proc. 5th Eur.
Conf. Antennas Propag. (EuCAP), Apr. 2011, pp. 26862689.
[59] N. N. Moghadam, P. Zetterberg, P. Hndel, and H. Hjalmarsson, Correlation of distortion noise between the branches of MIMO transmit
antennas, in Proc. IEEE 23rd Int. Symp. Pers. Indoor Mobile Radio
Commun. (PIMRC), Sep. 2012, pp. 20792084.
[60] A. Mezghani, M.-S. Khoufi, and J. A. Nossek, A modified MMSE
receiver for quantized MIMO systems, in Proc. ITG/IEEE Workshop
Smart Antennas (WSA), 2007.
[61] A. Demir, A. Mehrotra, and J. Roychowdhury, Phase noise in oscillators: A unifying theory and numerical methods for characterization,
IEEE Trans. Circuits Syst. I, Fundam. Theory Appl., vol. 47, no. 5,
pp. 655674, May 2000.
[62] D. Petrovic, W. Rave, and G. Fettweis, Effects of phase noise on OFDM
systems with and without PLL: Characterization and compensation,
IEEE Trans. Commun., vol. 55, no. 8, pp. 16071616, Aug. 2007.
[63] G. Durisi, A. Tarable, C. Camarda, and G. Montorsi, On the capacity
of MIMO Wiener phase-noise channels, in Proc. Inf. Theory Appl.
Workshop (ITA), Feb. 2013, pp. 17.
[64] H. Mehrpouyan, A. A. Nasir, S. D. Blostein, T. Eriksson,
G. K. Karagiannidis, and T. Svensson, Joint estimation of channel and
oscillator phase noise in MIMO systems, IEEE Trans. Signal Process.,
vol. 60, no. 9, pp. 47904807, Sep. 2012.
[65] Y. Nasser, M. Des Noes, L. Ros, and G. Jourdain, On the system
level prediction of joint time frequency spreading systems with carrier
phase noise, IEEE Trans. Commun., vol. 58, no. 3, pp. 839850,
Mar. 2010.
[66] R. Baldemair et al., Evolving wireless communications: Addressing the
challenges and expectations of the future, IEEE Veh. Technol. Mag.,
vol. 8, no. 1, pp. 2430, Mar. 2013.
[67] C. Shepard et al., Argos: Practical many-antenna base stations, in Proc.
18th Annu. Int. Conf. MobiCom, 2012, pp. 5364.
[68] K. Nishimori, K. Cho, Y. Takatori, and T. Hori, Automatic calibration method using transmitting signals of an adaptive array for TDD
systems, IEEE Trans. Veh. Technol., vol. 50, no. 6, pp. 16361640,
Nov. 2001.
[69] R. Rogalin, O. Bursalioglu, H. Papadopoulos, G. Caire, and A. Molisch,
Hardware-impairment compensation for enabling distributed largescale MIMO, in Proc. Inf. Theory Appl. Workshop (ITA), 2013,
pp. 110.
[70] J. W. Silverstein and Z. D. Bai, On the empirical distribution of eigenvalues of a class of large dimensional random matrices, J. Multivariate
Anal., vol. 54, no. 2, pp. 175192, 1995.
[71] Z. Bai and J. W. Silverstein, Spectral Analysis of Large Dimensional Random Matrices (Statistics), 2nd ed. Marrickville, Australia:
Science Press, 2009.
[72] A. M. Tulino and S. Verd, Random matrix theory and wireless
communications, Found. Trends Commun. Inf. Theory, vol. 1, no. 1,
pp. 1182, 2004.
7139