Digital Coherent Optical Receivers Algorithms and Subsystems

1164 IEEE JOURNAL OF SELECTED TOPICS IN QUANTUM ELECTRONICS, VOL. 16, NO.
5, SEPTEMBER/OCTOBER 2010
Digital Coherent Optical Receivers:

Algorithms and Subsystems
Seb J. Savory, Member, IEEE
(Invited Paper)
AbstractDigital coherent receivers have caused a revolution in maximum data per wavelength (5.1 Tb/s) [9], and the maximum
the design of optical transmission systems, due to the subsystems capacity distance product (112 Pbkm/s) [10], coherent detection
and algorithms embedded within such a receiver. After giving a also offers benefits at the network level. Not only can digital co-
high-level overview of the subsystems, the optical front end, the
analog-to-digital converter (ADC) and the digital signal process- herent receivers allow for more robust transmission, and hence,
ing (DSP) algorithms, which relax the tolerances on these sub- rerouting, but due to the presence of a local oscillator, the re-
systems are discussed. Attention is then turned to the compensa- ceiver is inherently frequency selective permitting new network
tion of transmission impairments, both static and dynamic. The architectures to be devised [11], [12].
discussion of dynamic-channel equalization, which forms a sig- In order to understand the operation of the digital coherent
nificant part of the paper, includes a theoretical analysis of the
dual-polarization constant modulus algorithm, where the control receiver, in this paper, we consider each of the subsystems re-
surfaces several different equalizer algorithms are derived, includ- quired. After giving a high-level overview of the subsystems
ing the constant modulus, decision-directed, trained, and the radi- in Section II, we discuss the optical front end, analog-to-digital
ally directed equalizer for both polarization division multiplexed converter (ADC) and the DSP algorithms necessary to overcome
quadriphase shift keyed (PDM-QPSK) and 16 level quadrature am- imperfections in the optical front end in Sections IIIV, respec-
plitude modulation (PDM-16-QAM). Synchronization algorithms
employed to recover the timing and carrier phase information are tively. Sections VI and VII are concerned with compensation
then examined, after which the data may be recovered. The paper of transmission impairments, including the problem of digital
concludes with a discussion of the challenges for future coherent polarization tracking and the behavior of the algorithms used.
optical transmission systems. Sections VIIIX discuss the synchronization algorithms em-
Index TermsDigital communication, polarization. ployed after which the data may be recovered. The paper con-
cludes with a discussion of the challenges for next generation
coherent optical transmissions systems.
I. INTRODUCTION
HE SYMBIOTIC combination of digital signal processing
T (DSP), coherent detection, and spectrally efficient modula-
tion formats has resulted in the digital coherent optical receiver.
II. SUBSYSTEMS OF A DIGITAL COHERENT RECEIVER
Due to advances in silicon technology, CMOS analog-to-digital In a digital coherent receiver, there are four key subsystems.
converters, having sampling rates commensurate with current 1) Optical front end, which linearly maps the optical field
optical line rates have become available [1], allowing ideas de- into a set of electrical signals.
vised in the 1990s [2][4] to be commercially developed [5]. 2) ADC, which converts from the electrical signals into a set
Not only does the digital coherent receiver offer improved sen- of discrete-time-quantized signals at the sampling rate.
sitivity, but also it has allowed systems to overcome optical im- 3) Digital demodulator, which converts the digital samples
pairments, such as chromatic dispersion or polarization mode into a set of signals at the symbol rate.
dispersion (PMD), which have hitherto hindered the deployment 4) Outer receiver, which includes error correction and whose
of high-speed systems [6], [7]. functionality is to optimally decode the demodulated sig-
While digital coherent receivers have allowed for record nals in order to produce the best estimate of the sequence
transmission results, such as maximum capacity (32 Tb/s) [8], of bits, which were encoded by the transmitter.
We shall focus on the first three of these subsystems, which
form the inner receiver, whose functionality is to produce a
synchronized channel, which is as close as possible to the
Manuscript received December 23, 2009; revised January 7, 2010; accepted
January 8, 2010. Date of publication May 17, 2010; date of current version information theoretic communication channel.
October 6, 2010. This work was supported by the Joint Regione Piemonte In order to discuss the DSP contained in the digital demodu-
(Italy)Politecnico di Torino Visiting Professor Scheme, and by the U.K. Engi- lator, we begin by considering the structural level design of the
neering and Physical Sciences Research Council.
The author is with the Department of Electronic and Electrical Engi- DSP, including the subsystems listed in Table I.
neering, University College London, London WC1E 7JE, U.K. (e-mail: While for a particular digital coherent receiver, the subsys-
ssavory@ee.ucl.ac.uk). tems employed may differ slightly from those detailed in Table I,
Color versions of one or more of the figures in this paper are available online
at http://ieeexplore.ieee.org. they give some indication as to the design choices, which
Digital Object Identifier 10.1109/JSTQE.2010.2044751 can be made at a structural level, such as the ordering of the
1077-260X/$26.00 2010 IEEE

SAVORY: DIGITAL COHERENT OPTICAL RECEIVERS: ALGORITHMS AND SUBSYSTEMS 1165
TABLE I
FUNCTIONAL SUBSYSTEMS IN A DIGITAL DEMODULATOR
Fig. 2. 28 port receiver.
estimation subsystem, however, other paths depend on the algo-

rithms employed. For example, feedback from the symbol esti-
mation and decoding subsystem is required for data-aided algo-
rithms, but not for blind algorithms. Likewise, for synchronous
sampling at the baud rate, feedback would be required from
the timing-recovery subsystem to the ADC subsystem, which
could be omitted for asynchronous sampling. In the subsequent
sections of this paper, we shall discuss each of these subsystems
independently, forming a basis for understanding the operation
of a digital coherent receiver employing feedback between the
subsystems.
III. PHASE AND POLARIZATION DIVERSE

OPTICAL COHERENT RECEIVER
The functionality of the optical front-end, being both phase
and polarization diverse, is to linearly map the incoming optical
field into the electrical domain [14]. To realize this function-
ality, the architecture is shown in Fig. 2, is often employed,
Fig. 1. Subsystems in a digital coherent receiver, including the possible feed- which employs a pair of 90 hybrids, one for each component
back paths.
of polarization.2
If we consider that the electric field of the incoming op-
tical signal is of the form [Ex , Ey ]T and the local oscillator
subsystems.1 Additional considerations at the structural level in- Elo , then following the photodiode, the three complex sig-
clude the combining or partitioning of subsystems, for example, nals will give rise to a total of nine real quantities, which

the carrier phase estimation may be separated into estimating are3 |Ex |2 ,|Ey |2 ,|Elo |2 , Re{Ex Elo }, Im{Ex Elo }, Re{Ey Elo },

the frequency offset and the residual carrier phase, whereas in Im{Ey Elo }, Re{Ex Ey }, and Im{Ex Ey }. Hence, if we con-
a conventional wireless communication system, it is possible to sider these product terms as the input to the subsystem, it is
perform joint carrier phase and symbol synchronization [13]. evident that we can model an optically diverse hybrid, by a lin-
We shall focus on the structural and algorithmic level rather ear scattering matrix Sh , which may be simplified yet further by
than the implementation level, where the finite resources, such noting that for a polarization beam splitter with high extinction
as machine precision and clock speed are considered. In order ratio, there will be no beating between the two components of
to discuss performance of the digital coherent receiver at the the polarization, and hence, Re{Ex Ey } and Im{Ex Ey } may be
algorithmic level, we will restrict our discussion to one possible neglected. Using the scattering matrix for the 90 hybrid [15],
realization, where the subsystems are arranged, as in Fig. 1, it can be shown that the eight outputs v = Sh u of the 28 port
including the inner receiver and the outer receiver, which
performs symbol estimation and decoding. 2 In Fig. 2, the power splitter can also be realized using a second polarization
As can be seen in Fig. 1, there are numerous feedback paths beam splitter, assuming the state of polarization of the local oscillator is correctly
between the subsystems. Some of these paths occur naturally, chosen.
3 In principle, it might be necessary to consider terms of the form E (t)E (t
such as between the phase estimation subsystem and frequency i j
), where i, j {x, y, lo} and is a delay, however, these may be neglected
in practice, since we note for practical systems, the phase noise is sufficiently
1 We will focus on a single-carrier system, rather than a multicarrier format, low, such that E lo (t) E lo (t ) and there will be no beating between the
such as orthogonal frequency division multiplexing (OFDM); however, the two components of the polarization due to the polarization beam splitter (PBS),
design methodology is similar. such that E x (t)E y (t) E x (t)E y (t ) 0, etc.
1166 IEEE JOURNAL OF SELECTED TOPICS IN QUANTUM ELECTRONICS, VOL. 16, NO. 5, SEPTEMBER/OCTOBER 2010

receiver are given by4 such that s = Hv = HSh u = [Re{Ex Elo }, Im{Ex Elo },
1
1 Re{Ey Elo }, Im{Ey Elo }, |Ex | + |Ey | + |Elo | ] , resulting
2 2 2 T
1
0 0 0 0 in five independent quantities, corresponding to the in-phase
2 4 8
and quadrature components of the two polarization, in addition
1 1 1
0 0 0 0 to the direct-detection term |Ex |2 + |Ey |2 + |Elo |2 . Using this
2 4 8

1 1 1 Re{Ex Elo } formalism, we can now consider the effect of a nonideal 28
0 0 0 0
8 Im{Ex Elo }
port receiver, which can be recast as a multiple input multiple

2 4

1 1 1 Re{Ey E } output (MIMO) DSP problem with the objective being to find
0 0 0 0 lo such that the signals s are given by
8 the matrix H,
v= 2 4 Im{Ey Elo
}
1 1 1 Re{Ex Elo }
0 0 0 0 |Ex |2
8 Im{Ex Elo }
2 4

=
1 1 1 |Ey | 2
s = Hv Re{Ey Elo
} . (5)
0 0 0 0
2 4 8 |Elo |2 Im{Ey Elo }

1 1 1
|Ex |2 + |Ey |2 + |Elo |2
0 0 0 0 u
2 4 8
This is an overdetermined MIMO system, and hence, if the
1 1 1
0 0 0 0 fidelity of the digital signal is sufficiently close to the ana-
2 4 8
log waveform, all of these imperfections, can in principle, be
Sh
compensated digitally, thereby relaxing the requirements on the
(1)
optical components.6 While blind source separation techniques,
where u is the input vector of product terms and Sh is the power
such as independent component analysis could be used [16], it
scattering matrix of the hybrid. These outputs are then fed into
is often possible to simplify this problem, reducing it to finding
a suitable electrical network, which may also be represented
two pairs of mutually orthogonal components, plus the direct-
by a scattering matrix. For a receiver based on balanced or
detection term. Furthermore, in general, we can create a good
single-ended photodiodes, the scattering matrix of the electrical based on our a priori knowledge of the sys-
estimate for H
network is given by
tem,7 reducing the problem to orthogonalizing pairs of signals,

1 b 0 0 0 0 0 0 due to the phase in the hybrid not being exactly 90 . In the
0 0 1 b 0 0 0 0 field of DSP, there are numerous techniques to achieve this for
Se = (2)
0 0 0 0 1 b 0 0 creating orthogonal components, such as the GramSchmidt or
0 0 0 0 0 0 1 b Lowdin orthogonalization algorithms, which we will discuss in
where b = 0 for the single-ended case, selecting alternate out- Section V-B.
puts, and b = 1 for the balanced case, selecting the difference
between pairs of outputs, and we note that IV. ANALOG-TO-DIGITAL CONVERSION
Having mapped the signal from the optical domain into the
Re{Ex Elo }
Im{Ex Elo
} electrical domain, the next stage is to convert the analog signals
Se (b = 1)v = Se (b = 1)Sh u = (3) into a set of digital signals. From a functional view, we can
Re{Ey Elo }
Im{Ey Elo
} consider the ADC to be made up of two subsystems, a sampler,
which samples the signal in time, converting the continuous-
such that all of the direct-detection terms |Ex |2 , |Ey |2 , and time analog signal into a discrete-time analog signal, followed
|Elo |2 , are canceled out, with the only remaining terms being the by a quantizer, which converts the discrete-time analog signal
coherently detected products. By rejecting the direct-detection into a finite set of values determined by the bits of resolution in
terms, the full potential of the coherent detection to enable a the ADC.8
frequency selective receiver is achieved.5 While this is the case As noted by Schvan et al., in order to implement a high-speed
for an ideal 90 hybrid [15], we note there is a matrix H given ADC, there are several options [1].
by 1) Flash, where the performance is limited by the clock dis-
tribution accuracy and the comparator characteristics.
1 1 0 0 0 0 0 0
0 0 1 1 0 0 0 0 2) Flash with track and hold, which reduces the demand on
comparator, with the performance now limited by the track
0 0 0 0 1 1 0 0
H= (4) and hold.
0 0 0 0 0 0 1 1

1 1 1 1 1 1 1 1
6 At present, however, the fidelity of the digital signal is typically only five
8 8 8 8 8 8 8 8
effective bits, and hence, may favor the use of precision balanced receivers
followed by four ADCs rather than the eight-port network with eight ADCs.
4 In practice, v S u due to the responsivity of the photodiodes and any 7 If a calibration procedure was included, this a priori knowledge could be
h
subsequent electrical gain, however, we neglect this scale factor for clarity. increased, thereby, minimizing the difference between a real hybrid and an
5 The cancelation of the direct-detection terms also allows for a greater dy- idealized hybrid.
namic range for the local-oscillator-to-signal ratio and also the cancelation of 8 In an ADC, timing jitter reduces the physical number of bits to an effective
the relative intensity noise (RIN) on the local oscillator. number of bits (ENOB).
3) Time interleaved, which uses lower speed ADCs, but re-

quires offset, delay, and gain correction.
Of these structures, it is the time-interleaved structure, which
lends itself toward CMOS implementation, and hence, integra-
tion with the DSP in a single application-specific integrated
circuit (ASIC) [17]. As noted, however, the use of a time-
interleaved structure requires that the offset, delay, and gain of
the lower speed ADCs be compensated, if spurious tones at the
clock rate of lower speed ADCs are to be avoided.9 Neverthe-
less, since this is a well understood issue for time-interleaved Fig. 3. Visual representation of the GramSchmidt and Lowdin orthogonal-
ization algorithms.
systems, there exist numerous algorithms to overcome these
imperfections, which can either be calibrated out or adaptively
compensated [18], [19]. delay of i = /TADC samples, followed by a fractional delay
Having established that it is possible to realize a high-speed TADC . While the basepoint delay is straight forward to imple-
ADC, we now consider the requirements for such a data con- ment, being a delayed version of the signal, in order to delay by
verter. For a digital communication system, which transmits a fraction of a sample period, interpolation is required, which
symbols at a rate S symbols per second, the minimum sam- we discuss further in Section VIII.
pling rate is S (in hertz). In general, however, for asynchronous
sampling a sampling rate of 2S Hz, is advantageous, giving B. Orthogonalization Algorithms
rise to two samples per symbol, thereby enabling digital timing
In order to recover the original signal using a receiver with a
recovery.10 While in practice, there may be a slight difference
suboptimal hybrid, it is often necessary to use orthogonalization
between clocks of the transmitter and the receiver, the signal
algorithms. In practice, these are applied following digitization,
may be resampled by interpolating of the digital signal, since
however, in principle, they could be applied via an analog electri-
as noted by Meyr et al. the digital receiver does not need to
cal network. To illustrate the effect orthogonalization for two dif-
have a clock frequency equal to the symbol rate l/T as does the
ferent algorithms, let us consider two transmitted signals t1 and
analog counterpart [13].
t2 , which are independent zero mean and of unit variance, such
that E{t1 t2 } = E{t1 }E{t2 } = 0 and E{t21 } = E{t22 } = 1. At
V. DESKEW AND ORTHONORMALIZATION
the receiver, the hybrid is imperfect, such that the angle be-
The functionality of this subsystem is to compensate for the tween the components is /2 2, and the received sig-
imperfections in the optical front end, where optical path lengths nals r1 and r2 are given by r1 = t1 cos() + t2 sin() and
may not be perfectly matched or the hybrid may not be ex- r2 = t2 cos() + t1 sin(). Let us also form the correlation ma-
actly 90 . Signal deskew compensates for the path length mis- trix R, which is
matches between the signals, synchronizing the digital signals 2
r1 r1 r2 1 sin(2)
with respect to one another in time. Following on from this, R= = . (6)
r2 r1 r22 sin(2) 1
the orthonormalization process compensates for imbalances in
amplitude and nonideal 90 hybrids. 1) GramSchmidt Orthogonalization: The GramSchmidt
algorithm creates a set of mutually orthogonal vectors, tak-
A. Deskew Algorithms ing the first vector as a reference against which all subsequent
vectors are orthogonalized [22]. The outputs g = [g1 , g2 ]T are
In a laboratory prototype, it is not atypical to observe tim-
determined from the received signals r = [r1 , r2 ]T , by forming
ing mismatches between the optical paths of tens of symbols,
the matrix product Gr, where G is given by
such that the resulting digital signals are not synchronized in
time. The timing mismatch between the signal paths may be 1 0
1 0
either measured or estimated by cross correlating the signals, G = r1 r2 1 = . (7)
sin(2) 1
with the aim of the deskew algorithm being the compensate for r12
these delays [21]. Since, in general, the timing delays will not After the signals have been orthogonalized, they are subse-
correspond to an integer number of samples, we note any delay quently normalized to unit power, i.e., the second vector is
may be written as = iTADC + TADC , being a basepoint multiplied by = r22 r1 r2 2 /r12 = cos(2). As can
clearly be seen in Fig. 3, the GramSchmidt algorithm leaves
9 While synchronous baud rate sampling offers a significant reduction in the one vector unchanged, requiring the second vector to be rotated
DSP complexity, such as halving the number of taps required to compensate by an angle 2.
chromatic dispersion, at present asynchronous sampling is the preferred solu- 2) Lowdin Orthogonalization: While the GramSchmidt al-
tion, with the current state of the art for a CMOS based ADC is 56 GSa/s,
providing 2 Sa/symbol for 28 Gbd PDM-QPSK [20]. gorithm has been shown to be effective in compensating for im-
10 While synchronous baud rate sampling offers a significant reduction in the balance in the 90 hybrid [23], [24], there are other approaches
DSP complexity, for example, the number of taps required to compensate chro- to orthogonalizing the signal. In the Lowdin algorithm, the aim
matic dispersion reduces by a factor of four, at present asynchronous sampling
is the preferred solution, with the current state-of-the-art for a CMOS-based is to create a set of vectors, which are in a least-mean squares
ADC is 56 GSa/s, providing 2 Sa/symbol for 28 Gbd PDM-QPSK [20]. sense, closest to the original vectors [25]. It can be shown that
if the correlation matrix is R, the optimal choice of the transfor- the dispersed signal.12 By considering the impulse response of
mation matrix L = R1/2 [26]. Often this can be achieved via the fiber, it can be shown [7], that a signal sampled every TADC
a singular value decomposition of the matrix, however, for the seconds can be recovered by applying a finite impulse response13
case of the 2 2 signals, considered herein is of the form filter to the signal with tap weights hcd [k] given by14
2
cos() tan(2) N 1
1
hcd [k] = exp j

k
cos(2) 2 cos() (11)
L= tan(2) cos()
(8) 2

2 cos() cos(2) where k [0, N 1], N is the number of taps given by15
N = || and = 22 Ltotal /TADC 2
. While the dispersion
which satisfies the equation L2 R = I as required. As can be compensating filter could also be realized using a shorter
seen in Fig. 3, in contrast to the GramSchmidt algorithm, both infinite-impulse response filter [31], the FIR realization is of-
vectors are equally displaced being rotated by an angle . ten preferred, since it may be implemented efficiently in the
While the GramSchmidt algorithm has been shown to be frequency domain, e.g., using an overlap and add method [32].
sufficient for current systems [23], [24], the algorithm increases The hardware efficient implementation of the chromatic dis-
the impact of quantization noise for the component, which is persion compensating filter is an on-going research topic with
displaced. In contrast, when symmetric methods are employed, alternative approaches, such as the subband equalizer structure
the quantization noise is equally distributed across components. having been recently explored [33].
As the spectrally density of the modulation formats increases,
alternatives to the conventional GramSchmidt procedure, such
B. Nonlinear Compensation
as the Lowdin [25] or that proposed by Schweinler and Wigner
[27] are likely to warrant investigation. For long distances, the nonlinear transmission may be mod-
eled by the Manakov equation, which is of the form [30]
VI. STATIC-CHANNEL EQUALIZATION A +N
= (D )A (12)
One of the key distinguishing features of a digital coherent re- z
ceiver is its ability to compensate for transmission impairments, is given by (10) and N
is given by
where D
in particular, chromatic dispersion and PMD [28], [29]. Much
of this is due to the properties of coherent detection, where the = j 8 P0 p(z)AH A
N (13)
optical field is mapped linearly into the digital domain. While 9
in principle equalization could be realized in one subsystem,
where P0 p(z) is the power profile of the signal along the link and
it is generally beneficial to partition the problem into static
is the nonlinear coefficient. By using the BakerCampbell
and dynamic equalization. Static equalization typically requires
Hausdorff formula [34], it can be shown that an approximate
large static filters, with dynamic equalization requiring a set of
solution over a single step of length L, is of the form
relatively short adaptive filters to compensate for time-varying
L
effects, such as polarization rotations and PMD [7].
A(z = L) exp (DL) exp N dz A(z = 0) (14)
0
A. Linear Compensation
which for constant dispersion and polarization independent loss
In the absence of nonlinearity, the effect of chromatic dis- gives p(z) = exp(z), and AH A 1. Hence, if we con-
persion on the pulse A(z, t) = [Ax (z, t), Ay (z, t)]T may be sider a step size, such that L 1, we find
modeled by the following differential equation11 [30]:
8 P0
A(z, t) A(z = 0) exp j exp(DL)A(z = L) (15)

= DA(z, t) (9) 9
z
i.e., to recover the original signal, linear compensation of the
where
chromatic dispersion is applied as per the previous section,
2
= j 2
D (10)
followed by an instantaneous nonlinear phase shift [35]. To
2 t2 date, much of the nonlinear compensation is based on (15),
with 2 being the group delay dispersion which is approximately with the step-size ranging from the entire length of a system
21 ps2 /km for standard single mode fiber. By solving (9) it can
be shown that A(z = 0, t) = exp(DL total )A(z = Ltotal , t),
12 The operator exp( Dz) may be defined such that exp(Dz)A =
and hence the original signal can in principle be recovered from F 1 {exp(F {D}z)F
{A}} where F denotes the Fourier transform.
13 The FIR filter is known by a number of names, including feed-forward
equalizer, transversal tap filter, however the principle is the same, with the
11 Herein, we have adopted the electrical engineering convention for Fourier output being a linear combination of delayed versions of the input.
14 This assumes no windowing is applied however in practice these weights
transforms and the associated electric field, such that the field is of the form
E(z, t) = A(z, t) exp(j[t z]) in contrast to the physicists convention could be convolved with the desired pulse shape for matched filtering.
that E(z, t) = A(z, t) exp(i[t z]). Conversion is easily achieved by 15 This is an upper bound for N , with an approximate lower bound given by
substituting j = i into the nonlinear Schrodinger equation given in [30]. N l b = |2 2 L to ta l /(T sy m T A D C )| where T sy m is the symbol period.
QPSK) signal, which we may represent mathematically as

x y
Etx (t) = [Etx (t), Etx (t)]T being of the form

k (ak + jbk )P (t kT )
Etx (t) = (17)
k (ck + jdk )P (t kT )

where ak , bk , ck , dk {1/ 2, 1/ 2}, and P (t) is the pulse
shape. If we normalize the pulse power19 and consider Nyquist
pulse shapes, then synchronous samples at the symbol rate
Etx [k] = Etx (t = kT ) may be written as follows:
Fig. 4. MIMO equalizer.
1
Etx [k] = ej [k ] j n /2 (18)
e
to just 2 km [36], with much of the current work employ-
ing one step per span.16 While the complexity may be re- where [k] = arg(ak + jbk ) and
duced using wavelets [38], nonlinear compensation based on 2
n = (arg(ck + jdk ) arg(ak + jbk )) .
one step per span may still be prohibitive to implement in
hardware. Nevertheless, it provides a benchmark against which One of the key features to note from this representation is that
simpler compensation schemes may be compared, such as the the two QPSK constellations have constant modulus, a feature,
three block models HammersteinWiener (N LN ), the Wiener- which we will exploit in the subsequent discussions.
Hammerstein (LN L) [39], with the challenge being to improve
the nonlinear tolerance of the system without making the DSP B. Channel Model
prohibitive to implement.
If we assume that the baud rate is sufficiently low that the
channel may be considered flat and polarization dependent loss
VII. DYNAMIC-CHANNEL EQUALIZATION
is negligible, then we may model the channel by a unitary 2 2
The purpose of this subsystem is to compensate for chan- matrix R, which rotates the horizontal and vertical states of po-
nel impairments, which are time varying, such as the state of larization at the transmitter to a pair of arbitrary, but orthogonal
polarization or the PMD of the system. The adaptive equalizer states20

required to compensate for these dynamic effects may be real- cos() ej sin()
ized using a set of four FIR filters [40], as illustrated in Fig. 4. R= (19)
ej sin() cos()
The functionality of the MIMO filter is to perform the inverse
Jones matrix of the dynamic channel,17 such that the outputs are where 2 and are the azimuth and elevation rotation angles,
given by respectively. Using this channel model, the received signal is of
the form Er x [k] = [Erxx [k], Eryx [k]]T = REtx [k], which gives
xout [k] = hH H
xx [k]xin [k] + hxy [k]yin [k]
cos() + ej n /2 ej sin()
Er x [k] = ej [k ] . (20)
yout [k] = hH H
y x [k]xin [k] + hy y [k]yin [k] (16) ej sin() + ej n /2 cos()
where for an N tap FIR filter, hxx , hy x , hxy , and hy y are vec- Given our desire to polarization demultiplex the signals, a key
tors of length N representing the tap weights,18 with xin and yin question is what choices of and result in the signal being
representing a sliding block of N samples to which the filter is successfully demultiplexed. One possible approach to achieving
applied, such that xin [k] = [xin [k], xin [k 1], . . . xin [k N ]]. this is the constant modulus algorithm (CMA), which we shall
In order to understand the salient features of the algorithms now discuss.
employed in dynamic-channel equalization, we reduce the prob-
lem to that of digital polarization tracking, which shall be the C. Constant Modulus Algorithm
focus of this section. In the 1980s, the CMA was proposed by Godard [44], as a
means of blind equalization for QPSK signals. While the use
A. Signal Model of the CMA for polarization multiplexed signals was not dis-
Let us consider that the modulation format of choice is po- cussed by Godard, who was concerned with 2-D modulation
larization division multiplexed quadriphase shift keyed (PDM- formats, it has subsequently been extensively been applied to
polarization division multiplexed optical communication sys-
16 Given, in general, only one wavelength channel is received, this results
tems, which are essentially 4-D formats. Nevertheless, with the
in a nonlinear equalizer, which compensates for intrachannel effects, such as
self-phase modulation. However, recent research indicates that the benefit af- 19 Later, we shall relax this assumption, however, we retain it here, since it
forded by such an equalizer decreases significantly in a wavelength division makes the mathematics easier to follow.
multiplexing (WDM) environment [37]. 20 In practice, there may also be a phase offset between the two polarization
17 Once the equalizer has converged, the tap weights allow the Jones matrix of
channels. While this will modify the shape of the resulting control surface
the channel to be estimated, hence, giving estimates of the residual dispersion for data aided algorithms, the salient features, such as the existence of local
PMD and PDL of the link [41][43]. minima are unchanged by the presence of this differential phase, and therefore,
18 In contrast to the static-channel equalizer, the memory of the equalizer is to simplify the presentation, we neglect this effect by setting the differential
typically of the order of ten symbols or less. phase to be zero.
notable exception of Kikuchi [45], there is very little discussion

in the literature, as to why this algorithm works. As such herein,
we shall aim to demonstrate how this algorithm achieves the
functionality of a digital polarization demultiplexing, coming
from the perspective of a control surface.
The dual polarization CMA aims to restore the QPSK signal
by minimizing the cost functions 2x = (1 |Erxx |2 )2 and 2y =
(1 |Eryx |2 )2 . Therefore, given the form of Er x [k], we can
show that the modulus of the signal is as follows:

2 1 + sin(2) cos( n/2)
|Er x [k]| = . (21)
1 sin(2) cos( n/2)
As expected, the total power for the two polarization is un-
changed by the unitary rotation, with only the ratio of the powers
changing. Using these results, we can, therefore, see that Fig. 5. Evolution of determined using a stochastic gradient algorithm with
= 0.01 with the dashed and thin solid line, and dashed lines denoting solution
n
2x = 2y = sin2 (2) cos2 . (22) of the difference (27) and differential (28) equations, respectively.
2

Given 2x = 2y in the subsequent, we shall only consider 2x for and close to this minima the function 2x is convex, and hence,
which we note there are two cases corresponding to n being odd if [k] is an estimate close to the minima a better estimate is
or even, which gives 2x as follows. obtained by [22]
1) For n even, 2x = sin2 (2) cos2 ().
2) For n odd, 2x = sin2 (2) sin2 (). d 2x
[k + 1] = [k] (26)
hence, if we average over sufficient data points, assuming n is d
equally likely to be odd or even, the resulting control signal is where determinesthe rate of convergence of the algorithm.
as follows: For the case, where 2x = 1/2 sin2 (2), the update rule gives
2 1
x = sin2 (2). (23) [k + 1] = [k] sin(4[k]). (27)
2
Thishasthe desired property that by minimizing the cost func- By inspection, we note that when sin(4[k]) = 0, [k + 1] =
tion 2x , we separate the two polarizations: [k], and hence, while exists due to the periodic nature of the
1) Erxx Etx x
for = m; sine function, it is not unique. Having established the existence
y
2) Er x Etx
x
for = /2 + m. of , in order to investigate the convergence, we must solve the
Let us consider that we are able to modify the polarization governing nonlinear difference equation. While we can achieve
rotation in the channel by varying , which could be achieved this numerically, a closed-form analytical solution is possible
in practice by using a polarization controller at the input to the if we begin by approximating the nonlinear difference equation
coherent receiver. In this case, a key question is to determine by a nonlinear differential equation
how we should update our initial estimate of , to ensure demul- d[k]
tiplexing is achieved, in the limit as time tends to infinity. Hence, = sin(4[k]) (28)
dk
if [k] is the kth estimate of , we wish to determine the value of
which has the following solution:
= lim [k]. (24)
k 2[0] 1
[k] = nint + arg(f ([0], k, )) (29)
In order to investigate the convergence, there are several issues 2 4
related to , which require following considerations.
where nint(x) denotes the nearest integer to x and
1) Does exist, i.e., does the algorithm converge?
2) Is unique? f ([0], k, )) = sin(4[0]) + j cosh(4k) cos(4[0])
3) How sensitive is to the initial conditions?
4) How fast does the algorithm converge to and is this + j sinh(4k)) sin(4[0]). (30)
rate of convergence sensitive to the initial conditions?
While there are numerous possible update algorithms [22], As can be seen in Fig. 5, a bifurcation occurs at /4 with two
much of the current research is focused on stochastic gradient very close initial starting positions leading to very different final
descent algorithms, which we now consider. solutions, being either = 0 or /2 in this case. The sensitiv-
1) Stochastic Gradient Descent: One approach is to observe ity of the final solution to the initial conditions near = /4
that at the minima is typical of nonlinear differential equations, leading to the so-

d 2x called butterfly effect, in chaos theory [46]. In addition to this
=0 (25) issue, the other key feature to observe is that the time taken to
d
convergence increases significantly in the region of the bifurca-

tion, where a singularity is observed.
While this may appear a trivial example, it highlights many of
the key issues associated with the convergence algorithms, such
as the solution of a nonlinear difference equation. Often it is not
possible to obtain a closed solution to the governing equations,
requiring approximations to be made, or the equations to be lin-
earized about an operating point [47]. While linearization gives
an insight as to tracking performance, it hides issues associ-
ated with acquisition, such as the presence of multiple minima,
leading to one of the most interesting and frustrating features of
the blind CMA algorithm when applied to polarization tracking, Fig. 6. Control surface J with = /6 and = /5.
namely the possibility that the two outputs lock on to the same
source [48], [49], being strongly affected by the choice of initial
taps weights [50]. It is for these reasons that we pursue a more E. Numerical Investigation
qualitative assessment of the equalizer, namely the 2-D control In order to confirm our theoretical insight, we wish to perform
surface from which many of the key issues associated with the a numerical simulation of the system, however prior to this, it
acquisition can be ascertained. is necessary to derive the update algorithm for the CMA with
stochastic gradient.
D. CMA With Arbitrary Initial Tap Weights 1) CMA Algorithm With Stochastic Gradient: The CMA
At the receiver the incoming signal is of the form attempts to minimize 2x = (1 |xout |2 )2 and 2y = (1
|yout |2 )2 , often using a stochastic gradient algorithm, such that,
Er x [k] = REin [k] (31) for example, the update algorithm for the taps hxx is given by
we will, however, apply an equalizer to this using a set of tap 2
hxx = hxx = hxx + x xout xout (37)
weights h, such that 2 hxx x hxx
Eeq [k] = hH Er x [k]. (32) where xout = hxx xin + hxy yin and is the convergence param-
eter. We define the complex conjugate derivative as follows [22]:
We consider that the signal has passed through an unknown
polarization rotation R given by 1
= +j (38)
h 2 Re{h} Im{h}
cos() exp(j) sin()
R= (33) which gives the following update algorithm for the taps:
exp(j) sin() cos()
hxx = hxx + x xin xout
with the CMA initialized as h0 given by
hxy = hxy + x yin xout
cos() exp(j) sin()
h0 = (34)
exp(j) sin() cos() hy x = hy x + y xin yout

such that the initial control surface J = 2x is as follows: hy y = hy y + y yin yout . (39)
1 2) Simulation Model: To investigate the occurrences of sin-

J = sin2 (2) sin2 (2) cos2 ( ) gularities numerically, we consider a PDM-QPSK system with
2
an optical to SNR (OSNR) commensurate with a bit error rate
1
sin(4) cos(2) sin(2) cos( ) (BER) of 1 103 . For the case of 100 Gigabit Ethernet (GbE)
2
system, this corresponds to 28 GBd PDM-QPSK signals with
1 an OSNR of 13.3 dB, measured using a noise bandwidth of
+ (1 cos2 (2) cos2 (2)). (35)
2 0.1 nm.
A set of four bit sequences, each of length 212 bits were used
As can be seen from Fig. 6, in the region of interest, the control to create a stream of 212 PDM-QPSK symbols.21 Both and
surface exhibits two distinct optima due to the two possible were swept from /2 to /2 in steps of /128, giving rise to
conditions, which result in the two polarization channels being a total of 16 641 simulation points. A convergence parameter of
correctly de-multiplexed. These two regions are separated by = 103 was used, cycling through the pattern a total of four
line defined by the maximum value of J, which by inspection times; the first three times being used to ensure the tap weights
of (35) takes the value of 1/2 for which
1 21 In the case of analyzing, the singularity sequences were based on rounding
= arg() (36)
2 2 random variables created using the MATLAB rand function, however, we also
explored using a 2 1 2 de Bruin sequence with a delay of 2 1 0 symbols between
where = cos(2) + j cos( ) sin(2). the four tributaries, which gave the same results.
Fig. 9. Control surface for PDM-QPSK with a DD-EQ.
Fig. 7. Analytical prediction (dashed lines) and simulation points (dots), where
singularity were observed with = /6 and = /5.
Fig. 8. Control surface for PDM-QPSK with a CMA-based equalizer. Fig. 10. Location of points with high BER (>102 ) c.f. BER = 10 3 target
and points, where a singularity occurs.
had fully converged, with the final pass being used to measure
BER and assess whether a singularity had occurred. the CMA will ensure that the signal is correctly polarization-
3) Agreement Between Simulation and Theory: As can be demultiplexed. Furthermore at the minima point not only is the
seen in Fig. 7, there is excellent agreement between the analyt- error gradient zero, but also the value is zero and hence the
ical predictions of where the singularity should occur and the steady state error is zero.
recorded location of where a singularity occurred. Given this, it 2) Decision Directed: In a DD-EQ, the output of the equal-
is clear that the control surface illustrated in Fig. 6 gives signif- izer is feed into a decision circuit, such that the cost function is
icant insight into the convergence of the algorithm. Given this, given by
we now propose to extend this very visual approach to algo- 2
csgn(Eeq
x
)
rithms, such as the decision-directed equalizer (DD-EQ) or one, 2x = x
Eeq (41)
which employs training sequences for PDM-QPSK. 2
where csgn(x) is given by
F. Control Surfaces for PDM-QPSK
1+j [Re(x) > 0, Im(x) > 0]
1) Constant Modulus Algorithm: While we have already in-
1j [Re(x) > 0, Im(x) < 0]
vestigated the CMA in the previous sections, in order to per- csgn(x) = (42)

1 + j [Re(x) < 0, Im(x) > 0]
form a comparison with the other methods, herein we derive the 1 j [Re(x) < 0, Im(x) < 0].
control surface. In all cases, we shall assume that a stochastic
gradient method is used to achieve convergence of the taps. As can be seen in Fig. 9, the DD-EQ exhibits features not
The cost function is as follows: seen in the CMA surface, namely the presence of nonoptimal
2x = (1 |Eeq | )
x 2 2
(40) local minima. In principle, such an equalizer may become stuck
in a local minima, and hence, fail to converge to the correct
which gives the control surface illustrated in Fig. 8. output. Nevertheless, if the equalizer is preconditioned by using
As previously discussed, this exhibits multiple minima, how- a CMA to bring the tap weights close to the desired minima,
ever, notwithstanding the issue associated with the singularity, e.g., /4 < < /4, then the DD-EQ will converge.
Fig. 12. Constellation diagrams normalized to unitpower for QPSK

(with unit
circle shown) and 16-QAM (with circles of radius 0.2, 1 and 1.8 shown).
Fig. 11. Control surface for PDM-QPSK with an equalizer employing training
sequences.
To investigate further the issue associated with the local min-

ima exhibited by the control surface, we perform a numeri-
cal simulation with the same parameters, as those detailed in
Section VII-E, other than the angular resolution, which was
decreased from /128 to /64 to reduce simulation time.
As can been seen in Fig. 10, the control surface gives excellent
insight into the convergence of the equalizer with the triangular
plateaus resulting in a regions of high BER,22 and the boundaries
between regions predicting the location of the points, which give
Fig. 13. Control surface for PDM16-QAM with a CMA-based equalizer.
rise to a singularity.
3) Training Based: Thus far the algorithms are essentially
blind in that they exploit properties of the modulation format, 1) Constant Modulus Algorithm: While a 16-QAM signal
such as the constant modulus, but have no knowledge of what is clearly not a constant modulus format, the CMA may be
data was transmitted. An alternative approach is to train the used for preconvergence. As can be seen from Fig. 13, while
equalizer with a known training sequence such that the equalizer there is a local minima at the desired location, the error term
now has complete knowledge of the transmitted data.23 is nonzero at the optimum with min{2x } 0.33 in this case.
In this case, if the original data was x, then equalizer attempts Since, the error term does not tend to zero at the optimum, if
to minimize the function a stochastic gradient method is used, the equalizer continues to

2x = x Eeqx 2
. (43) adapt erroneously, introducing noise into the signal, and hence,
degrading the performance. As such it is desirable to modify the
control surface to ensure that the error signal tends to zero at
As can be seen from Fig. 11, in contrast to the previously the optimum point, resulting in the radially directed equalizer
considered cases, the control surface is smooth without local (RDE).
minima in the range considered. Given this control surface, one 2) Radially Directed Equalizer: In order to overcome the
would expect that the convergence properties would be the best limitations associated with the CMA [52], experimental inves-
of those considered, albeit at the expense of requiring redundant tigations by Winzer et al. [53], [54] used an RDE [55], [56],
symbols in order to train the equalizer.24 which has an error signal given by
G. Control Surfaces for PDM16-QAM 2x = (R02 r2 )2 (44)
Having developed this new insight into the control surfaces where r = |Eeq x
| and R0 is given by [56]
for PDM-QPSK, it is possible to extend this approach to other 2
if r < 0.6 then
modulation formats, such as PDM16-QAM. As can be seen in point belongs to group 1 and R02 = 0.2
Fig. 12, while 16-QAM does not have constant modulus, it may else
be considered to be made up of three shells, which is a property if r2 > 1.4 then
that a blind equalizer may exploit [51]. point belongs to group 3 and R02 = 1.8
else
22 Given the expected BER is 10 3 , we define high BER as one, which is point belongs to group 2 and R02 = 1.0
greater that 10 2 . end if
23 We also assume that the linewidth of the laser is sufficiently low that the
end if
carrier phase can be considered constant over the set of training symbols. with the signal normalized to unit power on each polarization.
24 It should be noted, however, that in current communication systems, there
are known framing symbols, which could be exploited to assist the equalizer As can be seen from Fig. 14, while minimum value of the con-
without reducing the data throughput. trol surface is zero, it is well behaved only over a small region, for
Fig. 14. Control surface for PDM16-QAM with an RDE. Fig. 16. Control surface for PDM16-QAM with an equalizer employing
training sequences.
single tap equalizer case becomes

hxx = hxx + x xin xout
hxy = hxy + x yin xout

hy x = hy x + y xin yout

hy y = hy y + y yin yout (45)
where x = 1 |xout | and y = 1 |yout | . Such equalizer
2 2
have been used extensively [7], [57] [59], in order to increase

the robustness of systems to PMD with current commercial
Fig. 15. Control surface for PDM16-QAM with a DD-EQ. systems offering the ability to compensate for a mean differential
group delay (DGD) of 25 ps for a 43 Gb/s system using 10.7 Gbd
PDM-QPSK. While, in principle, the amount of PMD, which
example, the surface has multiple saddle points for = /4. can be compensated is unlimited [60], the limitation is often the
To overcome this limitation, Winzer et al. [54] suggested the complexity of the receiver [61].
use of the CMA to bring the equalizer toward the region of con-
vergence, and once the CMA has converged, switching to the
RDE; since at the local minima, for the RDE unlike the CMA, I. Overcoming the Singularity Problem
the error term is zero. While this is not immediately intuitive
from the mathematics, the examination of the control surfaces Of the algorithms currently employed for dynamic equaliza-
gives insight as to why this is a sensible scheme. tion, the two outputs are converged independently. As such,
3) Decision Directed: As for the case of PDM-QPSK, it is there is a possibility of the two outputs converging to the same
also possible to implement a DD-EQ, which has the control source. At present, several methods have been explored to over-
surface illustrated in Fig. 15. While, in principle, this algorithm come this limitation, including the use of a cross correlation
has the minimum tracking error in the presence of noise, the term in the cost function [48], multiuser CMA [62], independent
region of convergence is the smallest, such that either CMA or component analysis [49], magnitude-bounded blind source sep-
RDE are required to precondition the equalizer. aration [63], or by the sequentially converging the outputs [50];
4) Training Sequences: Hitherto we have considered blind however, the optimal solution remains an open research prob-
algorithms, however, if a training sequence is available, this may lem. As noted by Haykin Signal processing is at its best when
be used to train the equalizer. As can be seen in Fig. 16, the use it successfully combines the unique ability of mathematics to
of training sequences creates a smooth control surface with a generalize with both the insight and prior information gained
global minima at the desired location, being identical in form to from the underlying physics of the problem at hand [64], and
Fig. 11, obtained for PDM-QPSK data. therefore, one solution is to consider the general transfer matrix
for the channel [65]
H. Frequency-Dependent Dynamic Channel T () = exp(j())V ()P U () (46)
Thus far, we have assumed that the dynamic channel is es- where U () and V () are unitary matrices, P is the matrix,
sentially flat, such that it may be reduced to one of polariza- which represents a partial polarizer (i.e., PDL), and () is the
tion tracking. Nevertheless, PMD, for example, results in a common phase, which may be a function of , due to residual
frequency-dependent dynamic channel, and therefore, in this chromatic dispersion. The form of T () suggests the equalizer
section, we consider extending the analysis to compensating structure illustrated in Fig. 17, where px and py compensate for
such impairments. If we consider the MIMO CMA with a the residual chromatic dispersion, and polarization dependent
stochastic gradient algorithm, which being very similar to the loss with conjugate symmetry imposed on u and v to ensure
For the case of linear interpolation, the sample y[k] will be a

linear combination of x[i] and x[i + 1], such that
y[k] = y(mk , k ) = x[mk ] + k (x[mk + 1] x[mk ]) (49)
where k is the fractional delay, mk is the basepoint, given
by mk = kwk + k , and wk is our estimate of Tsym /TADC .
As the sample point is incremented, the revised estimates of
Fig. 17. Modified MIMO equalizer. basepoint mk +1 and the fractional delay k +1 are given by
mk +1 = mk + k + wk
k +1 = [k + wk ]m o d1 . (50)
they model a unitary matrix, i.e., if uxx = (uxx [1], uxx [2],
. . . uxx [N ]), then uy y = (uxx [N ], uxx [N 1], . . . uxx [1]), and Linear interpolation forms the basis of higher order approx-
uxy = (uxy [1], uxy [2], . . . uxy [N ]), then uy x = (uxy [N ], imations, such as cubic interpolation or piecewise parabolic
uxy [N 1], . . . uxy [1]). While the structure of the equal- interpolation [68], [69]. Nevertheless, as the degree of the ap-
izer in Fig. 17 appears significantly more complex than the orig- proximation is increased, it does not necessarily follow that the
inal MIMO equalizer of Fig. 4, functionally they are identical, quality of the approximation improves with a study by Chang
albeit with constraints applied to the tap weights. In principle, et al. [70], indicating that piecewise parabolic interpolation out-
it is possible to update all of the tap weights simultaneously performs linear or cubic interpolation.
resulting in an adaptive cascaded filter. However, the discussion
of such filters is outside of the scope of this paper. B. Timing Recovery
Digital timing recovery has been a topic of extensive re-
VIII. INTERPOLATION AND TIMING RECOVERY search [13] with both non-data-aided [71] and data-aided [72]
Having equalized for the channel impairments, it becomes algorithms being employed. In order to simplify the interaction
possible to compensate for the difference between the symbol between the subsystems, we shall restrict our consideration to
clock and the ADC sampling rate. As previously mentioned, nondata-aided methods. One such algorithm aims to maximize
in principle, it is entirely possible to operate the entire ASIC the squared modulus |y(mk , k )|2 of the interpolated signal
using only the ADC sampling rate, however, an alternative is to y[k] = y(mk , k ) [13], [70]. Differentiating |y(mk , k )|2 with
consider partitioning the ASIC, such that one part operates using respect to time gives the error signal as follows:
the ADC clock and the second part uses a derived symbol clock.
d|y(mk , k )|2
The task of the interpolating subsystem is to obtain samples y[k] e(mk ) =
at time t = t0 + kTsym given samples x[i] at t = iTADC . dt

(y(mk + 1, k ) y(mk 1, k ))
2Re y(mk , k )
A. Interpolation 2Tsym
Interpolation draws on the mathematical theory of approxi- (51)
mation theory [66], using Lagrange polynomials or splines to
with this error signal going to zero when the signal is synchro-
interpolate a sampled function [67]. To illustrate the principle,
nized. We may use this error signal e(mk ) to update wk , our
if x[i] are our samples at t = iTADC + TADC , where i is an
estimate of Tsym /TADC , such that
integer and 0 < 1, then it is possible to create a continuous-
time approximation of the form
N 1
wk +1 = wk + c[i]e(mk i ) (52)
y(t) = x[i]i (t [iTADC + TADC ]) (47) i=0
i
where the error signal is filtered by an FIR filter of length N
where i (t) are a set of interpolation functions.25 Hence, if we with coefficients c[i], which incorporate the convergence factor
resample this signal at a time t = kTsym + Tsym , where k is for this stochastic gradient method.26
an integer and 0 < 1, then the samples y[k] will be given
by IX. FREQUENCY ESTIMATION

y[k] = x[i]i (kTsym + Tsym [iTADC + TADC ]) Given the structural design of the DSP, we have elected to
i separate the bulk frequency estimation from the carrier recovery.
(48) This not only reduces the amount of phase, which the carrier
such that the output is a linear combination of the inputs, and recovery subsystem has to track, but also it improves the efficacy
hence, may be realized as an FIR filter. of the carrier recovery, since many phase estimation schemes
25 Over an infinite-time window, these take the form of sinc(x) = sin(x)/x 26 One common approach used by Chang et al. [70] is to implement a
as per the celebrated ShannonNyquist sampling theorem, however, for a finite- proportionalintegral controller, such that c[0] = K i + K p with all other co-
time window may be realized as Lagrange polynomials or B-splines. efficients being K i .
are only unbiased in the presence of zero frequency offset. If X. CARRIER RECOVERY
the input signal is of the form
In order to estimate the residual carrier phase, feedforward
xin [k] = xsym [k] exp(j[[k] + 2f kTsym ]) (53) techniques are often preferable [79], [80]. As with many of the
frequency-estimation algorithms, the carrier phase may be es-
the task of this subsystem is to estimate f .
timated using an fourth-order nonlinearity in order to remove
the QPSK modulation, giving the estimate of the phase as fol-
A. Differential Phase-Based Methods
lows [81]:
Given the dominance of PDM-QPSK as a modulation format, !
we shall first begin with discussing algorithms suitable for this 1
N
4
[k] = arg w[n]xin [k + n] (59)
modulation format, prior to discussing higher order modulation 2N + 1
n =N
formats. For QPSK data, we note that
(xin [k]xin [k 1])4 exp(4j[k]). (54) where w[n] is a weighting function, which depends on the ratio
of the additive white Gaussian noise to the laser phase noise [82]
In the absence additive noise, 4 has a circular Gaussian dis- (in the absence of laser phase noise w[n] = 1, and the estimator
tribution,27 due to the laser phase noise with mean 8f Tsym , becomes that given by Viterbi and Viterbi [81]). The result of
such that the probability density function (pdf) is of the form the weighting function is to apply a Wiener filter to estimate
exp( cos(4 8f Tsym )) the phase noise [80], [82], which can approach the performance
f (4) = (55) of an ideal maximum a priori (MAP) estimator of the phase.
2I0 ()
Using these approaches for a 1 dB penalty, a linewidth up to
where is related to the linewidth of the laser. Using the pdf, 28 MHz may be tracked for 28 GBd PDM-QPSK [82]. While
we can estimate the parameter of interest using the maximum other estimators such as the Kalman filter have been investi-
likelihood technique [73], which gives an estimate of f as gated [83], the efficiency with which the algorithm may be im-
follows: plemented in hardware requires consideration. One particularly
!
1 N hardware-efficient phase estimator is the Barycenter algorithm
4
f = arg (xin [k]xin [k 1]) (56) in which the order of the operations in (59) are reversed to give
8Tsym
k =1 [84]
which is the form of the estimator proposed by Leven et al. [74].
Alternatively, another estimator may be obtained by reversing 1
N
$ %
[k] = w[n] arg x4in [k + n] . (60)
the order of the operations, and iteratively, estimating the fre- 2N + 1
n =N
quency offset [75]
" # As modulation formats move beyond QPSK to QAM, the
arg (xin [k]xin [k 1])4 requirements on the laser linewidth become increasingly strin-
f [k] = f [k 1](1 ) + gent [85]. Nevertheless using conventional wireless approaches
8Tsym
(57) such as decision-directed phase locked loops, have enabled a
where is a convergence factor and f [k] is the kth estimate linewidth of 1 MHz to be tracked digitally for 14 GBd PDM-
of the frequency estimate with the advantage of this method 16-QAM [52], [54]. Furthermore, for differential 16-QAM and
being that it may be implemented using a hardware-efficient 64-QAM, it has been shown that a digital phase-locked loop
realization [75]. can compensate for a residual frequency offset of 1% of the
symbol rate [86]. While the digital phase-locked loop presents
B. Spectral Methods challenges for CMOS-based parallel implementation, hardware-
efficient carrier-recovery schemes have also been proposed with
For QPSK data, f may be estimated by [76]
similar performance [87].

N In addition to the errors due to the residual phase noise, there
f = arg max x4in [k] exp(8jkf Tsym ) (58) is also the possibility of a cycle slip, which can have a catas-
f
k =1 trophic effect on the performance. As discussed by Taylor [82],
which may be implemented by observing the peak in the spec- in order to reduce the probability of a cycle slip to that of the
trum of x4in . In practice, the number of frequency points may be corrected the BER = 1018 , the laser linewidth may need to be
insufficient to give an accurate estimate of the frequency, how- reduced by two orders of magnitude, e.g., 600 kHz for 28 GBd
ever, an iterative method may be used to improve this estimate, PDM-QPSK.28 In order to minimize the impact of cycle slips,
with this method being equally applicable to QAM formats [77]. differential decoding may be employed. While this has the effect
While these are essentially feedforward techniques, feedback of increasing the BER by as much as a factor of two, and hence,
techniques employing a frequency-controlled loop may also be incurring a modest penalty (<1 dB), this often outweighs advan-
used, having the advantage that they are agnostic to the modu- tage of relaxed linewidth requirements and simplified symbol
lation format [13], [78]. decoding.
27 The circular Gaussian distribution is also known as the Tikhonov distribu- 28 These stringent linewidth requirements may be optimistic due to the pres-
tion and the von Mises distribution. ence of equalization enhanced phase noise [88].
XI. SYMBOL ESTIMATION AND DECODING draws on methods developed for wireless communication sys-
tems (such as OFDM, e.g., [91][93]), however, the channel is
Following carrier recovery, the signal may be decoded by
the outer receiver. This could take the form of a soft-decision fundamentally different due to the distributed nature of the non-
linear channel and the presence of non-Gaussian noise [94]. Not
forward error correction (FEC) using a Galois field correspond-
only is the optical channel distributed in nature, having both dis-
ing to the symbol alphabet, or symbol estimation followed by
hard-decision FEC. In current systems, which are based on hard- tributed nonlinearities and noise due to the period amplification
of the signal, but also the lasers have nonnegligible phase noise.
decision decoding of binary data, symbol estimation and bit de-
The task of designing CMOS-based photonic digital modems,
coding is required. For rectangular constellations, such as QAM,
this may be achieved by applying a series of decision thresholds which allow the information theoretic capacity of the nonlinear
dispersive MIMO communication channel with non-Gaussian
to the in-phase and quadrature components separately. While
this corresponds to the maximum likelihood symbol estimation and signal dependent noise will challenge the community for
for a system limited by additive white Gaussian noise (AWGN), years to come.
by using nonrectangular decision boundaries, it is possible to
improve the performance for systems limited by phase noise, XIII. CONCLUDING REMARKS
both linear and nonlinear [89]. In this paper, we have attempted to outline the subsystems
and algorithms, which are required to realize a digital coherent
XII. CHALLENGES FOR DIGITAL COHERENT optical receiver. As systems move beyond PDM-QPSK toward
TRANSMISSION SYSTEMS higher level modulation formats or to multicarrier techniques,
a natural evolution will be for DSP to be employed at both the
In this final section, we shall consider future challenges for
transmitter and receiver, such that the structural design of the
digital coherent transmission systems in which DSP is not only
DSP may differ from that presented. Nevertheless, many of the
applied at the receiver, but also the transmitter in order to opti-
subsystems outlined herein, such as channel equalization and
mize the overall performance.
carrier recovery are likely to be present in future photonic dig-
The first challenge is that of integration of the subsystems, in-
ital modems. While the commercialization of photonic digital
cluding photonic integration of optical components in the trans-
receivers has begun, it is clear that there remain much research
mitter and receiver allowing the size, cost, and power consump-
to be done in order to allow a digital coherent optical commu-
tion of the digital coherent modems to be reduced. In addition
nication system to achieve the information theoretic nonlinear
to photonic integration, digital integration of data converters
channel capacity, while offering the functionality that the oper-
and associated DSP, is highly desirable. At present, one of the
ators require for future optical networks.
major barriers to the deployment of coherent systems are the
resources required to develop an ASIC and while field pro-
ACKNOWLEDGMENT
gramable gate arrays (FPGA) offer a means of evaluating al-
gorithms, the lack of integrated data converters make their use The author would like to thank Politecnico di Torino for
prohibitive for commercial systems.29 The integration of data their support over the summer 2009, when the initial work on
converters with an ASIC, whose functionality may be controlled the control surfaces was carried out. He would also like to
via firmware (such as an FPGA) would be a key enabling tech- thank Prof. P. Bayvel and Prof. P. Poggiolini for stimulating
nology for future digital coherent systems, paving the way for discussions, and Dr. B. Thomsen, Mr. D. Millar, and Mr. I.
software-defined transceivers. The final level of integration is Fatadin for their comments on this paper.
at the information theoretic level, combining modulation and
demodulation with the associated coding and decoding.30 REFERENCES
The second challenge is that of algorithm design. This in- [1] P. Schvan, J. Bach, C. Fait, P. Flemke, R. Gibbins, Y. Greshishchev,
cludes not only devising algorithms, which can allow the com- N. Ben-Hamida, D. Pollex, J. Sitch, S.-C. Wang, and J. Wolczanski, A
plexity and power consumption of the transceiver to be reduced, 24GS/s 6b ADC in 90nm CMOS, in Proc. Digest Tech. Papers. IEEE Int.
Solid-State Circuits Conf., 2008 (ISSCC), Feb., pp. 544634.
employing, for example, baud rate sampling, but also to increase [2] F. Derr, Coherent optical QPSK intradyne system: Concept and digital
the robustness of systems to imperfections in the photonic com- receiver realization, J. Lightw. Technol., vol. 10, no. 9, pp. 12901296,
ponents, such as laser phase noise or nonideal hybrids. While Sep. 1992.
[3] J. Winters, Equalization in coherent lightwave systems using a fraction-
the algorithms outlined in this paper are well-suited to current ally spaced equalizer, J. Lightw. Technol., vol. 8, no. 10, pp. 14871491,
SDH/SONET data payloads, the DSP would need to be fun- Oct. 1990.
damentally redesigned to facilitate the detection of bursts of [4] S. Betti, F. Curti, G. De Marchis, and E. Iannone, A novel multilevel
coherent optical system: 4-quadrature signaling, J. Lightw. Technol.,
data, and hence, minimizing the loss of data due to the finite- vol. 9, no. 4, pp. 514523, Apr. 1991.
acquisition time. Most of the current research on algorithms [5] H. Sun, K.-T. Wu, and K. Roberts, Real-time measurements of a 40 Gb/s
coherent system, Opt. Exp., vol. 16, no. 2, pp. 873879, 2008.
[6] S. J. Savory, G. Gavioli, R. I. Killey, and P. Bayvel, Electronic compensa-
29 Primarily, due to a data throughput of >1 Tb/s being required for a 100 GbE
tion of chromatic dispersion using a digital coherent receiver, Opt. Exp.,
system (6 bits of resolution 4 56 GSa/s), however, the power consumption vol. 15, no. 5, pp. 21202126, 2007.
and cost of this hybrid solution is currently at least an order of magnitude higher [7] S. J. Savory, Digital filters for coherent optical receivers, Opt. Exp.,
than an integrated solution. vol. 16, no. 2, pp. 804817, 2008.
30 One example of this is the turbo equalizer, which combines a MAP receiver [8] X. Zhou, J. Yu, M.-F. Huang, Y. Shao, T. Wang, P. Magill, M. Cvijetic,
with low-density parity codes [90]. L. Nelson, M. Birk, G. Zhang, S. Ten, H. Matthew, and S. Mishra, 32Tb/s
(320114Gb/s) PDM-RZ-8QAM transmission over 580km of SMF-28 [35] E. Ip and J. Kahn, Compensation of dispersion and nonlinear impair-
ultra-low-loss fiber, in Proc. Opt. Fiber Commun. Conf., Mar. 2009, ments using digital backpropagation, J. Lightw. Technol., vol. 26, no. 20,
paper PDPB4. pp. 34163425, Oct. 15, 2008.
[9] C. Schmidt-Langhorst, R. Ludwig, D.-D. Gross, L. Molle, M. Seimetz, [36] F. Yaman and G. Li, Nonlinear impairment compensation for
R. Freund, and C. Schubert, Generation and coherent time-division de- polarization-division multiplexed WDM transmission using digital back-
multiplexing of up to 5.1 Tb/s single-channel 8-PSK and 16-QAM sig- ward propagation, IEEE Photon. J., vol. 1, no. 2, pp. 144152, Aug.
nals, in Proc. Opt. Fiber Commun. Conf, Mar. 2009, paper PDPC6. 2009.
[10] M. Salsi, H. Mardoyan, P. Tran, C. Koebele, G. Charlet, and S. Bigo, [37] S. J. Savory, G. Gavioli, E. Torrengo, and P. Poggiolini, Impact of inter-
155x100Gbit/s coherent PDM-QPSK transmission over 7,200km, in channel nonlinearities on a split-step intrachannel nonlinear equalizer,
Proc. Eur. Conf. Opt. Commun, Sep. 2009, paper PD2.5. IEEE Photon. Technol. Lett., vol. 22, no. 10, pp. 673675, May 2010.
[11] H. Rohde, S. Smolorz, E. Gottwald, and K. Kloppe, Next generation [38] G. Goldfarb and G. Li, Efficient backward-propagation using wavelet
optical access: 1 Gbit/s for everyone, in Proc. Eur. Conf. Opt. Commun., based filtering for fiber backward-propagation, Opt. Exp., vol. 17, no. 11,
Sep. 2009, paper 10.5.5. pp. 88158821, 2009.
[12] P. J. Anslow, C. R. S. Fludger, S. Savory, I. Hardcastle, and J. Fells, [39] D. T. Westwick and R. E. Kearney, Identification of Nonlinear Physiolog-
Frequency selective coherent receiver for Agile Networks, in Proc. Eur. ical Systems. Piscataway, NJ: IEEE, 2003.
Conf. Opt. Commun, Sep. 2006, paper Mo4.2.4. [40] R. Raheli and G. Picchi, Synchronous and fractionally-spaced blind
[13] H. Meyr, M. Moeneclaey, and S. A. Fechtel, Digital Communication equalization in dually-polarized digital radio links, in Proc. IEEE Int.
Receivers: Synchronization, Channel Estimation, and Signal Processing. Conf. Commun., 1991 (ICC), Conf. Record., Jun., vol. 1, pp. 156
New York: Wiley, 1997. 161.
[14] L. Kazovsky, Phase- and polarization-diversity coherent optical tech- [41] F. Hauske, M. Kuschnerov, B. Spinnler, and B. Lankl, Optical perfor-
niques, J. Lightw. Technol., vol. 7, no. 2, pp. 279292, Feb. 1989. mance monitoring in digital coherent receivers, J. Lightw. Technol.,
[15] M. Seimetz and C.-M. Weinert, Options, feasibility, and availability of 2 vol. 27, no. 16, pp. 36233631, Aug. 15, 2009.
4 90 degree hybrids for coherent optical systems, J. Lightw. Technol., [42] S. Woodward, L. Nelson, M. Feuer, X. Zhou, P. Magill, S. Foo, D. Hanson,
vol. 24, no. 3, pp. 13171322, Mar. 2006. H. Sun, M. Moyer, and M. OSullivan, Characterization of real-time PMD
[16] S. Haykin, Neural Networks and Learning Machines. Englewood Cliffs, and chromatic dispersion monitoring in a high-PMD 46-Gb/s transmission
NJ: Prentice-Hall, 2009. system, IEEE Photon. Technol. Lett., vol. 20, no. 24, pp. 20482050,
[17] C. Vogel and H. Johansson, Time-interleaved analog-to-digital convert- Dec. 15, 2008.
ers: Status and future directions, in Proc. 2006 IEEE Int. Symp. Circuits [43] J. Geyer, F. Hauske, C. Fludger, T. Duthel, C. Schulien, M. Kuschnerov,
Syst., 2006 (ISCAS), pp. 33863389. K. Piyawanno, D. van den Borne, E.-D. Schmidt, B. Spinnler, H. de
[18] J. Elbornsson, F. Gustafsson, and J.-E. Eklund, Blind equalization of time Waardt, B. Lankl, and B. Schmauss, Channel parameter estimation for
errors in a time-interleaved ADC system, IEEE Trans. Signal Process., polarization diverse coherent receivers, IEEE Photon. Technol. Lett.,
vol. 53, no. 4, pp. 14131424, Apr. 2005. vol. 20, no. 10, pp. 776778, May. 15, 2008.
[19] M. Jridi, L. Bossuet, B. Le Gal, and D. Dallet, New adaptive calibration [44] D. Godard, Self-recovering equalization and carrier tracking in two-
method for time interleaved analog to digital converters, in Proc. IEEE dimensional data communication systems, IEEE Trans. Commun.,
Northeast Workshop Circuits Syst., 2007 (NEWCAS), Aug., pp. 932935. vol. 28, no. 11, pp. 18671875, Nov. 1980.
[20] I. Dedic, 56 GS/s ADC: Enabling 100GbE, in Proc. Opt. Fiber Commun. [45] K. Kikuchi, Polarization-demultiplexing algorithm in the digital coherent
Conf., 2010, p. OThT6. receiver, in Proc. Digest 2008 IEEE/LEOS Summer Topical Meetings,
[21] T. Tanimura, S. Oda, T. Tanaka, T. Hoshida, Z. Tao, and J. Rasmussen, Jul., pp. 101102.
A simple digital skew compensator for coherent receiver, in Proc. Eur. [46] L. Smith, Chaos: A Very Short Introduction. London, U.K.: Oxford
Conf. Opt. Commun., Sep. 2009, paper 7.3.2. Univ. Press, 2007.
[22] S. Haykin, Adaptive Filter Theor. Englewood Cliffs, NJ: Prentice-Hall, [47] M. Silva and M. Miranda, Tracking issues of some blind equalization
2001. algorithms, IEEE Signal Process. Lett., vol. 11, no. 9, pp. 760763, Sep.
[23] I. Fatadin, S. J. Savory, and D. Ives, Compensation of quadrature imbal- 2004.
ance in an optical QPSK coherent receiver, IEEE Photon. Technol. Lett., [48] S. Kun and Z. Xudong, A SE-CMA based blind equalization for MIMO
vol. 20, no. 20, pp. 17331735, Oct. 15, 2008. systems, in Proc. 7th Int. Conf. Signal Process., 2004 (ICSP), Aug./Sep.,
[24] S. H. Chang, H. S. Chung, and K. Kim, Impact of quadrature imbalance vol. 2, pp. 16741677.
in optical coherent QPSK receiver, IEEE Photon. Technol. Lett., vol. 21, [49] H. Zhang, Z. Tao, L. Liu, S. Oda, T. Hoshida, and J. Rasmussen, Polariza-
no. 11, pp. 709711, Jun. 1, 2009. tion demultiplexing based on independent component analysis in optical
[25] I. Mayer, On Lowdins method of symmetric orthogonalization, Int. J. coherent receivers, in Proc. Eur. Conf. Opt. Commun, Sep. 2008, paper
Quantum Chem., vol. 90, no. 1, pp. 6365, Oct. 5, 2002. Mo.3.D.5.
[26] D. Scofield, Note on Lowdin orthogonalization and square root of a [50] L. Liu, Z. Tao, W. Yan, S. Oda, T. Hoshida, and J. Rasmussen, initial
positive self-adjoint matrix, Int. J. Quantum Chem., vol. 7, no. 3, pp. tap setup of constant modulus algorithm for polarization de multiplexing
561568, 1973. in optical coherent receivers, in Proc. Opt. Fiber Commun. Conf, Mar,
[27] H. C. Schweinler and E. P. Wigner. (1970). Orthogonalization meth- 2009, paper OMT2.
ods. J. Math. Phys., [Online]. 11(5), pp. 16931694. Available: [51] J. Treichler, M. Larimore, and J. Harp, Practical blind demodulators for
http://link.aip.org/link/?JMP/11/1693/1 high-order QAM signals, Proc. IEEE, vol. 86, no. 10, pp. 19071926,
[28] M. G. Taylor, Coherent detection method using DSP for demodulation of Oct. 1998.
signal and subsequent equalization of propagation impairments, IEEE [52] I. Fatadin, D. Ives, and S. J. Savory, Blind equalization and carrier phase
Photon. Technol. Lett., vol. 16, no. 2, pp. 674676, Feb. 2004. recovery in a 16-QAM optical coherent system, J. Lightw. Technol.,
[29] D. Crivelli, H. Carter, and M. Hueda, Adaptive digital equalization in the vol. 27, no. 15, pp. 30423049, Aug. 1, 2009.
presence of chromatic dispersion, PMD, and phase noise in coherent fiber [53] P. J. Winzer and A. H. Gnauck, 112-Gb/s polarization-multiplexed 16-
optic systems, in Proc. IEEE Global Telecommun. Conf. (GLOBECOM), QAM on a 25-GHz WDM grid, in Proc. Eur. Conf. Opt. Commun., Sep.
Nov. 2003Dec. 2004, vol. 4, pp. 25452551. 2008, paper Th.3.E.5.
[30] G. P. Agrawal, Lightwave Technology: Telecommunication Systems. [54] P. Winzer, A. Gnauck, C. Doerr, M. Magarini, and L. Buhl, Spec-
New York: Wiley Interscience, 2005. trally efficient long-haul optical networking using 112-Gb/s polarization-
[31] G. Goldfarb and G. Li, Chromatic dispersion compensation using dig- multiplexed 16-QAM, J. Lightw. Technol., vol. 28, no. 4, pp. 547556,
ital IIR filtering with coherent detection, IEEE Photon. Technol. Lett., Feb. 15, 2010.
vol. 19, no. 13, pp. 969971, Jul. 1, 2007. [55] W. Sethares, G. Rey, and J. Johnson C.R., Approaches to blind equaliza-
[32] M. Kuschnerov, F. Hauske, K. Piyawanno, B. Spinnler, M. Alfiad, tion of signals with multiple modulus, in Proc. Int. Conf. Acoust., Speech,
A. Napoli, and B. Lankl, DSP for coherent single-carrier receivers, Signal Process., 1989 (ICASSP), May, vol. 2, pp. 972975.
J. Lightw. Technol., vol. 27, no. 16, pp. 36143622, Aug. 15, 2009. [56] M. Ready and R. Gooch, Blind equalization based on radius directed
[33] K.-P. Ho, Subband equaliser for chromatic dispersion of optical fibre, adaptation, in Proc. Int. Conf. Acoust., Speech, Signal Process., 1990
Electron. Lett., vol. 45, no. 24, pp. 12241226, Nov. 19, 2009. (ICASSP), Apr., vol. 3, pp. 16991702.
[34] R. Gilmore, Baker-Campbell-Hausdorff formulas, J. Math. Phys., [57] C. Fludger, T. Duthel, D. van den Borne, C. Schulien, E.-D. Schmidt,
vol. 15, no. 12, pp. 20902092, 1974. T. Wuth, J. Geyer, E. De Man, K. Giok-Djan, and H. de Waardt, Coherent
equalization and POLMUX-RZ-DQPSK for robust 100-GE transmis- [83] L. Pessoa, H. Salgado, and I. Darwazeh, Performance evaluation of phase
sion, J. Lightw. Technol., vol. 26, no. 1, pp. 6472, Jan. 1, 2008. estimation algorithms in equalized coherent optical systems, IEEE Pho-
[58] C. Laperle, B. Villeneuve, Z. Zhang, D. McGhan, H. Sun, and ton. Technol. Lett., vol. 21, no. 17, pp. 11811183, Sep. 1, 2009.
M. OSullivan, WDM Performance and PMD tolerance of a coherent 40- [84] S. Hoffmann, R. Peveling, T. Pfau, O. Adamczyk, R. Eickhoff, and R. Noe,
Gbit/s dual-polarization QPSK transceiver, J. Lightw. Technol., vol. 26, Multiplier-free real-time phase tracking for coherent QPSK receivers,
no. 1, pp. 168175, Jan. 1, 2008. IEEE Photon. Technol. Lett., vol. 21, no. 3, pp. 137139, Feb. 1, 2009.
[59] O. Bertran-Pardo, J. Renaudier, G. Charlet, P. Tran, H. Mardoyan, M. Salsi, [85] S. Zhang, P. Kam, C. Yu, and J. Chen, Laser linewidth tolerance of
and S. Bigo, Experimental assessment of interactions between nonlinear decision-aided maximum likelihood phase estimation in coherent optical
impairments and polarization-mode dispersion in 100-Gb/s coherent sys- M -ary PSK and QAM systems, IEEE Photon. Technol. Lett., vol. 21,
tems versus receiver complexity, IEEE Photon. Technol. Lett., vol. 21, no. 15, pp. 10751077, Aug. 1, 2009.
no. 1, pp. 5153, Jan. 1, 2009. [86] I. Fatadin, D. Ives, and S. J. Savory, Compensation of frequency offset
[60] E. Ip and J. Kahn, Digital equalization of chromatic dispersion and po- for differentially encoded 16- and 64-QAM in the presence of laser phase
larization mode dispersion, J. Lightw. Technol., vol. 25, no. 8, pp. 2033 noise, IEEE Photon. Technol. Lett., vol. 22, no. 3, pp. 176178, Feb. 1,
2043, Aug. 2007. 2010.
[61] B. Spinnler, Complexity of algorithms for digital coherent receivers, in [87] T. Pfau, S. Hoffmann, and R. Noe, Hardware-efficient coherent digital
Proc. Eur. Conf. Opt. Commun., Sep. 2009, paper 7.3.6. receiver concept with feedforward carrier recovery for M -QAM con-
[62] A. Vgenis, C. Petrou, C. Papadias, I. Roudas, and L. Raptis, Nonsingular stellations, J. Lightw. Technol., vol. 27, no. 8, pp. 989999, Apr. 15,
constant modulus equalizer for PDM-QPSK coherent optical receivers, 2009.
IEEE Photon. Technol. Lett., vol. 22, no. 1, pp. 4547, Jan. 1, 2010. [88] W. Shieh and K.-P. Ho, Equalization-enhanced phase noise for coherent-
[63] T. Oktem, A. Erdogan, and A. Demir, Adaptive receiver structures for detection systems using electronic digital signal processing, Opt. Exp,
fiber communication systems employing polarization-division multiplex- vol. 16, no. 20, pp. 15 71815 727, 2008.
ing, J. Lightw. Technol., vol. 27, no. 23, pp. 53945404, Dec. 1, 2009. [89] A. P. T. Lau and J. M. Kahn, Signal design and detection in presence of
[64] S. Haykin, Signal processing: Where physics and mathematics meet, nonlinear phase noise, J. Lightw. Technol., vol. 25, no. 10, pp. 3008
IEEE Signal Process. Mag, vol. 18, no. 4, pp. 67, Jul. 2001. 3016, 2007.
[65] H. Hurwitz and R. C. Jones, A new calculus for the treatment of optical [90] I. B. Djordjevic, L. L. Minkov, L. Xu, and T. Wang, Suppression of
systems, J. Opt. Soc. Amer., vol. 31, no. 7, pp. 493495, 1941. fiber nonlinearities and PMD in coded-modulation schemes with coherent
[66] M. J. D. Powell Approximation Theory and Methods, Cambridge, U.K. detection by using turbo equalization, J. Opt. Commun. Netw., vol. 1,
Cambridge Univ. Press, 1982. no. 6, pp. 555564, 2009.
[67] P. Prandoni and M. Vetterli, From Lagrange to Shannon. and back: An- [91] S. L. Jansen, I. Morita, T. C. W. Schenk, N. Takeda, and H. Tanaka,
other look at sampling [DSP Education], IEEE Signal Process. Mag., Coherent optical 25.8-Gb/s OFDM transmission over 4160-km SSMF,
vol. 26, no. 5, pp. 138144, Sep. 2009. J. Lightw. Technol., vol. 26, no. 1, pp. 615, Jan. 2008.
[68] F. M. Gardner, Interpolation in digital modems. Part I: Fundamentals, [92] X. Liu, F. Buchali, and R. Tkach, Improving the nonlinear tolerance
IEEE Trans. Commun., vol. 41, no. 3, pp. 501507, Mar. 1993. of polarization-division-multiplexed CO-OFDM in long-haul fiber trans-
[69] L. Erup, F. M. Gardner, and R. A. Harris, Interpolation in digital modems. mission, J. Lightw. Technol., vol. 27, no. 16, pp. 36323640, Aug. 15,
Part II: Fundamentals, IEEE Trans. Commun., vol. 41, no. 6, pp. 998 2009.
1008, Jun. 1993. [93] Y. Ma, Q. Yang, Y. Tang, S. Chen, and W. Shieh, 1-Tb/s single-channel
[70] S. H. Chang, H. S. Chung, and K. Kim, Digital non-data-aided sym- coherent optical OFDM transmission over 600-km SSMF fiber with sub-
bol synchronization in optical coherent intradyne reception, Opt. Exp., wavelength bandwidth access, Opt. Exp., vol. 17, no. 11, pp. 94219427,
vol. 16, no. 19, pp. 15 09715 103, 2008. 2009.
[71] F. Gardner, A BPSK/QPSK timing-error detector for sampled receivers, [94] R.-J. Essiambre, G. J. Foschini, G. Kramer, and P. J. Winzer, Capacity
IEEE Trans. Commun., vol. COM-34, no. 5, pp. 423429, May 1986. limits of information transport in fiber-optic networks, Phys. Rev. Lett.,
[72] K. Mueller and M. Muller, Timing recovery in digital synchronous data vol. 101, no. 16, p. 163901, 2008.
receivers, IEEE Trans. Commun., vol. COM-24, no. 5, pp. 516531,
May 1976.
[73] H. L. Van Trees, Detection, Estimation, and Modulation Theory, Part I.
New York: Wiley Interscience, 2001.
[74] A. Leven, N. Kaneda, U.-V. Koc, and Y.-K. Chen, Frequency estimation
in intradyne reception, IEEE Photon. Technol. Lett., vol. 19, no. 6,
pp. 366368, Mar. 15, 2007.
[75] S. Hoffmann, S. Bhandare, T. Pfau, O. Adamczyk, C. Wordehoff, Seb J. Savory (M07) was born in Stirling, U.K.,
R. Peveling, M. Porrmann, and R. Noe, Frequency and phase estima- in 1973. He received the M.Eng., M.A., and Ph.D.
tion for coherent QPSK transmission with unlocked DFB lasers, IEEE degrees in engineering from the University of
Photon. Technol. Letts, vol. 20, no. 18, pp. 15691571, Sep. 15, 2008. Cambridge, Cambridge, U.K., in 1996, 1999, and
[76] M. Morelli and U. Mengali, Feedforward frequency estimation for PSK: 2001, respectively, and the M.Sc. degree in math-
A tutorial review, Eur. Trans. Telecommun., vol. 9, no. 2, pp. 103116, ematics from the Open University, Milton Keynes,
1998. U.K., in 2007.
[77] M. Selmi, Y. Jaouen, and P. Ciblat, Accurate digital frequency offset His interest in optical communications began in
estimator for coherent PolMux QAM transmission systems, in Proc. Eur. 1991, when he joined Standard Telecommunications
Conf. Opt. Commun., Sep. 2009, paper P3.08. Laboratories, Harlow, U.K., prior to being sponsored
[78] K. Piyawanno, M. Kuschnerov, B. Spinnler, and B. Lankl, Fast and though his undergraduate and postgraduate studies by
accurate automatic frequency control for coherent receivers, in Proc. Nortel. On completion of his Ph.D. degree in 2000, he joined Nortels Harlow
Eur. Conf. Opt. Commun., Sep. 2009, paper 7.3.1. Laboratories as Senior Research Engineer, where he was engaged in research
[79] R. Noe, Phase noise-tolerant synchronous QPSK/BPSK baseband-type into digital signal processing and advanced optical transmission systems. In
intradyne receiver concept with feedforward carrier recovery, J. Lightw. 2005, he joined the Optical Networks Group at University College London
Technol., vol. 23, no. 2, pp. 802808, Feb. 2005. (UCL), where he held a Leverhulme Trust Early Career Fellowship from 2005
[80] E. Ip and J. Kahn, Feedforward carrier recovery for coherent optical 2007, being appointed as a University Lecturer in 2007. From June 2009June
communications, J. Lightw. Technol., vol. 25, no. 9, pp. 26752692, 2010 he was also a Visiting Professor at the Politecnico di Torino, Italy. His
Sep. 2007. research interests include digital coherent transceivers, optical transmission sys-
[81] A. J. Viterbi and A. M. Viterbi, Nonlinear estimation of PSK-modulated tems and subsystems, digital signal processing, and nonlinear systems.
carrier phase with application to burst digital transmission, IEEE Trans. Dr. Savory is a Chartered Engineer and an Associate Editor for the IEEE
Inf. Theory, vol. 29, no. 4, pp. 543551, Jul. 1983. PHOTONICS TECHNOLOGY LETTERS. He also serves on the technical program
[82] M. G. Taylor, Phase estimation methods for optical coherent detection us- committee for the Optical Fiber Communication Conference, the European
ing digital signal processing, J. Lightw. Technol., vol. 27, no. 7, pp. 901 Conference on Optical Communication, and the IEEE Photonics Society An-
914, Apr. 1, 2009. nual Meeting.

Digital Coherent Optical Receivers Algorithms and Subsystems

Uploaded by

Document Information

Copyright

Available Formats

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

Digital Coherent Optical Receivers Algorithms and Subsystems

Uploaded by

Copyright:

Available Formats

1164 IEEE JOURNAL OF SELECTED TOPICS IN QUANTUM ELECTRONICS, VOL. 16, NO.

Digital Coherent Optical Receivers:

1077-260X/$26.00 2010 IEEE

Fig. 2. 28 port receiver.

estimation subsystem, however, other paths depend on the algo-

III. PHASE AND POLARIZATION DIVERSE

3) Time interleaved, which uses lower speed ADCs, but re-

QPSK) signal, which we may represent mathematically as

notable exception of Kikuchi [45], there is very little discussion

convergence increases significantly in the region of the bifurca-

1 2) Simulation Model: To investigate the occurrences of sin-

Fig. 9. Control surface for PDM-QPSK with a DD-EQ.

Fig. 12. Constellation diagrams normalized to unitpower for QPSK

To investigate further the issue associated with the local min-

G. Control Surfaces for PDM16-QAM 2x = (R02 r2 )2 (44)

single tap equalizer case becomes

have been used extensively [7], [57] [59], in order to increase

For the case of linear interpolation, the sample y[k] will be a

You might also like