Equalization 2

Equalization
Prof. David Johns University of Toronto (johns@eecg.toronto.edu) (www.eecg.toronto.edu/~johns)
University of Toronto
slide 1 of 70
D.A. Johns, 1997
Adaptive Filter Introduction

Adaptive filters are used in:
Noise cancellation Echo cancellation Sinusoidal enhancement (or rejection) Beamforming Equalization
Adaptive equalization for data communications proposed by R.W. Lucky at Bell Labs in 1965. LMS algorithm developed by Widrow and Hoff in 60s for neural network adaptation
slide 2 of 70
D.A. Johns, 1997
Adaptive Filter Introduction

A typical adaptive system consists of the following two-input, two output system
(n) y(n) + e(n ) y(n )
adaptive algorithm
u(n)
H(z)
u(n) and y(n) are the filters input and output (n) and e(n) are the reference and error signals
slide 3 of 70
D.A. Johns, 1997
Adaptive Filter Goal

Find a set of filter coefficients to minimize the power of the error signal, e(n) . Normally assume the time-constant of the adaptive algorithm is much slower than those of the filter, H(z) . If it were instantaneous, it could always set y(n) equal to (n) and the error would be zero (this is useless)
Think of adaptive algorithm as an optimizer which finds the best set of fixed filter coefficients that minimizes the power of the error signal.
slide 4 of 70
D.A. Johns, 1997
Noise (and Echo) Cancellation

signal + noise + H 1(z) noise H 1(z) H 2(z) u(n) H(z) + (n) e(n) = signal y(n) = H 1(z) noise H(z) = H 1(z) H 2(z)
Useful in cockpit noise cancelling, fetal heart monitoring, acoustic noise cancelling, echo cancelling, etc.
slide 5 of 70
D.A. Johns, 1997
Sinusoidal Enhancement (or Rejection)

sinusoid + noise fixed delay u(n) H(z ) (n) y(n) + noise sinusoid e(n)
The sinusoids frequency and amplitude are unknown. If H(z) is adjusted such that its phase plus the delay equals 360 degrees at the sinusoids frequency, the sinusoid is cancelled while the noise is passed. The noise might be a broadband signal which should be recovered.
slide 6 of 70
D.A. Johns, 1997
Adaptation Algorithm
Optimization might be performed by:
perturb some coefficient in H(z) and check whether the power of the error signal increased or decreased. If it decreased, go on to the next coefficient. If it increased, switch the sign of the coefficient change and go on to the next coefficient. Repeat this procedure until the error signal is minimized.
This approach is a steepest-descent algorithm but is slow and not very accurate. The LMS (Least-Mean-Square) algorithm is also a steepest-descent algorithm but is more accurate and simpler to realize
slide 7 of 70
D.A. Johns, 1997
Steepest-Descent Algorithm
Minimize the power of the error signal, E [ e 2(n) ] General steepest-descent for filter coefficient pi(n) :
E [ e (n) ] p i(n + 1) = p i(n) ----------------------- p i
2
Here > 0 and controls the adaptation rate
slide 8 of 70
D.A. Johns, 1997
Steepest Descent Algorithm

In the one-dimensional case
E [ e (n ) ]
2
E [ e (n ) ] ----------------------- > 0 p i
* pi
p i(2) p i(1) p i(0)
pi
slide 9 of 70
D.A. Johns, 1997
Steepest-Descent Algorithm
In the two-dimensional case
p2
p2
(out of page)
E [ e (n ) ]
* p1
p1
Steepest-descent path follows perpendicular to tangents of the contour lines.

slide 10 of 70
D.A. Johns, 1997
LMS Algorithm
Replace expected error squared with instantaneous error squared. Let adaptation time smooth out result.
e (n) p i(n + 1) = p i(n) -------------- p i e(n) - p i(n + 1) = p i(n) 2 e(n) ---------- p i
2
and since e(n) = (n) y(n) , we have

p i(n + 1) = p i(n) + 2 e(n) i(n) where i = y(n) p i
e(n) and i(n) are uncorrelated after convergence.

slide 11 of 70
D.A. Johns, 1997
Variants of the LMS Algorithm

To reduce implementation complexity, variants are taking the sign of e(n) and/or i(n) . LMS p i(n + 1) = p i(n) + 2 e(n) i(n) Sign-data LMS Sign-error LMS
p i(n + 1) = p i(n) + 2 e(n) sgn ( i(n) ) p i(n + 1) = p i(n) + 2 sgn ( e(n) ) i(n)
Sign-sign LMS i(n + 1) = p i(n) + 2 sgn ( e(n) ) sgn ( i(n) However, the sign-data and sign-sign algorithms have gradient misadjustment may not converge! These LMS algorithms have different dc offset implications in analog realizations.
slide 12 of 70
D.A. Johns, 1997
Obtaining Gradient Signals

m pi n h um(n) u(n) H (z) h ny(n) y(n)
y(n) - = h ny(n) h um(n) u(n) i(n) = ----------p i
H(z) is a LTI system where the signal-flow-graph arm corresponding to coefficient p i is shown explicitly. h um(n) is the impulse response of from u to m The gradient signal with respect to element p i is the convolution of u(n) with hum(n) convolved with h ny(n) .
slide 13 of 70
D.A. Johns, 1997
Gradient Example
G1 v lp(t) u(t) v bp(t) -1 G1 G2 G3 1 y(t)
v lp(t)
1 -1 G2 G3 1 y(t ) ---------G 1
y(t) ---------- = v lp(t) G 2
y(t) ---------- = v bp(t) G 3
slide 14 of 70
D.A. Johns, 1997
Adaptive Linear Combiner

x 1(n) p 1(n) p 2(n) x 2(n)
N state generator
(n )
y(n) p N(n) x N(n)
+ e(n )
u(n)
y(n ) =
pi(n) xi(n)
often, a tapped delay line
Y(z) H(z) = --------U(z)
y(n ) ----------- = x i(n) p i
slide 15 of 70
D.A. Johns, 1997

The gradient signals are simply the state signals
p i(n + 1) = p i(n) + 2 e(n) x i(n)
(1)
Only the zeros of the filter are being adjusted. There is no need to check that for filter stability (though the adaptive algorithm could go unstable if is too large). The performance surface is guaranteed unimodal (i.e. there is only one minimum so no need to worry about being stuck in a local minimum). The performance surface becomes ill-conditioned as the state-signals become correlated (or have large power variations).
slide 16 of 70
D.A. Johns, 1997
Performance Surface
Correlation of two states is determined by multiplying the two signals together and averaging the output. Uncorrelated (and equal power) states result in a hyper-paraboloid performance surface good adaptation rate. Highly-correlated states imply an ill-conditioned performance surface more residual mean-square error and longer adaptation time.
p2 E [ e (n) ]
2 *
(out of page)
* p1
p1
slide 17 of 70
D.A. Johns, 1997
Adaptation Rate
Quantify performance surface state-correlation matrix
E [ x1 x1 ] E [ x1 x2 ] E [ x1 x3 ] R E [ x2 x1 ] E [ x2 x2 ] E [ x2 x3 ] E [ x3 x1 ] E [ x3 x2 ] E [ x3 x3 ]
Eigenvalues, i , of R are all positive real indicate curvature along the principle axes.
1 - but adaptation rate For adaptation stability, 0 < < -------- max
is determined by least steepest curvature, min . Eigenvalue spread indicates performance surface conditioning.
slide 18 of 70
D.A. Johns, 1997
Adaptation Rate
Adaptation rate might be 100 to 1000 times slower than time-constants in programmable filter. Typically use same for all coefficient parameters since orientation of performance surface not usually known. A large value of results in a larger coefficient bounce. A small value of results in slow adaptation Often gear-shift use a large value at start-up then switch to a smaller value during steady-state. Might need to detect if one should gear-shift again.
slide 19 of 70
D.A. Johns, 1997
Adaptive IIR Filtering

The poles (and often the zeros) are adjusted useful in applications with long impulse responses. Stability check needed for the adaptive filter itself to ensure the poles do not go outside the unit circle for too long a time (or perhaps at all). In general, a multi-modal performance surface occurs. Can get stuck in local minimum. However, if the order of the adaptive filter is greater than the order of the system being matched (and all poles and zeros are being adapted) the performance surface is unimodal. To obtain the gradient signals for poles, extra filters are generally required.
slide 20 of 70
D.A. Johns, 1997
Adaptive IIR Filtering

Direct-form structure needs only one additional filter to obtain all the gradient signals. However, choice of structure for programmable filter is VERY important sensitive structures tend to have ill-conditioned performance surfaces. Equation error structure has unimodal performance surface but has a bias. SHARF (simplified hyperstable adaptive recursive filter) the error signal is filtered to guarantee adaptation needs to meet a strictly-positive-real condition There are few commercial use of adaptive IIR filters
slide 21 of 70
D.A. Johns, 1997
Digital Adaptive Filters

FIR tapped delay line is the most common
x 1(n) z
1
p 1(n) p 2(n) (n )
u(n)
x 2(n)
1
y(n)
+ e(n )
p N(n) x N(n)
y(n) =
pi(n) xi(n)
y(n) ----------- = x i(n) p i
slide 22 of 70
D.A. Johns, 1997
FIR Adaptive Filters

All poles at z = 0 and zeros only adapted. Special case of an adaptive linear combiner Unimodal performance surface States are uncorrelated and equal power if input signal is white hyper-paraboloid If not sure about correlation matrix, can guarantee adaptation stability by choosing
1 0 < < -------------------------------------------------------------------------( # of taps ) ( input signal power )
Usually need an AGC so signal power is known.
slide 23 of 70
D.A. Johns, 1997
FIR Adaptive Filter

Coefficient word length typically 2 + 0.5log 2(# of taps) bits longer than bit-equivalent dynamic range Example: 6-bit input with 8-tap FIR might have 10-bit coefficient word lengths. Example: 12-bit input with 128-tap FIR might have 18-bit coefficient word lengths for 72 dB output SNR. Requires multiplies in filter and adaptation algorithm (unless an LMS variant used or slow adaptation rate) twice the complexity of FIR fixed filter.
slide 24 of 70
D.A. Johns, 1997
Equalization Training Sequence

1 known input data H tc(z) u(n) y(n ) H (z)
FFE
output data 1
e(n) (n )
FFE = Feed Forward Equalizer
regenerated delayed input data
The reference signal, (n) is equal to a delayed version of the transmitted data The training pattern should be chosen so as to ease adaptation pseudorandom is common. Above is a feedforward equalizer (FFE) since y(n) is not directly created using derived output data
slide 25 of 70
D.A. Johns, 1997
FFE Example
Suppose channel, Htc(z) , has impulse response 0.3, 1.0, -0.2, 0.1, 0.0, 0.0 1
time
If FFE is a 3-tap FIR filter with

y(n) = p 1 u(n) + p 2 u(n 1) + p 3 u(n 2)
(2)
Want to force y(1) = 0 , y(2) = 1 , y(3) = 0 Not possible to force all other y(n) = 0
slide 26 of 70
D.A. Johns, 1997
FFE Example
y(1) = 0 = 1.0 p 1 + 0.3 p 2 + 0.0 p 3 y(2) = 1 = 0.2 p 1 + 1.0 p 2 + 0.3 p 3 y(3) = 0 = 0.1 p 1 + ( 0.2 ) p 2 + 1.0 p 3
(3)
Solving results in p 1 = 0.266 , p 2 = 0.886 , p 3 = 0.204 Now the impulse response through both channel and equalizer is: 0.0, -0.08, 0.0, 1.0, 0.0, 0.05, 0.02, ... 1
time
slide 27 of 70
D.A. Johns, 1997
FFE Example
Although ISI reduced around peak, introduction of slight ISI at other points (better overall) Above is a zero-forcing equalizer usually boosts noise too much An LMS adaptive equalizer minimizes the mean squared error signal (i.e. find low ISI and low noise) In other words, do not boost noise at expense of leaving some residual ISI
slide 28 of 70
D.A. Johns, 1997
Equalization Decision-Directed
1 input data H tc(z) u(n) y(n ) H (z)
FFE
output data 1 ( n) e(n)
After training, the channel might change during data transmission so adaptation should be continued. The reference signal is equal to the recovered output data. As much as 10% of decisions might be in error but correct adaptation will occur
slide 29 of 70
D.A. Johns, 1997
Equalization Decision-Feedback
1 input data y DFE(n) e(n) H 2( z )
DFE
H tc(z)
y(n)
output data 1 (n)
Decision-feedback equalizers make use of (n) in directly creating y(n) . They enhance noise less as the derived input data is used to cancel ISI The error signal can be obtained from either a training sequence or decision-directed.
slide 30 of 70
D.A. Johns, 1997
DFE Example
Assume signals 0 and 1 (rather than -1 and +1) (makes examples easier to explain) Suppose channel, Htc(z) , has impulse response 0.0, 1.0, -0.2, 0.1, 0.0, 0.0 1
time
If DFE is a 2-tap FIR filter with

y DFE(n) = 0.2 (n 1) + ( 0.1 )(n 2)
(4)
Input to slicer is now 0.0, 1.0, 0.0, 0.0 0.0 0.0
slide 31 of 70
D.A. Johns, 1997
FFE and DFE Combined

1 input data H tc(z) u(n) e(n ) H 1(z)
FFE
y(n)
output data 1 (n )
y DFE(n) e(n )
H 2(z)
DFE
Assuming correct operation, output data = input data e(n) same for both FFE and DFE e(n) can be either training or decision directed
slide 32 of 70
D.A. Johns, 1997
FFE and DFE Combined

Model as:
n noise(n) x(n) 1 input data H tc(z)
FFE
H 1(z) H 2(z)
DFE
y(n)
output data 1 (n )
y DFE(n)
Y --- = H1 N Y -- = H tc H 1 + H 2 X
(5) (6)
When Htc small, make H 2 = 1 (rather than H 1 )
slide 33 of 70
D.A. Johns, 1997
DFE and FFE Combined

1
time
precursor ISI
postcursor ISI
FFE can deal with precursor ISI and postcursor ISI DFE can only deal with postcursor ISI However, FFE enhances noise while DFE does not When both adapt FFE trys to add little boost by pushing precursor into postcursor ISI (allpass)
slide 34 of 70
D.A. Johns, 1997
Equalization Decision-Feedback
The multipliers in the decision feedback equalizer can be simple since received data is small number of levels (i.e. +1, 0, -1) can use more taps if needed. An error in the decision will propagate in the ISI cancellation error propagation More difficult if Viterbi detection used since output not known until about 16 sample periods later (need early estimates). Performance surface might be multi-modal with local minimum if changing DFE affects output data
slide 35 of 70
D.A. Johns, 1997
Fractionally-Spaced FFE
Feed forward filter is often a FFE sampled at 2 or 3 times symbol-rate fractionally-spaced (i.e. sampled at T 2 or at T 3 ) Advantages: Allows the matched filter to be realized digitally and also adapt for channel variations (not possible in symbol-rate sampling) Also allows for simpler timing recovery schemes (FFE can take care of phase recovery) Disadvantage Costly to implement full and higher speed multiplies, also higher speed A/D needed.
slide 36 of 70
D.A. Johns, 1997
dc Recovery (Baseline Wander)

Wired channels often ac coupled Reduces dynamic range of front-end circuitry and also requires some correction if not accounted for in transmission line-code
+1 -1 +1 -1 +2
Front end may have to be able to accomodate twice the input range! DFE can restore baseline wander - lower frequency pole implies longer DFE Can use line codes with no dc content
slide 37 of 70
D.A. Johns, 1997
Baseline Wander Correction #1

DFE Based Treat baseline wander as postcursor interference May require a long DFE
z1 1 --------------- z1 1 -- z2 1 -- z 3 = 1 -z 0.5 2 4 8
0 1 -0.5 -0.25 -0.125 -0.06 ... 0 1 0 0 0 0 ... 0 1 0 0 0 0 ...
IMPULSE INPUT
0 1 0 0 0 0 ...
0 0 0.5 0.25 0.125 0.06 ...
DFE
1 -- z 1 + 1 -- z 2 + 1 -- z 3 + 2 4 8
slide 38 of 70
D.A. Johns, 1997

DFE Based
z1 1 - z1 1 --------------- = 1 --- z2 1 -- z 3 z 0.5 2 4 8
0 1 0.5 0.25 0.125 0.06 ... 0 1 1 1 1 1 ... 0 1 1 1 1 1 ...
STEP INPUT
0 1 1 1 1 1 ...
0 0 0.5 0.75 0.875 0.938 ...
DFE
1 -- z 1 + 1 -- z 2 + 1 -- z 3 + 2 4 8
slide 39 of 70
D.A. Johns, 1997

Analog dc restore STEP INPUT
0 1 1 1 1 1 ... 0 1 1 1 1 1 ... 0 1 1 1 1 1 ...
Equivalent to an analog DFE Needs to match RC time constants
slide 40 of 70
D.A. Johns, 1997

Error Feedback
y(n )
0 1 1 1 1 1 ...
output data 1 ( n) e(n)
integrator
1---------z1
Integrator time-constant should be faster than ac coupling time-constant Effectively forces error to zero with feedback May be difficult to stablilize if too much in loop (i.e. AGC, A/D, FFE, etc)
slide 41 of 70
D.A. Johns, 1997
Analog Equalization
slide 42 of 70
D.A. Johns, 1997
Analog Filters
Switched-capacitor filters + Accurate transfer-functions + High linearity, good noise performance - Limited in speed - Requires anti-aliasing filters Continuous-time filters - Moderate transfer-function accuracy (requires tuning circuitry) - Moderate linearity + High-speed + Good noise performance
slide 43 of 70
D.A. Johns, 1997

x 1(t) p 1(t) p 2(t) x 2(t)
N state generator
(t )
y(t) p N(t) x N(t)
+ e(t)
u(t)
y(t) =
pi(t) xi(t)
Y(s) H(s) = --------U(s)

y(t ) ---------- = x i(t) p i
slide 44 of 70
D.A. Johns, 1997

The gradient signals are simply the state signals If coeff are updated in discrete-time
p i(n + 1) = p i(n) + 2 e(n) x i(n)
(7)
If coeff are updated in cont-time
p i(t) =
2 e(t) x i(t) dt
0
(8)
Only the zeros of the filter are being adjusted. There is no need to check that for filter stability (though the adaptive algorithm could go unstable if is too large).
slide 45 of 70
D.A. Johns, 1997

The performance surface is guaranteed unimodal (i.e. there is only one minimum so no need to worry about being stuck in a local minimum). The performance surface becomes ill-conditioned as the state-signals become correlated (or have large power variations). Analog Adaptive Linear Combiner Better to use input summing rather than output summing to maintain high speed operation Requires extra gradient filter to obtain gradients
slide 46 of 70
D.A. Johns, 1997
Analog Adaptive Filters

Analog Equalization Advantages Can eliminate A/D converter Reduce A/D specs if partial equalization done first If continuous-time, no anti-aliasing filter needed Typically consumes less power and silicon for highfrequency low-resolution applications. Disadvantages Long design time (difficult to shrink to new process) More difficult testing DC offsets can result in large MSE (discussed later).
slide 47 of 70
D.A. Johns, 1997
Analog Adaptive Filter Structures

Tapped delay lines are difficult to implement in analog. To obtain uncorrelated states: Can use Laguerre structure cascade of allpass first-order filters poles all fixed at one location on real axis For arbitrary pole locations, can use orthonormal filter structure to obtain uncorrelated filter states [Johns, CAS, 1989].
slide 48 of 70
D.A. Johns, 1997
Orthonormal Ladder Structure

1 x 2( t ) 2 3 x 4(t)
1/s
1/s
1/s
1/s
x 1(t)
x 3(t) 3 u(t)
in
For white noise input, all states are uncorrelated and have equal power.
slide 49 of 70
D.A. Johns, 1997
Analogs Big Advantage

In digital filters, programmable filter has about same complexity as a fixed filter (if not power of 2 coeff). In analog, arbitrary fixed coeff come for free (use element sizing) but programming adds complexity. In continuous-time filters, frequency adjustment is required to account for process variations relatively simple to implement. If channel has only frequency variation use arbitrary fixed coefficient analog filter and adjust a single control line for frequency adjustment. Also possible with switched-C filter by adjusting clock frequency.
slide 50 of 70
D.A. Johns, 1997
Analog Adaptive Filters

Usually digital control desired can switch in caps and/or transconductance values Overlap of digital control is better than missed values
pi
better (hysteresis effect) digital coefficient control
pi
worse (potential large coeff jitter) digital coefficient control
In switched-C filters, some type of multiplying DAC needed. Best fully-programmable filter approach is not clear
slide 51 of 70
D.A. Johns, 1997
Analog Adaptive Filters DC Offsets

DC offsets result in partial correlation of data and error signals (opposite to opposite DC offset)
x i (k ) m xi e ( k) me

w i(k)
mi
At high-speeds, offsets might even be larger than signals (say, 100 mV signals and 200mV offsets) DC offset effects worse for ill-conditioned performance surfaces
slide 52 of 70
D.A. Johns, 1997
Analog Adaptive Filters DC Offsets

Sufficient to zero offsets in either error or statesignals (easier with error since only one error signal) For integrator offset, need a high-gain on error signal Use median-offset cancellation slice error signal and set the median of output to zero In most signals, its mean equals its median
error + offset comparator offset-free error up/down counter
D/A
Experimentally verified (low-frequency) analog adaptive with DC offsets more than twice the size of the signal.
slide 53 of 70
D.A. Johns, 1997
DC Offset Effects for LMS Variants

0
Test Case input power no offsets
2
LMS
2
SD-LMS
2
SE-LMS
2
SS-LMS
e 1 x no effect 2 2 e 0 e 0 for 0 for 0 2 e weakly depends on

1 multiplier/tap 1 integrator/tap 1 slicer/tap 1 trivial multiplier/tap 1 integrator/tap gradients misaligned
e 1 ln [ x ] no effect 2 4 2 2 2 2 e x e x
2 e
Residual Mean Squared Error

-10
SD-LMS
strongly depends on
-20
LMS
algorithm circuit complexity convergence
1 trivial multiplier/tap 1 integrator/tap 1 slicer/filter no gradient misalignment
1 slicer/tap 1 XOR gate/tap 1 counter/tap 1 DAC/tap 1 slicer/filter gradients misaligned
-30
no gradient misalignment
-40
SS-LMS SE-LMS
-50 -6 10
10
-5
10
-4
10
-3
10
-2
10
-1
Step Size
slide 54 of 70
D.A. Johns, 1997
Coax Cable Equalizer

Analog adaptive filter used to equalize up to 300m Cascade of two 3rd order filters with a single tuning control
w1 s ( s + p1 )
in
w2 s ( s + p2 ) w3 s ( s + p3 )
out
highpass filters Variable is tuned to account for cable length

slide 55 of 70
D.A. Johns, 1997
Coax Cable Equalizer

Eq Resp parasitic poles
freq Equalizer optimized for 300m Works well with shorter lengths by tuning Tuning control found by looking at slope of equalized waveform Max boost was 40 dB System included dc recovery circuitry Bipolar circuit used operated up to 300Mb/s
slide 56 of 70
D.A. Johns, 1997
Analog Adaptive Equalization Simulation

{1,0}
{ 1, 0 }
1-D 1
noise channel 2
a (k )
r(t)
equalizer 3
y(t)
PR4 detector
(k ) a
e ( k)
_
PR4 generator 4
y i(t)
S1 - for training S2 - for tracking
S1
S2
Channel modelled by a 6th-order Bessel filter with 3 different responses 3MHz, 3.5MHz and 7MHz 20Mb/s data PR4 generator 200 tap FIR filter used to find set of fixed poles of equalizer Equalizer 6th-order filter with fixed poles and 5 zeros adjusted (one left at infinity for high-freq roll-off) University of Toronto
slide 57 of 70
D.A. Johns, 1997
Analog Adaptive Equalization Simulation

Analog blocks simulated with a 200MHz clock and bilinear transform. Switch S1 closed (S2 open) and all poles and 5 zeros adapted to find a good set of fixed poles.
1
2 1.5 1 0.5
-1
y [v]
0 -0.5 -1 -1.5
-1
-2 0
20
40
60
80 Time [ns]
100
120
140
160
Poles and zeros depicted in digital domain for equalizer filter. Residual MSE was -31dB
slide 58 of 70
D.A. Johns, 1997
Equalizer Simulation Decision Directed

Switch S2 closed (S1 open), all poles fixed and 5 zeros adapted using

e(k) = 1 y(t) if ( y(t) > 0.5 ) e(k) = 0 y(t) if ( 0.5 y(t) 0.5 ) e(k) = 1 y(t) if ( y(t) < 0.5 )
all sampled at the decision time assumes clock recovery perfect Potential problem AGC failure might cause y(t) to always remain below 0.5 and then adaptation will force all coefficients to zero (i.e. y(t) = 0 ). Zeros initially mistuned to significant eye closure
slide 59 of 70
D.A. Johns, 1997

3.5MHz Bessel
1.5 1.5 1 1 0.5 0.5
Amplitude [V]
y [V]
-0.5
-0.5
-1
-1
-1.5 0
20
40
60
initial mistuned
2 1.5 1 14 0.5 0 -0.5 -1 4 -1.5 -2 0 2 20 40 60 80 Time [ns] 100 120 140 160 0 0 20 18 16
80 Time [ns]
100
120
140
160
-1.5 0
50
ideal PR4 and equalized pulse outputs

PR4 overall
100 Discrete Time [k]
150
200
Gain [V/V]
12 10 8 6
y [v]
channel equalizer
0.1 0.2 0.3 0.4 0.5 Normalized Radian Frequency [rad] 0.6 0.7
after adaptation (2e6 iterations)

slide 60 of 70
D.A. Johns, 1997

Channel changed to 7MHz Bessel Keep same fixed poles (i.e. non-optimum pole placement) and adapt 5 zeros.
1.5 20 18 1 16 14
PR4
0.5
Gain [V/V]
12 10 8 6 4 2
y [V]
-0.5
channel equalizer
0.1
-1
overall
0.6 0.7
-1.5 0
20
40
60
80 Time [ns]
100
120
140
160
0 0
0.2 0.3 0.4 0.5 Normalized Radian Frequency [rad]
Residual MSE = -29dB Note that no equalizer boost needed at high-freq.

slide 61 of 70
D.A. Johns, 1997

Channel changed to 3MHz Bessel Keep same fixed poles and adapt 5 zeros.
2 20 18 16 1 14
PR4 overall
1.5
Gain [V/V]
0.5
12 10 8 6 4
y [V]
0
-0.5
channel equalizer
0.1 0.2 0.3 0.4 0.5 Normalized Radian Frequency [rad] 0.6 0.7
-1 2 -1.5 0 20 40 60 80 Time [ns] 100 120 140 160 0 0
Residual MSE = -25dB Note that large equalizer boost needed at high-freq. Probably needs better equalization here (perhaps move all poles together and let zeros adapt)
slide 62 of 70
D.A. Johns, 1997
BiCMOS Analog Adaptive Filter Example

Demonstrates a method for tuning the polefrequency and Q-factor of a 100MHz filter adaptive analog Application is a pulse-shaping filter for data transmission. One of the fastest reported integrated adaptive filters it is a Gm-C filter in 0.8um BiCMOS process Makes use of MOS input stage and translinearmultiplier for tuning Large tuning range (approx. 10:1) All analog components integrated (digital left off)
slide 63 of 70
D.A. Johns, 1997
BiCMOS Transconductor
M1 vd ---2 I+i I+i vo ---2 Q1 VC1 Q2 M6 M5 M7 M8 VC2 Q4 V BIAS M3 M4 Ii M2 vd ---2
Two styles implemented: 2-quadrant tuning (F-Cell), 4-quadrant tuning by cross-coupling top input stage (Q-Cell) + +
vo I i ---2 Q3
G m_
_
slide 64 of 70
D.A. Johns, 1997
Biquad Filter
+ + G mi _ _ + + G m 21 _ _
2C X2 2C
2C
+ + G m 22 _ _ + + G m 12 _ _
X1 2C
+ G mb _ _
fo and Q not independent due to finite output conductance Only use 4 quadrant transconductor where needed
slide 65 of 70
D.A. Johns, 1997
Experimental Results Summary

Transconductor (T.) size T. power dissipation Biquad size Biquad f o tuning range Biquad Q tuning range Bq. inpt. ref. noise dens. Biquad PSRR+ Biquad PSRRFilter Setting
100MHz, Q = 2, Gain = 10.6dB 20MHz, Q = 2, Gain = 30dB 100MHz, Q = 15, Gain = 29.3dB 227MHz, Q = 35, Gain = 31.7dB
0.14mm x 0.05mm 10mW @ 5V 0.36mm x 0.164mm 10MHz-230MHz @ 5V, 9MHz-135MHz @ 3V 1-Infinity 0.21 V rms 28dB 21dB Output 3rd Order Intercept Point 23dBm 20dBm 18dBm 10dBm SFDR 35dB 26dB 26dB 20dB
slide 66 of 70
D.A. Johns, 1997
Biquad worst case CMRR 20dB
Hz
Adaptive Pulse Shaping Algorithm

30
Ideal input pulse (not to scale) lowpass output
20
Voltage [mV]
10
-10
bandpass output
-20
=
-30 20 25 30
2.5ns
35
40 45
Fo control: sample output pulse shape at nominal zero-crossing and decide if early or late (cutoff frequency too fast or too slow respectively) Q control: sample bandpass output at lowpass nominal zero-crossing and decide if peak is too high or too small (Q too large or too small)
slide 67 of 70
D.A. Johns, 1997
Experimental Setup
pulse-shaping filter chip
LP
Data Generator
Gm-C Biquad Filter

fo Q
+ + -
CLK
counter u/d u/d DAC
logic logic
counter
DAC
off-chip tuning algorithm BP
Vref
CLK
Off-chip used an external 12 bit DAC. Input was 100Mb/s NRZI data 2Vpp differential. Comparator clock was data clock (100MHz) time delayed by 2.5ns
slide 68 of 70
D.A. Johns, 1997
Pulse Shaper Responses

20 20 0 0 -20 5 10 15 20 25 0 5 10 15 20 25
-20 0
20
20
-20 0 5 10 15 20 25
-20 0 5 10 15 20 25
Initial high-freq. high-Q

20 20
Initial high-freq. low-Q

0
-20 0 5 10 15 20 25
-20 0 5 10 15 20 25
20
20
-20 0
-20
Initial low-freq. high-Q University of Toronto
10
15
20
25
Initial low-freq. low-Q
10
15
20
25
slide 69 of 70
D.A. Johns, 1997
Summary
Adaptive filters are relatively common LMS is the most widely used algorithm Adaptive linear combiners are almost always used. Use combiners that do not have poor performance surfaces. Most common digital combiner is tapped FIR Digital Adaptive:
more robust and well suited for programmable filtering
Analog Adaptive:
best suited for high-speed, low dynamic range. less power very good at realizing arbitrary coeff with frequency only change. Be aware of DC offset effects
slide 70 of 70
D.A. Johns, 1997

Equalization 2

Uploaded by

Document Information

Original Description:

Copyright

Available Formats

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

Equalization 2

Uploaded by

Copyright:

Available Formats

Equalization

Prof. David Johns University of Toronto (johns@eecg.toronto.edu) (www.eecg.toronto.edu/~johns)

Adaptive Filter Introduction

Adaptive Filter Introduction

Adaptive Filter Goal

Noise (and Echo) Cancellation

Sinusoidal Enhancement (or Rejection)

Here > 0 and controls the adaptation rate

Steepest Descent Algorithm

p i(2) p i(1) p i(0)

Steepest-descent path follows perpendicular to tangents of the contour lines.

and since e(n) = (n) y(n) , we have

e(n) and i(n) are uncorrelated after convergence.

Variants of the LMS Algorithm

Obtaining Gradient Signals

y(n) - = h ny(n) h um(n) u(n) i(n) = ----------p i

y(t) ---------- = v lp(t) G 2

y(t) ---------- = v bp(t) G 3

Adaptive Linear Combiner

y(n) p N(n) x N(n)

often, a tapped delay line

Y(z) H(z) = --------U(z)

y(n ) ----------- = x i(n) p i

Adaptive Linear Combiner

Adaptive IIR Filtering

Adaptive IIR Filtering

Digital Adaptive Filters

y(n) ----------- = x i(n) p i

FIR Adaptive Filters

Usually need an AGC so signal power is known.

FIR Adaptive Filter

Equalization Training Sequence

regenerated delayed input data

If FFE is a 3-tap FIR filter with

output data 1 ( n) e(n)

output data 1 (n)

If DFE is a 2-tap FIR filter with

Input to slicer is now 0.0, 1.0, 0.0, 0.0 0.0 0.0

FFE and DFE Combined

FFE and DFE Combined

When Htc small, make H 2 = 1 (rather than H 1 )

DFE and FFE Combined

dc Recovery (Baseline Wander)

Baseline Wander Correction #1

0 0 0.5 0.25 0.125 0.06 ...

Baseline Wander Correction #1

0 0 0.5 0.75 0.875 0.938 ...

Baseline Wander Correction #2

Equivalent to an analog DFE Needs to match RC time constants

Baseline Wander Correction #3

output data 1 ( n) e(n)

Adaptive Linear Combiner

y(t) p N(t) x N(t)

Y(s) H(s) = --------U(s)

y(t ) ---------- = x i(t) p i

Adaptive Linear Combiner

If coeff are updated in cont-time

Adaptive Linear Combiner

Analog Adaptive Filters

Analog Adaptive Filter Structures

Orthonormal Ladder Structure

Analogs Big Advantage

Analog Adaptive Filters