Professional Documents
Culture Documents
ASSP-34,
NO. 5 , OCTOBER 1986 1153
dTr q-1
JOHN P. PRINCEN, STUDENT MEMBER, IEEE, AND ALANBERNARDBRADLEY
Abstract-A single-sidebandanalysis/synthesissystemisproposed
which provides perfect reconstruction aofsignal froma set of critically
sampledanalysissignals.Thetechniqueisdevelopedintermsof a
weighted overlap-add methodof analysis/synthesis and allows overlap x(n) %nl
between adjacent time windows. This implies that time domain aliasing
is introduced in the analysis; however, this aliasing is cancelledin the
synthesis process, and the system can provide perfect reconstruction. k=O...K-1
Achieving perfect reconstruction places constraints on the time domain
window shape which are equivalent to those placed on the frequency Fig. 1. Analysidsynthesis system framework.
domain shape of analysis/synthesis channels used in recently proposed
critically sampled systems based on frequency domain aliasing cancel-
ily expressed in the frequency domain. Reconstruction can
lation [7],[8]. In fact, a duality exists between the new technique and be obtained if the composite analysidsynthesis channel
the frequency domain techniques of [7] and 181. The proposed tech- responses overlap and add such that their sum is flat in
nique is more-efficient than frequency domain designs for a given num- the frequency domain. Any frequency domain aliasing in-
ber of analysis/synthesis channels, and can provide reasonably band- troduced by representing the narrow-band analysis signals
limited channel responses. The technique could be particularly useful
at areduced sample rate mustbe removed in the synthesis
in applicationswherecriticallysampledanalysislsynthesis is desir-
able, e.g., coding. bank [2], [4].
A second description of an analysislsynthesis system is
in ‘terms of a block transform approach (Fig. 2). Analysis
I. INTRODUCTION involves windowing the signal and transforming the win-
Analysls window
scribed in terms of the complex channel structure. They
Input signal :, ; /
I
produce real channel signals and can be described using
channel structures based on single-sideband (SSB) modu-
* lation [2].
n=mM n
An SSB analysis/synthesis system is shown in Fig. 3 .
U As will be shown, it is possible to develop a time domain
I=/ Analysis description of the SSB system which results in a block
transform implementation that is basically the same as the
transform implementation of complex filter banks; how-
ever, the DFT is replaced by appropriately defined DCT
and DST. Just as a duality exists between the time and
[ INV. TRANSFORM I frequency domain descriptions of complex analysis/syn-
thesis systems, a duality also exists between time and fre-
quency domain descriptions of SSB systems. This means
that the form of the time domain aliasing introduced in a
critically sampled SSB system with overlapped windows
Overlap and Add is the same as that of the frequency domain aliasing intro-
---
duced in a critically sampled SSB system with overlap-
Reconstructed signal ped channel responses.
In this paper a new critically sampledSSB analysidsyn-
n=rnoM n
thesis system is described which allows overlapped win-
dows to be used. Overlap occurs between adjacent win-
Fig. 2. A block transform description of an analysisisynthesis system.
dows.Timedomainaliasing distortion is introduced in
the analysis; however, this distortion is cancelled in the
In coding applications it is desirable that the analysis/ synthesis process.
synthesis system be designed so that the overall sample Since the technique is a critically sampled SSB system,
rate at the output of the analysis bank is equal to the sam- the second section of this paper defines such a system and
ple rate of the input signal. In the case ofa uniform filter states the efficient weighted overlap-add (OLA) [2], [3],
bank with K uniquechannelsignalsall of equal band- [5] analysis and synthesisequations.The analysis and
width, each channel signal must be decimated by K. Sys- synthesis involve both sine and cosine transforms, and it
tems which satisfy this condition are described as being is shown that representation and subsequent reconstruc-
critically sampled. tion of a finite sequence from the appropriately defined
Consider the design of critically sampled analysis/syn- sine or cosine transforms results in time domain aliasing.
thesis systems using the two approaches outlined above. The form of this aliasing and the effect of the introduction
For the frequency domain approach, if a system satisfies of a time offset in the modulation functions is described
the overlapcondition for reconstruction, then critical in Section 111.
sampling will always introduce frequency domain alias- Based on this discussion, it is shown that perfect recon-
ing, except in the case where the analysis and synthesis struction by time domain aliasing cancellation is possible
filters have rectangular responses of width equal to the from a critically sampled set of analysis signals. The con-
channel spacing. A dual condition exists for the time do- straints on the time domain windows are emphasized, and
main approach. If the time domain overlap requirement is the final section discusses the design of suitable windows.
satisfied, a critically sampled system will always intro- Two examples are given. The first is similar to the win-
duce time domain aliasing, exceptif the analysis and syn- dows commonly used in transform coding [l] and has a
thesis windows are rectangular and their widths (in sam- small amount of overlap. The second contains maximum
ples) are equal to the decimation factor. allowable overlap, and the resulting channel frequency re-
It is possible, however, to develop critically sampled sponse shows that the bandwidth of such designs is close
analysis/synthesis systems which have overlapped chan- to the bandwidth of designs used in subband coding of
nel frequency responses and also providereconstruc- speech.
tion. One such technique is known as quadrature mirror The advantage of the new technique is that critically
filtering (QMF) and was originally proposed by Croisier sampled analysis/synthesis can be performed with overlap
et al. [6] for the case of a two-channel system. Using the between adjacent analysis and synthesis windows. This is
basic QMF principle, a number of authors have extended not possible using conventional block transform tech-
the result to allow an arbitrary number of channels [7], niques and implies that narrower channel frequency re-
[8]. Overlap is restricted to occur only between adjacent sponse can be obtained, while allowing critical sampling.
channel filters. For these techniques, frequency domain
aliasing is introducedin the channel signals; however, the 11. SSB ANALYSI~SYNTHESIS
systems are designed so that this distortion is cancelled in Consider a uniformly spaced, evenly stacked, critically
the synthesis filter bank. The techniques cannot. be de- sampled SSB analysis/synthesis (Fig. 3) [l], [2]. The
PRINCEN ANDBRADLEY:ANALYSISiSYNTHESIS FILTERBANKDESIGN 1155
-
1
cos(mn/l) K cos(wo(n+nO)i
-1
.
sin(wo(n+no)) sin(mni2) '
0
A set of K rin(wo(ntno))
Unique channel
- signals for an
eveniy stacked
system
.
cos(mW2) K cos(wKlq(n+no))
Magnitude +
Only K / 2 1 of the K analysis channels are unique; how -
ever, in the following development the system will be de-
scribed as a K-channel system, and the full K channels
will be retained. This is more convenient when consid-
ering a block transform implementation.As shown in Fig.
4,half-width channels occur at k = 0 and k = K / 2 . If the
full-width channels are decimated by M , then the half-
-n -211 /K 0 211 /K n Freq.
width channels can be decimated by 2 M . Critical sam-
.piing implies that the overall output sample rate equals
Fig. 4. A bank of real bandpass filters. the input sample rate, so
M = -K
channelresponsesarethelow-passequivalentofthe 2'
bandpass filter bank shown in Fig. 4. The analysis chan-
nel signals are of course real, and the center frequencies From Fig. 3 and (1) the analysis signals can be written as
of the K channels are
sin E) = 0, m even
Both h (n) and f(n) are assumed to be finite impulse re-
sponsefiltersandexistonly for 0 s n s P - 1 . The
delay of P - 1 has been added to make the development
cos E) = 0, m odd.
straightforward.
To express the analysis as a block transform operation,
make the substitution - Y = mM - n [2], [3], [ 5 ] and use
1156 IEEE
TRANSACTIONS ON ACOUSTICS.
SPEECH.
SIGNAL
ANDPROCESSING.
VOL. ASSP-34, NO. 5, OCTOBER 1986
P- 1
* c x(mM + r ) h ( P - 1 - r) Substituting r = n - moM, the synthesized signal at out-
r?
r=O
put block time mO can be determined as the windowed
inverse transform of Xk(mO)plus overlapping terms from
sin K (r + no + s)).
2 other time segments [ 2 ] ,[ 3 ] , [ 5 ] ,i.e.,
m
-%n(r)= xm([rlK)
where [rIKrepresents the residue of r modulo K.
Note that
Tm(r) = xm(r), r = 0 .* *
(5)
It can be seen that for a given value of the block time m
only one of the terms in ( 5 ) is nonzero. For m even, the
channel signals Xk(m) are the appropriately defined DCT
of the windowed signal segment near time n = mM, mod-
ified by (- l)mk. Similarly, form odd, the channel signals
are the modified DST of the windowed sequence near n
= mM.
A block transform implementation of the synthesis pro-
cedure is also possible. Assuming that &(m) = Xk(m) and That is, form even,Ym(r)is the inverse DCT of the mod-
using Fig. 3 and (l),the synthesis procedure can be writ- ified channel signals; while for m odd, it is the inverse
ten as DST of the modified signals. The index r can be inter-
preted as reduced modulo K.
m Using (9) or (lo), we can write
m = -m Ym ((mo - m)M + r)
c
m = -m Xk(m)sin @ ) f ( n - mM)]. (6)
This can be identified as the transform operation in (8), be thought of as time domain aliasing which results from
and
hence,
from (8) and (11) inadequately
an sampled
frequency
domain. The
time
do-
m
main aliasing which is introducedcanbe controlled to
some extentby introducing a phase factor in the transform
a,,(r) = C f((m0 - r n ) +~ r)
m = -m kernels.
Consider first the DCT of a K-point sequence, defined
’ ym((mo- m)M +
r), r =0 M - 1. as
(12) K- 1
(14) Now
K- 1
Also, y”,(r) is the inverse DCT or DSTof the modified
channel signals, Le., (-- l ) ” k k ( , ) . Since (-- 1)2mk = 1, k=O
ym(r)is the sequence obtained from the forward and then
the reverse DCT or DSTof (14). Synthesis can be imple- n = sK,s integer
mented using the block transform approach shown in Fig. = other integer n,
2. Thechannelsignalsaremodifiedby (- andan
inverse DCT or DST transform is applied to form the se- hence,
quence y,(r). This sequence is multiplied by the synthesis f ‘ ( n ) = f(n)/2 + R(K - n - 2no)/2, 2no integer
window f ( n ) and overlapped and added to shifted output
from the previous block time. In most applications, the (1 8)
analysis/synthesis procedure can be simplified by remov- where the time indexes areinterpreted as reduced modulo
ing the modifications (- l)mk[3]. K.
In order to determine the relationship between f ( n ) and Equation (18) shows that the resulting sequence is equal
x ( n ) , it is necessary to investigate the properties of a se-
to the original sequence plus an aliasing term which is a
quence recovered from the cosine and sine transforms of shifted replica of the original sequence, time reversed.
a rea1 sequence. The relative shift between the two sequences can be con-
trolled by the phase factor no.
111. PROPERTIES OF SINEAND COSINETRANSFORMS Similarly, for the DST let
If we have a K-point real sequence, it is possible to
represent it by K unique points in the frequency domain
and recover the original sequence from these frequency
domain points. However, if a K-point sequence is repre-
sented by either the sine or cosine transforms used in the
SSB analysis/synthesis procedures, fewer than K unique and
frequency domain points are determined, and hence, the
origipal sequence cannot be recovered from the frequency
domain samples. The inverse transform yields adistorted
replica of the original sequence. In fact, thedistortion can Equations (19) and (20) are the transform operations re-
1158 IEEE TRANSACTIONS ON ACOUSTICS, SPEECH, AND ‘SIGNAL PROCESSING, VOL. ASSP-34,
NO. 5 , OCTOBER 1986
quired for analysis and synthesis when the block time is The first term in (24) is the desired signal component,
odd. while the second term is due to time domain aliasing.
It can be shown that For reconstruction we require
f ’ ( n ) = f ( n ) / 2 - 2 (K - n - 2no)/2, 2no integer. f ( r + M ) & ( M - 1 - r) + f ( r )
(2 1) *h(2M-l-r)=2, r=O*-*M-l
Again, time indexes are interpreted as reduced moduloK. (254
In this case the aliasing term is equal in magnitude to and
that in the cosine transform; however,it has opposite sign.
It can be seen that for a critically sampled SSB analysis/ f(r) h(r + 2no - 1) - f ( r M ) +
synthesis system, time domain aliasing is introduced. The
next section shows that careful design of the windows h ( n ) -K(M+~+~~,-I)=o, ~ = O . . . M - I .
andf(n) and selection of a particular value of no allows (25b)
this distortion to be cancelled in the OLA synthesis.
Equation (25a) is the well-known condition that adjacent
analysis/synthesis windows must add so that the result is
Iv. PERFECT RECONSTRUCTION FROM CRITICALLY
SAMPLED ANALYSIS SIGNALS flat [2], [3]. This is the time domain dualof the condition
that the frequency response of the filter bank be flat.
As discussed, jjm(r)is the sequence obtained by a for- Equation (25b) must be satisfied so that time domain
ward and then inverse DCT or DST of the sequence (14). aliasing introduced by the frequency domain representa-
Using the reconstruction equations developed in the pre- tion is cancelled between adjacent time segments. This
vious section, i.e., (18) and (21), and substituting (14), condition is the dual of the condition for adjacent band
jjmo(r)can be expressed in termsofthewindowedse- frequency domain aliasing cancellation in the frequency
quence as domain approaches [7],[8]. To satisfy both these condi-
tions, choose
jj%(r) = gm0(r)/2+ (- l)mogm,(K - Y - 2no)/2. (22)
f(r) = h (r).
The recovered sequence is the sum of the original win-
dowed sequence and an alias term. The sign of the alias Note that
term is dependent on the block time mo (whether a DCT h(r) = K(Y), Y = 0 * * . K - 1,
or DST is applied).
For the case where mo is even, (22) and (14) can be and if
used in (13) to give the recovered signal segment at n = .h(K- 1 -r)=h(r), r = O . - . K - 1,
Q M in terms of the input signal.
then (25a) becomes
imo(r)= f(r + M ) {K(K- 1 - r - M )
+ +
f 2 ( r M ) f 2 ( r ) = 2, r =0 - -
M - 1 (26a)
* 2%-1(r + M ) - K(r + +M 2no - 1 ) and (25b) becomes
* 3%- 1(K - r - 2no)}/2 f(r) f(-2M - 1 + Y + 2no)
- f(r + M)J;(-M - 1 + r + 2no) = 0,
r=O*..M- 1. (26b)
+ K ( r + 2no - 1) Zm0(K- r - 2n0)}/2.
If we let
(23)
2no = M + 1, (27)
Noting that M = K/2 and then
+
Tm0-](r M ) = i&,(r), r =0 + - * M - 1, f(--~-1+r+2%)=f(r), ~ = o . - . M - I ,
(23) becomes and
imo(r)= Zm(r) ( f ( r + M ) K(M - 1 - r) f(-2M - 1 + I- + 2no) = f(r + M ) ,
+ h(2M - r - 1 ) f ( r ) } / 2 r = O - * . M - 1,
+ fm(K - r -2no) (f(r) h(r + 2no - 1 ) satisfying (26b).
when mo
A similar development is possible for the case
- f(r + M ) K(M + r + 2no - 1)}/2, is odd. Equation (25a) would remain the same, and the
signs on the aliasing components of (25b) would be in-
r = O . . + M - 1. (24) verted.
PRINCEN AND BRADLEY:ANALYSISISYNTHESIS FILTER BANKDESIGN 1159
1.6
W
0 0.8
'i.
:
2
0
0
TIME
1 .I
0 -
n
4
-n -
3n n
2 4
FREQUENCY (a)
WINDOW
0
COEFFICIENTS
'I
c1.
RECONSTRUCTION
dow for a 32-band system.
TABLE I
FOR Two POSSIBLE DESIGNS
r
-
0
1
2
3
4
5
6
7
8
9
10
11
12
0
0
0
0
WHICH GIVE
IN A 32-BAND SYSTEM
r WINDOW
DESIGN 1
0
0
0.50000
0.86603
1.11803
1.32287
1.41421
1 ,41421
1 .41421
16
TIME
Fig. 6 . (a) A 20-point window for a 32-band system. (b) A 32-point win-
PERFECT
IESIGNS
DESIGN2
0.18332
0.26722
0.36390
0.47162
0.58798
0.70847
0.82932
0.94533
1.05202
1.14558
1.22396
1.28610
1.33307
32
L' 0
\
\ I
-
n
4
-7l
2
FREQUENCY [ w )
(b)
-
3n
4
n
ing systemandappropriatewindowdesigns forthese [3] R. E. Crochiere, “A weighted overlap-add method of short time Fou-
analysis/synthesis,” ZEEE Trans. Acoust., Speech, Signal Pro-
systems is a subject for future research. We can say, how- rier
cessing, vol. ASSP-28, pp. 99-102, Feb. 1980.
ever, that the techniquepresented allows a number of de- [4] J . B. Allen and L. R. Rabiner, “A unified approach to short-time Fou-
sign choices not possible with other critically sampled rieranalysisandsynthesis,” Proc. IEEE, vol. 65,pp.1558-1564,
Nov.1977.
analysidsynthesis techniques. [5] M. R. Portnoff, “Implementation of the digital phase vocoder using
the fast Fourier transform,” IEEE Trans. Acoust., Speech, Signal Pro-
VI. CONCLUSION cessing, vol. ASSP-24, pp. 243-248, June 1976.
[6] A. Croisier et al., “Perfect channel splitting by use of interpolation/
In this papera new critically sampledSSB a n a l y d s y n - decimationltree decomposition techniques,” presented at the Int.’Conf.
thesis system which provides perfect reconstruction has Inform. Sci./Syst., Patras, Greece, Aug. 1976.
been developed. The technique allows overlap between [7] J. Rothweiler, “Polyphase quadrature filters-A new subband coding
technique,” presented at the ICASSP, Boston, MA, 1983.
adjacent analysis and synthesis windows, and hence, time [8] P. L. Chu, “Quadrature mirror filter design for an arbitrary numberof
domain aliasing i s introduced in the analysis. However, equal bandwidth channels,” IEEE Trcms. Acoust., Speech,Signal Pro-
this aliasing is cancelled in thesynthesis process. Achiev- cessing, vol. ASSP-33, pp. 203-218, Feb. 1985.
ing aliasing cancellation places time domain constraints
on the analysis and synthesis windows. The proposed sys-
tem was developed using the OLA method of analysis/
synthesis, which also provides an efficient implementa- John P. Princen (S’83) was born in Melbourne,
Australia, on September 12, 1961. Hereceived the
tion. The technique is the dual ofrecently developed crit- B.Eng.degreeincommunicationsengineering
ically sampled frequency domain designs based on alias- with distinction from the Royal Melbourne Insti-
ing canceilation.Itadds considerable flexibility in the tute of Technology (R.M.I.T.), Melbourne, in
1984 and has completed the M.Eng. program in
design of transform coders, can provide reasonably band- thearea of analogspeechscrambling,alsoat
limited channel responses, and is more efficient than cor- R.M.I.T.
responding frequency domain designs for a given number Hisresearchinterestsincludedigitalsignal
processing and digital communciations.
of bands.
ACKNOWLEDGMENT
The authors wouldlike to thank A. Johnson for his
helpful comments, and also the reviewers, whose sugges- Alan Bernard Bradley was born in Melbourne,
Australia, in 1946. He received the B.E. degree
tions contributed significantly to the quality of the final in electrical engineering from Melbourne Univer-
manuscript. sity in 1968 and the M.E.Sc. degree in electrical
engineering from Monash University in 1972.
REFERENCES Since1973 he hasbeenworkingasLecturer
and Senior Lecturer in the Department of Com-
[I] J. M.Triboletand R. E. Crochiere, “Frequencydomaincoding of municationandElectronicEngineeringatthe
speech,” IEEE Trans. Acoust., Speech, Signal Processing, vol. ASSP: Royal Melbourne Institute of Technology, Mel-
27, pp. 512-533, Oct. 1979. bourne. His research interests are in the field of
[2] R. E. Crochiere and L . R. Rabiner, Multirate Digital Signal Process- digital signal processing, with particular emphasis
ing. EnglewoodCliffs, NJ:Prentice-Hall,1983. on speech analysis and coding.