You are on page 1of 6

IEEE JOURNAL OF SOLID-STATE CIRCUITS, VOL. 41, NO.

9, SEPTEMBER 2006 2077

A 120-MHz–1.8-GHz CMOS DLL-Based Clock


Generator for Dynamic Frequency Scaling
Jin-Han Kim, Student Member, IEEE, Young-Ho Kwak, Student Member, IEEE,
Mooyoung Kim, Student Member, IEEE, Soo-Won Kim, and Chulwoo Kim, Senior Member, IEEE

Abstract—A delay-locked loop (DLL)-based clock generator of DLLs. However, large internal node capacitances of the fre-
for dynamic frequency scaling has been developed in a 0.35- m quency multiplier limit the operating speed of the clock gen-
CMOS technology. The proposed clock generator can generate erator. Finally, the multiplying DLL (MDLL) in [14] was pro-
clock signals ranging from 120 MHz to 1.8 GHz and change the
frequency dynamically in a short time. If the clock generator scales posed to be highly integrated in clock and data recovery unit,
its output frequency dynamically by programming with the same of which the high-frequency portion of jitter can be filtered out
last bit, it takes only one clock cycle to lock. In addition, the clock by injection locking using a slave oscillator. The multiplication
generator inherits advantages of a DLL. The proposed DLL-based factor of the MDLL can be changed from 1 to 10.
clock generator occupies 0.07 mm2 and has a peak-to-peak jitter In this paper, a DLL-based clock generator is demonstrated
of 6.6 ps at 1.3 GHz.
in a 0.35- m CMOS technology. The proposed clock generator
Index Terms—Clock generator, delay-locked loop (DLL), DLL- can multiply input frequency dynamically, which enables dy-
based frequency multiplication, fast lock, low jitter. namic voltage scaling and can increase the maximum operating
frequency owing to the proposed frequency multiplier. The pro-
posed frequency multiplier has small internal node capacitances
I. INTRODUCTION
and is controlled by simple enabling logic circuits. As a result,
the generator can operate from 120 MHz to 1.8 GHz with var-
ious input signals and multiplication factors. Hence, the pro-
C OMPARED to most phase-locked-loop (PLL)-based clock
generators and local oscillators [1]–[9], delay-locked loop
(DLL)-based counterparts exhibit less jitter and phase noise be-
posed clock generator can be used in clock and data recovery
applications or in a dynamic frequency scaler in microproces-
cause of no jitter accumulation. This is true even under severe sors, which needs a fast lock during frequency change.
supply noise which is becoming common and critical in many This paper is organized as follows. Section II briefly
SoCs. Furthermore, they show stable operation with process, overviews the architecture of proposed frequency multiplier.
voltage and temperature (PVT) variations, are easier to design, Section III presents the circuit implementation of the frequency
and occupy smaller area due to a simpler loop filter [10]–[13]. multiplier concretely. The proposed frequency multiplier
Recently, several DLL-based clock generators and a local oscil- consists of transition detector and edge-combiner. Section IV
lator have been proposed to overcome the difficulty of frequency describes the operation of DLL-based clock generator using the
multiplication with DLLs and utilize the several inherent ad- proposed frequency multiplier for dynamic frequency scaling.
vantages of DLLs over PLLs [10]–[14]. The DLL-based local Experimental results, die photo, and performance comparisons
oscillator for PCS applications consumes large power and area to previous works are shown in Section V.
and the frequency multiplication factor cannot be changed once
II. ARCHITECTURE OF PROPOSED FREQUENCY MULTIPLIER
the LC-tank value is chosen [10]. The frequency synthesizer in
[11] multiplies input clock frequency by a fixed multiplication To overcome the problems of conventional DLL-based clock
factor, 9, and generates a 1-GHz off-chip clock signal with an generators, we have proposed a new one. The key idea of
external 50- pull-up resistor [11]. The multiplying DLL pro- the proposed DLL-based clock generator is shown in Fig. 1.
posed in [12] uses multiplexers (MUXes) which cause jitter due The architecture of the proposed DLL-based clock generator
to a fixed supply voltage. In addition, the duty cycle of the fre- features a programmable multiplication factor controller and
quency multiplied output clock is not 50% which is required a high-frequency frequency multiplier. The buffered outputs
in many high-performance applications. The DLL-based clock of the voltage-controlled delay line (VCDL) are applied to the
generator proposed in [13] provides a low-jitter on-chip clock inputs of the frequency multiplier. The multiplication factor
signal with a variable multiplication factor. Its frequency-mul- controller generates select signals for both the MUX and the
tiplied output clock has a 50% duty cycle even with a non-50% transition detector. Appropriate phases for DLL locking are
duty cycle input. It also overcomes the limited locking range selected by the MUX. Whenever each buffered VCDL output
rises, the transition detector generates a short period pulse,
Manuscript received August 24, 2005; revised April 20, 2006. This work
as shown in Fig. 2. The edge combiner puts the short pulses
was supported by Grant R08-2003-000-10899-0 from the Basic Research Pro- together and toggles the phase of output at every negative
gram of the Korea Science and Engineering Foundation and IT-SOC Promotion edge of the short pulses. Thus, the multiplier output clock
Group. signal toggles at every rising edge of signal . Fig. 2 shows
The authors are with the Department of Electronics Engineering, Korea Uni-
versity, Seoul 136-713, Korea (e-mail:ckim@korea.ac.kr). an example of frequency multiplication by two. Even though
Digital Object Identifier 10.1109/JSSC.2006.880609 the duty cycle of signals is not 50%, the output clock signal
0018-9200/$20.00 © 2006 IEEE
2078 IEEE JOURNAL OF SOLID-STATE CIRCUITS, VOL. 41, NO. 9, SEPTEMBER 2006

2
Fig. 3. Block diagram of proposed frequency multiplier ( 4).

Fig. 1. Block diagram of DLL-based clock generator.

Fig. 4. Simulated waveform which shows the signal propagation failure at


1.6 GHz without signal ordering in the edge-combiner.

will arrive faster than the other which comes after three-inverter
delay. As a result, at the rising edge of output of buffer, the
Fig. 2. Waveforms of frequency multiplication by two. NAND gate generates the negative narrow pulse corresponding
to the three-inverter delay. Second, the edge-combiner consists
of symmetric NANDs, inverters, and a toggle pulsed latch (TPL).
does toggle at 50% duty cycle because each virtual pulse is The negative pulses are transferred through three stages of
generated at every rising edge of the signals. If the VCDL two-input AND gates which is constructed by a symmetric
has delay cells, then the output clock frequency can be NAND and an inverter. Node A results in an AND operation of
expressed as the negative pulses. Therefore, whenever any buffered VCDL
output feeds the transition detector, a short pulse is generated
at node A. Finally, the TPL toggles its output signal as long
as the negative pulse is generated at node A. If a pulse signal
(1) propagating through the edge combiner arrives at a certain
internal node during the transition time of a previous pulse
where is the frequency of the reference signal and the signal, a malfunction may occur, which limits the operation
multiplication factor can be chosen dynamically by the frequency of the edge combiner. To prevent this problem,
multiplication factor controller. the number of fan-in of a symmetric NAND gate is limited to
two. In addition, outputs of the transition detector are paired
III. CIRCUIT IMPLEMENTATION OF FREQUENCY MULTIPLIER to guarantee maximum pulse separation as shown in Fig. 3,
To enhance the operating frequency of the frequency which reduces signal transition overlap and increases the op-
multiplier while maintaining dynamic programmability, the erating frequency of the frequency multiplier by 15%. Fig. 4
frequency multiplier is proposed as shown in Fig. 3. The circuit shows the failing case when the transition detector outputs
implementation of each block is as follows. First, the transition are connected without signal ordering to the edge combiner.
detector consists of three inverters and a three-input NAND If two pairs of adjacent inputs, ( and ) and ( and ),
gate as shown in Fig. 3. Enable signals ( ) for adjusting the feed to NAND gates and , respectively, overlap between
multiplication factor are fed to the NAND gates. Each buffered adjacent signals reduces the voltage levels of nodes x and y.
VCDL output ( ) is then fed to a NAND gate and three inverters In the same manner, voltage reduction and overlap in nodes x
at the same time. If signal rises, one input of a NAND gate and y results in voltage level degradation of node i. As a result,
KIM et al.: A 120-MHZ–1.8-GHZ CMOS DLL-BASED CLOCK GENERATOR FOR DYNAMIC FREQUENCY SCALING 2079

Fig. 5. Edge-combiner circuits. (a) Symmetric NAND. (b) TPL.

Fig. 7. Schematics of (a) charge-pump circuit, (b) charge-pump bias circuit,


and (c) voltage regulator.
Fig. 6. Detailed block diagram of DLL-based clock generator.

DLL-based clock generator was implemented in a 0.35- m


pulses at node A are not generated properly. Fig. 5 shows the single-poly four-metal CMOS process with a 3.3-V supply.
circuit implementation of edge-combiner. Each component was The phase detector (PD) with reset circuitry has increased the
controlled by the regulated supply for supply noise suppression. locking range by initial condition as proposed in [12]. The
A pull-down path of a symmetric NAND gate is divided into lock range for the DLL is from 240 to 500 MHz. A charge
two paths and input connections are switched for symmetric pump with replica bias controls the charge pump current for
short propagation. The operation of TPL is as follows. During adaptive bandwidth. Schematics of a charge pump and its bias
the short pulse width, the nMOS pass transistor turns on and circuit are shown in Fig. 7. The charge pump is similar to that
transmits input data ( ) to output. Furthermore, the output is introduced in [15] and can make the voltages of nodes
inverted and fed back to input data until next pulse is triggered. and n1 equal. Then the offset is canceled. The bias current is
Consequently, TPL can be considered a toggle-edge-triggered controlled by the last bit, which was compared through PD
flip-flop. The pMOS transistor (M1) of TPL prevents internal as shown in Fig. 7(b). The bandwidth is determined by the
node X from voltage drop of . Also, it provides a negative VCDL gain ( ) and charge pump current. The VCDL gain
setup time when input data changes from logic “Low” to logic is inversely proportional to charge pump current. Therefore,
“High”. The increased size of the nMOS pass transistor can as the position of the last bit decreases, the VCDL gain will
have a better drive strength at node X than the equivalent size be reduced, and the charge pump current should be reduced to
of transmission gate. Thus, the total propagation time through maintain the bandwidth. Delay of VCDL is controlled by the
TPL determines the maximum frequency of the frequency mul- regulated supply voltage. The regulator consists of a two-stage
tiplier. Because data transfers through only an nMOS transistor current mirror as shown in Fig. 7(c) [16]. The bandwidth of the
and three inverters, the frequency multiplier maintains dynamic regulator was controlled by the bias voltage of . Thus, the
frequency programmability and operates at high frequency regulated voltage swing results in the adaptive bandwidth ac-
compared to the frequency multiplier in [4], which suffers from cording to input frequency and independent of the supply noise.
large internal node capacitance. Finally, there are the MUX and the frequency multiplier. The
MUX selects the last bit which connects to the PD according
IV. DLL-BASED CLOCK GENERATOR to the control signal. Hence, the selected number of delay
Fig. 6 shows the overall block diagram of the frequency cell outputs determines the maximum frequency of multiplied
multiplier for dynamic frequency scaling. The proposed output by (1). For example, if the multiplication factor is 3 in
2080 IEEE JOURNAL OF SOLID-STATE CIRCUITS, VOL. 41, NO. 9, SEPTEMBER 2006

Fig. 9. Input and multiplied output.

Fig. 8. Dynamic frequency multiplication example, where D represents


“disable”.

Fig. 8, the MUX selects the output of the sixth tap as the last bit
and the number of delay cell is six. The maximum frequency
of multiplied output is input frequency . In addition, if
the multiplication factor controller enables the first, third, and
fifth transition detectors, the frequency of the multiplied output
will be halved: input frequency . Therefore, when the
frequency multiplier scales its output frequency dynamically by
Fig. 10. Jitter histogram of the output.
programming with the same last bit, it takes only one cycle time
for locking, which is critical for dynamic power management
for system-level power reduction. Other combinations marked
in gray in Fig. 8 can be chosen, but in those cases, some cycle
time is wasted before the multiplied output clock is generated.
For example, if the multiplication factor is changed from 1 to
2 with the last bit of 8 and the multiplication factor controller
enables the second transition detector instead of the third, the
first multiplied pulsewidth is narrower than desired and should
be discarded.
To reduce static phase offset and mismatch of delay path, the
replica bias charge pump and dummy delay cells in the VCDL
are used. Furthermore, a symmetric NAND gate is used in the
phase detector and layout of the proposed clock generator was
carefully done to reduce mismatch.
Fig. 11. Dynamic frequency scaling example (from 1.6 GHz to 400 MHz and
then to 800 MHz).
V. EXPERIMENTAL RESULT
Input and output waveforms of the frequency multiplier
are shown in Fig. 9. The measured waveforms of Fig. 9 show Inside the chip, the sinusoidal input signal becomes a square
a 1.3-GHz frequency multiplier output from an input refer- wave after passing through several buffers and then feeds the
ence frequency of 325 MHz, giving a multiplication factor of DLL-based clock multiplier. The measured output waveform
four. The measured sinusoidal input signal is the output of looks triangular due to the weak driving capability of the output
a signal generator which feeds the device under test (DUT). driver. The measurement result with a quiet supply shows
KIM et al.: A 120-MHZ–1.8-GHZ CMOS DLL-BASED CLOCK GENERATOR FOR DYNAMIC FREQUENCY SCALING 2081

Fig. 12. Performance comparison of the DLL-based frequency multipliers.

TABLE I
PERFORMANCE SUMMARY

VI. CONCLUSION
In this paper, a DLL-based clock generator for dynamic fre-
quency scaling was proposed. This clock generator has been
designed in a 0.35- m CMOS process and could increase the
maximum operating frequency owing to the proposed frequency
Fig. 13. Die photo. multiplier. The proposed frequency multiplier has small internal
node capacitances and was controlled by simple enabling logic
circuits. As a result, the generator can operate from 120 MHz
peak-to-peak jitter of 6.6 ps and rms jitter of 1.8 ps in Fig. 10. to 1.8 GHz with various input signal and multiplication factors.
Fig. 11 shows that the frequency multiplier output changes Since the multiplied output was made from each rising input
from 1.6 GHz to 400 MHz and then to 800 MHz with one cycle signal, output signal maintain 50% duty cycle ratio independent
lock time. The minimum and maximum multiplied output fre- of duty cycle ratio of the input signal. In addition, with plain dig-
quencies are 120 MHz and 1.8 GHz with input frequencies of ital logic for frequency adjustment, the multiplication factor can
240 MHz and 450 MHz, respectively. The triangles in Fig. 12 be changed dynamically with fast lock time. In case of the same
are simulated maximum operating frequency of the proposed last bit, it takes only one-cycle to lock during frequency scaling.
multiplier featuring programmability for dynamic frequency Since the DLL includes the voltage regulator and replica bias,
scaling. The proposed clock multiplier increases the operating the generator can suppress the supply noise and have adap-
frequency significantly in a given technology as shown in tive bandwidth. The implemented DLL-based clock generator
Fig. 12. The clock generator die photo is shown in Fig. 13. shows that for a reference signal of 325 MHz and a multipli-
The clock generator occupies 0.07 mm in a 0.35- m CMOS cation factor of four, the measured peak-to-peak cycle-to-cycle
process. The performance of the implemented clock generator jitter is 6.6 ps at 1.3 GHz. The proposed clock generator can
is summarized in Table I. be used in clock and data recovery applications or in dynamic
2082 IEEE JOURNAL OF SOLID-STATE CIRCUITS, VOL. 41, NO. 9, SEPTEMBER 2006

frequency scalers for low-power microprocessors which need a Young-Ho Kwak (S’04) received the B.S. degree
fast lock during frequency change. in electronics engineering from Korea University,
Seoul, Korea, in 2004, where he is currently working
toward the Ph.D. degree in electronics and computer
REFERENCES engineering.
[1] I. W. Young, J. K. Greason, and K. L. Wong, “A PLL clock generator His research interests are in clock and data re-
with 5 to 110 MHz of lock range for microprocessors,” IEEE J. Solid- covery for high-speed communications, high-speed
State Circuits, vol. 34, no. 11, pp. 1599–1607, Nov. 1992. serial link, and PLL design.
[2] J. Alvarez, H. Sanchez, G. Gerosa, and R. Countryman, “A wide-band- Mr. Kwak was the recipient of the IEEE Seoul Sec-
width low-voltage PLL for power PC microprocessors,” IEEE J. Solid- tion Student Paper Contest Award in 2005.
State Circuits, vol. 30, no. 4, pp. 383–391, Apr. 1995.
[3] V. R. von Kaenel, D. Aebischer, C. Piguet, and E. Dijkstra, “A 320
MHz, 1.5 mW @ 1.35 V CMOS PLL for microprocessor clock gen-
eration,” IEEE J. Solid-State Circuits, vol. 31, no. 11, pp. 1715–1722,
Nov. 1996. Mooyoung Kim (S’04) received the B.S. degree
[4] J. C. Rudell, J. J. Ou, T. Cho, G. Chien, F. Brianti, J. A. Weldon, and P. in electrical computer engineering from Korea
R. Gray, “A 1.9-GHz wideband IF double conversion CMOS receiver University, Seoul, Korea, in 2004, where he is
for cordless telephone applications,” IEEE J. Solid-State Circuits, vol. currently working toward the Ph.D degree in the area
32, no. 12, pp. 2701–2088, Dec. 1997. of integrated circuits and systems.
[5] V. R. von Kaenel, “A high-speed, low-power clock generator for a mi- His research interests include mixed-mode circuit
croprocessor application,” IEEE J. Solid-State Circuits, vol. 33, no. 11, and system design.
pp. 1634–1639, Nov. 1998.
[6] D. W. Boerstler, “A low-jitter PLL clock generator for microprocessors
with lock range of 340–612 MHz,” IEEE J. Solid-State Circuits, vol.
34, no. 4, p. 513, Apr. 1999.
[7] V. Gutnik and A. P. Chandrakasan, “Active GHz clock network using
distributed PLLs,” IEEE J. Solid-State Circuits, vol. 35, no. 11, pp.
1553–1560, Nov. 2000.
[8] R. Magoon, A. Molnar, J. Zachan, G. Hatcher, and W. Rhee, “A single- Soo-Won Kim received the B.S. degree in electronics
chip quad-band (850/900/1800/1900 MHz) direct conversion GSM/ engineering from Korea University, Seoul, Korea, in
GPRS RF transceiver with integrated VCOs and fractional-n synthe- 1974, and the M.S. and Ph.D. degrees in electrical en-
sizer,” IEEE J. Solid-State Circuits, vol. 37, no. 11, pp. 1710–1720, gineering from Texas A&M University, College Sta-
Nov. 2000. tion, in 1983 and 1987, respectively.
[9] T. Olsson and P. Nilsson, “A digitally controlled PLL for SoC applica- He joined the Department of Electronics Engi-
tions,” IEEE J. Solid-State Circuits, vol. 39, no. 5, pp. 751–760, May neering at Korea University as an Assistant Professor
2004. in 1987. Since 1989, he has been a Professor in
[10] G. Chien and P. R. Gray, “A 900-MHz local oscillator using a DLL- Department of Electronics Engineering at Korea
based frequency multiplier technique for PCS applications,” IEEE J. University, Korea. His research area is in design
Solid-State Circuits, vol. 35, no. 12, pp. 1996–1999, Dec. 2000. of mixed mode IC, RF PLL, and high-speed and
[11] D. Foley and M. P. Flynn, “CMOS DLL-based 2-V 3.2-ps jitter 1-GHz low-power digital systems.
clock synthesizer and temperature-compensated tunable oscillator,”
IEEE J. Solid-State Circuits, vol. 36, no. 3, pp. 417–423, Mar. 2001.
[12] R. Farjad-Rad, W. Dally, H.-T. Ng, R. Senthinathan, M.-J. E. Lee,
R. Rathi, and J. Poulton, “A low-power multiplier DLL for low-jitter Chulwoo Kim (S’98-M’02-SM’06) received the
multigigahertz clock generation in highly integrated digital chips,” B.S. and M.S. degrees in electronics engineering
IEEE J. Solid-State Circuits, vol. 37, no. 12, pp. 1804–1812, Dec. from Korea University, Seoul, Korea, in 1994 and
2002. 1996, respectively, and the Ph.D. degree in electrical
[13] C. Kim, I.-C. Hwang, and S.-M. Kang, “A low-power small-area 7.28 6 and computer engineering from the University of
ps jitter 1 GHz DLL-based clock generator,” IEEE J. Solid-State Cir- Illinois at Urbana-Champaign in 2001.
cuits, vol. 37, no. 11, pp. 1414–1420, Nov. 2002. While at the University of Illinois, during
[14] R. Farjad-Rad, A. Nguyen, J. Tran, T. Greer, J. Poulton, W. Dally, J. 1997–2000, he was a research assistant with the
Edmondson, R. Senthinathan, R. Rathi, M.-J. Lee, and H.-T. Ng, “A Coordinated Science Laboratory. In 1999, he also
33-mW 8-Gb/s CMOS clock multiplier and CDR for highly integrated worked as a summer intern at the Design Technology
I/Os,” IEEE J. Solid-State Circuits, vol. 39, no. 9, pp. 1553–1561, Sep. at Intel Corporation, Santa Clara, CA. In May
2004. 2001, he joined IBM Microelectronics Division, Austin, TX, where he was
[15] J. Maneatis, “Low-jitter process-independent DLL and PLL based on involved in Cell processor design. Prior to joining IBM, he was a Research
self-biased techniques,” IEEE J. Solid-State Circuits, vol. 31, no. 11, Staff Member with the University of California, Santa Cruz, CA, in 2001.
pp. 1723–1732, Nov. 1996. Since September 2002, he has been with the Department of Electronics and
[16] S. Sidiropoulos, “Adaptive bandwidth DLLs and PLLs using regulated Computer Engineering, Korea University, where he is currently an Assistant
supply CMOS buffers,” in Symp. VLSI Circuits Dig. Tech. Papers, Jun. Professor. His current research interests are in the areas of wireline transceiver,
2000, pp. 124–127. high-speed I/O, PLL/DLL, dynamic power management, and network on chip.
Dr. Kim received the HumanTech Thesis Contest Bronze Award from Sam-
Jin-Han Kim (S’04) received the B.S. and M.S. de- sung Electronics Corporation (1996), the International Low-Power Design Con-
grees in electronics engineering from Korea Univer- test Award at the IEEE International Symposium on Low-Power Electronics and
sity, Seoul, Korea, in 2002 and 2005, respectively, Design (2001), the Design Automation Conference (DAC) Student Design Con-
where he is currently working toward the Ph.D. de- test Award (2002), and Semiconductor Research Corporation (SRC) Inventor
gree in electronics and computer engineering. Recognition Award (2002).
His research interests are in PLL/DLL design for
high-speed communications and panel display driver
IC design.
Mr. Kim was the recipient of the IEEE Seoul Sec-
tion Student Paper Contest Award in 2004.

You might also like