Documenti di Didattica
Documenti di Professioni
Documenti di Cultura
1, JANUARY 2011
165
V. CONCLUSION
We have presented a reconfigurable SRAM architecture for
low-power mobile multimedia applications. The proposed architecture
uses spatial voltage scaling where cells storing lower-order bits of
an image pixel are operated at a lower voltage, and the higher-order
cells are operated at nominal voltage. The results demonstrate that
45% power savings can be obtained with marginal image quality
degradation.
REFERENCES
Index TermsAll-digital phase-locked loop (ADPLL), digital filter, digitally controlled oscillator (DCO), frequency interpolation, locking algorithm.
I. INTRODUCTION
In a system-on-chip (SOC) era, it is common that a chip is embedded
with its own clock generator. Phase-locked loop (PLL) is an important
clocking device for these digital systems. Traditionally, a PLL is partly
made of some analog components. However, an analog control signal
is often subject to digital switching noise so that more design efforts are
needed when operating in a noisy digital environment. Moreover, the
loop filter in traditional analog PLLs is usually composed of passive
devices, such as resistors and capacitances, leading to not only large
area but also low portability to different processes. On the contrary, an
all-digital phase-locked loop (ADPLL) has not only higher noise immunity but also a number of other attractive features, including better
testability, programmability, and ease of integration into digital systems. With the aid of digital filtering techniques, the passive devices
could be avoided to save the cost and increase the portability of the
ADPLLs.
Numerous works on ADPLL have been presented in the past [1][4],
[6], [9][12], in which the digitally controlled oscillator (DCO) is at
the heart. Unlike the traditional analog voltage controlled oscillator
(VCO), the oscillator in ADPLL is digitally controlled so that some
quantization error exists. High-resolution design of DCO is thus pursued to catch up with the performance of analog VCO where the resolution can be defined as the minimum delay difference between the
clock periods of two adjacent frequencies a DCO can generate. How to
design the fine-resolution delay cell becomes the early challenge of the
DCO design. Several types of DCOs have been analyzed and summarized in [12]. In [8], the average resolution of DCO could be enhanced
to as fine as 1 ps with the insertion of more delay cells and control lines,
implying that the performance bottleneck of ADPLL is now shifting to
the other components such as the phase detectors (PDs) and/or the loop
filters.
In addition to the DCO, the resolution of an ADPLL is also dictated
by the phase-frequency detector (PFD), or simply PD in some ADPLLs. Recently, the design of a high-resolution PD that is able to distinguish a minute phase difference is becoming more desirable. Sheng
Manuscript received January 15, 2009; revised May 18, 2009. First published
October 13, 2009; current version published December 27, 2010.
The authors are with the Department of Electrical Engineering, National
Tsing-Hua University, Hsin-Chu 30013, Taiwan (e-mail: hjhsu00@larc.ee.
nthu.edu.tw; syhuang@ee.nthu.edu.tw).
Digital Object Identifier 10.1109/TVLSI.2009.2030410
166
IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS, VOL. 19, NO. 1, JANUARY 2011
et al. [6] used pulse amplifier and signal extender to amplify the small
phase difference, i.e., 5 ps, to be detected. Tierno et al. [11] inserted a
meta-stability filter to enhance the resolution of proposed PFD. In [12],
we discovered that a latch-based sense amplifier could be modified to
serve as a good PD in which the minimum resolvable phase difference
is as small as 1 ps.
Since the two critical components, i.e., the PD and the DCO, have
performed almost as well as their analog counterparts, the digital loop
filter or called controller in the ADPLL turns into a new focal point. The
digital loop filter, in some sense, seems like the bridge between the PD
and the DCO. It converts the phase information received from the PD
into a digital code sequence that directs the increase or the decrease
of the frequency of the DCO. This is especially difficult for ADPLLs
since most PDs only provide the phase polarity, not the exact phase
magnitude. Whether the frequency of DCO can adequately react to the
phase polarity has a large influence on performance parameters such as
jitter and phase noise.
In this brief, we present an ADPLL that reuses the high-resolution
DCO and the PD proposed in [12]. We further improve its jitter performance by a four-step frequency/phase locking procedure: 1) frequency
locking, 2) preliminary phase locking, 3) jitter reduction by a suppressive filter, and finally 4) phase error reduction by frequency interpolation. Experimental results show that the proposed scheme can reduce
the jitter to a level as small as 7.28 ps, which is only 3% away from the
ideal jitter amount (7.06 ps) of a free-running DCO, when the output
clock is operating at 600 MHz.
The rest of this paper is organized as follows. Section II provides the
overview of our ADPLL architecture and the designs of the DCO and
the PD. Section III discusses the details of the four-step locking procedure. Section IV presents our experimental results. Finally, Section V
concludes.
II. THE ADPLL ARCHITECTURE
A. Overall Architecture
Fig. 1 shows the architecture of the proposed ADPLL. In integerADPLL, the frequency of the output clock signal generated by the DCO
is divided by times to produce a divided clock signal. The parameter
will be referred to as the frequency multiplicative factor in the sequel. When the frequency of an ADPLL is locked (i.e., the frequency
of divided clock has the same frequency as the reference clock), the
output clock frequency is times the reference clock frequency.
The DCO is often regarded as the most critical part among these
components because it dictates the frequency range and the resolution
that an ADPLL can synthesize. It is controlled by 12 b in our design,
dividing into two parts: a 6-b coarse control code and a 6-b fine control code. The PD is to compare the phase polarity between the divided
clock signal and the reference clock signal. It generates two mutually
exclusive signals: lead and lag to show whose edge is leading. Since
the bandwidth of the DCO is quite large, the frequency detector is to
compare the divided clock with the reference clock so as to make the
frequency of DCO gradually approach the target frequency. Once the
frequency of DCO has been tuned as close as possible to the target frequency, signal frequency_lock is asserted and the phase-locking process
begins. The controller performs a phase-locking algorithm, mimicking
the function of loop filter in an analog PLL. In a number of reference
clock cycles, the control code will gradually converge to a small range,
indicating that the phase has also been locked to the reference clock
and the target frequency has been synthesized.
B. Digitally Controlled Oscillator
The DCO architecture we use in this design follows the one discussed in [12]. The fine-tuning delay cell comprises an array of tunable
metaloxidesemiconductor (MOS) capacitance, resulting in a highresolution, uniform, and monotonic delay profile of DCO. Such properties are contributive to the frequency detection and phase locking and
simplify the design of the digital loop filter. The average resolution of
one coarse step and one fine step is summarized in Table I. It is notable
that the delay range of the fine-tuning delay cell can always cover a
IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS, VOL. 19, NO. 1, JANUARY 2011
167
TABLE I
MEASURED CHARACTERISTICS OF THE DCO
coarse-tuning step even after manufacturing so that the DCO does not
have any uncovered region within the supported frequency range.
C. Phase Detector
We adopt the sense-amplifier-based PD proposed in [12] since it can
almost eliminate the dead zone, (which is the minute range of phase
difference a PD cannot properly react). The phase polarity of two input
clocks is sent to both frequency detector and phase-locking controller.
Experimental results show that the performance will not be affected
even when there is severe process variation.
III. LOCKING PROCEDURE OF THE ADPLL
The locking procedure is the most contributive part in this work as
compared to previous ADPLL designs. It is composed of four steps
shown as Fig. 2. In step 1, the frequency tuning range of DCO should be
narrowed down to the proximity of the target frequency so as to facilitate the subsequent phase locking. This is achieved by tuning the coarse
control code of DCO under the guidance of the frequency detector. In
step 2, we start the preliminary phase locking with the reference clock.
The phase will be locked quickly but roughly, implying that the jitter
could still be large. Next, a suppressive digital filter is adopted in step
3 to further reduce the jitter. Finally, an interpolation-based locking algorithm is used to make the phase error as small as possible. In the
following sections, we will introduce the detail of each step. In the sequel, the reference clock signal in our ADPLL is denoted as ref_clk,
and the divided clock signal is denoted as div_clk. We will mention
these two signals frequently in the rest of our discussion.
A. Frequency Detector
Since the initial frequency of the DCO could be far from the target
frequency, we should draw it to closer to the target one first. This is
done by a binary search process to select an adequate coarse control
code so that the phase locking in the following steps can be achieved by
tuning fine control code only. In the frequency detector, the most critical operation is the frequency comparisonan operation that checks if
ref_clk or div_clk has a higher frequency. The one with a larger number
of edges within a preset period is assumed to have a higher frequency.
Since the accuracy of the edge counting is limited to the setup/hold
time constraint of a flip-flop, we further use the information derived
by our 1-ps-resolution PD to achieve a higher accuracy [12]. The frequency_lock signal will be triggered once the binary search process is
completed. In other words, we have found the two neighboring coarse
control codes surrounding the target frequency. Therefore, the delay
difference between the current clock period and the target one is within
the delay of one coarse step, said about 230 ps by the measurement results in TSMC18 process.
B. Preliminary Phase-Locking Scheme
Before starting phase locking, the DCO is reset first to align the phase
of div_clk with the ref_clk. Fig. 3 shows an example relationship between the phase error (which relates to the amount of time difference
between the two rising edges of signals div_clk and ref_clk) and the
control code. It is notable that if div_clk leads ref_clk, then the phase
error polarity is defined as positive. Under such a condition, the DCOs
frequency will be decreased until the phase error eventually becomes
negative. Very often, the frequency of the DCO is now underestimated
and another cycle of catching up is started. In that cycle, the DCOs
frequency is increased gradually until the phase error turns positive
again. If such an oscillation pattern appears, we can say that the phase
of div_clk is locked to that of ref_clk at least roughly. It can be seen
from the illustration that the target frequency is approximately located
at the middle value of the maximal and minimal control codes.
In light of the previous observation, we can quicken the
phase-locking process by a forward prediction on the control code.
The idea is that the maximal and minimal control codes always occur
when the polarity changes. To take advantage of this phenomenon,
we record one previous control code at all times. Whenever there is a
polarity change, the next control code will be set to be the average of
the current control code and the previous recorded one. Fig. 4 shows
an example, the phase polarity changes at point B so that the next
control code, which is #3, will be set to the average of current control
code, which is #2, and the previous recorded control code, which is
#1. The recorded control code is then updated as the control code #3.
As we can see, the phase is quickly locked and the fluctuations of
the phase error and the control code are limited to a narrower range,
implying that the resulting ADPLL would have a lower jitter. Then we
168
IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS, VOL. 19, NO. 1, JANUARY 2011
16
16
f=3
= 163
3 fN
+ 13
16
3 fN +1 :
IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS, VOL. 19, NO. 1, JANUARY 2011
169
TABLE II
PERFORMANCE COMPARISON
Fig. 10. (a) Phase alignment after phase locked. (b) Jitter measurement of the
ADPLL (@600 MHz).
Fig. 11. Jitter performance under different output frequencies (reference clock
frequency fixed at 1 MHz).
Measurement results of jitter performance under different output frequencies are presented in Fig. 11. The reference clock frequency is
fixed and the output frequency is changed by tuning the multiplicative number N . It is found that the interpolation-based locking scheme
can not only reduce the phase error but also further reduce the jitter.
The jitterPk-Pk and the jitterRMS of ADPLL are 72 and 10.1 ps at
100 MHz and 68 and 8.81 ps, respectively, in our free-running DCO.
The jitterRMS is small than 0.44%UI even when the output frequency
varies from 100 to 600 MHz, indicating that the ADPLL can achieve
low-jitter performance all over its operation range. Experimental results reveal that the jitter of our ADPLL is very close to that of free
running DCO no matter what output frequency is synthesized.
Table II shows the characteristics of our ADPLL and its comparison with some previous works. All of them can provide a wide range
170
IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS, VOL. 19, NO. 1, JANUARY 2011
of multiplicative numbers for frequency multiplication or clock generation. Measurement results show that our ADPLL can correctly function under varying input reference clock frequencies and multiplicative
numbers. In general, our measured jitter is comparable to the analog
PLL and clearly outperforms the previous ADPLLs.
V. CONCLUSION
ADPLL has been regarded inferior to its analog counterpart from
the performance point of view. More recently, ADPLL design has progressed significantly. However, problems such as inadequate DCOs
resolution and large jitter remain to be solved. In this brief, we presented an ADPLL with an ultralow-jitter characteristic. There are three
major contributions we made in this paper. 1) We use a predictive
scheme to expedite the preliminary phase-locking procedure. 2) We
further successfully reduce the jitter amount by a simple and inexpensive suppressive digital loop filter, which incorporates only counting
and thresholding to reduce the sensitivity of a PLL during the second
phase-locking stage. 3) We propose to use frequency interpolation to
virtually improve the resolution of a DCO beyond its physical limitation. Better jitter performance and phase error can thus be achieved.
Measurement results of a test chip show that the root mean square
(RMS) jitter is less then 0.44%UI when the output clock frequency
varies from 100 to 600 MHz. Operated at 600 MHz, the jitter is only
7.28 ps, which is only 3% away from that of the free-running DCO
(7.06 ps).
REFERENCES
[1] J. -S. Chiang and K. -Y. Chen, The design of an all-digital phase
locked loop with small DCO hardware and fast phase lock, IEEE
Trans. Circuits Syst. II, Analog Digit. Signal Process., vol. 46, no. 7,
pp. 945950, Jul. 1999.
[2] C.-C. Chung and C.-Y. Lee, An all-digital phase-locked loop for highspeed clock generation, IEEE J. Solid-State Circuits, vol. 38, no. 2, pp.
347351, Feb. 2003.
[3] T. Watanabe and S. Yamauchi, An all-digital PLL for frequency multiplication by 4 to 1022 with seven-cycle lock time, IEEE J. Solid-State
Circuits, vol. 38, no. 2, pp. 198204, Feb. 2003.
[4] T. Olsson and P. Nilsson, A digitally controlled PLL for SOC applications, IEEE J. Solid-State Circuits, vol. 39, no. 5, pp. 751760, May
2004.
[5] G. Yan, C. Ren, Z. Guo, Q. Ouyang, and Z. Chang, A self-based PLL
with current-mode filter for clock generation, in Proc. IEEE Int. SolidState Circuit Conf., Feb. 2005, pp. 420421.
[6] D. Sheng, C.-C. Chung, and C.-Y. Lee, An all-digital phase-locked
loop with high-resolution for SoC applications, in Proc. IEEE Int.
Symp. Very Large Scale Integr. (VLSI) Design Autom. Test, Apr. 2006,
pp. 14.
[7] P.-L. Chen, C.-C. Chung, J.-N. Yang, and C.-Y. Lee, A clock generator with cascaded dynamic frequency counting loops for wide multiplication range applications, IEEE J. Solid-State Circuits, vol. 41, no.
6, pp. 12751285, Jun. 2006.
[8] S. Duo, C.-C. Chung, and C.-Y. Lee, An ultra-low-power and portable
digitally controlled oscillator for SoC applications, IEEE Trans. Circuits Syst. II, Exp. Briefs, vol. 54, no. 11, pp. 954958, Nov. 2007.
[9] R. Staszewski and P. Balsara, All-digital PLL with ultra fast settling,
IEEE Trans. Circuits Syst. II, Exp. Briefs, vol. 54, no. 2, pp. 181185,
Feb. 2007.
[10] J. Zhuang, Q. Du, and T. Kwasniewski, A 4 GHz low complexity
ADPLL-based frequency synthesizer in 90 nm CMOS, in Proc. IEEE
Custom Integr. Circuits Conf., Sep. 2007, pp. 543546.
[11] J.-A. Tierno, A.-V. Rylyakov, and D.-J. Friedman, A wide power
supply range, wide tuning range, all static CMOS all digital PLL in
65 nm SOI, IEEE J. Solid-State Circuits, vol. 43, no. 1, pp. 4251,
Jan. 2008.
[12] H.-J. Hsu, C.-C. Tu, and S.-Y. Huang, A high-resolution all-digital
phase-locked loop with its application to built-in speed grading for
memory, in Proc. IEEE Int. Symp. Very Large Scale Integr. (VLSI)
Design Autom. Test, Apr. 2008, pp. 267270.
[13] CIC Referenced Flow for Mixed-Signal IC Design, Chip Implementation Center (CIC), Taiwan, 2008, Document no. CIC-DSD-RD-08-02.