Documenti di Didattica
Documenti di Professioni
Documenti di Cultura
VLSIII
Global
Clocking
ClkA
Hctor Snchez
Mark McDermott
Tcycle
Tjitter
ClkD
Foil # 1
Class Agenda
Clocking Overview
Clock Generation
Clock Distribution
Clock Uncertainty
Clock Skew
Clock Jitter
Clock Regeneration
Tunable global clock buffers
Local clock buffers
Clocking with SOI Technology
Key learnings
Future Clocking Issues
Foil # 2
Clocking Overview
Foil # 3
Clocking Overview
Foil # 4
Clocking
Element
ClkA
Domain
Clocking
Element
ClkD
Domain
Logic block
ClkA
Tcycle
Tuncertainty
ClkD
Foil # 5
Foil # 6
Clocking Overview
Foil # 7
Clock Generation
There are two techniques used to synchronize the clocks in a high performance
system: Phase Locked Loop (PLL) or a Delay Locked Loop (DLL)
The PLL is used to phase synchronize (and probably multiply) the system
clock WRT to a reference clock (internal or external).
PLL features:
Frequency Multiplication to run processor at faster speed than memory interface.
Skew reduction. The reference clock is aligned to the feedback clock.
Possible stability issues with PLL due to 2nd or 3rd order loop behavior.
The DLL is used to delay synchronize the system clock to a reference clock.
DLL Features:
DLLs typically do not multiply an input clock, although high performance DLLs
have been designed to implement limited frequency multiplication.
DLLs are inherently stable. Think of only trying to align delay not delay and
frequency as in a PLL.
Foil # 8
4*XX
4*XX/N
4*XX*F/N
EE 382M Class Notes
Foil # 9
Clock Distribution
Foil # 10
Reference
Clock
Core Clock
PLL
Global
Clock
Buffer
Gclk
Local
Clock
Buffer
Clk
Clk
System
Clock
Gclk
Global
Clock
Buffer
Gclk
Foil # 11
Local
Clock
Buffer
Clk
Local
Clock
Buffer
Clk
Latch/FF
Latch/FF
Latch/FF
Latch/FF
Latch/FF
Latch/FF
Clock Distribution
Delay
Skew
Grid
High 15x
Low
sub100 ps
Low-Med
Trees
Low 1x
High
100s ps
Low
Serpentine
Very High
30x
High
100s ps
Low
Spine
High 10x
Lowsub100ps
Med
Foil # 12
Tapered H-Tree
EE 382M Class Notes
Foil # 13
Foil # 14
Foil # 15
Foil # 16
Foil # 17
Foil # 18
Foil # 19
Foil # 20
Foil # 21
Foil # 22
Clock Uncertainty
Foil # 23
Clock Uncertainty
Foil # 24
Clock Skew
Foil # 25
Clock Skew
Foil # 26
Clock Skew
System
Clock
Core Clock
PLL
Global
Clock
Buffer
Gclk
Local
Clock
Buffer
ClkA
Latch/FF
ClkB
Latch/FF
Gclk
Global
Clock
Buffer
Gclk
Local
Clock
Buffer
ClkC
Local
Clock
Buffer
ClkD
Latch/FF
Latch/FF
ClkA
Foil # 27
Tskew
ClkD
Wire coupling
Coupling will be different on different clock routes
RC mismatch
Clock routes not all of equal length
Latches not all equal distance from LCB (local clock buffer)
Inductance of high speed, low resistance lines changes edge-rates.
Unequal buffering can cause additional skew due to rise timedependent delay in buffers.
Foil # 28
Frequency
(MHz)
Process
Itanium
800
.18
PowerPC
1000
.22
UltraSPARC III
800
80ps Al wires, no
deskew
.18
Alpha
600
72ps Al wires, no
deskew
.13
Foil # 29
Foil # 30
Foil # 31
Clock Deskewing
This can result in clock jitter. Careful analysis is required to validate the
benefits.
Foil # 32
Foil # 33
Clock Jitter
PLL Jitter
Wire coupling
Changing data can alter coupling in different cycles
Foil # 34
Clock Jitter
System
Clock
Core Clock
PLL
Global
Clock
Buffer
Gclk
ClkA
Local
Clock
Buffer
Latch/FF
ClkB
Latch/FF
Gclk
Global
Clock
Buffer
Gclk
Local
Clock
Buffer
ClkC
Local
Clock
Buffer
ClkD
Latch/FF
Latch/FF
ClkA
Tcycle
Tjitter
ClkD
Foil # 35
Foil # 36
H. Sanchez, ISSCC2006
Foil # 37
PLL
Foil # 38
Clock distribution
buffers are exposed to
Vdd noise
Large di/dt events of
microprocessor Core
can cause substantial
Vdd noise that in turn
affects the
instantaneous phase
relationships of clocks
Addition of MIM decap
shows a measured 40%
reduction in peak-peak
phase change due to
large transient currents
related to
microprocessor Core
activity.
Clock Regeneration
The global clock buffers (GCB) are used to regenerate the clock
signal(s) to a region or cluster in the chip. They are typically
designed with skew adjustment control.
The local clock buffers (LCB) are used to regenerate the clock
signal(s) to functional blocks in each cluster. The LCB usually
contains logic which allow the clock signals to be gated on or off
to reduce power.
Foil # 39
Core
Clock
Gclk
SpeedUp1
SpeedUp2
SpeedUp3
Foil # 40
Gclk
Clk
En1
Clk#
En2
ClkDelayed
En3
ClkDelayed#
En4
Foil # 41
Foil # 42
Floating Body
Gate
tSi 150 nm
tOX
Oxide
N+
tSi
N+
Oxide
tBOX
P Substrate
C. T. Chuang et al., SOI Circuit Design for High-Performance
CMOS Microprocessors, 2001 IEEE ISSCC Microprocessor Design Workshop
Foil # 43
TDfall
L-H
12
10
H-L
TDrise
H-L
0.0
0.5
VBS-fall
-0.1
0.4
L-H
-0.2
H-L
0.3
0.2
-0.3
-0.4
VBS-rise
0.1
-0.5
0.0
-0.6
14
0.6
16
Time (sec)
Time (sec)
Igate reduces history effect due to control of floating body ! ( not shown above )
M. M. Pelella et al., VLSI-TSA, 1999 via
C. T. Chuang et al., SOI Circuit Design for High-Performance
CMOS Microprocessors, 2001 IEEE ISSCC Microprocessor Design Workshop
Foil # 44
Foil # 45
Foil # 46
WHAT ???
Yes, GOOD and BAD.
Bad if you are used to thinking that transistors behave like the lumped
empirical models they are based on.
GOOD, because it only prepares you as a designer to accept variability and
learn to deal with it. Ultimately, if its not PDSOI, it will be some other nonclassical device that will require you to learn to adapt
Foil # 47
Foil # 48
Foil # 49
Foil # 50
Future Directions:
Tuned LC tanks, Standing Wave Oscillators
Foil # 51
Foil # 52
Foil # 53
Foil # 54
Foil # 55
Foil # 56
Foil # 57
References
Foil # 59
Key Learnings
Foil # 60
Summary
Foil # 61
References
Foil # 62
Backup
Foil # 63
Acronyms
Foil # 64
C1
C2
Ref
UP
R
Charge
Bias
Pump
Gen
PFD
Fdbk
DN
VCO
VCNTL
NBIAS
1/N
Clock Network
Foil # 65
Foil # 66
Foil # 67
Phase/Freq.
Detector
i(S) +
e(S)
Filter
Ip
2
VCO
Ko
F(S)
Fo(S)
Divider
fdbk(S)
1
M
Foil # 68
o(S)
2
S
K I
1
F =
2 M C
n
= R C1 F n
Foil # 69
Foil # 70
PLL Parameters
Foil # 71
PLL Parameters
Foil # 72
Foil # 73
Differential VCO
Output Period = N * tdelay
Differential
Delay Stage
VCNTRL
+
-
+
-
+
-
+
-
+
-
NBIAS
Output
Amplifiers
- +
- +
- +
- +
- +
Foil # 74
Rvcr
Rvcr
Rvcr
VCNTL
OUT2
OUT1
OUT2#
OUT1#
IN1#
IN1
NBIAS
IN2
IN2#
VSS
Advantages:
Differential signals more immune to noise
Insensitive to switching trip-point inaccuracy
EE 382M Class Notes
Foil # 75
i = C V
T
t delay = C V
i
Frequency =
where
i
C V N
Foil # 76
Frequency
Best Case
Ko2
Typical
Ko1
Worst Case
fmax
fmin
Foil # 77
Volts
1
N
Frequency &
Phase Locked
Post Divider
PLL Core
1
M
1
P
Fvco
M
F in
F vco =
N
M
F
F =
N P
out
Feedback Divider
EE 382M Class Notes
Foil # 78
Fout
in
Books:
Phase Locked Techniques, F.M. Gardner, 1979.
Phase Lock Loops, Design, Simulation and Applications, R. Best, 1999.
Phase Lock Loop Circuit Design, Dan Wolaver, 1991.
Articles:
Charge-Pump Phase-Lock Loops, F.M. Gardner, IEEE Trans. On
Comm., Vol. Com-28. No. 11, November 1980.
A PLL Clock Generator with 5 to 110 MHz of Lock Range for
Microprocessors, I.A. Young, IEEE Journal of Solid-State Circuits, Vol.
27, No. 11, November 1992.
Low-Jitter Process-Independent DLL and PLL Based on Self-Biased
Techniques, John G. Maneatis, IEEE Journal of Solid-State Circuits, Vol.
31, No. 11, November 1996.
Foil # 79