Documenti di Didattica
Documenti di Professioni
Documenti di Cultura
Folding
Lan-Da Van (), Ph. D. Department of Computer Science National Chiao Tung University Taiwan, R.O.C. Fall, 2010
ldvan@cs.nctu.edu.tw http://www.cs.nctu.tw/~ldvan/
Outline
Introduction Folding Transformation Register Minimization Techniques Register Minimization in Folded Architecture Conclusions
Lan-Da Van
VLSI-DSP-6-2
Introduction (1/2)
Systematically determine the control circuits in DSP architectures by folding transformation, where multiple algorithm operations are time-multiplexed to a single functional unit. Use for synthesis of DSP architectures that can be operated at single or multiple clocks. Use to reduce the number of hardware functional units (FUs) by a factor of N at the expense of increasing computation time by a factor of N. Lead to an architecture that uses a large number of registers and thus present the register minimization technique.
Lan-Da Van VLSI-DSP-6-3
Introduction (2/2)
Lan-Da Van
VLSI-DSP-6-4
Outline
Introduction Folding Transformation Register Minimization Techniques Register Minimization in Folded Architecture Conclusions
Lan-Da Van
VLSI-DSP-6-5
U, V: nodes (operations) of the original DFG HU, HV: nodes (functional units) of the folded DFG W(x): x-th iteration of node W e U V: an edge e from node U to noe V w(e): # of delays of the edge e Folding factor N # of operations that share one FU An ordered set of operations that executed by the same FU the position of an operation U in folding set is actually the folding order of U The folding set are typically obtained from a scheduling and allocation algorithm (ref. Appendix B) The folding set represents underlying folding transformation
Lan-Da Van VLSI-DSP-6-6
Folding set
DF (U V ) [ N (l w(e))] v ] [ Nl P u] U
Negative value of folding equation DF is possible before retiming the folding equations.
Nw(e) P v u U
Lan-Da Van
VLSI-DSP-6-7
U(l)
V(l+w(e))
N folded PU+DF
N folded
HU(Nl+u)
HV(N(l+w(e))+v)
VLSI-DSP-6-8
Lan-Da Van
addition and multiplication require 1 and 2 u.t. respectively. 1-stage adders and 2-stage pipelined multipliers are available.
Lan-Da Van
VLSI-DSP-6-9
Lan-Da Van
VLSI-DSP-6-10
Retiming (1/3)
What situations will be suffered if the folding equation DF is negative? Retiming (moving delay elements) the original DFG prior to folding Constraint: e DF(UV)= Nwr(e)PU +vu>=0 -----(1) Substitute wr(e)=w(e)+r(V)r(U) into (1) e r(U)r(V)<= DF(UV)/N
Since the retiming values of the nodes are restricted to be integers, the above equations can be rewritten as
r(U)r(V)<=DF(UV)/N
Lan-Da Van
VLSI-DSP-6-11
Retiming (2/3)
Example: DF(12)=Nw(e)-PU+vu=0-1+1-3=-3
Lan-Da Van
VLSI-DSP-6-12
Retiming (3/3)
Lan-Da Van
VLSI-DSP-6-13
Outline
Introduction Folding Transformation Register Minimization Techniques Register Minimization in Folded Architecture Conclusions
Lan-Da Van
VLSI-DSP-6-14
Lifetime Analysis
Lifetime analysis is a procedure used to compute the minimum number of registers required to implement a DSP algorithm in hardware.
In lifetime analysis, the number of live variables at each time unit is computed, and the maximum number of live variables at any time unit is determined. Forward-backward register allocation technique
Lan-Da Van
VLSI-DSP-6-15
Periodicity Implicit
VLSI-DSP-6-16
adg beh c f i
ihgfedcba
Matrix Transpose
ifchebgda
Lan-Da Van
VLSI-DSP-6-17
Lan-Da Van
VLSI-DSP-6-18
Lan-Da Van
VLSI-DSP-6-21
Outline
Introduction Folding Transformation Register Minimization Techniques Register Minimization in Folded Architecture Conclusions
Lan-Da Van
VLSI-DSP-6-22
Lan-Da Van
VLSI-DSP-6-23
Lan-Da Van
VLSI-DSP-6-24
Lan-Da Van
VLSI-DSP-6-25
Lan-Da Van
VLSI-DSP-6-27
Folding Factor = 4
Lan-Da Van
VLSI-DSP-6-29
Retiming
DF(UV) = Nw(e) Pu + v - u DF(12) = 4(1) 1 + 1 3 = 0 DF(23) = 4(1) 1 + 0 3 = 5 DF(24) = 4(1) 1 + 2 3 = 2 DF(31) = 4(1) 1 + 3 3 = 1 DF(41) = 4(2) 1 + 1 3 = 0
Lan-Da Van
VLSI-DSP-6-31
Folding Factor = 2
Lan-Da Van
VLSI-DSP-6-33
Conclusions
Present a systematic transformation of timemultiplexed architectures Explore folding techniques to reduce # of functional units Explore register minimization technique to reduce # of registers
Lan-Da Van
VLSI-DSP-6-34
References
K. K. Parhi, VLSI Digital Signal Processing Systems: Design and Implementation, Wiley, 1999. S. Y. Huang, Handout of text book, 2004.
Lan-Da Van
VLSI-DSP-6-35