Documenti di Didattica
Documenti di Professioni
Documenti di Cultura
3, MARCH 2009
Abstract—Among the different methods of reducing power for power (Pleak ) in CMOS designs is primarily due to subthresh-
core-based system-on-chip (SoC) designs, the voltage-island tech- old leakage [2] and can be expressed compactly as
nique has gained in popularity. Assigning cores to the different
supply voltages and floorplanning to create contiguous voltage −VT
islands are two important steps in the design process. We propose Pleak = I0 e nVth VDD (2)
a new application-driven approach to voltage partitioning and
island creation with the objective of reducing overall SoC power, where I0 is the leakage current coefficient that embodies a num-
area, and floorplanner runtime. Given an application power-state ber of factors, VT is the threshold voltage, Vth is the thermal
machine (PSM), we first identify the suitable range of supply voltage, and n is the subthreshold swing parameter. Projections
voltages for each core. Then, we generate the discrete voltage for the off-state current indicate an increase of approximately
assignment table using a heuristic technique. Next, we describe
a methodology of reducing the large number of available choices five times per generation [3]. Thus, leakage power is now a
from the voltage assignment table down to a useful set using critical design concern for many very large scale integration
the application PSM. We partition the cores into islands, using a systems.
cost function that gradually shifts from a power-based assignment The overall goal of any design is to meet a certain perfor-
to a connectivity-based one. Compared with previously reported mance goal while consuming the minimum possible energy.
techniques, a 9.4% reduction in power and 8.7% reduction in
area are achieved using our approach, with an average runtime The energy of a digital design is given by
improvement of 2.4 times.
E = (Pdyn + Pleak ) × Dcrit
Index Terms—Low-power design, power-state machine (PSM),
voltage-island optimization. where Dcrit is the critical-path delay of the design of the form
Ki VDD
I. I NTRODUCTION Dcrit =
i
(VDD − VT )α
Authorized licensed use limited to: The University of British Columbia Library. Downloaded on July 31, 2009 at 12:50 from IEEE Xplore. Restrictions apply.
SENGUPTA AND SALEH: APPLICATION-DRIVEN VOLTAGE-ISLAND PARTITIONING FOR SYSTEM-ON-CHIP DESIGN 317
Fig.1. SoC with (a) single voltage island and with (b) three voltage islands.
Authorized licensed use limited to: The University of British Columbia Library. Downloaded on July 31, 2009 at 12:50 from IEEE Xplore. Restrictions apply.
318 IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, VOL. 28, NO. 3, MARCH 2009
Authorized licensed use limited to: The University of British Columbia Library. Downloaded on July 31, 2009 at 12:50 from IEEE Xplore. Restrictions apply.
SENGUPTA AND SALEH: APPLICATION-DRIVEN VOLTAGE-ISLAND PARTITIONING FOR SYSTEM-ON-CHIP DESIGN 319
Authorized licensed use limited to: The University of British Columbia Library. Downloaded on July 31, 2009 at 12:50 from IEEE Xplore. Restrictions apply.
320 IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, VOL. 28, NO. 3, MARCH 2009
the set of edges; Φ is the set of edge weights; and Γ is the set
of state weights. Each state Si ∈ Π describes the state of the
cores of a SoC at any given time instant, and each edge ai,j ∈
E represents a valid transition from state Si to Sj . We further
assume that such transitions are controlled by inputs from the
power manager as well as by external inputs [39]. In Fig. 7, Init
is the initial state of the system, from which the chip can change
to S1 , S2 , or S3 . Each state in the PSM describes the operating
mode of the cores in the SoC. For example, the cores can be in
any of the three operating modes: Active state (full operational
mode), Idle state (no activity state) and Sleep state (power shut
off). In every state Si , each core is in one of these three states.
As shown in Fig. 7, C1 is in Idle state in S1 and in Active state
in S4 . We assign weights w(ai,j ) ∈ Φ to the edges from state
Si to states Sj (Si , Sj ∈ Π and i = j) such that
w(ai,j ) = 1 (3)
j Fig. 8. Power, delay, and EDP versus VDD for a core.
with w(ai,j ) = 0 when no edge exists between Si and Sj . performance IC’s, there are a few blocks with high activity
To represent the relative time period, the SoC remains in while others have relatively low activity. Cores having low
a particular state; we also assign weights W (Si ) ∈ Γ to the activity (hence, higher leakage power) should be implemented
individual states. The larger the weight, the greater is the using HVT with no power gating. Cores having high activity
likelihood that the SoC will be in that state within a certain (and, thus, higher dynamic power) should be designed using
time window. Thus, the PSM provides an application-driven SVT or LVT [30], and lower supply voltages should be de-
abstraction of the different power modes, in which the cores signed with HVT power-gating transistors. Additionally, (5)
might operate, and of their respective probabilities. will be used later to distinguish between high and low power
Since the choice of SVT, LVT, and HVT depends upon the consuming cores in the method described in Section V.
activity of the circuit, we must extract activity information from
the PSM. In order to do so, we compute the overall activity IV. C HOICE OF S UPPLY V OLTAGES
factor of each core (Ci ) as αi and the overall idle factor as βi
The next step in the design process is to choose a finite
n
n
number of supply voltages from the range of supply voltages
[w(aj,k ) × W (Sk ) × mi,k ]
j=1 k=1 for each core and generate the voltage assignment table. As an
α1 =
n example, consider a SoC with eight cores (C1 , C2 , . . . , C8 ).
W (Sr ) Assume the chip-level supply voltage to be VDDH ∗
= 1.2 V.
r=1
n
n We first determine the threshold voltage for each core with the
[w(aj,k ) × W (Sk ) × ni,k ] technique described in the previous section. With a target delay
j=1 k=1 i
β1 = for each core Ci , we can calculate VDDL . It should be noted
n
i ∗
W (Sr ) that the minimum value of VDDL is restricted to be VDDH /2 in
r=1 order to ensure proper operation of the gates. Next, we identify
where a useful intermediate operating point VDDM i
by selecting the
1, if Ci is Active in state Sk optimal value of one of the various products of power and
mi,k =
0, otherwise delay, such as the power–delay product (PDP, or energy),
1, if Ci is Idle in state Sk energy–delay product (EDP) [40], or the power–energy product
ni,k = (4)
0, if Ci is Sleep or Active state Sk . [41]. Each of these quantities gives rise to a different optimal
i
VDDM value for the core. For the purposes of this paper, the one
We then use αi and βi to select the suitable threshold voltages we use is the optimal PDP value, as shown in Fig. 8. These types
for each core. The power Pi can be computed using the overall of graphs can be generated for a core if an initial calibration
activity, idle, and sleep factors of the core Ci as point is available with the delay, dynamic power, and leakage
Pi = αi PA,i + βi PI,i + (1 − αi − βi )PS,i (5) power for a given (VDD , VT ) pair. The accuracy has been shown
to be within 10%–20% of the actual values across process and
where PA,i , PI,i , and PS,i are the average power consump- temperature ranges [35].
i
tions of stand-alone core Ci in Active, Idle, and Sleep modes, The optimum energy point can be located between VDDL
∗
respectively. Therefore, power Pi represents the actual power and VDDH , depending on how leaky the core is, the degree of
contribution of core Ci to the total chip power for the given activity, and the target critical-path delay. The actual position
i
PSM. After using (4) to calculate the activity of each core, of VDDM for three different ring oscillator circuits with two
we decide the suitable VT and the range of supply voltages different activity factors is shown in Fig. 9. In general, with in-
i i
over which the core is permitted to operate. In typical high- crease in activity, VDDM is closer to VDDL . For example, when
Authorized licensed use limited to: The University of British Columbia Library. Downloaded on July 31, 2009 at 12:50 from IEEE Xplore. Restrictions apply.
SENGUPTA AND SALEH: APPLICATION-DRIVEN VOLTAGE-ISLAND PARTITIONING FOR SYSTEM-ON-CHIP DESIGN 321
Authorized licensed use limited to: The University of British Columbia Library. Downloaded on July 31, 2009 at 12:50 from IEEE Xplore. Restrictions apply.
322 IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, VOL. 28, NO. 3, MARCH 2009
TABLE III
VOLTAGE ASSIGNMENT TABLE
TABLE II
VOLTAGE–ENERGY TABLE cases: m = 1, 2, and 3. For example, in the first square, 54 nJ is
the total energy, assuming only one island with VDD = 1.2 V.
The square with the lowest chip energy is 37.2 nJ, and the
resulting supply voltages are 0.6, 0.8, and 1.2 V. We now
refer to these as permissible voltages. As discussed before, the
lowest two supply voltages were not the best choice to minimize
power.
We have found, after running a large number of cases, that
the error incurred by the greedy algorithm, as well as setting
i i
VDDL to VDDM (for the nonmonotonic cores to make them
monotonic), is relatively small. On average, our solution is
greater than the globally optimal energy by 1% when three
energy and the corresponding supply voltage in lines L13–L15. islands are chosen and is indistinguishable from the globally
To explore other solutions, each of the m − 1 supply voltages optimal solution when the number of islands is four or more.
in V is replaced by νp . If νp gives a better solution, we update In the aforementioned example, the greedy heuristic technique
array V and the energy value esum(m, p) in lines L16–L18. actually gives the correct globally optimal solution.
The final solution using our heuristic technique for choosing Using these permissible voltages, we can now construct the
m voltages from a set of n possible supply voltages is e∗min voltage assignment table from Table II to produce Table III.
in line L19. Although the approach in [28] and our approach Note that if a core’s lowest supply voltage is not permissible, it
require the same time complexity, the space complexities of must be replaced by a higher voltage that is on the permissible
our technique and in [28] are, respectively, quadratic and linear. list. The highest voltage is always available in this case and
To implement this algorithm, we need to know the energy is included in the table. Moreover, note that, during voltage
values of the cores at all the n supply voltages. At this stage, assignment, cores can be assigned to only the values listed in
any core may be assigned to any of these voltages, as long Table III.
as the timing constraints are satisfied. Hence, we generate the
voltage–energy table as shown in Table II from Table I. How-
i VI. V OLTAGE A SSIGNMENT FOR F LOORPLANNING
ever, due to the additional constraint on VDDL , not all possible
supply voltages are considered for all cores. For example, in the So far, only the voltage assignment table has been generated
case of C3 , we omit the energy at VDD = 0.6 V as the minimum but the actual assignment of voltages to cores has yet to be
i i
energy is reached at VDDM = 0.7 V. Thus, VDDL is reset to this performed. In typical high-performance ICs, there are many
value. Similar effects are seen on C2 and C4 . blocks; however, only a few blocks have high activity while
Fig. 11 shows the step-by-step results of applying the pro- others have relatively low activity [42]. Based on the power
posed technique to the example in Table II. The supply voltages consumption of cores in a few industrial chips, the number of
for the voltage islands are included with each energy value. We high power consuming blocks is 10%–20% of the total number
have shown the results from our approach for three different of blocks in the SoC. Using (5), we compute the power for each
Authorized licensed use limited to: The University of British Columbia Library. Downloaded on July 31, 2009 at 12:50 from IEEE Xplore. Restrictions apply.
SENGUPTA AND SALEH: APPLICATION-DRIVEN VOLTAGE-ISLAND PARTITIONING FOR SYSTEM-ON-CHIP DESIGN 323
core and order them from the highest to lowest power. Starting TABLE IV
POWER VERSUS LEVEL SHIFTERS
with N cores, the VDD assignment is carried out in two phases.
1) High-Power Blocks (Phase I): Select k out of N cores
that have the largest contribution to the total chip power.
Assign the lowest available VDD from Table III to each
of the high-power cores and go to Phase II. On the
next iteration of Phase I, move the core that results in
the lowest increase in power to its next highest VDD .
Repeat on subsequent iterations until all voltage options
are exhausted (deterministic approach).
2) Low-Power Blocks (Phase II): Voltage assignment for
the remaining N−k cores that have a relatively smaller
contribution to the overall power is performed using a
cost function (described in the next section) that considers
both connectivity and power (heuristic approach).
Authorized licensed use limited to: The University of British Columbia Library. Downloaded on July 31, 2009 at 12:50 from IEEE Xplore. Restrictions apply.
324 IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, VOL. 28, NO. 3, MARCH 2009
TABLE V
SIMULATION RESULTS ON BENCHMARK CIRCUITS
decap placement to meet the power-grid noise budget. Since chip was assumed to be proportional to the number of cores.
there is no floorplan available during this step, we use a simple Next, we used the heuristic technique (as in Section IV) for
1-D power grid that satisfies the worst-case IR drop for each choosing the permissible supply voltages that could be used for
island [43]. For each voltage assignment, dynamic IR-drop realizing voltage islands on the chip. This produced the voltage
simulation is carried out, and the decap is adjusted until the assignment table.
voltage drop satisfies the allotted budget (5%–10%) for each We applied our technique on 11 benchmark circuits using the
grid. The total amount of decap is then translated to the amount HotSpot Floorplan Tool that included the LAPACK package
of white space using [47]. The experiments were run using an Intel Pentium 4
2.4-GHz machine with 2-GB RAM running on Debian Linux
Cdecap (version 2.6.18).
Adecap = . (8)
Cox The flow presented in [15] is our main comparison point
Finally, we attempt floorplanning with the partitioned cores due to the availability of the floorplanning tool and on their
to create voltage islands. In each island, we reserve additional use of the same benchmark circuits. As the voltage assignment
white space based on (8) for decap placement after completing table is the starting point in [15], we use our table, generated in
the floorplanning operation. If a feasible floorplan is obtained, Section IV, as a starting point for their design flow. Based on the
then we consider it as the desirable solution. If not, we go relative importance of power and area, a number of coefficients
back to Phase I and repeat the process. A 2-D power-grid noise must be assigned to the fitness function in [15]. In our approach,
analysis, as described in [44], can be performed on the final we have given equal importance to both power and area by
floorplan to ensure that IR-drop violation does not exist in each assigning χ1 = χ2 = 0.5. Similar priorities were given to the
island. coefficients in the fitness function of the genetic algorithm in
their technique.
Table V compares the runtime and the quality of results
VII. E XPERIMENTAL R ESULTS
(number of voltage islands, area, and power) of our approach
This section presents the results of our proposed approach. with [15]. Both methods were required to use the same number
We have applied our methodology on a number of MCNC and of islands. Columns 10–12 compare the savings of our proposed
GSRC benchmark circuits. For all the circuits, a 90-nm CMOS technique against that in [15]. Our power and area values are
design kit was used in order to model dynamic power, leakage significantly better than those obtained using the flow in [15],
power, threshold voltage, power, and area of level shifters. with an average improvement of 8%–10%. The runtime is
As the designs in the benchmarks were not originally in- 2.4 times better. This is due to the reduced number of floorplan
tended for voltage islands, additional information was added. iterations required to generate a feasible floorplan. Note that
A representative (but artificial) PSM was generated for each voltage assignments far from the optimal are often attempted
benchmark using an approach similar to that in [45]. The power in the genetic algorithm. Our approach is specifically targeted
consumption and delay of each core at 1.2 V was randomly as- to avoid such floorplan attempts, as it only adds to the runtime
signed between 2 W and 1 mW, similar to the approach used in [48]. The incremental cost is relatively small for ordering the
[46]. The activity of each core was derived from the PSM using cores and finding the preferable voltage assignments for which
the techniques described in Section III. Appropriate threshold floorplanning should be attempted. In the case of xerox, the
voltages (LVT, SVT, and HVT) were assigned to the cores two techniques produced the same power and area solutions,
based on their activity factors, and the optimal voltage table was although our technique was much faster than that in [15]. For
generated. The possible number of supply voltages allowed on the circuit n10, the power consumption was slightly higher for
Authorized licensed use limited to: The University of British Columbia Library. Downloaded on July 31, 2009 at 12:50 from IEEE Xplore. Restrictions apply.
SENGUPTA AND SALEH: APPLICATION-DRIVEN VOLTAGE-ISLAND PARTITIONING FOR SYSTEM-ON-CHIP DESIGN 325
Fig. 13. Floorplan of n50 using (a) technique in [15] and the (b) proposed
approach.
our result. Such anomalies are possible for the smaller circuits.
In Fig. 13, we compare the floorplans of n50 obtained using
the technique in [15] and our approach. Both floorplans use six
islands; however, the power consumption and cores assigned to
the islands were different. Our solution shown in Fig. 13(b) has
an area that is 5.8% less and a power that is 9.7% less than their
corresponding result of Fig. 13(a).
Investigating the thermal effect on the floorplan of each
Fig. 14. Detailed floorplan of n100 illustrating final adjustments. (a) Floor-
benchmark, we found that the average temperature in our case plan of n100 with potential IR-drop problem. (b) Floorplan of n100 without
is approximately 9%–10% lower than that in [15]. Additionally, IR-drop problem.
the peak temperature was approximately 13% lower. The reason
is that our approach assigns more cores to islands with lower explored using power as the priority, and then, a heuristic
supply voltages. This leads to an overall reduction in the approach is used for remaining cores based on a power and con-
dynamic and leakage powers, which, in turn, lowers the on-chip nectivity cost function. We then attempt floorplanning for each
average and peak temperature [49]. assignment solution. We compared our results with existing
We also inspected the floorplan solutions of all the bench- work on a number of benchmark circuits and obtained improve-
marks and noticed that, in some cases, there were blocks that ments of 9.4% in power, 8.7% in area, and 2.4 times in runtime.
belonged to one island but were surrounded on three sides
by other islands. This problem can be seen in circuit (n100), ACKNOWLEDGMENT
as shown in Fig. 14(a). The voltage-island design not only
requires placement of blocks belonging to similar islands in The authors would like to thank the reviewers for many
a contiguous fashion but also must ensure that there are no valuable suggestions and comments that have significantly en-
power-grid design problems due to the final placement. Such hanced the quality of this paper.
floorplans may produce an IR-drop problem when a 2-D power-
grid analysis is performed. To avoid such potential problems, R EFERENCES
we attempt to move this block to other adjacent islands and [1] S. Borkar, “Design challenges of technology scaling,” IEEE Micro, vol. 9,
no. 4, pp. 23–29, Jul. 1999.
assess the incremental cost in power, level shifters, and power- [2] L. Wei, K. Roy, and V. K. De, “Low voltage low power CMOS design
grid design. The lowest cost solution is used, as long as only two techniques for deep submicron ICs,” in Proc. Conf. VLSI Des., 2000,
sides are adjacent to other islands. The solution selected for this pp. 24–29.
[3] B. Chatterjee, M. Sachdev, S. Hsu, R. Krishnamurthy, and S. Borkar,
case is shown in Fig. 14(b) and indicates the final adjustment to “Effectiveness and scaling trends of leakage control techniques for sub-
a different island. 100 nm CMOS technologies,” in Proc. Symp. Low-Power Electron., 2003,
pp. 122–127.
[4] D. Markovic, V. Stojanovic, B. Nikolic, M. A. Horowitz, and
VIII. C ONCLUSION R. W. Brodersen, “Methods of true energy-performance optimization,”
IEEE J. Solid-State Circuits, vol. 39, no. 8, pp. 1282–1293, Aug. 2004.
In this paper, we have proposed a comprehensive, [5] V. Zyuban and P. N. Strenski, “Balancing hardware intensity in micro-
application-driven, and floorplan-aware methodology for processor pipelines,” IBM J. Res. Develop., vol. 47, no. 5/6, pp. 585–598,
voltage-island design. First, we identified meaningful supply Sep. 2003.
[6] G. E. Tellez, A. Farrahi, and M. Sarrafzadeh, “Activity-driven clock de-
voltages for each core and constructed a voltage assignment sign for low power circuits,” in Proc. ICCAD, Nov. 2005, pp. 62–65.
table using a greedy heuristic technique. Next, the voltage [7] M. Srivastav, S. S. S. P. Rao, and H. Bhatnagar, “Power reduction tech-
assignment to the cores was attempted in a two-stage approach. niques using multi-Vt libraries,” in Proc. IWSOC, Jul. 2005, pp. 363–367.
[8] D. E. Lackey, P. S. Zuchowski, T. R. Bednar, D. W. Stout, S. W. Gould,
By observing that cores in a SoC can be categorized as high and J. M. Cohn, “Managing power and performance for system-on-chip
and low power blocks, a deterministic sequence of options is designs using voltage islands,” in Proc. ICCAD, 2002, pp. 195–202.
Authorized licensed use limited to: The University of British Columbia Library. Downloaded on July 31, 2009 at 12:50 from IEEE Xplore. Restrictions apply.
326 IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, VOL. 28, NO. 3, MARCH 2009
[9] T. Kobayashi and T. Sakurai, “Self-adjusting threshold-voltage scheme [38] [Online]. Available: http://www.unifiedpowerformat.com/
(SATS) for low-voltage high-speed operation,” in Proc. CICC, May 1994, [39] L. Benini, R. Hodgson, and P. Siege, “System-level power estimation and
pp. 271–274. optimization,” in Proc. ISLPD, Aug. 1998, pp. 173–178.
[10] K. Choi, R. Soma, and M. Pedram, “Dynamic voltage and frequency [40] R. Gonzalez, B. M. Gordon, and M. A. Horowitz, “Supply and threshold
scaling based on workload decomposition,” in Proc. ISLPED, Aug. 2004, voltage scaling for low power CMOS,” IEEE J. Solid-State Circuits,
pp. 174–179. vol. 32, no. 8, pp. 1210–1216, Aug. 1997.
[11] S. Mutoh, T. Douseki, Y. Matsuya, T. Aoki, S. Shigematsu, and J. Yamada, [41] D. Sengupta and R. Saleh, “Generalized power–delay metrics in deep
“A 1 V power supply high-speed digital circuit technology with multi submicron CMOS design,” IEEE Trans. Comput.-Aided Design Integr.
threshold voltage CMOS,” IEEE J. Solid-State Circuits, vol. 30, no. 8, Circuits Syst., vol. 26, no. 1, pp. 183–189, Jan. 2007.
pp. 847–854, Aug. 1995. [42] J. Cai, Y. Taur, S. Huang, D. J. Frank, S. Kosonocky, and
[12] R. Puri, D. Kung, and L. Stok, “Minimizing power with flexible voltage R. H. Dennard, “Supply voltage strategies for minimizing the power
islands,” in Proc. ISCAS, May 2005, pp. 21–24. of CMOS processors,” in Proc. IEEE Symp. VLSI Technol. Dig. Tech.
[13] W. Mak and J. Chen, “Voltage island generation under performance re- Papers, 2002, pp. 102–103.
quirement for SoC designs,” in Proc. ASP-DAC, Jan. 2007, pp. 798–803. [43] D. Sengupta and R. Saleh, “Constrained-based voltage island partition-
[14] M. Popovich, E. G. Friedman, M. Sotman, and A. Kolodny, “On-chip ing,” in Proc. IEEE MWSCAS, Aug. 2007, pp. 1050–1053.
power distribution grids with multiple supply voltages for high perfor- [44] S. Zhao, K. Roy, and C. K. Koh, “Decoupling capacitance allocation and
mance integrated circuits,” in Proc. GLSVLSI, Apr. 2005, pp. 2–7. it’s application to power-supply noise-aware floorplanning,” IEEE Trans.
[15] W. L. Hung, G. M. Link, Y. Xie, N. Vijaykrishnan, N. Dhanwadaf, and Comput.-Aided Design Integr. Circuits Syst., vol. 21, no. 1, pp. 81–92,
J. Conner, “Temperature aware voltage islands architecting in system-on- Jan. 2002.
chip design,” in Proc. ICCD, Oct. 2005, pp. 689–694. [45] R. P. Dick, D. L. Rhodes, and W. Wolf, “TGFF: Task graphs for
[16] J. Hu, Y. Shin, N. Dhanwada, and R. Marculescu, “Architecting volt- free,” in Proc. IEEE Int. Work. Hardware/Software Codesign, Mar. 1998,
age islands in core-based system-on-a chip designs,” in Proc. ISLPED, pp. 97–101.
Aug. 2004, pp. 180–185. [46] C. H. Tsai and S. M. Kang, “Cell-level placement for improving substrate
[17] H. Wu, M. D. F. Wong, I. Liu, and Y. Wang, “Placement-proximity- thermal distribution,” IEEE Trans. Comput.-Aided Design Integr. Circuits
based voltage island grouping under performance requirement,” IEEE Syst., vol. 19, no. 2, pp. 253–266, Feb. 2000.
Trans. Comput.-Aided Design Integr. Circuits Syst., vol. 26, no. 7, [47] “The LAPACK project,” LAPACK-Linear Algebra PACK-age. [Online].
pp. 1256–1269, Jul. 2007. Available: http://www.netlib.org/lapack
[18] H. Wu, I. Liu, M. D. F. Wong, and Y. Wang, “Post-placement volt- [48] D. Sengupta and R. Saleh, “Application-driven floorplan-aware voltage
age island generation under performance requirement,” in Proc. ICCAD, island design,” in Proc. DAC, Jun. 2008, pp. 155–160.
Nov. 2005, pp. 309–316. [49] J. C. Ku, M. Ghoneima, and Y. Ismail, “The importance of including ther-
[19] Y. Cai, B. Liu, Q. Zhaou, and X. Hong, “A thermal aware floorplanning mal effects in estimating the effectiveness of power reduction techniques,”
algorithm supporting voltage islands for low power SoC design,” in Proc. in Proc. CICC, Sep. 2005, pp. 301–304.
PATMOS, 2005, pp. 257–266.
[20] W.-P. Lee, H.-Y. Liu, and Y.-W. Chang, “Voltage island aware floorplan-
ning for power and timing optimization,” in Proc. ICCAD, Nov. 2006,
pp. 389–394.
[21] Y. Cai, B. Liu, J. Shi, Q. Zhou, and X. Hong, “Power delivery aware Dipanjan Sengupta (S’05) received the B.Sc. and
floorplanning,” in Proc. ISQED, 2007, pp. 350–355. B.Tech. degrees from University of Kolkata, Kolkata,
[22] H. Wu, M. D. F. Wong, and I. Liu, “Timing-constrained and voltage- India, and the M.A.Sc. degree from University of
island-aware voltage assignment,” in Proc. DAC, 2006, pp. 429–432. British Columbia, Vancouver, BC, Canada, where
[23] H. Wu and M. D. F. Wong, “Improving voltage assignment by outlier he is currently working toward the Ph.D. degree in
detection and incremental placement,” in Proc. DAC, 2007, pp. 459–464. the System-on-Chip (SoC) Laboratory, Department
[24] L. Guo, C. Yici, Z. Qiang, and H. Xianlong, “Logic and layout aware of Electrical and Computer Engineering.
voltage island generation for low power design,” in Proc. ASP-DAC, His research interest includes low-power SoC de-
Jan. 2007, pp. 666–671. sign, with particular emphasis on voltage-island de-
[25] ITRS. [Online]. Available: http://public.itrs.net sign and power-gating techniques.
[26] C. C. Chang, J. Cong, M. Romesis, and M. Xie, “Optimality and scalabil-
ity study of existing placement algorithms,” IEEE Trans. Comput.-Aided
Design Integr. Circuits Syst., vol. 23, no. 4, pp. 537–548, Apr. 2004.
[27] P. Wielage and K. Goossens, “Network on silicon: Blessing or nightmare,”
in Proc. Euromicro Symp. DSD, 2002, pp. 196–200. Resve A. Saleh (S’78–M’79–SM’03–F’06) received
[28] W.-P. Lee, H.-Y. Liu, and Y.-W. Chang, “A provably good approximation the B.S. degree in electrical engineering from
algorithm for power optimization using multiple supply voltages,” in Carleton University, Ottawa, ON, Canada, and the
Proc. DAC, 2007, pp. 887–890. M.S. and Ph.D. degrees in electrical engineering
[29] Z. Y. Yang, J. Wang, R. P. Dick, and L. Shang, “TAPHS: Thermal-aware from the University of California, Berkeley.
unified physical-level and high-level synthesis,” in Proc. ASP-DAC, 2006, He was a Founder of Simplex Solutions which de-
pp. 879–885. veloped computer-aided design (CAD) software for
[30] M. Keating, D. Flynn, R. Aitken, A. Gibbons, and K. Shi, Low Power deep submicrometer digital design verification. Prior
Methodology Manual. New York: Springer-Verlag, 2007. to starting Simplex, he spent nine years as a Professor
[31] T. McPherson, R. Averill, D. Balazich, K. Barkley, S. Carey, Y. Chan, with the Department of Electrical and Computer
Y. H. Chan, R. Crea, A. Dansky, R. Dwyer, A. Haen, D. Hoffman, Engineering, University of Illinois, Urbana. He also
A. Jatkowski, M. Mayo, D. Merrill, T. McNamara, G. Northrop, taught for one year at Stanford University, Stanford, CA. He was with Mitel
J. Rawlins, L. Sigal, T. Slegel, D. Webber, P. Williams, and F. Yee, Corporation, Ottawa, ON, Canada; Toshiba Corporation, Japan; Tektronix,
“760 MHz G6 s/390 microprocessor exploiting multiple Vt and copper Beaverton, OR; and Nortel, Ottawa. He is currently a Professor and the Natural
interconnects,” in Proc. ISSCC, Feb. 2000, pp. 96–97. Sciences and Engineering Research Council/PMC Sierra Chairholder with the
[32] J. Kao, S. Narendra, and A. Chandrakasan, “Subthreshold leakage model- System-on-Chip (SoC) Laboratory, Department of Electrical and Computer
ing and reduction techniques,” in Proc. ICCAD, Nov. 2002, pp. 141–148. Engineering, The University of British Columbia, Vancouver, BC, Canada, in
[33] A. Wang, A. Chandrakasan, and S. V. Kosnocky, “Optimal supply the field of SoC design and test. He has published over 100 journal articles
and threshold scaling for subthreshold CMOS circuits,” in Proc. IEEE and conference papers. He coauthored a book entitled “Design and Analysis of
Comput. Soc. Annu. Symp. VLSI, 2002, pp. 5–9. Digital Integrated Circuit Design: In deep submicrometer technology.”
[34] E. Maccii, M. Pedram, and F. Somenzi, “High-level power modeling, Dr. Saleh received the Presidential Young Investigator Award in 1990 from
estimation, and optimization,” IEEE Trans. Comput.-Aided Design Integr. the National Science Foundation in the U.S. He is a Professional Engineer
Circuits Syst., vol. 17, no. 11, pp. 1061–1079, Nov. 1998. of British Columbia. He served as General Chair (1995), Conference Chair
[35] T. Koyagi, M. Fukui, and R. Saleh, “Delay macromodeling and estimation (1994), and Technical Program Chair (1993) for the Custom Integrated Circuits
for RTL,” in Proc. ISCAS, May 2008, pp. 2430–2433. Conference. He held the positions of Technical Program Chair, Conference
[36] R. A. Bergamaschi and Y. W. Jiang, “State-based power analysis for Chair, and Vice-General Chair of the International Symposium on Quality in
system-on-chip,” in Proc. DAC, 2003, pp. 638–641. Electronic Design (2001), and has served as Associate Editor of the IEEE
[37] [Online]. Available: http://www.si2.org/?page=766 TRANSACTIONS ON CAD.
Authorized licensed use limited to: The University of British Columbia Library. Downloaded on July 31, 2009 at 12:50 from IEEE Xplore. Restrictions apply.