Sei sulla pagina 1di 42

Nanotechnology: Spatial

Computing Using Molecular


Electronics

Mihai Budiu
joint work with
Seth Copen Goldstein
Dan Rosewater
Intersection of Three Areas
Nanotechnology Reconfigurable
computing

Computer
architecture
SSS April 20, 2001 2
Prophecies, A Risky Endeavor
There is no reason anyone would want a computer in their home.
--- Ken Olson

I think there is a world


market for maybe five There is not the slightest
computers. indication that nuclear
--- T. J. Watson energy will ever be
obtainable.
--- Albert Einstein

640K ought to be enough for everybody.


--- Bill Gates

I will propose this semester.


--- Anonymous
SSS April 20, 2001 3
Moore’s Law

SSS April 20, 2001 4


Moore’s Second Law

X 1000$
generation

Plant cost Mask cost

SSS April 20, 2001 5


Our Proposal
Nanotechnology Reconfigurable
+ cheap Computing
+ high-density _ + defect tolerant
+ low-power + high performance
– unreliable – low density
_ ++++
_ + +
_

Computer architecture
+ vast body of knowledge
– expensive
– high-power
SSS April 20, 2001 6
Paradigm Shift

Executable Configuration

Complex fixed chip Dense, regular structure


+ +
Program Configuration
SSS April 20, 2001 7
Outline
 Introduction
• Reconfigurable computing
• Nanotechnology
• Nano-architecture proposal
• Preliminary results
• Conclusions and Future Work

SSS April 20, 2001 8


Reconfigurable Computing
• Back to ENIAC-style computing

• Synthesize one machine to solve one


problem
SSS April 20, 2001 9
Island-Style RC Architecture
Interconnection
network

Universal gates
and/or
storage elements

Programmable Switches

SSS April 20, 2001 10


Main RC Ingredient: RAM Cell
0
a0 0 data a0
0 a1 a1 & a2
a1
1

Universal gate = RAM

data in
0
control

Switch controlled by a 1-bit RAM cell

SSS April 20, 2001 11


Place and Route

int reverse(int x)
{
int k,r=0;
for (k=0; k<64; k++)
r |= x&1;
x = x >> 1;
r = r << 1;
}
}
int func(int* a,int *b)
{
int j,sum=0;
for (j=0; *a>0; j++)
sum+=reverse(*b

SSS April 20, 2001 12


Kernel Speedup Using PipeRench
Times Over 300Mhz UltraSparc-II

1000

189.7

100 63.3 57.1


42.4
26.0 29.0
15.5
11.3 12.0
10

t
s
2D
c

n
EA

Po er
T
R

n
di

ou
C
AT

FI

ee

v
T-
or

ID
D

O
pC
SSS April 20, 2001 13
qu
C
C

N
Defect Tolerance

Despite having >70% of the


chips defective, Teramac works
flawlessly.

Compilation has two phases:


• defect detection through
self-testing
• placement for
defect-avoidance

SSS April 20, 2001 14


Outline
 Introduction
 Reconfigurable computing
• Nanotechnology
• Nano-architecture proposal
• Preliminary results
• Conclusions and Future work

SSS April 20, 2001 15


Nanotechnology

SSS April 20, 2001 16


Predicted Features
• Low Power: 1010 gates use less than 2 W
(compare to 3x107 transistors using 100 W in
CMOS)
• Low cost
(nanocents/gate)
• Small size
(105 factor area gain)

Nano-RAM cell
.
In yellow: a CMOS RAM cell
SSS April 20, 2001 17
Nano-wires
• carbon nanotubues, Si, metal
• >2nm diameter, up to mm length
• excellent electrical properties

A carbon nanotube: one molecule


SSS April 20, 2001 18
Nano-switch

SSS April 20, 2001 19


Nano-switch Between
Nano-wires

SSS April 20, 2001 20


Self-assembly

SSS April 20, 2001 21


No Complex Irregular
Structures

SSS April 20, 2001 22


No Three-Terminal Devices

SSS April 20, 2001 23


Diode-resistor Logic

V AND

A B VDD

B Input 1 Output
A *^ B Input 2

A*B

Nano-implementation Electrical equivalent


SSS April 20, 2001 24
Nanoscale Latches

Provide:
• signal restoration (amplification)
• clocking (synchronization)
• memory
data out

D
clock

SSS April 20, 2001 25


High Defect Rate

SSS April 20, 2001 26


Outline
 Introduction
 Reconfigurable computing
 Nanotechnology
• Nano-architecture proposal
• Preliminary results
• Conclusions and future work

SSS April 20, 2001 27


The nanoBlock (3-in to 3-out Logic)
CMOS
Inputs
+Vdd

clk
Gnd

Gnd
clk
Outputs

SSS April 20, 2001 28


Interconnecting nanoBlocks
Switch block

SSS April 20, 2001 29


Global View

SSS April 20, 2001 30


Many Clusters = nanoFabric
cluster

long-lines
Control

SSS April 20, 2001 31


Compilation
int reverse(int x)
{
int k,r=0;
for (k=0; k<64; k++)

1. Program r |= x&1;
x = x >> 1;
r = r << 1;
}
}

Computations
3. Split-phase Abstract & local storage
Machines
Unknown latency ops.

5. Configurations placed
independently

7. Placement on chip

SSS April 20, 2001 32


Outline
 Introduction
 Reconfigurable Hardware
 Nanotechnology
 Nano-architecture proposal
• Preliminary results
• Conclusions and Future work

SSS April 20, 2001 33


A Limit Study of Performance
A graph of the whole program execution:

Basic block Control-flow transfer

Memory write

Memory read Memory word

SSS April 20, 2001 34


units

50000

0
100000
150000
200000
09 250000
12 9.
9.c g o
om

SSS April 20, 2001


pr
es
s
13
0.
13 li
2.
ijp
eg
ad
pc
m
_d
ad
pc
m
_e
ep
ic
g7 _e
21
_Q
g7 _d
21
Area

_Q
_e
gs
m
_d
gs
m
(106 units/cm2 available)

_e
jp
eg
_d
jp
eg
_e
m
pe
code area

g2
_ d
memory area

35
Typical Program Graph (g721_e)
Memory reads
Control flow transfer

100% code cluster

100% memory cluster


SSS April 20, 2001 36
Typical Program Graph (g721_e)
Memory reads
Control flow transfer

code

memcpy
memory
SSS April 20, 2001 37
Program Graph After Inlining memcpy

memcpy

SSS April 20, 2001 38


times slower than native

0
2
3
4
5
6
7
8
9
10
09

-1
1
11
12 9.
9. go
co
m
pr
es
s
13
0.

SSS April 20, 2001


13 li
2.
i jp
eg
ad
pc
m
_d
ad
pc
m
_e
ep
i c_
g7 e
21
_Q
g7 _d
21
_Q
_e
1 clock/square

gs
m
_d
gs
m
_e
jpe
g_
d
jpe
g_
m e
pe
Application Slowdown

g2
_d
5 clocks/square

39
How Time Is Spent
No caches: reads expensive
100%
90%
80%
70%
60% idle
percent

50% execution
40% control flow
30% register traffic
20%
10%
0%
No speculation
21 d
_e

_d
ad _d

_e

_d

_e
ad e g
co .g o

g7 c _e
13 0.li

_d

_e
s

_
es

g2
_Q

_Q
m

m
ijp

eg

eg
m

m
13
9

i
pr

pc

pc

ep
09

gs

gs
2.

pe
jp

jp
21
m

m
g7
9.
12

SSS April 20, 2001 40


Future Work
• Better nano-devices

• More accurate hardware models in


simulations

• Compilation technology
SSS April 20, 2001 41
Conclusions
• Electronic nanotechnology promises to
transcend the limitations of CMOS
• Nanofabrics are very well suited to
reconfigurable computation
• 109-gate designs can be managed through
hierarchies of abstract machines

SSS April 20, 2001 42

Potrebbero piacerti anche