Documenti di Didattica
Documenti di Professioni
Documenti di Cultura
in Cloud Infrastructure
Jie Liu
Microsoft Research
liuj@microsoft.com
Physical Awareness
Physical Data
Physical Interface
Energy-efficient computing
Context-aware services
Mining from physical and mobile data
Stream data management and processing
Embedded devices and networking
Novel sensors, circuits, and RF comm.
RF Proximity sensor
LittleRock
Cloud-offloaded
GPS
NFC Ring
FM-based indoor location
Genomotes
Data center
asset tracking
Energy Expenditure
Cost Analysis
Data centers can cost between
$10M and $20M per Megawatt
Total Equipment
cost per server
12%
46%
23%
19%
Datacenter
operating cost per
server
gasoline
power
Generators
Power grid
PUE=
PDU
Racks
UPS
Transformer
air
CRAC
water
Water chillers
160
140
120
100
80
60
40
20
0
Sleep
Idle
20%
40$
60%
80%
100%
CPU Utilization
Login Rate
5
Connections
1200
1000
3
800
2
600
400
200
20
40
60
80
100
Time in hours
120
computing
140
1400
Login rate (per second)
temporal
spatial
160 0
physical
Christos Kozyrakis, Aman Kansal, Sriram Sankar, and Kushagra Vaid, Server Engineering Insights for Large-Scale Online Services, in IEEE Micro, IEEE, July 2010
Chieh-Jan Mike Liang, Jie Liu, Liqian Luo, Andreas Terzis, and Feng Zhao, RACNet: A High-Fidelity Data Center Sensing Network, in Proceedings of The 7th ACM
Conference on Embedded Networked Sensor Systems (SenSys 2009),, November 2009
Traditional Operations
Over cooled
Under utilized
-33% cooling
-4% lighting
-15% UPS loss
-10% air handling (fans)
-35% power supply
-85% underutilization
-40% inefficient applications
3% energy for
useful work
30kW
to the data
center
16kW
to the server
9.5kW
to the
application
0.9kW
to the
customer
REDUCE
REUSE
RENEW
Measure
Control
Adaptive cooling
economizers
load distribution
VM migrations
Improve
Data Center
Efficiency
trend
correlation
dependencies
abnormality
bottlenecks
Model
Plan
capacity
provisioning
allocation
consolidation
Design
facilities
server hardware
networking
applications
Sense Everything
Asset location
Heat distribution
Electrical wiring
Network wiring
Air flow
Power consumption
Service types
Cooling Systems
Collect, archive,
and understand
operations data
Server utilization
Processor
Network
Power
consumption
Storage
Weather
Electricity availability & price
Networking
Server performance
The easy
The hard
The ugly
Visualization
Static provisioning
The Easy
Eye-Balling
Capacity planning
Statistics
Monte Carlo Simulation
Change management
before
middle
bottom
after
Generation 1
~2 PUE
Colocation
Server
Capacity
20 year Technology
2007
2008
2011+
Generation 2
Generation 3
Generation 4
Containment
Modular
Density
Rack
Density and Deployment
Minimized Resource Impact
Containers, PODs
Scalability & Sustainability
Air & Water Economization
Differentiated SLAs
The Hard
Single Domain Modeling
Clear Hierarchy
Dynamic Control
Load Placement
Power Capping
Data centers oversubscribe their
power capacity.
Statistically speaking, the
aggregated power will not exceed
circuit capacities.
At rear events when the power
exceeds capacity, server activities
must be capped.
Time
Lead Acid Battery Charging Curve
Actuation:
DVFS
Shut down unimportant
servers/tasks
Power (Watt)
Power Tracking
3
2.5
2
1.5
1
0.5
0
0
5
10
Time (hours)
15
Dynamic Provisioning
The number of active servers follow the workload
Login Requests
Clients
Dispatch Server
Pick a CS
Connection
Load reporting
Connection
Server
Backend Servers:
Authentication, address book, etc.
Load Forecasting
Seasonal Data Regression
X
Mon. Two
weeks ago
Last Mon.
Number of Connections
X X
x 10
Observed value
Forecasted value
4.5
4
3.5
3
2.5
2
20
40
60
80
100
Time (Hours)
120
140
160
X
Today
X
X
11:30 12:00
Long term dependency +
Local adjustments
1200
Observed value
Forecasted value
1000
800
600
400
200
20
40
60
80
100
Time (Hours)
120
140
160
Load Balancing
Load Skewing
pi
N
1
1
( i )
K
K N tot
Ltot (t )
Ltot (t )
User requests
Load
Dispatcher
Load
Dispatcher
Li (t )
Li (t )
N i (t )
N i (t )
Di (t )
User requests
Di (t )
N tgt
Algorithm Performances
Algorithm
Energy (KWH)
Savings
Denials
478
---
331
30.8%
3,711,680
343
28.2%
799,120
367
23.3%
597,520
375
21.5%
48,160
60
60
FB
FS
FSS
FB
FBS
RLS
55
Number of Active Servers
55
50
45
40
35
50
45
40
35
30
30
25
25
0
16
24
Time (Hours)
32
40
48
16
24
Time (Hours)
3032
40
Gong Chen, Wenbo He, Jie Liu, Suman Nath, Leonidas Rigas, Lin Xiao, and Feng Zhao, Energy-aware server provisioning and load dispatching for connectionintensive internet services, in NSDI'08, Berkeley, CA, USA, 2008
48
VM VM
Server-12
VM VM
Server-1j
Rack
VM VM
Server-11
Service Virtualization
Soft Actuations
The Ugly
Cross-Domain Modeling
Interference
VM Interference
Co-located
applications
App 1
App 2
Core-private
cache
Processor
Shared Cache
Memory
Bandwidth
Memory
DRAM
Shared
resource
contention
Static partitioning
Up to 40% degradation
was observed among
Google applications*
120
Normalized Performance
Degradation (%)
Up to 125%
lbm
100
mcf
80
bzip2
60
povray
40
20
0
Vs lbm Vs mcf Vs bzip2
Vs
povray
Co-located Application
on Intel Core 2 Duo
*The impact of memory subsystem resource sharing on datacenter applications, Tang et al., ISCA 2011
lbm
Tunable Cache
Intensity
Cache Sets
Cache Ways
VM
Core 1
Core 2
Shared Cache
50
40
30
20
10
0
lbm
gcc
mcf
soplex
omnetpp
bzip2
gobmk
povray
perlbench
libquant
hmmer
sjeng
Performance: Bytes
accessed per second
Sriram Govindan, Jie Liu, Aman Kansal, and Anand Sivasubramaniam, Cuanta: Quantifying Effects of Shared On-chip Resource
Interference for Consolidated Virtual Machines, in ACM Symposium on Cloud Computing (SOCC), October 2011
Alan Roytman, Aman Kansal, Sriram Govindan, Jie Liu, and Suman Nath, PACMan: Performance Aware Virtual Machine Consolidation, in
10th International Conference on Autonomic Computing (ICAC), June 2013
Interference-Aware VM Consolidation
Given n jobs and m machines each with k cores
Job degradation is specified over all job sets
The max degradation should be less than D
Every set of jobs has (energy) cost w(S)
Heuristic:
()
=1
Polynomial when k = 2
NP-hard when k > 2
Polynomial time approximation
ln()
Interference-Aware VM Migration
Given an existing assignment and G allowable
migrations, minimize the total cost of the new assignment
after migration.
Polynomial when k = 2
NP-hard when k > 2
NP-hard to approximate
Greedy Heuristics:
Select the worst degraded
VM on a server.
Migrate to the server that
causes least interference
Repeat until G is exhausted.
VM Power Capping
Software Energy Estimation
Estimate VM power consumption from performance counters
Linear regression with whole machine (HW) power meters
Power consumption
Energy Model Error
Component Dynamic
Energy
Performance
counters
20
60
CPU
Memory
Disk
18
16
20
Measured
Estimate
Error
50
14
18
16
14
40
12
10
Watts
Watts
Watts
12
30
8
20
6
4
10
2
0
10
50
100
150
Time(s)
200
250
300
50
100
150
Time(s)
200
250
300
25
2500
Aman Kansal, Feng Zhao, Jie Liu, Nupur Kothari, and Arka
2000
Computing (SOCC), June 2010
CPU
Memory
Bhattacharya,
Disk
2500
Total
App 1
App 2
2000
VM Performance Accounting
CPU
DISK
19
Benchmark Number
17
Benchmark numbers 2 to 19
are SPEC CPU 2006 INT and FP
benchmarks (ones that compile
without Fortran).
15
13
11
9
7
5
3
1
0
2
3
Error (%)
servers
time
Time
Settings reflected by OS
Command received
by agent
~20ms
< 1ms
< 1ms
Command reaches
destination server
200-350ms
<40-60ms in
current
implementation (using
user-level code)
OS changes
setting in hardware
time
(not to scale)
Power decreases
REMOTE
POWER
PANEL
RACK
PDU
SERVER
UPS
Magntic Breaker
Thermal Breaker
Data Center
Controller
Papp-1(t)
Papp-n(t)
VM
VM
Ptier-n(t)
Tier Level
Controller n
VM
VM
Ptier-1(t)
Tier Level
Controller 1
VM
VM
Ptier-n(t)
Tier Level
Controller n
VM
VM
Harold Lim, Aman Kansal, and Jie Liu, Power Budgeting for Virtualized Data Centers, in 2011 USENIX Annual Technical Conference (USENIX ATC '11), June 2011
Experimental Results
40 VMs on 10 servers
3 Priorities (stock trader (high), web service (mid), SPEC CPU (low))
Take battery charging as complimentary power consumer
Workload
(%)
100
MSN Messenger
demand traces
50
0
Time (s)
Uncapped
Total Power
Power (Watt)
1150
MPC
Controller
1100
1050
1000
0
1000
2000
3000
Time (s)
4000
5000
6000
Physical
Hierarchy
Controller
Total Power
Budget
The Complete
What
We HavePicture
Discussed So Far
-67% power plant
-10% transmission and distribution
-33% cooling
Fossil fuel
-4% lighting
-15% UPS loss
-10% air handling (fans)
-35% power supply
-85% underutilization
-40% inefficient applications
>99% of initial energy
lost in conversion
100kW
to the power
plant
30kW
to the data
center
16kW
to the server
9.5kW
to the
application
0.9kW
to the
customer
2013
2012
2011
2010
2009
2008
2007
2006
2005
2004
2003
2002
2001
2000
Fuel Cells
Comparing prices - Electricity vs. Natural Gas
(Energy Equivalent)
0.14
0.12
USD
0.10
0.08
Electricity
0.06
Natural gas
0.04
0.02
0.00
Natural gas grid is 100 times more reliable than electrical grid.
Gas grid failure is more graceful.
Natural gas is easier to store than electrical or kinetic energy.
It is Green!
150 kV 15kV
Medium
voltage switch
board
15kV 480 V
Low voltage
switch board
Genera tor
TVSS
Connection
to street
header
ATS
Manifold
distribution to
each row
No electrical distribution
No backup generators
No central UPS
Branch
circuit
distribution
SE RVE R
SE RVE R
SE RVE R
SE RVE R
SE RVE R
SE RVE R
SE RVE R
Fue l Ce ll
SE RVE R
SE RVE R
SE RVE R
SE RVE R
SE RVE R
SE RVE R
Pressure
regulation
600
500
Power on
400
300
200
100
Power
cord on
Time (seconds)
0.0
0.9
1.9
2.8
3.8
4.7
5.6
6.6
7.5
8.5
9.4
10.4
11.3
12.2
13.2
14.1
15.1
16.0
16.9
17.9
18.8
400
300
200
100
0
Server crash;
Blue screen;
Core dump;
and Restart
Time (seconds)
0
5
10
15
20
25
30
35
40
45
50
55
60
66
71
76
81
86
91
96
Power (Watt)
500
On Going Work
Workload Power
Consumption
Server Power
Consumption
I(k)
PS(k)
Fuel Cell
V(k)
-
+
U(k)
-
Server
Internal
Parasitic
PW(k)
Server
Provisioning
Energy storage selection
and sizing
Power capping/tracing
Dynamic coordination
Conclusions
Data centers are large, complex cyber-physical systems
and a very rich research space.
Acknowledgements
Sriram Govindan
Sean James
Mike Liang
Aman Kansal
Suman Nath
Eric Peterson
Bodhi Priyantha
Sriram Shankar
Lin Xiao
Feng Zhao
Arka Bhattacharya
Gong Chen
Christos Faloutsos
Wenbo He
Oliver Kennedy
Lei Li
Harold Lim
Xue Liu
Chenyang Lu
Ana Carolina Reikstin
Alan Roytman
Abu Sayeed Saifullah
Andreas Terzis
Qiang Wang
Thank you!