Documenti di Didattica
Documenti di Professioni
Documenti di Cultura
Xiuduan Fang
Department of Computer Science
University of Virginia
September 19, 2008
Outline
• Overview
Hypothesis
Contributions & Publications
• Motivation
• Theoretical component:
Design and evaluate algorithms to support file
transfers on circuit-switched networks
• Experimental component:
Implement and demonstrate architecture for
internetworking circuit-switched networks with the
Internet
• Conclusions & Future work
2
Hypothesis
Circuit-switched networks, with dynamic call-
by-call bandwidth sharing and support for
heterogeneous-rate circuits, can be used
efficiently to support file transfers, and can be
evolved gradually into the existing Internet.
Dissertation organization
end-to-end circuits?
Yes No
Theoretical component Experimental component
Call-admission control (CAC): Internetworking architecture
rate allocation
minimum file size
3
Key Contributions
• File transfers on a hybrid architecture
Constructed analytical models
Provided insights on how to design admission control
Proposed a novel heterogeneous rate-allocation
scheme to lower file-transfer delay
• Internetworking architecture
Designed and implemented a gateway to interconnect
circuit networks with the Internet
Characterized the gateway performance
4
Publications
• Ph.D. dissertation:
X. Fang and M. Veeraraghavan, On using circuit-switched
networks for file transfers,” accepted to IEEE Globecom, New
Orleans, LA, Nov. 2008.
X. Fang, M. Veeraraghavan, M. E. McGinley, and R. W. Gisiger,
“An overlay approach for enabling access to dynamically
shared backbone GMPLS networks,” in Proc. of IEEE
ICCCN2007, Honolulu, Hawaii, Aug. 2007.
X. Fang and M. Veeraraghavan, “On using a hybrid architecture
for file transfers,” Submitted to IEEE Transactions on Parallel
and Distributed Systems, 2008.
• MS thesis:
M. Veeraraghavan, X. Fang, and X. Zheng, “On the suitability of
applications for GMPLS networks,” in Proc. of IEEE Globecom,
San Francisco, CA, Nov. 2006.
X. Fang, X. Zheng, and M. Veeraraghavan, “Improving web
performance through new networking technologies,” IEEE
ICIW'06, Guadeloupe, French Caribbean, February 23-25, 2006.
5
Outline
• Overview
Hypothesis
Contributions & Publications
Motivation
• Theoretical component:
Design and evaluate algorithms to support file
transfers on circuit-switched networks
• Experimental component:
Implement and demonstrate architecture for
internetworking circuit-switched networks with the
Internet
• Conclusions & Future work
6
Motivation
• Why File Transfers on Circuit Networks?
Packet switching is considered better than circuit
switching for file transfers
Pros: high throughput under light loads
Cons:
– Unpredictable delays
– Proportional fairness but no temporal fairness
7
Dissertation Organization
end-to-end circuits?
Yes No
Theoretical component: Experimental component:
File transfers on Interconnect circuit networks
a hybrid architecture with the Internet
Call blocking for circuit network? Designed a gateway
Yes No Implemented software
Analytical model
Homogeneous Heterogeneous
Submitted to TPDS Analytical model Analytical model
Blocked calls rerouted Simulation model Simulation model
to the Internet path
Fairness issue
Accepted by Globecom2008
For large files, waiting for high-speed circuit s is a better
option than being immediately rerouted to Internet path
Hybrid Architecture - Example
Yes No
Theoretical component: Experimental component:
File transfers on Interconnect circuit networks
a hybrid architecture with the Internet
Call blocking for circuit network? Designed a gateway
Yes No Implemented software
Analytical model
Homogeneous Heterogeneous
Submitted to TPDS Analytical model Analytical model
Blocked calls rerouted Simulation model Simulation model
to the Internet path
Fairness issue
Accepted by Globecom2008
For large files, waiting for high-speed circuits is a better
option than being immediately rerouted to Internet path
Call-blocking Circuit Network
• Goal: design efficient connection-admission control
(CAC) algorithms
Metrics: file-transfer delay and utilization
• Block call if circuit is unavailable; reroute to Internet
• Our focus:
What is an appropriate minimum file size?
Serve files sized x > minimum file size, Â, via the
circuit network
What is an appropriate circuit rate, r, for a file transfer?
11
Analytical Model
Internet
N
¸0
x>Â
Y ¸0 1
routing decision Link L
…
capacity C
n
Circuit network
Assumptions:
• Single class
homogeneous rate allocation
m circuits; per-circuit rate, r=C/m
• Call arrival process: Poisson with rate, ¸0 [Paxson95]
• Call holding times: Pareto distribution [Crovella97]
[Paxson95] V. Paxson and S. Floyd, "Wide area traffic: the failure of Poisson modeling," Networking, IEEE/ACM Transactions on , vol.3, no.3, pp.226-244, Jun 1995
[Crovella97] M. E. Crovella and A. Bestavros, Self-Similarity in World Wide Web Traffic: Evidence and Possible Causes, IEEE/ACM Transactions on Networking, 5(6):835--846.
Key Insights
Combine M/G/m/m loss model & TCP delay model
Erlang-B formula: input the number of channels, m, & traffic load;
output: call blocking probability and utilization
TCP model: bottleneck link rate, round-trip time, packet loss rate
[Padhye98]
Two criteria to select Â
Delay-based (user-perspective): compare delay estimates
across two paths
Utilization (service provider-perspective): make circuit-setup
overhead a small fraction (e.g., 10%) of circuit file-transfer delay
Define a metric to quantify mean delay reduction
1
R = s-1 (E[Ttcp(x)]-E[Tcircuit(x)])¢fX(x)dx
Compute mopt (ropt = C/ mopt) & Âopt that maximize R
[Padhye98] J. Padhye, V. Firoiu, D. Towsley, and J. Kurose, “Modeling TCP throughput: A simple modeland its empirical validation,” in Proceedings of the ACM SIGCOMM, Aug. 1998, pp. 303–314.
Key Results
• To maximize R, ropt should be much higher than effective
throughput on the Internet path
e.g., Internet path: bottleneck link rate = 100 Mb/s,
RTT = 50 ms, packet loss rate = 1%
) effective throughput = 1.9 Mb/s
Circuit path: link capacity = 10 Gb/s, call-setup delay = 1 sec
) ropt = 63 Mb/s & Âopt = 75 MB
If r = 2 Mb/s ) Â = 4.5 MB
) Files of size (4.5 MB, 75MB) will get lower delay on circuits
But, mean delay will increase; hence directed to Internet
• Load sensitive: under low loads,
Larger per-call circuit rate, ropt
Larger ropt ) Larger minimum file size, Âopt
Relax utilization criterion to decrease Âopt
• RTT sensitive: Larger ropt & Âopt for short-RTT path
14
Dissertation Organization
end-to-end circuits?
Yes No
Theoretical component: Experimental component:
File transfers on Interconnect circuit networks
a hybrid architecture with the Internet
Call blocking for circuit networks? Designed a gateway
Yes No Implemented software
Accepted by Globecom2008
For large files, waiting for high-speed circuits is a better
option than being immediately rerouted to Internet path
Homogeneous Rate Allocation
• Key question: how much bandwidth should be allocated for
each file transfer so that the system performance is optimized
in terms of mean response time at a given effective utilization?
• Metrics: mean response time
16
M/G/m queueing model
• Goal: compute per-call circuit rate, ropt (i.e., C /mopt)
• Input:
A set of m = {1, 10, 100, 1000}
Link capacity C = 10 Gb/s
) r = {10Gb/s, 1Gb/s, 100Mb/s, 10Mb/s}
Call setup delay = 1 sec
Bounded-Pareto parameters
) the first two moments of service time
Traffic load 2 (0, 1)
• Output:
Effective utilization: call-setup delay overhead
Mean waiting time
17
Numerical Results
A complete-partitioning system
19
Analytical model
• Multiple separate M/G/m subsystems
Basis for classifying calls: cutoff points, Â1,…, Ân-1?
Bandwidth allocation per subsystem, C1, …, Cn?
Ideal per-call circuit rate for each class, r1, …, rn?
• To compute optimal operating point that minimizes mean
response time:
Mathematica optimization package
e.g., for a 2-class system
Start with an initial value for Â1
Determine C1, C2 & r1, r2
Vary Â1 to study its impact
• Fairness:
Fairness ratio: ratio of mean slowdown of 2 classes
20
Slowdown: ratio of waiting time to service requirement
File-size distribution parameters:
Smallest file size: l = 1 MB
Largest file size: u = 1 TB
Cutoff point: Â = 1000 MB
21
Fairness Ratio (small-file to large-file)
Homogeneous system
(at all utilization levels)
Heterogeneous system
23
Key Results
• Complete-partitioning heterogeneous rate
allocation
Large files allocated high-rate circuits
Lowers mean response time
Treats small files more fairly when compared with
complete-sharing
Requires a network management system to monitor
traffic load & dynamically update partitions
• Conditional priority scheme improves the fair
treatment between long-path and short-path
calls
24
Dissertation Organization
end-to-end circuits?
Yes No
Analytical model
Homogeneous Heterogeneous
Submitted to TPDS Analytical model Analytical model
Simulation model Simulation model
Blocked calls rerouted
to the Internet path Fairness issue
Accepted by Globecom2008
For large files, waiting for high-speed circuit s is a better
option than being immediately rerouted to Internet path
Experimental Component
• Motivation:
It is expensive to deploy a new networking technology on an
end-to-end basis
As link speeds increase, high-capacity circuit switches are
cheaper than packet switches
Circuit-switched (CS) networks operated in shared mode )
admission control (AC) phase
Connectionless (CL) networks have no admission control
phase
So internetworking CL + shared CS is a challenge
[Cerf74] V. G. Cerf and R. E. Kahn, “A protocol for packet network intercommunication,” IEEE Transactions on Communications, vol. 22, no. 5, pp. 637–648, May 1974.
[Clark88] D. D. Clark, “The design philosophy of the DARPA Internet protocols,” in SIGCOMM. Stanford, CA: ACM, Aug. 1988, pp. 106–114. 27
Internetworking Architecture
R
Web client IP-router core Web server
subnetwork
R R
S R R R S
S R
CAG
R R R S
CAG CAG
enterprise R R enterprise
network network
regional Circuit regional
network subnetwork
network
Connectionless
Connectionless
28
Internetworking Architecture
R
Web client IP-router core Web server
subnetwork
R R
S R R R S
S R
CAG
R R R S
CAG CAG
enterprise R R enterprise
network network
regional Circuit/ regional
network subnetwork
network
29
Gateway Design
• Start with an open-source Web proxy software
package called Squid
• Data-plane:
Base functionality provided by Squid
Integrated Circut-TCP (removes Slow Start, receive-side
autotuning)
30
Gateway Design contd.
• Unpredictable rate across connectionless (CL) segments
• But fixed-rate across circuit-switched (CS) segments
• What if these are mismatched?
• Need buffering within gateways
• Buffers are finite: so possibility of losses?
• Squid implementation: back-pressure mechanism;
Data not read from incoming TCP buffer if Squid buffer
(controlled by read_ahead_gap) is full
Latter is full if outgoing TCP buffer is full
• Leads to circuit utilization problems
• Answer: main memory or disk buffering in gateways +
multiplexing on circuits
31
Experimental Hypothesis
A modified version of Squid software can be used
as a gateway to interconnect circuit-switched
networks and connectionless packet-switched
networks for reliable file transfers, and can support
an effective throughput of 460 Mb/s when executed
on a Linux 2.6.20 host with a 2.8GHz Xeon
processor and 1 GB memory.
32
Experimental setup to test
if there is buffer overflow
• NIC speeds: CHEETAH NIC (NIC2) = GbE, Internet NIC (NIC1) ¸ 100 Mb/s
• Circuit (zelda1 $ zelda4) rate=155Mb/s, link (zelda4 $ zelda5) rate=1Gb/s
• Control link rate on zelda1 ! zelda2 path to mismatch sending and
receiving rates
• The parameter read_ahead_gap controls CAG’s application buffer for each
flow, read_ahead_gap = 16 KB (default value)
33
CAG zelda1’s forwarding rate CAG zelda1’s CPU and memory usage
• Key results:
No packet loss in buffers within
CAGs due to a back-pressure
mechanism
35
Other Experiments & Analysis
• Measured maximum forwarding rate
Stress test by using long flows: 460 ± 4.75 Mb/s
36
Conclusions
• File transfers on circuit-switched (CS) networks
Advantage relative to packet switching: predictable service time
• Packet switching (PS) better for small file transfers
Call setup delay >> Transfer time (link rates ↑, transfer time ↓)
Predictability not a concern when absolute delays are low
• Hence hybrid architecture: PS for small; CS for large
Internet path Considered in Not
Circuit metrics routing decision considered
network operation
Call blocking √ X
Call queueing X √
Call admission control algorithms designed to be fair across
small-file, large-file & across short-path, long-path
37
Conclusions contd.
• Designed a gateway called CAG to interconnect
connectionless networks with circuit networks
CAG implements all sub-layers of the network layer with
data-plane and control-plane (admission control)
CAG supports reliable file transfers
File transfers need high-speed links on whole path
Gradually evolving circuit-switched networks for access
(current bottleneck) will lead to improved performance
38
Future Work
• More sophisticated bandwidth-sharing schemes
Currently studied a complete partitioning scheme
To avoid sensitivity to network management system
performance as is the case with partitioning
• Hardware-based implementation of CAG with the
support of disk buffering for in-transit data
Current software implementation could slow down
effective transmission rates
39
Questions from Form G111
Thank you!
Questions?
41
Questions from Form G111 -
42
Questions from Form G111 -
Success criteria
• Has the student adequately defined the measure(s) of success to be
used to evaluate the work? Is there a well defined metric with a goal?
Does the metric adequately represent the desired success criteria?
• Success criteria
Theoretical work: use a hybrid architecture for file transfers
Call blocking circuit network: optimal design parameters to maximize mean
delay reduction
Call queueing circuit network: optimal design parameters to minimize mean
response time at a given effective utilization
Experimental work: designed an internetworking gateway called CAG
CAG supports reliable file transfers
Improved circuit utilization
Measured maximum forwarding rate of CAG
• Metrics
Theoretical work: file-transfer delay, utilization, mean delay reduction,
fairness ratio
Experimental work: reliable file transfer, circuit utilization, forwarding rate,
user-perceived throughput
44
Questions from Form G111 -
Solution
• Is the approach taken well executed? Does it appear to be correct?
Is the work technically challenging? Does the student utilize
appropriate professional standards?
45
Questions from Form G111 -
• To what extent is the work innovative? Has the student taken a risk
in applying the chosen approach?
46
Questions from Form G111 -
Broader implications
47
Background –
High-Speed Circuit-Switching
• Data-plane technologies
Switching: Time Division Multiplexing (TDM) &
Wavelength Division Multiplexing (WDM)
Mapping: to carry Ethernet frames via SONET signals or
WDM lightpaths
• Control plane: Generalized MultiProtocol Label
Switched (GMPLS)
Three components: signaling, routing, & management
Bandwidth sharing mode: immediate-request (IR)
• Equipment examples:
SONET switches: Sycamore SN16000
WDM switches: Adva/Movaz RayExpress OADM
48
Layers in OSI reference model
AL: Application Layer DLL or L: Data-Link Layer
TL: Transport Layer PHY or P: Physical Layer
• Service time:
• Effective utilization:
Capacity:
• Whole system:
Effective utilization:
53
Model Validation & Verification
[Jain91] R. Jain, The Art of Computer Systems Performance Analysis: Techniques for Experimental Design, Measurement, Simulation, and Modeling, New York, Wiley-Interscience, 1991.
54
Related Work –
File Transfers on Other Testbeds
• Other testbeds: Large file transfers high per-circuit
rate, long holding time Coarse-Grained Sharing
(CGS)
• Our interest: Fine-Grained Sharing (FGS) for all files
coarse fine
Different bandwidth sharing modes
55
Circuit-switched networks:
Signaling for call setup
Connection setup (Dest: III-B;
BW: OC1; Timeslot: a, 1) II
b a
Host a
I III Host
I-A c b
b
d c III-B
c
a IV V
Dest. Next hop d
Routing
table III-* IV
1
Y N
x>Â
2
¸ Simple
 & rx
N-1 sum
CAC
N allocate a circuit reject
with rate rx
Â: crossover file size
rx: allocated bandwidth
Switch model
Y Simple sum: N
rx < available bandwidth
accept reject
58
File Transfers on
Other Circuit/VC Networks
• Has ATM implemented file transfers with a guaranteed
service? No.
Service classes on the ATM layer
Hard QoS for multimedia applications
59
Analytical Model (cont.)
M/G/m/m loss model
Erlang-B: compute call blocking probability, Pb, and utilization, Ub,
given the number of channels, m, and traffic load, ½
0 1
RD Link L, capacity C
…
N
The computation of ½:
¸: aggregate call arrival rate offered to the switch
1/¹: mean call holding time
60
The Derivation of
Offered Traffic Load, ½
File size: Pareto distribution Serve files with Original call
size x >  arrival rate, N¢¸0
®: shape, k: scale
where
Choose
62
Numerical Results - Input Parameters
File size distribution: shape, ®=1.0009, scale,
k=1000bytes
Circuit:
Link capacity C=10Gbps
Original call arrival rate, N¢¸0=1100calls/second
Mean call setup delay, E[Tsetup]=1second
Round trip time, RTT=50ms
Utilization factor, ¯=10
Internet path
Bottleneck link rate, r=100Mbps
Packet loss rate, Ploss=1%
Round trip time, RTT=50ms
63
Numerical Results:
Utilization-based crossover file size Impact of Per-circuit Rate on Â
m=100
For m=10, 100, and 1000 (i.e, per-circuit rate is 1Gb/s, 100Mb/s, and 10Mb/s)
Âu is the limiting factor Simplifies the computation of Â
65
Numerical Results: Impact of
Per-circuit Rate on File-transfer Delay
File transfer delay over circuits or the Internet
66
Numerical Results: Impact of
Per-circuit Rate on Mean Delay Reduction
67
Design and Implementation
• Gateway:
Receive a Web request
Started with an open-
source Web proxy
software package called Y N
Cache miss
Squid
Integrated RSVP-TE
signaling client module Y Circuit to N
parent Serve the
into Squid to initiate circuit
request
set up/release
Integrated Circut-TCP
(removes Slow Start,
Serve the Fork a process to
receive-side autotuning)
request via attempt a circuit setup;
Added monitoring module the circuit
to watch circuit usage. Meanwhile, serve the
Initiate circuit release if request via the IP-
idle for time >T router core subnetwork
68
Experiments to Measure
User-perceived Response Time
• Two sets
Controllable experiments by loading specific-
size files on a Web server
Operational Web sites
Web server parameters RTT (ms) Bottleneck file size Latencies (s)
via the local RTT (MB)
name Internet (ms) direct CHEETAH
location Proxy
71
Background –
High-Speed Circuit-Switched Networks
US: DOE’s UltraScience net, CHEETAH, Internet2 Dynamic Circuit network
Europe: UKLight (UK), SURFnet (Netherland), VIOLA (Germany), MUPBED
Canada: CA*net 4
Japan: JGN
Oak Ridge, TN
SN16000 Raleigh, NC
To Cray X1
GbE/
OC192 Control
10GbE H zelda4 SN16000
card Card
card H zelda5 GbE/
OC192 Control
10GbE
card Card H wukong
card
Atlanta, GA
OC-192 OC-192
SN16000
H zelda1
GbE/
OC192 Control H zelda2
10GbE
card Card
card H zelda3
CHEETAH: Circuit-switched High-speed
End-to-End Transport ArcHitecture