Benvenuto in Scribd!

Seconds Seconds Seconds: To Appear in OSDI 2004

Caricato da

Il 0% ha trovato utile questo documento (0 voti)

5 visualizzazioni1 pagina

The document contains graphs showing the input, shuffle, and output rates over time for a sort program run on a MapReduce cluster. The input rate peaks at 13 GB/s early on as map tasks read data. Shuffling of intermediate data between map and reduce tasks occurs from around 200-600 seconds. The reduce tasks write sorted output to files at around 2-4 GB/s, completing after 850 seconds. The entire computation takes 891 seconds.

Descrizione originale:

mapreduce-osdi049

Titolo originale

mapreduce-osdi049

Copyright

Formati disponibili

PDF, TXT o leggi online da Scribd

Condividi questo documento

Condividi o incorpora il documento

Opzioni di condivisione

Hai trovato utile questo documento?

Questo contenuto è inappropriato?

Segnala questo documento

Copyright:

Formati disponibili

Scarica in formato PDF, TXT o leggi online su Scribd

Segnala contenuti inappropriati

Il 0% ha trovato utile questo documento (0 voti)

5 visualizzazioni1 pagina

Seconds Seconds Seconds: To Appear in OSDI 2004

Caricato da

p001

Copyright:

Formati disponibili

Scarica in formato PDF, TXT o leggi online su Scribd

Segnala contenuti inappropriati

Salta alla pagina

Sei sulla pagina 1di 1

Cerca all'interno del documento

20000 20000

20000
Input (MB/s) Done Done Done

Input (MB/s)
15000 15000

Input (MB/s)
15000
10000 10000 10000
5000 5000 5000
0 0 0
500 1000 500 1000 500 1000

20000 20000 20000

Shuffle (MB/s)

Shuffle (MB/s)
Shuffle (MB/s)
15000 15000 15000
10000 10000 10000
5000 5000 5000
0 0 0
500 1000 500 1000 500 1000

20000 20000 20000

Output (MB/s)
Output (MB/s)

Output (MB/s)
15000 15000 15000
10000 10000 10000
5000 5000 5000
0
0 0
500 1000
500 1000 500 1000
Seconds
Seconds Seconds

(a) Normal execution (b) No backup tasks (c) 200 tasks killed
Figure 3: Data transfer rates over time for different executions of the sort program

original text line as the intermediate key/value pair. We the first batch of approximately 1700 reduce tasks (the
used a built-in Identity function as the Reduce operator. entire MapReduce was assigned about 1700 machines,
This functions passes the intermediate key/value pair un- and each machine executes at most one reduce task at a
changed as the output key/value pair. The final sorted time). Roughly 300 seconds into the computation, some
output is written to a set of 2-way replicated GFS files of these first batch of reduce tasks finish and we start
(i.e., 2 terabytes are written as the output of the program). shuffling data for the remaining reduce tasks. All of the
As before, the input data is split into 64MB pieces shuffling is done about 600 seconds into the computation.
(M = 15000). We partition the sorted output into 4000 The bottom-left graph shows the rate at which sorted
files (R = 4000). The partitioning function uses the ini- data is written to the final output files by the reduce tasks.
tial bytes of the key to segregate it into one of R pieces. There is a delay between the end of the first shuffling pe-
riod and the start of the writing period because the ma-
Our partitioning function for this benchmark has built-
chines are busy sorting the intermediate data. The writes
in knowledge of the distribution of keys. In a general
continue at a rate of about 2-4 GB/s for a while. All of
sorting program, we would add a pre-pass MapReduce
the writes finish about 850 seconds into the computation.
operation that would collect a sample of the keys and
Including startup overhead, the entire computation takes
use the distribution of the sampled keys to compute split-
891 seconds. This is similar to the current best reported
points for the final sorting pass.
result of 1057 seconds for the TeraSort benchmark [18].
Figure 3 (a) shows the progress of a normal execution A few things to note: the input rate is higher than the
of the sort program. The top-left graph shows the rate shuffle rate and the output rate because of our locality
at which input is read. The rate peaks at about 13 GB/s optimization most data is read from a local disk and
and dies off fairly quickly since all map tasks finish be- bypasses our relatively bandwidth constrained network.
fore 200 seconds have elapsed. Note that the input rate The shuffle rate is higher than the output rate because
is less than for grep. This is because the sort map tasks the output phase writes two copies of the sorted data (we
spend about half their time and I/O bandwidth writing in- make two replicas of the output for reliability and avail-
termediate output to their local disks. The corresponding ability reasons). We write two replicas because that is
intermediate output for grep had negligible size. the mechanism for reliability and availability provided
The middle-left graph shows the rate at which data by our underlying file system. Network bandwidth re-
is sent over the network from the map tasks to the re- quirements for writing data would be reduced if the un-
duce tasks. This shuffling starts as soon as the first derlying file system used erasure coding [14] rather than
map task completes. The first hump in the graph is for replication.

To appear in OSDI 2004 9

Potrebbero piacerti anche

Final 2
Documento1 pagina
Final 2
MJ CARVAJAL
Nessuna valutazione finora
Bajaj Auto - BIDA
Documento26 pagine
Bajaj Auto - BIDA
Sandeep Sagar
Nessuna valutazione finora
E-Brosure Water Terrace
Documento20 pagine
E-Brosure Water Terrace
M Rifqi PT. Ryuu Industry
Nessuna valutazione finora
Input Voltage 50 Vrms Resistor 20 Ohms Capacitor 35uf
Documento2 pagine
Input Voltage 50 Vrms Resistor 20 Ohms Capacitor 35uf
Mark Anthony Salazar Arcayan
Nessuna valutazione finora
FP (Plan) - 1
Documento1 pagina
FP (Plan) - 1
Boknoy Drilon
Nessuna valutazione finora
CE168P-2 S-Curve Sample
Documento4 pagine
CE168P-2 S-Curve Sample
Cat Bug
Nessuna valutazione finora
Green Synthesis of Fe Nanoparticles Using Citrus Maxima Peels Aqueous Extracts
Documento2 pagine
Green Synthesis of Fe Nanoparticles Using Citrus Maxima Peels Aqueous Extracts
David Seguin
Nessuna valutazione finora
Tarea Economia General
Documento2 pagine
Tarea Economia General
Andres Olmos
Nessuna valutazione finora
Recurring Expenses No People People
Documento4 pagine
Recurring Expenses No People People
Pradeep Panigrahi
Nessuna valutazione finora
Inflow Performance Relationship
Documento3 pagine
Inflow Performance Relationship
pandu arief
Nessuna valutazione finora
Resource Utilization Sheet - 1
Documento17 pagine
Resource Utilization Sheet - 1
Ujjal Regmi
Nessuna valutazione finora
Bdry
Documento8 pagine
Bdry
cat buenafe
Nessuna valutazione finora
Brochure Cluster - The Mahogany Residence (Final) PDF
Documento21 pagine
Brochure Cluster - The Mahogany Residence (Final) PDF
Tkko riasto
Nessuna valutazione finora
Construction Industry Analysis
Documento132 pagine
Construction Industry Analysis
EleanorXu
Nessuna valutazione finora
Thruster Disc Brakes SB 8 Series: Pintsch Bubenzer
Documento7 pagine
Thruster Disc Brakes SB 8 Series: Pintsch Bubenzer
Jose Luis Vivanco Montenegro
Nessuna valutazione finora
Drafting Individual Project REPORT CE0105815
Documento4 pagine
Drafting Individual Project REPORT CE0105815
Keith Tay
Nessuna valutazione finora
Graph Valves
Documento2 pagine
Graph Valves
Priyank
Nessuna valutazione finora
Thruster Disc Brakes SB 8 Series: Pintsch Bubenzer
Documento7 pagine
Thruster Disc Brakes SB 8 Series: Pintsch Bubenzer
sholhan aziz
Nessuna valutazione finora
C:/Users/USER/Desktop/LAPORAN KP/download - PNG
Documento1 pagina
C:/Users/USER/Desktop/LAPORAN KP/download - PNG
Jesica Suyanto
Nessuna valutazione finora
Totale50 OTI Network Stats
Documento1 pagina
Totale50 OTI Network Stats
Franco Scacco
Nessuna valutazione finora
Differential Plot: Mean Size Fraction
Documento2 pagine
Differential Plot: Mean Size Fraction
Zain Ali Zing
Nessuna valutazione finora
Strategy Maps
Documento2 pagine
Strategy Maps
princemech2004
Nessuna valutazione finora
Mecanismos de Freios Industriais
Documento60 pagine
Mecanismos de Freios Industriais
fabio
Nessuna valutazione finora
Denah Pondasi Revisi 2
Documento1 pagina
Denah Pondasi Revisi 2
Dion Indraz
Nessuna valutazione finora
Part 1 Cost Baseline
Documento2 pagine
Part 1 Cost Baseline
Mohammad Nauman
Nessuna valutazione finora
Oshiyie Site Organizational Layout PDF
Documento1 pagina
Oshiyie Site Organizational Layout PDF
Nana Barima
Nessuna valutazione finora
Oshiyie Site Organizational Layout
Documento1 pagina
Oshiyie Site Organizational Layout
Nana Barima
Nessuna valutazione finora
Oshiyie Site Organizational Layout PDF
Documento1 pagina
Oshiyie Site Organizational Layout PDF
Nana Barima
Nessuna valutazione finora
Trial - EUL-DCH Load Balancing
Documento10 pagine
Trial - EUL-DCH Load Balancing
kafle
Nessuna valutazione finora
F3A1 Carbon.2.fid F3A1 Carbon
Documento1 pagina
F3A1 Carbon.2.fid F3A1 Carbon
zinebzinebberr berry
Nessuna valutazione finora
Brake Systems For Mining English
Documento96 pagine
Brake Systems For Mining English
t1o2n3i4
Nessuna valutazione finora
OSHYIYE
Documento1 pagina
OSHYIYE
Nana Barima
Nessuna valutazione finora
Denah Villa 150
Documento1 pagina
Denah Villa 150
Tian Aryo
Nessuna valutazione finora
Selling Price Per Unit, S (RM) Estimated Yearly Demand, D (Units) 75 4500 125 3500 200 2000 250 1000
Documento2 pagine
Selling Price Per Unit, S (RM) Estimated Yearly Demand, D (Units) 75 4500 125 3500 200 2000 250 1000
Filzahtuln Farihah
Nessuna valutazione finora
Cumulative Cost
Documento1 pagina
Cumulative Cost
adlena
Nessuna valutazione finora
Graph 3 Pipes
Documento4 pagine
Graph 3 Pipes
Priyank
Nessuna valutazione finora
Graph 3 Pipes
Documento4 pagine
Graph 3 Pipes
Priyank
Nessuna valutazione finora
JMKW Construction Supplies Trading: Petty Cash Fund Replenishment
Documento2 pagine
JMKW Construction Supplies Trading: Petty Cash Fund Replenishment
Pepeng Dmat
Nessuna valutazione finora
Movilor DC Motor
Documento4 pagine
Movilor DC Motor
Nurhayadi Amank
Nessuna valutazione finora
So-Ing-p01-Kbr Informe Esp Part 5
Documento109 pagine
So-Ing-p01-Kbr Informe Esp Part 5
Vero
Nessuna valutazione finora
Laydown Area For Bullet Option-1
Documento1 pagina
Laydown Area For Bullet Option-1
viknesh
Nessuna valutazione finora
MBG Vs Service Radius Profit Vs Service Radius
Documento6 pagine
MBG Vs Service Radius Profit Vs Service Radius
chetana
Nessuna valutazione finora
MBG Vs Service Radius Profit Vs Service Radius
Documento8 pagine
MBG Vs Service Radius Profit Vs Service Radius
chetana
Nessuna valutazione finora
21pgdm066 - Harshita Mishra - Eco 03
Documento5 pagine
21pgdm066 - Harshita Mishra - Eco 03
Harshita Mishra
Nessuna valutazione finora
SLIIT Lecture 8 Work Example Sheets To Print
Documento6 pagine
SLIIT Lecture 8 Work Example Sheets To Print
V. Oluka
Nessuna valutazione finora
Live Load RD
Documento1 pagina
Live Load RD
Mark Jangad
Nessuna valutazione finora
The Pioneer - August - September 07
Documento20 pagine
The Pioneer - August - September 07
nilnilen2009
Nessuna valutazione finora
RMS 0-40000 (KXGS-G TPS) L528 - 61 Rev0012
Documento1 pagina
RMS 0-40000 (KXGS-G TPS) L528 - 61 Rev0012
ufficio.tecnico
Nessuna valutazione finora
主機維修保養手冊
Documento1 pagina
主機維修保養手冊
a1021332124
Nessuna valutazione finora
Management Accounting Tasks and Solution
Documento2 pagine
Management Accounting Tasks and Solution
andrewsmithtader
Nessuna valutazione finora
B-H Curve For 1018 Steel
Documento1 pagina
B-H Curve For 1018 Steel
Maycon Maran
Nessuna valutazione finora
Civil Engineering Construction Project (Ecm 317) : Group Members
Documento28 pagine
Civil Engineering Construction Project (Ecm 317) : Group Members
Safi Hussein
Nessuna valutazione finora
Homework 2 PDF
Documento1 pagina
Homework 2 PDF
Da Ra
Nessuna valutazione finora
Benzene-Toluene Enthalpy Conc Diagram at P 1 Atm: Mole Fraction Benzene in Liquid, X
Documento1 pagina
Benzene-Toluene Enthalpy Conc Diagram at P 1 Atm: Mole Fraction Benzene in Liquid, X
carmelclaire
Nessuna valutazione finora
Workshop 5 - Multicollinearity and Functional Form
Documento77 pagine
Workshop 5 - Multicollinearity and Functional Form
Doan Anh Tuan
Nessuna valutazione finora
Tangent Modulus
Documento7 pagine
Tangent Modulus
Nobel Asresu
Nessuna valutazione finora
Group 2 - Automobile Industry - Strategic Management Project
Documento9 pagine
Group 2 - Automobile Industry - Strategic Management Project
Vatsala Mishra
Nessuna valutazione finora
Hammarby Sjöstad: Malin Olsson, Head of Section, Stockholm City Planning Adm
Documento27 pagine
Hammarby Sjöstad: Malin Olsson, Head of Section, Stockholm City Planning Adm
Shiva Mohan
Nessuna valutazione finora
Graph Contraction
Documento2 pagine
Graph Contraction
Priyank
Nessuna valutazione finora
Azure Networking Cookbook2
Documento1 pagina
Azure Networking Cookbook2
p001
Nessuna valutazione finora
Azure Networking Cookbook5 PDF
Documento1 pagina
Azure Networking Cookbook5 PDF
p001
Nessuna valutazione finora
Azure Networking Cookbook1
Documento1 pagina
Azure Networking Cookbook1
p001
Nessuna valutazione finora
Preface Xi Chapter 1: Azure Virtual Network 1
Documento1 pagina
Preface Xi Chapter 1: Azure Virtual Network 1
p001
Nessuna valutazione finora
Azure Networking Cookbook2 PDF
Documento1 pagina
Azure Networking Cookbook2 PDF
p001
Nessuna valutazione finora
Exit Strategy For COVID-19 Lockdown - FINAL - CDMT Publication 31
Documento1 pagina
Exit Strategy For COVID-19 Lockdown - FINAL - CDMT Publication 31
p001
Nessuna valutazione finora
2019 National Merit Scholars California
Documento1 pagina
2019 National Merit Scholars California
p001
Nessuna valutazione finora
How Do I Prepare For A Software Engineering Job Interview?: 100+ Answers
Documento1 pagina
How Do I Prepare For A Software Engineering Job Interview?: 100+ Answers
p001
Nessuna valutazione finora
How To Prepare For A Software Engineering Job Interview - Quora2
Documento1 pagina
How To Prepare For A Software Engineering Job Interview - Quora2
p001
Nessuna valutazione finora
A Guide To Writing The Perfect College Essay: Preminente
Documento1 pagina
A Guide To Writing The Perfect College Essay: Preminente
p001
Nessuna valutazione finora
Example 2.10 (Vietnam 1998) : Eliminating Radicals and Fractions
Documento1 pagina
Example 2.10 (Vietnam 1998) : Eliminating Radicals and Fractions
p001
Nessuna valutazione finora
2019 National Merit Scholars California
Documento1 pagina
2019 National Merit Scholars California
p001
Nessuna valutazione finora
How To Prepare For A Software Engineering Job Interview - Quora3
Documento1 pagina
How To Prepare For A Software Engineering Job Interview - Quora3
p001
Nessuna valutazione finora
2computer Science Principles Digital Portfolio Student Guide
Documento1 pagina
2computer Science Principles Digital Portfolio Student Guide
p001
Nessuna valutazione finora
Singapore Mathematical Society
Documento1 pagina
Singapore Mathematical Society
p001
Nessuna valutazione finora
S 7
Documento1 pagina
S 7
p001
Nessuna valutazione finora
Supplemental Guidelines To California Adjustments: General Information
Documento1 pagina
Supplemental Guidelines To California Adjustments: General Information
p001
Nessuna valutazione finora
S 8
Documento1 pagina
S 8
p001
Nessuna valutazione finora
Evan Chen (April 30, 2014) A Brief Introduction To Olympiad Inequalities
Documento1 pagina
Evan Chen (April 30, 2014) A Brief Introduction To Olympiad Inequalities
p001
Nessuna valutazione finora
Practice Problems
Documento1 pagina
Practice Problems
p001
Nessuna valutazione finora
'JJYJ - J 20!.0MIV/n
Documento1 pagina
'JJYJ - J 20!.0MIV/n
p001
Nessuna valutazione finora
A Brief Introduction To Olympiad Inequalities: April 30, 2014
Documento1 pagina
A Brief Introduction To Olympiad Inequalities: April 30, 2014
p001
Nessuna valutazione finora
En 5
Documento1 pagina
En 5
p001
Nessuna valutazione finora
Example 2.7 (Japan) : Evan Chen (April 30, 2014) A Brief Introduction To Olympiad Inequalities
Documento1 pagina
Example 2.7 (Japan) : Evan Chen (April 30, 2014) A Brief Introduction To Olympiad Inequalities
p001
Nessuna valutazione finora
Inequalities in Arbitrary Functions: Jensen / Karamata
Documento1 pagina
Inequalities in Arbitrary Functions: Jensen / Karamata
p001
Nessuna valutazione finora
Example 1.2: Evan Chen (April 30, 2014) A Brief Introduction To Olympiad Inequalities
Documento1 pagina
Example 1.2: Evan Chen (April 30, 2014) A Brief Introduction To Olympiad Inequalities
p001
Nessuna valutazione finora
Lec 15 - 17 (SQL FUNCTIONS)
Documento38 pagine
Lec 15 - 17 (SQL FUNCTIONS)
Latest Gadgets
Nessuna valutazione finora
Complete File Management System Source Code Using PHP - MySQLi Version 1.1 - Free Source Code, Projects & Tutorials
Documento3 pagine
Complete File Management System Source Code Using PHP - MySQLi Version 1.1 - Free Source Code, Projects & Tutorials
Dani Adugna
100% (1)
Acn PDF
Documento4 pagine
Acn PDF
maldini
Nessuna valutazione finora
Scaleout Users Guide
Documento347 pagine
Scaleout Users Guide
errr33
Nessuna valutazione finora
Operating Systems
Documento91 pagine
Operating Systems
smi
Nessuna valutazione finora
Aging Report
Documento37 pagine
Aging Report
supreeth
Nessuna valutazione finora
Mobile DBMS
Documento34 pagine
Mobile DBMS
Tech_MX
100% (1)
A Simple and Easy-To-Use Library To Enjoy Videogames Programming
Documento5 pagine
A Simple and Easy-To-Use Library To Enjoy Videogames Programming
Chandra suriya
Nessuna valutazione finora
Following Are The Multiple Choice Questions
Documento6 pagine
Following Are The Multiple Choice Questions
max4one
Nessuna valutazione finora
AS400 Interview Questions For All
Documento272 pagine
AS400 Interview Questions For All
sharuk
100% (2)
HCIP-Routing & Switching-IENP V2.5 Exam Outline PDF
Documento3 pagine
HCIP-Routing & Switching-IENP V2.5 Exam Outline PDF
صلاح الدين
Nessuna valutazione finora
SAS Interview Questions
Documento33 pagine
SAS Interview Questions
api-3840832
86% (7)
Basic 'Select' Exercises: Mysql Task 3
Documento4 pagine
Basic 'Select' Exercises: Mysql Task 3
Sweety Prasad
Nessuna valutazione finora
Lecture 1 - Introduction To Data Science
Documento39 pagine
Lecture 1 - Introduction To Data Science
Khan Pk
Nessuna valutazione finora
Whatsnew
Documento40 pagine
Whatsnew
christian
Nessuna valutazione finora
HashSet and HashMap
Documento2 pagine
HashSet and HashMap
Arala Fola
Nessuna valutazione finora
FFmpeg Streaming
Documento5 pagine
FFmpeg Streaming
anyuser
Nessuna valutazione finora
Basic Linux Commands PDF
Documento8 pagine
Basic Linux Commands PDF
Dipesh Yadav
Nessuna valutazione finora
Database Programming With PL/SQL 4-2: Practice Activities: Conditional Control: Case Statements
Documento5 pagine
Database Programming With PL/SQL 4-2: Practice Activities: Conditional Control: Case Statements
Абзал Калдыбеков
Nessuna valutazione finora
SQL Server Ssms
Documento86 pagine
SQL Server Ssms
Adrian Magpantay
Nessuna valutazione finora
Sap BW Data Modeling Guide
Documento18 pagine
Sap BW Data Modeling Guide
Sapbi Sri
Nessuna valutazione finora
1b (V2.0) Softswitch Control Equipment Technical Manual
Documento96 pagine
1b (V2.0) Softswitch Control Equipment Technical Manual
Dung Tran
0% (1)
8 Best Python Cheat Sheets For Beginners and Intermediate Learners
Documento13 pagine
8 Best Python Cheat Sheets For Beginners and Intermediate Learners
Joseph Jung
100% (1)
Assignment-2 MODULE-3:: DBMS-18CS53
Documento3 pagine
Assignment-2 MODULE-3:: DBMS-18CS53
neelagund
Nessuna valutazione finora
A1 Cyberthreats
Documento2 pagine
A1 Cyberthreats
api-459808522
Nessuna valutazione finora
How To Connect To MySQL Using PHP
Documento3 pagine
How To Connect To MySQL Using PHP
Ssekabira David
Nessuna valutazione finora
Presentasi Phase 1 Program FGDP It Rep
Documento20 pagine
Presentasi Phase 1 Program FGDP It Rep
Oky Wijaya
Nessuna valutazione finora
Sap Bo Fim Fi 10 User en
Documento88 pagine
Sap Bo Fim Fi 10 User en
Aakriti Ch
Nessuna valutazione finora
Untitled
Documento200 pagine
Untitled
DF PR0
0% (2)
RaspbianInstaller - Raspbian
Documento6 pagine
RaspbianInstaller - Raspbian
claudiu72
Nessuna valutazione finora