Benvenuto in Scribd!

Salta carosello

Introduction to Parallel Computing Tasks and Dependency Graphs (38

Caricato da

cleopatra2121

Il 0% ha trovato utile questo documento (0 voti)

60 visualizzazioni16 pagine

Lecture

Titolo originale

Basic of Parallel Algorithm Design

Copyright

Formati disponibili

PDF, TXT o leggi online da Scribd

Condividi questo documento

Condividi o incorpora il documento

Opzioni di condivisione

Hai trovato utile questo documento?

Questo contenuto è inappropriato?

Segnala questo documento

Lecture

Copyright:

Attribution Non-Commercial (BY-NC)

Formati disponibili

Scarica in formato PDF, TXT o leggi online su Scribd

Segnala contenuti inappropriati

Il 0% ha trovato utile questo documento (0 voti)

60 visualizzazioni16 pagine

Introduction to Parallel Computing Tasks and Dependency Graphs (38

Caricato da

cleopatra2121

Lecture

Copyright:

Attribution Non-Commercial (BY-NC)

Formati disponibili

Scarica in formato PDF, TXT o leggi online su Scribd

Segnala contenuti inappropriati

Salta alla pagina

Sei sulla pagina 1di 16

Cerca all'interno del documento

uses some of the slides for chapters 3 and 5 accompanying

Introduction to Parallel Computing, Addison Wesley, 2003. http://www-users.cs.umn.edu/~karypis/parbook

Preliminaries: Decomposition, Tasks, and Dependency Graphs

Parallel algorithm should decompose the problem

into tasks that can be executed concurrently. A decomposition can be illustrated in the form of a directed graph (task dependency graph).
nodes = tasks and edges = dependencies: the result of one

task is required for processing the next

Degree of concurrency determines the amount of

tasks which can indeed run in parallel. Critical path is the longest path in the dependency graph lower bound on program runtime.

Example: Multiplying a Dense Matrix with a Vector

Computation of each element of output vector y is independent of other elements. Based on this, a dense matrix-vector product can be decomposed into n independent tasks. Is this the maximum number of tasks we could decompose this problem into?

Observations: While tasks share data (namely, the vector b ), they do not have any control dependencies - i.e., no task needs to wait for the (partial) completion of any other. All tasks are of the same size in terms of number of operations. Is this the maximum number of tasks we could decompose this problem into?

Multiplying a dense matrix with a vector 2n CPUs available

Task 1 Task 2

On what kind of platform will we have 2N processors?

Multiplying a dense matrix with a vector 2n CPUs available

Task 2

Task 2n+1

Task 1

Task 2

Task 2n+1

Task 2n-1

Task 2n

Task 3n

Granularity of Task Decompositions

The size of tasks into which a problem is

decomposed is called granularity

Parallelization efficiency

Guidelines for good parallel algorithm design

Maximize concurrency.
Spread tasks evenly between processors to avoid

idling and achieve load balancing. Execute tasks on critical path as soon as the dependencies are satisfied. Minimize(overlap) communication between processors.

Evaluating parallel algorithm

Speedup
S = Tserial / Tparallel

Efficiency
E = S / #processors

Cost
C = #processors * Tparallel

Scalability
Speedup (efficiency) as a function of number of

processors with fixed task size.

Simple example: summing n numbers on p CPUs

Summing n numbers on p CPUs

Parallel Time = ((n/p)log p)
Serial Time = (n) Speedup = (p/(log p)) Efficiency = (1/log p)

Parallel multiplication of 3 matrices

A, B, C rectangular matrices N x N
We want to compute in parallel: A x B x C How to achieve the maximum performance?

Try 1
2-CPU parallelization:
A B TEMP

TEMP

Result

Try 2
3-CPU parallelization:
A B C'

Result

Is this a good way?

Dynamic task creation Simple Master-Worker

Identify independent computations
A(1,1)*B(1,1) A(1,2)*B(2,1) + T(1,1)*C(1,1) + Result(1,1) A(1,1)*B(1,2) + T(2,1)*C(2,1) A(1,2)*B(2,2)

Create a queue of tasks Each process picks from the queue and may insert a

new task to the queue

Hypothetic implementation by multiple threads

Let's assign separate thread for each multiply/sum

operation. How do we coordinate the threads?

Initialize threads to know their dependencies. Build producer-consumer like logic for every thread.

Potrebbero piacerti anche

Mapping Techniques for Parallel Algorithms
Documento27 pagine
Mapping Techniques for Parallel Algorithms
tt_aljobory3911
Nessuna valutazione finora
Chap3 Slides Week4
Documento42 pagine
Chap3 Slides Week4
A N
Nessuna valutazione finora
Algorithms Concurrency Overview
Documento89 pagine
Algorithms Concurrency Overview
Dev Rishi Thakur
Nessuna valutazione finora
Chapter Overview: Algorithms and Concurrency: - Introduction To Parallel Algorithms
Documento84 pagine
Chapter Overview: Algorithms and Concurrency: - Introduction To Parallel Algorithms
atharva gundawar
Nessuna valutazione finora
Parallel Algorithm Design Process Methods Agglomeration
Documento16 pagine
Parallel Algorithm Design Process Methods Agglomeration
Musanje Martin
Nessuna valutazione finora
CH 3
Documento82 pagine
CH 3
Ioana Sora
Nessuna valutazione finora
Principles of Parallel Algorithm Design
Documento78 pagine
Principles of Parallel Algorithm Design
Karthik Laxmikanth
Nessuna valutazione finora
6 Module 3 Preliminaries 12-01-2023
Documento86 pagine
6 Module 3 Preliminaries 12-01-2023
Pranav Hiremath
Nessuna valutazione finora
Lecture 5 Principles of Parallel Algorithm Design
Documento30 pagine
Lecture 5 Principles of Parallel Algorithm Design
nimranoor137
Nessuna valutazione finora
Parallel Programming: Lecture #9
Documento24 pagine
Parallel Programming: Lecture #9
Fatma mansour
Nessuna valutazione finora
8-Parallel Algorithm Design - Preliminaries-09-Jan-2020Material - I - 09-Jan-2020 - Module - 3 - Preliminaries PDF
Documento18 pagine
8-Parallel Algorithm Design - Preliminaries-09-Jan-2020Material - I - 09-Jan-2020 - Module - 3 - Preliminaries PDF
ANTHONY NIKHIL REDDY
Nessuna valutazione finora
Parallel Algorithm Design Life Cycle and Granularity
Documento26 pagine
Parallel Algorithm Design Life Cycle and Granularity
Kashif Mehmood
Nessuna valutazione finora
Lec04b-Processes and Mapping
Documento26 pagine
Lec04b-Processes and Mapping
agha
Nessuna valutazione finora
Introduction to Parallel Computing: Matrix Multiplication and Decomposition Methods
Documento52 pagine
Introduction to Parallel Computing: Matrix Multiplication and Decomposition Methods
Mumbi Njoroge
Nessuna valutazione finora
Unit 2
Documento81 pagine
Unit 2
rishisharma4201
Nessuna valutazione finora
CSC 580 - Chapter 3 Parallel Algorithm Design
Documento35 pagine
CSC 580 - Chapter 3 Parallel Algorithm Design
Farzana Salleh
Nessuna valutazione finora
CS416 - Parallel and Distributed Computing: Lecture # 6 (19-03-2021) Spring 2021 FAST - NUCES, Faisalabad Campus
Documento31 pagine
CS416 - Parallel and Distributed Computing: Lecture # 6 (19-03-2021) Spring 2021 FAST - NUCES, Faisalabad Campus
Rao Usama
Nessuna valutazione finora
Nscet E-Learning Presentation: Listen Learn Lead
Documento51 pagine
Nscet E-Learning Presentation: Listen Learn Lead
durai murugan
Nessuna valutazione finora
2008 Basics DDM FEA
Documento24 pagine
2008 Basics DDM FEA
whatever
Nessuna valutazione finora
Chapter 7
Documento14 pagine
Chapter 7
Avk Sanjeevan
Nessuna valutazione finora
W10 Scalability
Documento69 pagine
W10 Scalability
gigi
Nessuna valutazione finora
3.2 Performance Evaluations
Documento18 pagine
3.2 Performance Evaluations
Chetan Solanki
Nessuna valutazione finora
Use of DAG in Distributed Parallel Computing
Documento5 pagine
Use of DAG in Distributed Parallel Computing
International Journal of Application or Innovation in Engineering & Management
Nessuna valutazione finora
Module - 3 Parallel Algorithm Design - Preliminaries
Documento12 pagine
Module - 3 Parallel Algorithm Design - Preliminaries
Bantu Aadhf
Nessuna valutazione finora
Approach: Matching To Assignment Distributed Systems
Documento7 pagine
Approach: Matching To Assignment Distributed Systems
Arvind Kumar
Nessuna valutazione finora
Parallel Algorithm Design: 3.1 Task/Channel Model
Documento27 pagine
Parallel Algorithm Design: 3.1 Task/Channel Model
Section 9BG
Nessuna valutazione finora
Parallel Programming Fundamentals
Documento9 pagine
Parallel Programming Fundamentals
siddhu8134
Nessuna valutazione finora
Using MATLAB to solve differential equations numerically
Documento6 pagine
Using MATLAB to solve differential equations numerically
Brij Mohan Singh
Nessuna valutazione finora
Task Decomposition and Mapping Techniques for Parallel Computing
Documento62 pagine
Task Decomposition and Mapping Techniques for Parallel Computing
Houri melkonian
Nessuna valutazione finora
An Efficient Temporal Partitioning Algorithm To Minimize Communication Cost For Reconfigurable Computing Systems
Documento11 pagine
An Efficient Temporal Partitioning Algorithm To Minimize Communication Cost For Reconfigurable Computing Systems
International Journal of New Computer Architectures and their Applications (IJNCAA)
Nessuna valutazione finora
Towards Efficient Mapreduce Using Mpi
Documento10 pagine
Towards Efficient Mapreduce Using Mpi
Preethi Sai Krishnan
Nessuna valutazione finora
6th Sem Algorithm (c-14) Answer
Documento15 pagine
6th Sem Algorithm (c-14) Answer
rituprajna2004
Nessuna valutazione finora
Automatic Differentiation and MATLAB OOP
Documento19 pagine
Automatic Differentiation and MATLAB OOP
Scribd1427
Nessuna valutazione finora
Computer Lab 3
Documento3 pagine
Computer Lab 3
nathan
Nessuna valutazione finora
A Presentation On Parallel Computing: - Ameya Waghmare (Rno 41, BE CSE) Guided by-Dr.R.P.Adgaonkar (HOD), CSE Dept
Documento32 pagine
A Presentation On Parallel Computing: - Ameya Waghmare (Rno 41, BE CSE) Guided by-Dr.R.P.Adgaonkar (HOD), CSE Dept
Manish Ravula
Nessuna valutazione finora
A Presentation On Parallel Computing: - Ameya Waghmare (Rno 41, BE CSE) Guided by-Dr.R.P.Adgaonkar (HOD), CSE Dept
Documento32 pagine
A Presentation On Parallel Computing: - Ameya Waghmare (Rno 41, BE CSE) Guided by-Dr.R.P.Adgaonkar (HOD), CSE Dept
aadrika
Nessuna valutazione finora
Graph Partitioning Implementation Strategy Pattern Name
Documento12 pagine
Graph Partitioning Implementation Strategy Pattern Name
Devender Jain
Nessuna valutazione finora
Research Into The Conditional Task Scheduling Problem
Documento9 pagine
Research Into The Conditional Task Scheduling Problem
Smita Kadian
Nessuna valutazione finora
Task Scheduling Problem: A Scheduling System Model Consist of An
Documento4 pagine
Task Scheduling Problem: A Scheduling System Model Consist of An
Pratik Sharma
Nessuna valutazione finora
PC 10 Esf PDF
Documento30 pagine
PC 10 Esf PDF
Anisorac Vasile
Nessuna valutazione finora
Decomposition of dense matrix-vector multiplication into 4 tasks
Documento4 pagine
Decomposition of dense matrix-vector multiplication into 4 tasks
SAMINA ATTARI
Nessuna valutazione finora
A Presentation On Parallel Computing: - Ameya Waghmare (Rno 41, BE CSE) Guided by-Dr.R.P.Adgaonkar (HOD), CSE Dept
Documento32 pagine
A Presentation On Parallel Computing: - Ameya Waghmare (Rno 41, BE CSE) Guided by-Dr.R.P.Adgaonkar (HOD), CSE Dept
Shubhadip Dinda
Nessuna valutazione finora
Content PDF
Documento14 pagine
Content PDF
robbyyu
Nessuna valutazione finora
Assignment # 2: Name: M. Nasir
Documento12 pagine
Assignment # 2: Name: M. Nasir
Muhammad Umer
Nessuna valutazione finora
Functional Differentiation of Computer Programs: Karczma@
Documento22 pagine
Functional Differentiation of Computer Programs: Karczma@
Rahat Hossain
Nessuna valutazione finora
Low Memory Cost Dynamic Scheduling of Large Coarse Grain Task Graphs
Documento7 pagine
Low Memory Cost Dynamic Scheduling of Large Coarse Grain Task Graphs
Rosário Cunha
Nessuna valutazione finora
Fallsem2021-22 Cse4001 Eth Vl2021220104087 Reference Material I 02-09-2021 Module 3 Decomposition Techniques
Documento21 pagine
Fallsem2021-22 Cse4001 Eth Vl2021220104087 Reference Material I 02-09-2021 Module 3 Decomposition Techniques
Rishabh Nagar
Nessuna valutazione finora
04 Progbasics
Documento62 pagine
04 Progbasics
Juliana
Nessuna valutazione finora
Parallel Computing: An Overview of Concepts, Approaches and Implementation
Documento32 pagine
Parallel Computing: An Overview of Concepts, Approaches and Implementation
aadrika
Nessuna valutazione finora
Laboratory 1 Discrete and Continuous-Time Signals
Documento8 pagine
Laboratory 1 Discrete and Continuous-Time Signals
Yasitha Kanchana Manathunga
Nessuna valutazione finora
FoP HPC Unit II PPT
Documento107 pagine
FoP HPC Unit II PPT
Akshata Chopade
Nessuna valutazione finora
Job Aware Scheduling Algorithm for MapReduce Framework
Documento6 pagine
Job Aware Scheduling Algorithm for MapReduce Framework
saahithyaalagarsamy
Nessuna valutazione finora
Matlab
Documento103 pagine
Matlab
Ancaa_p
Nessuna valutazione finora
MATLAB Introduction with Applications
Documento50 pagine
MATLAB Introduction with Applications
Aknel Kaiser
100% (1)
Eda Lab Manual
Documento25 pagine
Eda Lab Manual
AthiraRemesh
Nessuna valutazione finora
On-The - y Product: Compute Entries of A For Each A V Application
Documento4 pagine
On-The - y Product: Compute Entries of A For Each A V Application
latinwolf
Nessuna valutazione finora
Lec Notes
Documento50 pagine
Lec Notes
hvs1910
Nessuna valutazione finora
MATLAB for Beginners: A Gentle Approach
Da Everand
MATLAB for Beginners: A Gentle Approach
Peter I. Kattan
Nessuna valutazione finora
MATLAB for Beginners: A Gentle Approach - Revised Edition
Da Everand
MATLAB for Beginners: A Gentle Approach - Revised Edition
Peter Kattan
Nessuna valutazione finora
MATLAB for Beginners: A Gentle Approach - Revised Edition
Da Everand
MATLAB for Beginners: A Gentle Approach - Revised Edition
Peter I. Kattan
Valutazione: 3.5 su 5 stelle
3.5/5 (11)
Identification Impulse
Documento12 pagine
Identification Impulse
minhtrieudoddt
Nessuna valutazione finora
(Ebook - Physics) - Classical Mechanics (Rosu 1999)
Documento131 pagine
(Ebook - Physics) - Classical Mechanics (Rosu 1999)
Jacob Hanouna
Nessuna valutazione finora
Timer Counter Modes 8051 Microprocessor Design
Documento30 pagine
Timer Counter Modes 8051 Microprocessor Design
cleopatra2121
Nessuna valutazione finora
VXIbus Specification Revision 4.0 Available with Higher Throughput & Power
Documento2 pagine
VXIbus Specification Revision 4.0 Available with Higher Throughput & Power
cleopatra2121
Nessuna valutazione finora
Pmos and Cmos
Documento3 pagine
Pmos and Cmos
Viet Nguyen
Nessuna valutazione finora
Switched-Capacitor Fundamentals
Documento20 pagine
Switched-Capacitor Fundamentals
cleopatra2121
Nessuna valutazione finora
Parallel Computing An Introduction
Documento40 pagine
Parallel Computing An Introduction
cleopatra2121
Nessuna valutazione finora
Benefit of Hybrid Systems
Documento3 pagine
Benefit of Hybrid Systems
cleopatra2121
Nessuna valutazione finora
Intro Parallel Computation
Documento35 pagine
Intro Parallel Computation
cleopatra2121
Nessuna valutazione finora
Cmos Logic: Heavily Doped N-Type Silicon Sio2 Insulating Layer Metal Layer Polysilicate Gate
Documento11 pagine
Cmos Logic: Heavily Doped N-Type Silicon Sio2 Insulating Layer Metal Layer Polysilicate Gate
mchernitsky
Nessuna valutazione finora
Game Land Glossary
Documento13 pagine
Game Land Glossary
raskhy
Nessuna valutazione finora
Intro Gamelan
Documento3 pagine
Intro Gamelan
cleopatra2121
100% (2)
Algorithm Genetic
Documento67 pagine
Algorithm Genetic
cleopatra2121
Nessuna valutazione finora
Natural Frequency and Resonance
Documento4 pagine
Natural Frequency and Resonance
cleopatra2121
Nessuna valutazione finora
FSR Sensor
Documento16 pagine
FSR Sensor
cleopatra2121
Nessuna valutazione finora
Arsitektur Komputer
Documento1 pagina
Arsitektur Komputer
cleopatra2121
Nessuna valutazione finora
Modern Control System
Documento1 pagina
Modern Control System
cleopatra2121
Nessuna valutazione finora
Algorithm Genetic
Documento67 pagine
Algorithm Genetic
cleopatra2121
Nessuna valutazione finora
Sequential Design Homework
Documento3 pagine
Sequential Design Homework
cleopatra2121
Nessuna valutazione finora
Identification Impulse
Documento12 pagine
Identification Impulse
minhtrieudoddt
Nessuna valutazione finora
Sequential Design Homework
Documento3 pagine
Sequential Design Homework
cleopatra2121
Nessuna valutazione finora
Week 1 Q&A
Documento3 pagine
Week 1 Q&A
York Zeng
Nessuna valutazione finora
IPAG Elementary School 2nd Periodical Test in Mathematics VI
Documento5 pagine
IPAG Elementary School 2nd Periodical Test in Mathematics VI
RejaneReGalendez
33% (3)
Lect3 Long
Documento26 pagine
Lect3 Long
Onur Yazıcı
Nessuna valutazione finora
Math and population growth
Documento3 pagine
Math and population growth
Ronalyn Manzano
Nessuna valutazione finora
Metaheuristics: From Design To Implementation: Chap 2 Single-Solution Based Metaheuristics
Documento115 pagine
Metaheuristics: From Design To Implementation: Chap 2 Single-Solution Based Metaheuristics
karen dejo
Nessuna valutazione finora
1.system of Numbers and Conversion
Documento20 pagine
1.system of Numbers and Conversion
Ampol
Nessuna valutazione finora
Concise Selina Solutions For Class 9 Maths Chapter 7 Indices
Documento37 pagine
Concise Selina Solutions For Class 9 Maths Chapter 7 Indices
Pooja Kanoi
Nessuna valutazione finora
Cost Optimisation For Underground Mining Networks: Abstract
Documento16 pagine
Cost Optimisation For Underground Mining Networks: Abstract
Siyi Li
Nessuna valutazione finora
Maths For Natural Sciences
Documento5 pagine
Maths For Natural Sciences
Efrem Hirko Gufi
50% (2)
Mathematics ÔÇó Final Step-A ÔÇó VMC
Documento57 pagine
Mathematics ÔÇó Final Step-A ÔÇó VMC
Santosh
Nessuna valutazione finora
BUS 202 (6) - Course Outline - Tapas
Documento5 pagine
BUS 202 (6) - Course Outline - Tapas
Tapas
Nessuna valutazione finora
HKDSE Maths Revision Course Circles Equations Locus
Documento29 pagine
HKDSE Maths Revision Course Circles Equations Locus
Angus
Nessuna valutazione finora
Pre-Rmo 2015
Documento5 pagine
Pre-Rmo 2015
Kush Hariani
Nessuna valutazione finora
2016-2017 Algebra 1 Curriclum Map
Documento17 pagine
2016-2017 Algebra 1 Curriclum Map
api-229596858
Nessuna valutazione finora
Operating System - Scheduling Algorithms MCQs Ex
Documento15 pagine
Operating System - Scheduling Algorithms MCQs Ex
Sudhanshu Pandey
0% (1)
On Hecke Fields Fields: The For and Class Number
Documento15 pagine
On Hecke Fields Fields: The For and Class Number
Елизавета Мартыненко
Nessuna valutazione finora
Contact Polarizations and Associated Metrics in Geometric Thermodynamics
Documento25 pagine
Contact Polarizations and Associated Metrics in Geometric Thermodynamics
Juan José Gomez Romero
Nessuna valutazione finora
Explore 4D Space Visually
Documento25 pagine
Explore 4D Space Visually
manumbc
Nessuna valutazione finora
DSA - Ch13 - 14 - Graph
Documento83 pagine
DSA - Ch13 - 14 - Graph
Hùng Nguyễn
Nessuna valutazione finora
MT 461 Path Planning in MobileRobots
Documento6 pagine
MT 461 Path Planning in MobileRobots
Umair Aziz
Nessuna valutazione finora
Dwnload Full Contemporary Abstract Algebra 9th Edition Gallian Solutions Manual PDF
Documento35 pagine
Dwnload Full Contemporary Abstract Algebra 9th Edition Gallian Solutions Manual PDF
frederickboonemt21jc
100% (10)
P3 Merged Removed
Documento216 pagine
P3 Merged Removed
Albert Regi
Nessuna valutazione finora
Adaptive Thresholding Using Quadratic Cost Functions
Documento27 pagine
Adaptive Thresholding Using Quadratic Cost Functions
AI Coordinator - CSC Journals
Nessuna valutazione finora
TLMaths BUMPER Worksheet of Differential Equations (Separation of Variables)
Documento14 pagine
TLMaths BUMPER Worksheet of Differential Equations (Separation of Variables)
start
Nessuna valutazione finora
Junior Problems: Mathematical Reflections (2021) 1
Documento4 pagine
Junior Problems: Mathematical Reflections (2021) 1
PerepePere
Nessuna valutazione finora
Lesson 11 3 Logarithms Exponents
Documento13 pagine
Lesson 11 3 Logarithms Exponents
api-233527181
Nessuna valutazione finora
(Infosys Science Foundation Series) Ramji Lal (Auth.) - Algebra 2 - Linear Algebra, Galois Theory, Representation Theory, Group Extensions and Schur Multiplier-Springer Singapore (2017)
Documento440 pagine
(Infosys Science Foundation Series) Ramji Lal (Auth.) - Algebra 2 - Linear Algebra, Galois Theory, Representation Theory, Group Extensions and Schur Multiplier-Springer Singapore (2017)
Dharman
100% (3)
Mathematical Language
Documento9 pagine
Mathematical Language
jrqagua00332
Nessuna valutazione finora
Multi-Objective Antlion Optimizer
Documento17 pagine
Multi-Objective Antlion Optimizer
Subhodip Saha
Nessuna valutazione finora
Fourier Series Note5
Documento13 pagine
Fourier Series Note5
2133MANAS PARAB
Nessuna valutazione finora