Documenti di Didattica
Documenti di Professioni
Documenti di Cultura
EMC Presentation
April 2005
Outline
Motivation to study file-based I/O Profile-driven partitioning for parallel file I/O I/O Qualification Laboratory @ NU Areas for future work
EMC Presentation
April 2005
EMC Presentation
April 2005
Air
Mine
Soil
Reduced the runtime of a Monte Carlo scattered light simulation by 98% on a 16-node Silicon Graphics Origin 2000
Matlab-to-C compliation Hot-path parallelization
Speedup
64-vector 1024-vector
Limits of Parallelization
For compute-bound workloads, Beowulf clusters can be used effectively to overcome computational barriers Middlewares (e.g., MPI and MPI/IO) can significantly reduce the programming effort on parallel systems Multiple clusters can be combined, utilizing Grid Middleware (Globus Toolkit) For file-based I/O-bound workloads, Beowulf clusters and Grid systems are presently ill-suited to exploit the potential parallelism present on these systems
EMC Presentation
April 2005
Outline
Motivation to study file-based I/O Profile-driven partitioning for parallel file I/O I/O Qualification Laboratory @ NU Areas for future work
EMC Presentation
April 2005
File-intensive applications
Access file-based datasets frequently Large number of file operations
EMC Presentation April 2005 8
Introduction
Storage architectures
Direct Attached Storage (DAS)
Storage device is directly attached to the computer
EMC Presentation
April 2005
I/O Partitioning
P An I/O intensive application
Disk
Disk
Disk
Disk
Disk
Disk
Disk
Disk
Data Striping
EMC Presentation April 2005
Data Partitioning
10
I/O Partitioning
I/O is parallelized at both the application level (using MPI and MPI-IO) and the disk level (using file partitioning) Ideally, every process will only access files on local disk (though this is typically not possible due to data sharing) How to recognize the access patterns? Profile-guided approach
EMC Presentation
April 2005
11
Profile Generation
Run the application
EMC Presentation
April 2005
12
Generate a partition for every process Optimal partitioning is NP-complete, so we develop a greedy algorithm We have found we can use partial profiles to guide partitioning
EMC Presentation April 2005 13
Computational fluid dynamics Generates a file (~1.6 GB) dynamically and then reads it back Writes/reads sequentially in chunk sizes of 2040 Bytes
SPEChpc96/seismic
Seismic processing Generates a file (~1.5 GB) dynamically and then reads it back Writes sequential chunks of 96 KB and reads sequential chunks of 2 KB
Tile-IO
Parallel Benchmarking Consortium Tile access to a two-dimensional matrix (~1 GB) with overlap Writes/reads sequential chunks of 32 KB, with 2KB of overlap
Perf
Parallel I/O test program within MPICH Writes a 1 MB chunk at a location determined by rank, no overlap
Mandelbrot
An image processing application that includes visualization Chunk size is dependent on the number of processes
EMC Presentation April 2005 15
Beowulf Cluster
P2-350Mhz
RAID Node
P2-350Mhz P2-350Mhz
P2-350Mhz
P2-350Mhz
P2-350Mhz
RAID Node
EMC Presentation April 2005 16
Hardware Specifics
DAS configuration
Linux box, Western Digital WD800BB (IDE), 80GB, 7200RPM
EMC Presentation
April 2005
17
Write/Read Bandwidth
200
Bandwidth (MB/sec)
150 100 50 0
Unix Write Unix Read MPI-IO Write MPI-IO Read P-IO Write P-IO Read
NPB2.4/BT
200
Bandwidth (MB/sec)
150 100 50 0
Unix Write Unix Read MPI-IO Write MPI-IO Read P-IO Write P-IO Read
SPECHPC/seis
EMC Presentation
April 2005
18
Write/Read Bandwidth
125
Bandwidth (MB/sec)
MPI-Tile
Bandwidth (MB/sec)
Perf
100 75 50 25 0
MPI write MPI read PIO write PIO read
MPI write
MPI read
PIO write
PIO read
250
Mandelbrot
Bandwidth (MB/sec)
EMC Presentation
April 2005
19
MPI-IO PIO
erf P n Ma
t ro elb
20
EMC Presentation
6 4 2 0 1
model real
8 6 4 2 0
16
16
6 4 2 0 1 2 4 8 16 32
seek distance in number of blocks access size = 1 block number of accesses = 1000 Presentation
6 4 2 0 1 2 4 8 16 32
seek distance in number of blocks access size = 1 block number of accesses = 1000
EMC
April 2005
24
Disk Sim
EMC Presentation April 2005 25
model real
9 16 number of processors
April 2005
25
26
FileSystem
I/O traces
Disk Sim
27
EMC Presentation
model real
9
April 2005
16
25
28
number of processors
Hardware Specifications
EMC Presentation
April 2005
29
150 100 50 0
SAN-direct -ATA SAN-direct-SCSI SAN-direct-FC NAS-joulian SAN-joulian NAS-ATA NAS-SCSI NAS-FC
storage architectures
EMC Presentation April 2005 30
MB/s
SAN-direct -ATA
SAN-direct-SCSI
SAN-direct-FC
EMC Presentation
NAS-joulian
SAN-joulian
NAS-ATA
storage architectures
April 2005 31
NAS-SCSI
NAS-FC
Publications
1. 2. Profile-guided File Partitioning on Beowulf Clusters, Journal of Cluster Computing, Special Issue on Parallel I/O, to appear 2005. Execution-Driven Simulation of Network Storage Systems, Proceedings of the 12th ACM/IEEE International Symposium on Modeling, Analysis and Simulation of Computer and Telecommunication Systems (MASCOTS), October 2004, pp. 604611. Profile-Guided I/O Partitioning, Proceedings of the 17th ACM International Symposium on Supercomputing, June 2003, pp. 252-260. Source Level Transformations to Apply I/O Data Partitioning, Proceedings of the IEEE Workshop on Storage Network Architecture And Parallel IO, Oct. 2003, pp. 1221. Profile-Based Characterization and Tuning for Subsurface Sensing and Imaging Applications, International Journal of Systems, Science and Technology, September 2002, pp. 40-55.
3. 4.
5.
EMC Presentation
April 2005
32
EMC Presentation
April 2005
33
Outline
Motivation to study file-based I/O Profile-driven partitioning for parallel file I/O I/O Qualification Laboratory @ NU Areas for future work
EMC Presentation
April 2005
34
Outline
Motivation to study file-based I/O Profile-driven partitioning for parallel file I/O I/O Qualification Laboratory @ NU Areas for future work
EMC Presentation
April 2005
38
RAID
100Mbit/s 1Gbit/s
31 sub-nodes joulian.hpcl.neu.edu
EMC Presentation April 2005
8 sub-nodes keys.ece.neu.edu
39
dual server
April 2005
P2P
40