Documenti di Didattica
Documenti di Professioni
Documenti di Cultura
Jeremy Kepner, Tim Currie, Hahn Kim, Bipin Mathew, Andrew McCabe, Michael Moore, Dan Rabinkin, Albert Reuther, Andrew Rhoades, Lou Tella and Nadya Travinin
This work is sponsored by the Department of the Air Force under Air Force contract F19628-00-C-002. Opinions, interpretations, conclusions and recommendations are those of the author and are not necessarily endorsed by the United States Government.
Outline
Introduction
System Software Results Summary
Slide-2 Quicklook
LiMIT
Processing Challenge
Can we process radar data (SAR & GMTI) in flight and
provide feedback on sensor performance in flight?
Slide-4 Quicklook
Matlab*P PVL
User Interface
Lab-Wide Usage
Ballistic Missile Defense Laser Propagation Simulation Hyperspectral Imaging Passive Sonar Airborne Ground Moving Target Indicator (GMTI) Airborne Synthetic Aperture Radar (SAR)
MatlabMPI
Slide-5 Quicklook
QuickLook Concept
28 CPU Bladed Cluster Running pMatlab
RAID Disk Recorder
Data Files
Outline
Introduction
System Software Results Summary
ConOps Ruggedization Integration
Slide-7 Quicklook
Concept of Operations
~1 seconds = 1 dwell Record Streaming Data Copy to Bladed Cluster Process on Bladed Cluster Process on SGI RAID Disk Recorder
Gbit Ethernet (1/4x RT rate)
Timeline
~30 Seconds
2 Dwell ~2 minutes
2 Dwells ~1 hour
To Other Systems
Vibration Tests
X-axis, 13 CPU/13 HD
60 50 40 30 20 10 0
16 32 ,4 54 ,5 33 4 30 4, 19 4, 8 28 4, 52 36 ,5 65 2 19 8, 4 02 1, 8 12
0dB = 1.4G (above normal) -3dB = ~1.0G (normal) -6dB = ~0.7G (below normal)
Tested in all 3 dimensions Ran MatlabMPI file based communication test up 14 CPUs/14 Hard drives Throughput decreases seen at 1.4 G
Slide-9 Quicklook
Thermal Tests
Temperature ranges
Test range: -20C to 40C Bladecenter spec: 10C to 35C Successfully cooled to -10C Failed at -20C Cargo bay typically 0C Used duct to draw outside air to cool cluster inside oven Successfully heated to 40C Outside air cooled cluster to 36C
Cooling tests
Heating tests
Slide-10 Quicklook
Mitigation Strategies
IBM Bladecenter is not designed for 707s operational environment Strategies to minimize risk of damage:
1. Power down during takeoff/ landing
Avoids damage to hard drives Radar is also powered down
Slide-11 Quicklook
Integration
SGI RAID System Scan catalog files, select dwells and CPIs to process (C/C shell) Assign dwells/CPIs to nodes, package up signature / aux data, one CPI per file. Transfer data from SGI to each processors disk (Matlab) IBM Bladed Cluster Nodes process CPIs in parallel, write results onto node 1s disk. Node 1 processor performs final processing Results displayed locally
Bladed Cluster
NODE 1 P1 VP1 VP2 VP1 VP2 Gigabit Connection
rcp
SGI
P2
NODE 14
P1
RAID
P2
Slide-12 Quicklook
Outline
Introduction
Hardware Software Results Summary
pMatlab architecture GMTI SAR
Slide-13 Quicklook
Input
Analysis
Output
Vector/Matrix
Comp
Conduit
Task
Math (Matlab)
Can build applications with a few parallel structures and functions pMatlab provides parallel arrays and functions
X = ones(n,mapX); Y = zeros(n,mapY); Y(:,:) = fft(X); MIT Lincoln Laboratory
LiMIT GMTI
GMTI Block Diagram
DWELL
Input SIG Data Input Aux Data EQC EQ Subband SIG LOD LOD Crab Correct Doppler Process Adaptive Beamform STAP Input LOD Data Deal to Nodes EQC EQ Subband LOD SIG AUX LOD
Parallel Implementation
Approach Deal out CPIs to different CPUs Performance TIME/NODE/CPI TIME FOR ALL 28 CPIS Speedup
Recon
Display
Slide-15 Quicklook
Slide-16 Quicklook
LiMIT SAR
SAR Block Diagram
Collect Pulse Collect Cube
Chirp Coefficients
A/D
FFT
Buffer
Equalization Coefficients
Buffer
Display
Selection of submatrices
Will be a key enabler for parallel linear algebra (LU, QR, )
Slide-17 Quicklook
Slide-18 Quicklook
Outline
Introduction
Implementation Results Summary
Scaling Results Mission Results Future Work
Slide-19 Quicklook
Parallel Performance
Parallel Speedup
30
GMTI (1 per node) GMTI (2 per node) SAR (1 per node) Linear
20
10
0 0 10 20 30
Number of Processors
Slide-20 Quicklook
Application memory requirements too large for 1 CPU pMatlab a requirement for this application Corner Turn performance is limiting factor Optimization efforts have improved time by 30% Believe additional improvement is possible
MIT Lincoln Laboratory
Slide-21 Quicklook
Flight Plan
Two data collection flights Flew a 50 km diameter box Six GPS-instrumented vehicles
Two 2.5T trucks Two CUCV's Two M577's
Slide-22 Quicklook
Plans to use QuickLook for streaming processing in October mission MIT Lincoln Laboratory
Slide-24 Quicklook
Octave
Slide-25 Quicklook
Petascale pMatlab
pMapper: automatically finds best parallel mapping
A Parallel Computer
FFT
FFT
MULT
Optimal Mapping
Performance (MFlops)
100
in core FFT out of core FFT
75
pMatlab (N x GByte)
~1 GByte RAM ~1 GByte RAM
50
~1 GByte RAM
~1 TByte RAID disk
25
Matlab*P Client/Server
Parallel Libraries:
PVL, ||VSIPL++, ScaLapack
Slide-26 Quicklook
pMatlab Toolbox
Matlab math libraries
MIT Lincoln Laboratory
Summary
Airborne research platforms typically collect and process
data later