Heidi Ziegler, Mary Hall, Byoungro So (auth.), Lawrence Rauchwerger (eds.)3540211993, 9783540211990, 9783540246442
This book constitutes the thoroughly refereed post-proceedings of the 16th International Workshop on Languages and Compilers for Parallel Computing, LCPC 2003, held in College Station, Texas, USA, in October 2003.
The 35 revised full papers presented were selected from 48 submissions during two rounds of reviewing and improvement upon presentation at the workshop. The papers are organized in topical sections on adaptive optimization, data locality, parallel languages, high-level transformations, embedded systems, distributed systems software, low-level transformations, compiling for novel architectures, and optimization infrastructure.
Table of contents :
Front Matter….Pages –
Search Space Properties for Mapping Coarse-Grain Pipelined FPGA Applications….Pages 1-16
Adapting Convergent Scheduling Using Machine-Learning….Pages 17-31
TFP: Time-Sensitive, Flow-Specific Profiling at Runtime….Pages 32-47
A Hierarchical Model of Reference Affinity….Pages 48-63
Cache Optimization for Coarse Grain Task Parallel Processing Using Inter-Array Padding….Pages 64-76
Compiler-Assisted Cache Replacement: Problem Formulation and Performance Evaluation….Pages 77-92
Memory-Constrained Data Locality Optimization for Tensor Contractions….Pages 93-108
Compositional Development of Parallel Programs….Pages 109-126
Supporting High-Level Abstractions through XML Technology….Pages 127-146
Applications of HPJava….Pages 147-161
Programming for Locality and Parallelism with Hierarchically Tiled Arrays….Pages 162-176
Co-array Fortran Performance and Potential: An NPB Experimental Study….Pages 177-193
Evaluating the Impact of Programming Language Features on the Performance of Parallel Applications on Cluster Architectures….Pages 194-208
Putting Polyhedral Loop Transformations to Work….Pages 209-225
Index-Association Based Dependence Analysis and its Application in Automatic Parallelization….Pages 226-240
Improving the Performance of Morton Layout by Array Alignment and Loop Unrolling….Pages 241-257
Spatial Views: Space-Aware Programming for Networks of Embedded Systems….Pages 258-272
Operation Reuse on Handheld Devices….Pages 273-287
Memory Redundancy Elimination to Improve Application Energy Efficiency….Pages 288-305
Adaptive MPI….Pages 306-322
MPJava: High-Performance Message Passing in Java Using Java.nio….Pages 323-339
Polynomial-Time Algorithms for Enforcing Sequential Consistency in SPMD Programs with Arrays….Pages 340-356
C 3 : A System for Automating Application-Level Checkpointing of MPI Programs….Pages 357-373
The Power of Belady’s Algorithm in Register Allocation for Long Basic Blocks….Pages 374-389
Load Elimination in the Presence of Side Effects, Concurrency and Precise Exceptions….Pages 390-404
To Inline or Not to Inline? Enhanced Inlining Decisions….Pages 405-419
A Preliminary Study on the Vectorization of Multimedia Applications for Multimedia Extensions….Pages 420-435
A Data Cache with Dynamic Mapping….Pages 436-450
Compiler-Based Code Partitioning for Intelligent Embedded Disk Processing….Pages 451-465
Much Ado about Almost Nothing: Compilation for Nanocontrollers….Pages 466-480
Increasing the Accuracy of Shape and Safety Analysis of Pointer-Based Codes….Pages 481-494
Slice-Hoisting for Array-Size Inference in MATLAB….Pages 495-508
Efficient Execution of Multi-query Data Analysis Batches Using Compiler Optimization Strategies….Pages 509-523
Semantic-Driven Parallelization of Loops Operating on User-Defined Containers….Pages 524-538
Cetus – An Extensible Compiler Infrastructure for Source-to-Source Transformation….Pages 539-553
Back Matter….Pages –
Reviews
There are no reviews yet.