E-Books durchsuchen

OpenMP in a Heterogeneous World [2012]

1
Specification and Performance Evaluation of Parallel I/O Interfaces for OpenMP
15
The Design of OpenMP Thread Affinity
29
Auto-scoping for OpenMP Tasks
44
A Case for Including Transactions in OpenMP II: Hardware Transactional Memory
59
Extending OpenMP* with Vector Constructs for Modern Multicore SIMD Architectures
73
Introducing Task Cancellation to OpenMP
88
Automatic OpenMP Loop Scheduling: A Combined Compiler and Runtime Approach
102
<Emphasis Type="SmallCaps">libKOMP</Emphasis>, an Efficient OpenMP Runtime System for Both Fork-Join and Data Flow Paradigms
116
A Compiler-Assisted Runtime-Prefetching Scheme for Heterogeneous Platforms
130
Experiments with WRF on Intel® Many Integrated Core (Intel MIC) Architecture
140
Optimizing the Advanced Accelerator Simulation Framework Synergia Using OpenMP
154
Using Compiler Directives for Accelerating CFD Applications on GPUs
169
Effects of Compiler Optimizations in OpenMP to CUDA Translation
182
Assessing OpenMP Tasking Implementations on NUMA Architectures
196
Performance Analysis Techniques for Task-Based OpenMP Applications
210
Task-Based Execution of Nested OpenMP Loops
223
SPEC OMP2012 — An Application Benchmark Suite for Parallel Systems Using OpenMP
237
An OpenMP 3.1 Validation Testsuite
250
Performance Analysis of an Hybrid MPI/OpenMP ALM Software for Life Insurance Policies on Multi-core Architectures
254
Adaptive OpenMP for Large NUMA Nodes
258
A Generalized Directive-Based Approach for Accelerating PDE Solvers
262
Design of a Shared-Memory Model for CAPE
267
Overlapping Computations with Communications and I/O Explicitly Using OpenMP Based Heterogeneous Threading Models
271
A Microbenchmark Suite for OpenMP Tasks
275
Support for Thread-Level Speculation into OpenMP
Feedback