Toward OpenCL Automatic Multi-Device Support (English)
- New search for: Henry, Sylvain
- New search for: Denis, Alexandre
- New search for: Barthou, Denis
- New search for: Counilh, Marie-Christine
- New search for: Namyst, Raymond
- New search for: Silva, Fernando
- New search for: Dutra, Inês
- New search for: Santos Costa, Vítor
- New search for: Henry, Sylvain
- New search for: Denis, Alexandre
- New search for: Barthou, Denis
- New search for: Counilh, Marie-Christine
- New search for: Namyst, Raymond
In:
Euro-Par 2014 Parallel Processing
: 20th International Conference, Porto, Portugal, August 25-29, 2014. Proceedings
;
Chapter: 65
;
776-787
;
2014
- Article/Chapter (Book) / Electronic Resource
-
Title:Toward OpenCL Automatic Multi-Device Support
-
Contributors:Silva, Fernando ( editor ) / Dutra, Inês ( editor ) / Santos Costa, Vítor ( editor ) / Henry, Sylvain ( author ) / Denis, Alexandre ( author ) / Barthou, Denis ( author ) / Counilh, Marie-Christine ( author ) / Namyst, Raymond ( author )
-
Conference:European Conference on Parallel Processing ; 2014 ; Porto, Portugal
-
Published in:Euro-Par 2014 Parallel Processing : 20th International Conference, Porto, Portugal, August 25-29, 2014. Proceedings ; Chapter: 65 ; 776-787Lecture Notes in Computer Science ; 8632 ; 776-787
-
Publisher:
- New search for: Springer International Publishing
-
Place of publication:Cham
-
Publication date:2014-01-01
-
Size:12 pages
-
ISBN:
-
ISSN:
-
DOI:
-
Type of media:Article/Chapter (Book)
-
Type of material:Electronic Resource
-
Language:English
-
Keywords:
-
Source:
Table of contents eBook
The tables of contents are generated automatically and are based on the data records of the individual contributions available in the index of the TIB portal. The display of the Tables of Contents may therefore be incomplete.
- 1
-
MPI Trace Compression Using Event Flow GraphsAguilar, Xavier / Fürlinger, Karl / Laure, Erwin et al. | 2014
- 2
-
ScalaJack: Customized Scalable Tracing with In-situ Data AnalysisAnanthakrishnan, Srinath Krishna / Mueller, Frank et al. | 2014
- 3
-
Performance Measurement and Analysis of Transactional Memory and Speculative Execution on IBM Blue Gene/QJiang, Jie / Philippen, Peter / Knobloch, Michael / Mohr, Bernd et al. | 2014
- 4
-
c-Eclipse: An Open-Source Management Framework for Cloud ApplicationsSofokleous, Chrystalla / Loulloudes, Nicholas / Trihinas, Demetris / Pallis, George / Dikaiakos, Marios D. et al. | 2014
- 5
-
Modeling and Simulation of a Dynamic Task-Based Runtime System for Heterogeneous Multi-core ArchitecturesStanisic, Luka / Thibault, Samuel / Legrand, Arnaud / Videau, Brice / Méhaut, Jean-François et al. | 2014
- 6
-
Modeling the Impact of Reduced Memory Bandwidth on HPC ApplicationsTiwari, Ananta / Gamst, Anthony / Laurenzano, Michael A. / Schulz, Martin / Carrington, Laura et al. | 2014
- 7
-
ParaShares: Finding the Important Basic Blocks in Multithreaded ProgramsKambadur, Melanie / Tang, Kui / Kim, Martha A. et al. | 2014
- 8
-
Multi-Objective Auto-Tuning with Insieme: Optimization and Trade-Off Analysis for Time, Energy and Resource UsageGschwandtner, Philipp / Durillo, Juan J. / Fahringer, Thomas et al. | 2014
- 9
-
Performance Prediction and Evaluation of Parallel Applications in KVM, Xen, and VMwareHong, Cheol-Ho / Kim, Beom-Joon / Kim, Young-Pil / Park, Hyunchan / Yoo, Chuck et al. | 2014
- 10
-
DReAM: Per-Task DRAM Energy Metering in Multicore SystemsLiu, Qixiao / Moreto, Miquel / Abella, Jaume / Cazorla, Francisco J. / Valero, Mateo et al. | 2014
- 11
-
Characterizing the Performance-Energy Tradeoff of Small ARM Cores in HPC ComputationLaurenzano, Michael A. / Tiwari, Ananta / Jundt, Adam / Peraza, Joshua / Ward, William A. / Campbell, Roy / Carrington, Laura et al. | 2014
- 12
-
On Interactions among Scheduling Policies: Finding Efficient Queue Setup Using High-Resolution SimulationsKlusáček, Dalibor / Tóth, Šimon et al. | 2014
- 13
-
ProPS: A Progressively Pessimistic Scheduler for Software Transactional MemoryRito, Hugo / Cachopo, João et al. | 2014
- 14
-
A Queueing Theory Approach to Pareto Optimal Bags-of-Tasks Scheduling on CloudsDumitru, Cosmin / Oprescu, Ana-Maria / Živković, Miroslav / van der Mei, Rob / Grosso, Paola / de Laat, Cees et al. | 2014
- 15
-
SPAGHETtI: Scheduling/Placement Approach for Task-Graphs on HETerogeneous archItectureBarthou, Denis / Jeannot, Emmanuel et al. | 2014
- 16
-
Energy-Aware Multi-Organization Scheduling ProblemCohen, Johanne / Cordeiro, Daniel / Raphael, Pedro Luis F. et al. | 2014
- 17
-
Energy Efficient Scheduling of MapReduce JobsBampis, Evripidis / Chau, Vincent / Letsios, Dimitrios / Lucarelli, Giorgio / Milis, Ioannis / Zois, Georgios et al. | 2014
- 18
-
Automated Transformation of GPU-Specific OpenCL Kernels Targeting Performance Portability on Multi-Core/Many-Core CPUsHuang, Dafei / Wen, Mei / Xun, Changqing / Chen, Dong / Cai, Xing / Qiao, Yuran / Wu, Nan / Zhang, Chunyuan et al. | 2014
- 19
-
Switchable Scheduling for Runtime Adaptation of OptimizationBagnères, Lénaïc / Bastoul, Cédric et al. | 2014
- 20
-
A New GCC Plugin-Based Compiler Pass to Add Support for Thread-Level Speculation into OpenMPAldea, Sergio / Estebanez, Alvaro / Llanos, Diego R. / Gonzalez-Escribano, Arturo et al. | 2014
- 21
-
Improving Read Performance with Online Access Pattern Analysis and PrefetchingTang, Houjun / Zou, Xiaocheng / Jenkins, John / Boyuka, David A. / Ranshous, Stephen / Kimpe, Dries / Klasky, Scott / Samatova, Nagiza F. et al. | 2014
- 22
-
Robust and Efficient Large-Large Table Outer Joins on Distributed InfrastructuresCheng, Long / Kotoulas, Spyros / Ward, Tomas E / Theodoropoulos, Georgios et al. | 2014
- 23
-
Top-k Item Identification on Dynamic and Distributed DatasetsGuerrieri, Alessio / Montresor, Alberto / Velegrakis, Yannis et al. | 2014
- 24
-
Applying Selectively Parallel I/O Compression to Parallel Storage SystemsFilgueira, Rosa / Atkinson, Malcolm / Tanimura, Yusuke / Kojima, Isao et al. | 2014
- 25
-
Ultra-Fast Load Balancing of Distributed Key-Value Stores through Network-Assisted LookupsDe Cesaris, Davide / Katrinis, Kostas / Kotoulas, Spyros / Corradi, Antonio et al. | 2014
- 26
-
Virtual Machine Consolidation in Cloud Data Centers Using ACO MetaheuristicFerdaus, Md Hasanul / Murshed, Manzur / Calheiros, Rodrigo N. / Buyya, Rajkumar et al. | 2014
- 27
-
Workflow Scheduling on Federated CloudsDurillo, Juan J. / Prodan, Radu et al. | 2014
- 28
-
Locality-Aware Cooperation for VM Scheduling in Distributed CloudsPastor, Jonathan / Bertier, Marin / Desprez, Frédéric / Lebre, Adrien / Quesnel, Flavien / Tedeschi, Cédric et al. | 2014
- 29
-
Can Inter-VM Shmem Benefit MPI Applications on SR-IOV Based Virtualized Infiniband Clusters?Zhang, Jie / Lu, Xiaoyi / Jose, Jithin / Shi, Rong / Panda, Dhabaleswar K. (DK) et al. | 2014
- 30
-
Power-Aware L1 and L2 Caches for GPGPUsAtoofian, Ehsan / Manzak, Ali et al. | 2014
- 31
-
Power Consumption Due to Data Movement in Distributed Programming ModelsJana, Siddhartha / Hernandez, Oscar / Poole, Stephen / Chapman, Barbara et al. | 2014
- 32
-
Spanning Tree or Gossip for Aggregation: A Comparative StudyNyers, Lehel / Jelasity, Márk et al. | 2014
- 33
-
Shades: Expediting Kademlia’s Lookup ProcessEinziger, Gil / Friedman, Roy / Kantor, Yoav et al. | 2014
- 34
-
Analysis and Comparison of Truly Distributed Solvers for Linear Least Squares Problems on Wireless Sensor NetworksPrikopa, Karl E. / Straková, Hana / Gansterer, Wilfried N. et al. | 2014
- 35
-
High-Performance Computer Algebra: A Hecke Algebra Case StudyMaier, Patrick / Livesey, Daria / Loidl, Hans-Wolfgang / Trinder, Phil et al. | 2014
- 36
-
Generic Deterministic Random Number Generation in Dynamic-Multithreaded PlatformsMor, Stefano / Roch, Jean-Louis / Maillard, Nicolas et al. | 2014
- 37
-
Implementation and Performance Analysis of SkelGIS for Network Mesh-Based SimulationsCoullon, Hélène / Limet, Sébastien et al. | 2014
- 38
-
GoFFish: A Sub-graph Centric Framework for Large-Scale Graph AnalyticsSimmhan, Yogesh / Kumbhare, Alok / Wickramaarachchi, Charith / Nagarkar, Soonil / Ravi, Santosh / Raghavendra, Cauligi / Prasanna, Viktor et al. | 2014
- 39
-
Resolving Semantic Conflicts in Word Based Software Transactional MemorySharp, Craig / Blewitt, William / Morgan, Graham et al. | 2014
- 40
-
Automatic Tuning of the Parallelism Degree in Hardware Transactional MemoryRughetti, Diego / Romano, Paolo / Quaglia, Francesco / Ciciani, Bruno et al. | 2014
- 41
-
A Distributed CPU-GPU Sparse Direct SolverSao, Piyush / Vuduc, Richard / Li, Xiaoye Sherry et al. | 2014
- 42
-
Parallel Computation of Echelon FormsDumas, Jean-Guillaume / Gautier, Thierry / Pernet, Clément / Sultan, Ziad et al. | 2014
- 43
-
Time-Domain BEM for the Wave Equation: Optimization and Hybrid ParallelizationBramas, Berenger / Coulaud, Olivier / Sylvand, Guillaume et al. | 2014
- 44
-
Structured Orthogonal Inversion of Block p-Cyclic Matrices on Multicores with GPU AcceleratorsGogolenko, Sergiy / Bai, Zhaojun / Scalettar, Richard et al. | 2014
- 45
-
High-Throughput Maps on Message-Passing Manycore Architectures: Partitioning versus ReplicationShahmirzadi, Omid / Ropars, Thomas / Schiper, André et al. | 2014
- 46
-
A Fast Sparse Block Circulant Matrix Vector ProductRomero, Eloy / Tomás, Andrés / Soriano, Antonio / Blanquer, Ignacio et al. | 2014
- 47
-
Scheduling Data Flow Program in XKaapi: A New Affinity Based Algorithm for Heterogeneous ArchitecturesBleuse, Raphaël / Gautier, Thierry / Lima, João V. F. / Mounié, Grégory / Trystram, Denis et al. | 2014
- 48
-
Delegation Locking Libraries for Improved Performance of Multithreaded ProgramsKlaftenegger, David / Sagonas, Konstantinos / Winblad, Kjell et al. | 2014
- 49
-
A Generic Strategy for Multi-stage StencilsBianco, Mauro / Cumming, Benjamin et al. | 2014
- 50
-
Evaluation of OpenMP Task Scheduling Algorithms for Large NUMA ArchitecturesClet-Ortega, Jérôme / Carribault, Patrick / Pérache, Marc et al. | 2014
- 51
-
Power-Aware Replica Placement in Tree Networks with Multiple Servers per ClientAupy, Guillaume / Benoit, Anne / Journault, Matthieu / Robert, Yves et al. | 2014
- 52
-
On Constructing DAG-Schedules with Large AREAsRoche, Scott T. / Rosenberg, Arnold L. / Rajaraman, Rajmohan et al. | 2014
- 53
-
Software Defined Multicasting for MPI Collective Operation Offloading with the NetFPGAArap, Omer / Brown, Geoffrey / Himebaugh, Bryce / Swany, Martin et al. | 2014
- 54
-
MapReduce over Lustre: Can RDMA-Based Approach Benefit?Rahman, Md. Wasi-ur / Lu, Xiaoyi / Islam, Nusrat Sharmin / Rajachandrasekar, Raghunath / Panda, Dhabaleswar K. (DK) et al. | 2014
- 55
-
Random Fields Generation on the GPU with the Spectral Turning Bands MethodHunger, Lars / Cosenza, Biagio / Kimeswenger, Stefan / Fahringer, Thomas et al. | 2014
- 56
-
Fast Set Intersection through Run-Time Bitmap Construction over PForDelta-Compressed IndexesZou, Xiaocheng / Lakshminarasimhan, Sriram / Boyuka, David A. / Ranshous, Stephen / Tang, Houjun / Klasky, Scott / Samatova, Nagiza F. et al. | 2014
- 57
-
Hybrid CPU/GPU Acceleration of Detection of 2-SNP Epistatic Interactions in GWASGonzález-Domínguez, Jorge / Schmidt, Bertil / Kässens, Jan Christian / Wienbrandt, Lars et al. | 2014
- 58
-
IFM: A Scalable High Resolution Flood Modeling FrameworkSinghal, Swati / Aneja, Sandhya / Liu, Frank / Real, Lucas Villa / George, Thomas et al. | 2014
- 59
-
High Performance Pseudo-analytical Simulation of Multi-Object Adaptive Optics over Multi-GPU SystemsAbdelfattah, Ahmad / Gendron, Eric / Gratadour, Damien / Keyes, David / Ltaief, Hatem / Sevin, Arnaud / Vidal, Fabrice et al. | 2014
- 60
-
Parallel Dual Tree Traversal on Multi-core and Many-core Architectures for Astrophysical N-body SimulationsLange, Benoit / Fortin, Pierre et al. | 2014
- 61
-
Customizing Driving Directions with GPUsDelling, Daniel / Kobitzsch, Moritz / Werneck, Renato F. et al. | 2014
- 62
-
GPU Accelerated Range Trees with ApplicationsMaramreddy, Manoj Kumar / Kothapalli, Kishore et al. | 2014
- 63
-
Scalable On-Board Multi-GPU Simulation of Long-Range Molecular DynamicsNovalbos, Marcos / González, Jaime / Otaduy, Miguel A. / Martinez-Benito, Roberto / Sanchez, Alberto et al. | 2014
- 64
-
Resolution of Linear Algebra for the Discrete Logarithm Problem Using GPU and Multi-core ArchitecturesJeljeli, Hamza et al. | 2014
- 65
-
Toward OpenCL Automatic Multi-Device SupportHenry, Sylvain / Denis, Alexandre / Barthou, Denis / Counilh, Marie-Christine / Namyst, Raymond et al. | 2014
- 66
-
Concurrent Kernel Execution on Xeon Phi within Parallel Heterogeneous WorkloadsWende, Florian / Steinke, Thomas / Cordes, Frank et al. | 2014
- 67
-
Writing Self-adaptive Codes for Heterogeneous SystemsFabeiro, Jorge F. / Andrade, Diego / Fraguela, Basilio B. / Doallo, Ramón et al. | 2014
- 68
-
A Pattern-Based Comparison of OpenACC and OpenMP for Accelerator ComputingWienke, Sandra / Terboven, Christian / Beyer, James C. / Müller, Matthias S. et al. | 2014