POSTER - hVISC: A portable abstraction for heterogeneous parallel systems (English)
- New search for: Srivastava, Prakalp
- New search for: Kotsifakou, Maria
- New search for: Sinclair, Matthew D.
- New search for: Komuravelli, Rakesh
- New search for: Adve, Vikram
- New search for: Adve, Sarita
- New search for: Srivastava, Prakalp
- New search for: Kotsifakou, Maria
- New search for: Sinclair, Matthew D.
- New search for: Komuravelli, Rakesh
- New search for: Adve, Vikram
- New search for: Adve, Sarita
In:
2016 International Conference on Parallel Architecture and Compilation Techniques (PACT)
;
443-445
;
2016
-
ISBN:
- Conference paper / Electronic Resource
-
Title:POSTER - hVISC: A portable abstraction for heterogeneous parallel systems
-
Contributors:Srivastava, Prakalp ( author ) / Kotsifakou, Maria ( author ) / Sinclair, Matthew D. ( author ) / Komuravelli, Rakesh ( author ) / Adve, Vikram ( author ) / Adve, Sarita ( author )
-
Published in:
-
Publisher:
- New search for: IEEE
-
Publication date:2016-09-01
-
Size:1128845 byte
-
ISBN:
-
DOI:
-
Type of media:Conference paper
-
Type of material:Electronic Resource
-
Language:English
-
Source:
Table of contents conference proceedings
The tables of contents are generated automatically and are based on the data records of the individual contributions available in the index of the TIB portal. The display of the Tables of Contents may therefore be incomplete.
- 1
-
Big data analytics on flash storage with acceleratorsArvind, et al. | 2016
- 3
-
Combating the reliability challenge of GPU register file at low supply voltageTan, Jingweijia / Song, Shuaiwen Leon / Yan, Kaige / Fu, Xin / Marquez, Andres / Kerbyson, Darren et al. | 2016
- 17
-
μC-States: Fine-grained GPU datapath power managementKayiran, Onur / Jog, Adwait / Pattnaik, Ashutosh / Ausavarungnirun, Rachata / Tang, Xulong / Kandemir, Mahmut T. / Loh, Gabriel H. / Mutlu, Onur / Das, Chita R. et al. | 2016
- 31
-
Scheduling techniques for GPU architectures with processing-in-memory capabilitiesPattnaik, Ashutosh / Tang, Xulong / Jog, Adwait / Kayiran, Onur / Mishra, Asit K. / Kandemir, Mahmut T. / Mutlu, Onur / Das, Chita R. et al. | 2016
- 45
-
OAWS: Memory Occlusion Aware Warp SchedulingWang, Bin / Zhu, Yue / Yu, Weikuan et al. | 2016
- 57
-
Integrating algorithmic parameters into benchmarking and design space exploration in 3D scene understandingBodin, Bruno / Nardi, Luigi / Zia, M. Zeeshan / Wagstaff, Harry / Shenoy, Govind Sreekar / Emani, Murali / Mawer, John / Kotselidis, Christos / Nisbet, Andy / Lujan, Mikel et al. | 2016
- 71
-
Fusion of parallel array operationsKristensen, Mads R. B. / Lund, Simon A. F. / Blum, Troels / Avery, James et al. | 2016
- 87
-
Reduction drawing: Language constructs and polyhedral compilation for reductions on GPUsReddy, Chandan / Kruse, Michael / Cohen, Albert et al. | 2016
- 99
-
Resource conscious reuse-driven tiling for GPUsRawat, Prashant Singh / Hong, Changwan / Ravishankar, Mahesh / Grover, Vinod / Pouchet, Louis-Noel / Rountev, Atanas / Sadayappan, P. et al. | 2016
- 113
-
Accelerating linked-list traversal through near-data processingHong, Byungchul / Kim, Gwangsun / Ahn, Jung Ho / Kwon, Yongkee / Kim, Hongsik / Kim, John et al. | 2016
- 125
-
Scalable task parallelism for NUMA: A uniform abstraction for coordinated scheduling and memory managementDrebes, Andi / Pop, Antoniu / Heydemann, Karine / Cohen, Albert / Drach, Nathalie et al. | 2016
- 139
-
A static cut-off for task parallel programsIwasaki, Shintaro / Taura, Kenjiro et al. | 2016
- 151
-
Greater performance and better efficiency: Predicated execution has shown us the wayPatt, Yale N. et al. | 2016
- 153
-
WearCore: A core for wearable workloads?Mehta, Sanyam / Torrellas, Josep et al. | 2016
- 165
-
Energy aware persistence: Reducing energy overheads of memory-based persistence in NVMsKannan, Sudarsun / Qureshi, Moinuddin / Gavrilovska, Ada / Schwan, Karsten et al. | 2016
- 179
-
Power tuning HPC jobs on power-constrained systemsGholkar, Neha / Mueller, Frank / Rountree, Barry et al. | 2016
- 191
-
Online scalability characterization of data-parallel programs on many coresCho, Younghyun / Oh, Surim / Egger, Bernhard et al. | 2016
- 207
-
Speculatively exploiting cross-invocation parallelismHuang, Jialu / Prabhu, Prakash / Jablin, Thomas B. / Ghosh, Soumyadeep / Apostolakis, Sotiris / Lee, Jae W. / August, David I. et al. | 2016
- 221
-
MicroSpec: Speculation-centric fine-grained parallelization for FSM computationsQiu, Junqiao / Zhao, Zhijia / Ren, Bin et al. | 2016
- 235
-
Hash Map InliningGope, Dibakar / Lipasti, Mikko H. et al. | 2016
- 247
-
Sparso: Context-driven optimizations of sparse linear algebraRong, Hongbo / Park, Jongsoo / Xiang, Lingxiang / Anderson, Todd A. / Smelyanskiy, Mikhail et al. | 2016
- 261
-
Tardis 2.0: Optimized time traveling coherence for relaxed consistency modelsYu, Xiangyao / Liu, Hongzhe / Zou, Ethan / Devadas, Srinivas et al. | 2016
- 275
-
Reducing cache coherence traffic with hierarchical directory cache and NUMA-aware runtime schedulingCaheny, Paul / Casas, Marc / Moreto, Miquel / Gloaguen, Herve / Saintes, Maxime / Ayguade, Eduard / Labarta, Jesus / Valero, Mateo et al. | 2016
- 287
-
Characterizing and optimizing the performance of multithreaded programs under interferenceZhao, Yong / Rao, Jia / Yi, Qing et al. | 2016
- 299
-
Optimizing indirect memory references with milkKiriansky, Vladimir / Zhang, Yunming / Amarasinghe, Saman et al. | 2016
- 313
-
Scaling data analytics with moore's lawOlukotun, Kunle et al. | 2016
- 315
-
Bridging the semantic gaps of GPU acceleration for scale-out CNN-based big data processing: Think big, see smallSong, Mingcong / Hu, Yang / Xu, Yunlong / Li, Chao / Chen, Huixiang / Yuan, Jingling / Li, Tao et al. | 2016
- 327
-
A DSL compiler for accelerating image processing pipelines on FPGAsChugh, Nitin / Vasista, Vinay / Purini, Suresh / Bondhugula, Uday et al. | 2016
- 339
-
Automatically exploiting implicit Pipeline Parallelism from multiple dependent kernels for GPUsKim, Gwangsun / Jeong, Jiyun / Kim, John / Stephenson, Mark et al. | 2016
- 351
-
CAF: Core to core Communication Acceleration FrameworkWang, Yipeng / Wang, Ren / Herdrich, Andrew / Tsai, James / Solihin, Yan et al. | 2016
- 363
-
Vectorization of multibyte floating point data formatsAnderson, Andrew / Gregg, David et al. | 2016
- 373
-
Rinnegan: Efficient resource use in heterogeneous architecturesPanneerselvam, Sankaralingam / Swift, Michael et al. | 2016
- 387
-
Auto-tuning Spark big data workloads on POWER8: Prediction-based dynamic SMT threadingJia, Zhen / Xue, Chao / Chen, Guancheng / Zhan, Jianfeng / Zhang, Lixin / Lin, Yonghua / Hofstee, Peter et al. | 2016
- 401
-
EXCITE-VM: Extending the virtual memory system to support snapshot isolation transactionsLitz, Heiner / Braun, Benjamin / Cheriton, David et al. | 2016
- 413
-
POSTER: Fly-Over: A light-weight distributed power-gating mechanism for energy-efficient networks-on-chipBoyapati, Rahul / Huang, Jiayi / Wang, Ningyuan / Kim, Kyung Hoon / Yum, Ki Hwan / Kim, Eun Jung et al. | 2016
- 415
-
POSTER: Exploiting asymmetric multi-core processors with flexible system softwareChronaki, Kallia / Moreto, Miquel / Casas, Marc / Rico, Alejandro / Badia, Rosa M. / Ayguade, Eduard / Labarta, Jesus / Valero, Mateo et al. | 2016
- 419
-
Poster: Easy PRAM-based high-performance parallel programming with ICEGhanim, Fady / Barua, Rajeev / Vishkin, Uzi et al. | 2016
- 421
-
POSTER: Fault-tolerant execution on COTS multi-core processors with hardware transactional memory supportHaas, Florian / Weis, Sebastian / Ungerer, Theo / Pokam, Gilles / Wu, Youfeng et al. | 2016
- 423
-
POSTER - collective dynamic parallelism for directive based GPU programming languages and compilersOzen, Guray / Ayguade, Eduard / Labarta, Jesus et al. | 2016
- 425
-
POSTER - Firestorm: Operating systems for power-constrained architecturesPanneerselvam, Sankaralingam / Swift, Michael et al. | 2016
- 429
-
POSTER: ξ-TAO: A cache-centric execution model and runtime for deep parallel multicore topologiesPericas, Miquel et al. | 2016
- 433
-
POSTER: Efficient self-invalidation/self-downgrade for critical sections with relaxed semanticsRos, Alberto / Leonardsson, Carl / Sakalis, Christos / Kaxiras, Stefanos et al. | 2016
- 435
-
POSTER: SILC-FM: Subblocked interleaved Cache-Like Flat Memory OrganizationRyoo, Jee Ho / Meswani, Mitesh R. / Panda, Reena / John, Lizy K. et al. | 2016
- 439
-
Hybrid data dependence analysis for loop transformationsSampaio, Diogo / Ketterlin, Alain / Pouchet, Louis-Noel / Rastello, Fabrice et al. | 2016
- 441
-
POSTER: An optimization of dataflow architectures for scientific applicationsShen, Xiaowei / Ye, Xiaochun / Tan, Xu / Wang, Da / Zhang, Zhimin / Fan, Dongrui / Tang, Zhimin et al. | 2016
- 443
-
POSTER - hVISC: A portable abstraction for heterogeneous parallel systemsSrivastava, Prakalp / Kotsifakou, Maria / Sinclair, Matthew D. / Komuravelli, Rakesh / Adve, Vikram / Adve, Sarita et al. | 2016
- 447
-
POSTER: An integrated vector-scalar design on an in-order ARM coreStanic, Milan / Palomar, Oscar / Hayes, Timothy / Ratkovic, Ivan / Unsal, Osman / Cristal, Adrian / Valero, Mateo et al. | 2016
- 449
-
POSTER: Pagoda: A runtime system to maximize GPU utilization in data parallel tasks with limited parallelismYeh, Tsung Tai / Sabne, Amit / Sakdhnagool, Putt / Eigenmann, Rudolf / Rogers, Timothy G. et al. | 2016
- 451
-
Student research poster: Slack-aware shared bandwidth management in GPUsDublish, Saumay et al. | 2016
- 453
-
Student research poster - from processing-in-Memory to Processing-in-StorageKaplan, Roman et al. | 2016
- 454
-
Student research poster: Network controller emulation on a sidecore for unmodified virtual machinesKiyanovski, Arthur et al. | 2016
- 455
-
Student research poster: A low complexity cache sharing mechanism to address system fairnessSelfa, Vicent / Sahuquillo, Julio / Petit, Salvador / Gomez, Maria E. et al. | 2016
- 456
-
Student research poster: A scalable general purpose system for large-scale graph processingSun, Jiawen / Vandierendonck, Hans / Nikolopoulos, Dimitrios S. et al. | 2016
- 457
-
Student research poster: Compiling Boolean circuits to non-deterministic branching programs to be implemented by light switching circuitsVladislav, Tartakovsky et al. | 2016
- 458
-
Student research poster: Software out-of-order execution for in-order architecturesTran, Kim-Anh et al. | 2016
- 459
-
Author index| 2016
- i
-
[Front matter]| 2016