Prosper: Program Stack Persistence in Hybrid Memory Systems (Englisch)
- Neue Suche nach: KP, Arun
- Neue Suche nach: Mishra, Debadatta
- Neue Suche nach: Panda, Biswabandan
- Neue Suche nach: KP, Arun
- Neue Suche nach: Mishra, Debadatta
- Neue Suche nach: Panda, Biswabandan
In:
2024 IEEE International Symposium on High-Performance Computer Architecture (HPCA)
;
1168-1183
;
2024
-
ISBN:
-
ISSN:
- Aufsatz (Konferenz) / Elektronische Ressource
-
Titel:Prosper: Program Stack Persistence in Hybrid Memory Systems
-
Beteiligte:
-
Erschienen in:
-
Verlag:
- Neue Suche nach: IEEE
-
Erscheinungsdatum:02.03.2024
-
Format / Umfang:403437 byte
-
ISBN:
-
ISSN:
-
DOI:
-
Medientyp:Aufsatz (Konferenz)
-
Format:Elektronische Ressource
-
Sprache:Englisch
-
Datenquelle:
Inhaltsverzeichnis Konferenzband
Die Inhaltsverzeichnisse werden automatisch erzeugt und basieren auf den im Index des TIB-Portals verfügbaren Einzelnachweisen der enthaltenen Beiträge. Die Anzeige der Inhaltsverzeichnisse kann daher unvollständig oder lückenhaft sein.
- 1
-
Salus: Efficient Security Support for CXL-Expanded GPU MemoryAbdullah, Rahaf / Lee, Hyokeun / Zhou, Huiyang / Awad, Amro et al. | 2024
- 1
-
Enhancing Collective Communication in MCM Accelerators for Deep Learning TrainingLaskar, Sabuj / Majhi, Pranati / Kim, Sungkeun / Mahmud, Farabi / Muzahid, Abdullah / Kim, Eun Jung et al. | 2024
- 1
-
Revet: A Language and Compiler for Dataflow ThreadsRucker, Alexander C. / Sundram, Shiv / Smith, Coleman / Vilim, Matthew / Prabhakar, Raghu / Kjolstad, Fredrik / Olukotun, Kunle et al. | 2024
- 1
-
2024 IEEE International Symposium on High-Performance Computer Architecture HPCA 2024| 2024
- 1
-
MINOS: Distributed Consistency and Persistency Protocol Implementation & Offloading to SmartNICsPsistakis, Antonis / Chaix, Fabien / Torrellas, Josep et al. | 2024
- 1
-
Exploitation of Security Vulnerability on RetirementXu, Ke / Tang, Ming / Wang, Quancheng / Wang, Han et al. | 2024
- 1
-
WASP: Exploiting GPU Pipeline Parallelism with Hardware-Accelerated Automatic Warp SpecializationCrago, Neal C. / Damani, Sana / Sankaralingam, Karthikeyan / Keckler, Stephen W. et al. | 2024
- 3
-
Title Page iii| 2024
- 4
-
Copyright Page| 2024
- 15
-
GADGETSPINNER: A New Transient Execution Primitive Using the Loop Stream DetectorChen, Yun / Hajiabadi, Ali / Carlson, Trevor E. et al. | 2024
- 31
-
Uncovering and Exploiting AMD Speculative Memory Access Predictors for Fun and ProfitLiu, Chang / Wang, Dongsheng / Lyu, Yongqiang / Qiu, Pengfei / Jin, Yu / Lu, Zhuoyuan / Zhang, Yinqian / Qu, Gang et al. | 2024
- 46
-
E2EMap: End-to-End Reinforcement Learning for CGRA Compilation via Reverse MappingLiu, Dajiang / Xia, Yuxin / Shang, Jiaxing / Zhong, Jiang / Ouyang, Peng / Yin, Shouyi et al. | 2024
- 75
-
An Optimizing Framework on MLIR for Efficient FPGA-based Accelerator GenerationZhang, Weichuang / Zhao, Jieru / Shen, Guan / Chen, Quan / Chen, Chen / Guo, Minyi et al. | 2024
- 91
-
TALCO: Tiling Genome Sequence Alignment Using Convergence of Traceback PointersWalia, Sumit / Ye, Cheng / Bera, Arkid / Lodhavia, Dhruvi / Turakhia, Yatish et al. | 2024
- 91
-
Celeritas: Out-of-Core Based Unsupervised Graph Neural Network via Cross-Layer Computing 2024Li, Yi / Yang, Tsun-Yu / Yang, Ming-Chang / Shen, Zhaoyan / Li, Bingzhe et al. | 2024
- 108
-
PruneGNN: Algorithm-Architecture Pruning Framework for Graph Neural Network AccelerationGurevin, Deniz / Shan, Mohsin / Huang, Shaoyi / Hasan, MD Amit / Ding, Caiwen / Khan, Omer et al. | 2024
- 124
-
MEGA: A Memory-Efficient GNN Accelerator Exploiting Degree-Aware Mixed-Precision QuantizationZhu, Zeyu / Li, Fanrong / Li, Gang / Liu, Zejian / Mo, Zitao / Hu, Qinghao / Liang, Xiaoyao / Cheng, Jian et al. | 2024
- 139
-
Bandwidth-Effective DRAM Cache for GPU s with Storage-Class MemoryHong, Jeongmin / Cho, Sungjun / Park, Geonwoo / Yang, Wonhyuk / Gong, Young-Ho / Kim, Gwangsun et al. | 2024
- 156
-
Gemini: Mapping and Architecture Co-exploration for Large-scale DNN Chiplet AcceleratorsCai, Jingwei / Wu, Zuotong / Peng, Sen / Wei, Yuchen / Tan, Zhanhong / Shi, Guiming / Gao, Mingyu / Ma, Kaisheng et al. | 2024
- 172
-
Stellar: Energy-Efficient and Low-Latency SNN Algorithm and Hardware Co-Design with Spatiotemporal ComputationMao, Ruixin / Tang, Lin / Yuan, Xingyu / Liu, Ye / Zhou, Jun et al. | 2024
- 186
-
MIMDRAM: An End-to-End Processing-Using-DRAM System for High-Throughput, Energy-Efficient and Programmer-Transparent Multiple-Instruction Multiple-Data ComputingOliveira, Geraldo F. / Olgun, Ataberk / Yaglikci, Abdullah Giray / Bostanci, F. Nisa / Gomez-Luna, Juan / Ghose, Saugata / Mutlu, Onur et al. | 2024
- 204
-
Supporting Secure Multi-GPU Computing with Dynamic and Batched Metadata ManagementNa, Seonjin / Kim, Jungwoo / Lee, Sunho / Huh, Jaehyuk et al. | 2024
- 218
-
Data Enclave: A Data-Centric Trusted Execution EnvironmentXu, Yuanchao / Pangia, James / Ye, Chencheng / Solihin, Yan / Shen, Xipeng et al. | 2024
- 249
-
Morphling: A Throughput-Maximized TFHE-based Accelerator using Transform-domain ReusePrasetiyo / Putra, Adiwena / Kim, Joo-Young et al. | 2024
- 263
-
Pathfinding Future PIM Architectures by Demystifying a Commercial PIM TechnologyHyun, Bongjoon / Kim, Taehun / Lee, Dongjae / Rhu, Minsoo et al. | 2024
- 280
-
Functionally-Complete Boolean Logic in Real DRAM Chips: Experimental Characterization and AnalysisYuksel, Ismail Emir / Tugrul, Yahya Can / Olgun, Ataberk / Bostanci, F. Nisa / Yaglikci, A. Giray / Oliveira, Geraldo F. / Luo, Haocong / Gomez-Luna, Juan / Sadrosadati, Mohammad / Mutlu, Onur et al. | 2024
- 297
-
StreamPIM: Streaming Matrix Computation in Racetrack MemoryAn, Yuda / Tang, Yunxiao / Yi, Shushu / Peng, Li / Pan, Xiurui / Sun, Guangyu / Luo, Zhaochu / Li, Qiao / Zhang, Jie et al. | 2024
- 312
-
SmartDIMM: In-Memory Acceleration of Upper Layer ProtocolsPatel, Neel / Mamandipoor, Amin / Nouri, Mohammad / Alian, Mohammad et al. | 2024
- 330
-
BeaconGNN: Large-Scale GNN Acceleration with Out-of-Order Streaming In-Storage ComputingWang, Yuyue / Pan, Xiurui / An, Yuda / Zhang, Jie / Reinman, Glenn et al. | 2024
- 345
-
Smart-Infinity: Fast Large Language Model Training using Near-Storage Processing on a Real SystemJang, Hongsun / Song, Jaeyong / Jung, Jaewon / Park, Jaeyoung / Kim, Youngsok / Lee, Jinho et al. | 2024
- 361
-
FlashGNN: An In-SSD Accelerator for GNN TrainingNiu, Fuping / Yue, Jianhui / Shen, Jiangqiu / Liao, Xiaofei / Jin, Hai et al. | 2024
- 379
-
DockerSSD: Containerized In-Storage Processing and Hardware Acceleration for Computational SSDsGouk, Donghyun / Kwon, Miryeong / Bae, Hanyeoreum / Jung, Myoungsoo et al. | 2024
- 395
-
PREFETCHX: Cross-Core Cache-Agnostic Prefetcher-based Side-Channel AttacksChen, Yun / Hajiabadi, Ali / Pei, Lingfeng / Carlson, Trevor E. et al. | 2024
- 409
-
Modeling, Derivation, and Automated Analysis of Branch Predictor Security VulnerabilitiesWang, Quancheng / Tang, Ming / Xu, Ke / Wang, Han et al. | 2024
- 424
-
SegScope: Probing Fine-grained Interrupts via Architectural FootprintsZhang, Xin / Zhang, Zhi / Shen, Qingni / Wang, Wenhao / Gao, Yansong / Yang, Zhuoxi / Zhang, Jiliang et al. | 2024
- 439
-
Differential-Matching Prefetcher for Indirect Memory AccessFu, Gelin / Xia, Tian / Luo, Zhongpei / Chen, Ruiyang / Zhao, Wenzhe / Ren, Pengju et al. | 2024
- 454
-
SPADE: Sparse Pillar-based 3D Object Detection Accelerator for Autonomous DrivingLee, Minjae / Park, Seongmin / Kim, Hyungmin / Yoon, Minyong / Lee, Janghwan / Choi, Jun Won / Kim, Nam Sung / Kang, Mingu / Choi, Jungwook et al. | 2024
- 468
-
Rapper: A Parameter-Aware Repair-in-Memory Accelerator for Blockchain Storage PlatformMa, Chenlin / Wang, Yingping / Chen, Fuwen / Liao, Jing / Wang, Yi / Mao, Rui et al. | 2024
- 483
-
MOPED: Efficient Motion Planning Engine with Flexible Dimension SupportHuang, Lingyi / Gong, Yu / Sui, Yang / Zang, Xiao / Yuan, Bo et al. | 2024
- 515
-
Effective Context-Sensitive Memory Dependence PredictionKim, Sebastian S. / Ros, Alberto et al. | 2024
- 528
-
A Two Level Neural Approach Combining Off-Chip Prediction with Adaptive Prefetch FilteringJamet, Alexandre Valentin / Vavouliotis, Georgios / Jimenez, Daniel A. / Alvarez, Lluc / Casas, Marc et al. | 2024
- 543
-
Gem5-MARVEL: Microarchitecture-Level Resilience Analysis of Heterogeneous SoC ArchitecturesChatzopoulos, Odysseas / Papadimitriou, George / Karakostas, Vasileios / Gizopoulos, Dimitris et al. | 2024
- 560
-
Spatial Variation-Aware Read Disturbance Defenses: Experimental Analysis of Real DRAM Chips and Implications on Future SolutionsYaglikci, Abdullah Giray / Tugrul, Yahya Can / Oliveira, Geraldo F. / Yuksel, Ismail Emir / Olgun, Ataberk / Luo, Haocong / Mutlu, Onur et al. | 2024
- 578
-
START: Scalable Tracking for any Rowhammer ThresholdSaxena, Anish / Qureshi, Moinuddin et al. | 2024
- 593
-
CoMeT: Count-Min-Sketch-based Row Tracking to Mitigate RowHammer at Low CostBostanci, F. Nisa / Yuksel, Ismail Emir / Olgun, Ataberk / Kanellopoulos, Konstantinos / Tugrul, Yahya Can / Yaglici, A. Giray / Sadrosadati, Mohammad / Mutlu, Onur et al. | 2024
- 613
-
A Quantum Computer Trusted Execution EnvironmentTrochatos, Theodoros / Xu, Chuanqi / Deshpande, Sanjay / Lu, Yao / Ding, Yongshan / Szefer, Jakub et al. | 2024
- 614
-
Unleashing the Potential of PIM: Accelerating Large Batched Inference of Transformer-Based Generative ModelsChoi, Jaewan / Park, Jaehyun / Kyung, Kwanhee / Kim, Nam Sung / Ho Ahn, Jung et al. | 2024
- 615
-
Computational CXL-Memory Solution for Accelerating Memory-Intensive ApplicationsSim, Joonseop / Ahn, Soohong / Ahn, Taeyoung / Lee, Seungyong / Rhee, Myunghyun / Kim, Jooyoung / Shin, Kwangsik / Moon, Donguk / Kim, Euiseok / Park, Kyoung et al. | 2024
- 616
-
LearnedFTL: A Learning-Based Page-Level FTL for Reducing Double Reads in Flash-Based SSDsWang, Shengzhe / Lin, Zihang / Wu, Suzhen / Jiang, Hong / Zhang, Jie / Mao, Bo et al. | 2024
- 630
-
Are Superpages Super-fast? Distilling Flash Blocks to Unify Flash Pages of a Superpage in an SSDTseng, Shih-Hung / Chen, Tseng-Yi / Yang, Ming-Chang et al. | 2024
- 643
-
RiF: Improving Read Performance of Modern SSDs Using an On-Die Early-Retry EngineChun, Myoungjun / Lee, Jaeyong / Kim, Myungsuk / Park, Jisung / Kim, Jihong et al. | 2024
- 657
-
Midas Touch: Invalid-Data Assisted Reliability and Performance Boost for 3d High-Density FlashLi, Qiao / Dang, Hongyang / Wan, Zheng / Gao, Congming / Ye, Min / Zhang, Jie / Kuo, Tei-Wei / Xue, Chun Jason et al. | 2024
- 671
-
ECO-CHIP: Estimation of Carbon Footprint of Chiplet-based Architectures for Sustainable VLSISudarshan, Chetan Choppali / Matkar, Nikhil / Vrudhula, Sarma / Sapatnekar, Sachin S. / Chhabria, Vidya A. et al. | 2024
- 686
-
Lightening-Transformer: A Dynamically-Operated Optically-Interconnected Photonic Transformer AcceleratorZhu, Hanqing / Gu, Jiaqi / Wang, Hanrui / Jiang, Zixuan / Zhang, Zhekai / Tang, Rongxing / Feng, Chenghao / Han, Song / Chen, Ray T. / Pan, David Z. et al. | 2024
- 704
-
MIRAGE: Quantum Circuit Decomposition and Routing Collaborative Design Using Mirror GatesMcKinney, Evan / Hatridge, Michael / Jones, Alex K. et al. | 2024
- 719
-
SACHI: A Stationarity-Aware, All-Digital, Near-Memory, Ising ArchitectureSundara Raman, Siddhartha Raman / John, Lizy K. / Kulkarni, Jaydeep P. et al. | 2024
- 732
-
BitWave: Exploiting Column-Based Bit-Level Sparsity for Deep Learning AccelerationShi, Man / Jain, Vikram / Joseph, Antony / Meijer, Maurice / Verhelst, Marian et al. | 2024
- 747
-
LUTein: Dense-Sparse Bit-Slice Architecture With Radix-4 LUT-Based Slice-Tensor Processing UnitsIm, Dongseok / Yoo, Hoi-Jun et al. | 2024
- 760
-
FIGNA: Integer Unit-Based Accelerator Design for FP-INT GEMM Preserving Numerical AccuracyJang, Jaeyong / Kim, Yulhwa / Lee, Juheun / Kim, Jae-Joon et al. | 2024
- 774
-
ASADI: Accelerating Sparse Attention Using Diagonal-based In-Situ ComputingLi, Huize / Li, Zhaoying / Bai, Zhenyu / Mitra, Tulika et al. | 2024
- 788
-
Enabling Large Dynamic Neural Network Training with Learning-based Memory ManagementRen, Jie / Xu, Dong / Yang, Shuangyan / Zhao, Jiacheng / Li, Zhicheng / Navasca, Christian / Wang, Chenxi / Xu, Harry / Li, Dong et al. | 2024
- 803
-
Tessel: Boosting Distributed Execution of Large DNN Models via Flexible Schedule SearchLin, Zhiqi / Miao, Youshan / Xu, Guanbin / Li, Cheng / Saarikivi, Olli / Maleki, Saeed / Yang, Fan et al. | 2024
- 817
-
SpecFL: An Efficient Speculative Federated Learning System for Tree-based Model TrainingZhang, Yuhui / Zhao, Lutan / Che, Cheng / Wang, XiaoFeng / Meng, Dan / Hou, Rui et al. | 2024
- 848
-
TinyTS: Memory-Efficient TinyML Model Compiler Framework on MicrocontrollersLiu, Yu-Yuan / Zheng, Hong-Sheng / Fang Hu, Yu / Hsu, Chen-Fong / Yeh, Tsung Tai et al. | 2024
- 861
-
CAMEL: Co-Designing AI Models and eDRAMs for Efficient On-Device LearningZhang, Sai Qian / Tambe, Thierry / Cuevas, Nestor / Wei, Gu-Yeon / Brooks, David et al. | 2024
- 876
-
FlipBit: Approximate Flash Memory for IoT DevicesBuck, Alexander / Ganesan, Karthik / Jerger, Natalie Enright et al. | 2024
- 891
-
Usas: A Sustainable Continuous-Learning´ Framework for Edge ServersMishra, Cyan Subhra / Sampson, Jack / Kandemir, Mahmut Taylan / Narayanan, Vijaykrishnan / Das, Chita R et al. | 2024
- 908
-
Cepheus: Accelerating Datacenter Applications with High-Performance RoCE-Capable MulticastLi, Wenxue / Zhang, Junyi / Liu, Yufei / Zeng, Gaoxiong / Wang, Zilong / Zeng, Chaoliang / Zhou, Pengpeng / Wang, Qiaoling / Chen, Kai et al. | 2024
- 922
-
LibPreemptible: Enabling Fast, Adaptive, and Hardware-Assisted User-Space SchedulingLi, Yueying / Lazarev, Nikita / Koufaty, David / Yin, Tenny / Anderson, Andy / Zhang, Zhiru / Suh, G. Edward / Kaffes, Kostis / Delimitrou, Christina et al. | 2024
- 954
-
Ursa: Lightweight Resource Management for Cloud-Native MicroservicesZhang, Yanqi / Zhou, Zhuangzhuang / Elnikety, Sameh / Delimitrou, Christina et al. | 2024
- 970
-
An LPDDR-based CXL-PNM Platform for TCO-efficient Inference of Transformer-based Large Language ModelsPark, Sang-Soo / Kim, KyungSoo / So, Jinin / Jung, Jin / Lee, Jonggeon / Woo, Kyoungwan / Kim, Nayeon / Lee, Younghyun / Kim, Hyungyo / Kwon, Yongsuk et al. | 2024
- 983
-
LightPool: A NVMe-oF-based High-performance and Lightweight Storage Pool Architecture for Cloud-Native Distributed DatabaseXu, Jiexiong / Chen, Yiquan / Wang, Yijing / Shi, Wenhui / Fang, Guoju / Chen, Yi / Liao, Huasheng / Wang, Yang / Lin, Hai / Jin, Zhen et al. | 2024
- 996
-
Enterprise-Class Cache Compression DesignBuyuktosunoglu, Alper / Trilla, David / Abali, Bulent / Berger, Deanna / Walters, Craig / Lee, Jang-Soo et al. | 2024
- 1012
-
HotTiles: Accelerating SpMM with Heterogeneous Accelerator ArchitecturesGerogiannis, Gerasimos / Aananthakrishnan, Sriram / Torrellas, Josep / Hur, Ibrahim et al. | 2024
- 1029
-
SPARK: Scalable and Precision-Aware Acceleration of Neural Networks via Efficient EncodingLiu, Fangxin / Yang, Ning / Li, Haomin / Wang, Zongwu / Song, Zhuoran / Pei, Songwen / Jiang, Li et al. | 2024
- 1043
-
Data Motion Acceleration: Chaining Cross-Domain Multi AcceleratorsWang, Shu-Ting / Xu, Hanyang / Mamandipoor, Amin / Mahapatra, Rohan / Ahn, Byung Hoon / Ghodrati, Soroush / Kailas, Krishnan / Alian, Mohammad / Esmaeilzadeh, Hadi et al. | 2024
- 1063
-
RELIEF: Relieving Memory Pressure In SoCs Via Data Movement-Aware Accelerator SchedulingGupta, Sudhanshu / Dwarkadas, Sandhya et al. | 2024
- 1080
-
GRIT: Enhancing Multi-GPU Performance with Fine-Grained Dynamic Page PlacementWang, Yueqi / Li, Bingyao / Jaleel, Aamer / Yang, Jun / Tang, Xulong et al. | 2024
- 1111
-
Guser: A GPGPU Power Stressmark GeneratorShan, Yalong / Yang, Yongkui / Qian, Xuehai / Yu, Zhibin et al. | 2024
- 1125
-
GPU Scale-Model SimulationSeyyedAghaei, Hossein / Naderan-Tahan, Mahmood / Eeckhout, Lieven et al. | 2024
- 1141
-
Agile-DRAM: Agile Trade-Offs in Memory Capacity, Latency, and Energy for Data CentersLee, Jaeyoon / Jung, Wonyeong / Kim, Dongwhee / Kim, Daero / Lee, Junseung / Kim, Jungrae et al. | 2024
- 1154
-
CHROME: Concurrency-Aware Holistic Cache Management Framework with Online Reinforcement LearningLu, Xiaoyang / Najafi, Hamed / Liu, Jason / Sun, Xian-He et al. | 2024
- 1168
-
Prosper: Program Stack Persistence in Hybrid Memory SystemsKP, Arun / Mishra, Debadatta / Panda, Biswabandan et al. | 2024
- 1184
-
Mitigating Write Disturbance in Non-Volatile Memory via Coupling Machine Learning with Out-of-Place UpdatesWu, Ronglong / Shen, Zhirong / Yang, Zhiwei / Shu, Jiwu et al. | 2024
- 1199
-
Author Index| 2024
- v
-
Table of Contents| 2024
- xix
-
Message from Program Chair HPCA 2024| 2024
- xviii
-
Message from General Chair HPCA 2024| 2024
- xxii
-
Message from Industry Track Chair HPCA 2024| 2024
- xxiii
-
Organizing Committee| 2024
- xxiv
-
Program Committee| 2024
- xxix
-
Sponsors| 2024
- xxviii
-
Industry Track Program Committee| 2024