


default search action
ASPLOS 2025: Rotterdam, The Netherlands
- Lieven Eeckhout, Georgios Smaragdakis, Kaitai Liang, Adrian Sampson, Martha A. Kim, Christopher J. Rossbach:
Proceedings of the 30th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, Volume 1, ASPLOS 2025, Rotterdam, The Netherlands, 30 March 2025 - 3 April 2025. ACM 2025, ISBN 979-8-4007-0698-1 - Zhuoran Ji
, Jianyu Zhao
, Peimin Gao
, Xiangkai Yin
, Lei Ju
:
Accelerating Number Theoretic Transform with Multi-GPU Systems for Efficient Zero Knowledge Proof. 1-14 - Derrick Quinn
, Mohammad Nouri
, Neel Patel
, John Salihu
, Alireza Salemi
, Sukhan Lee
, Hamed Zamani
, Mohammad Alian
:
Accelerating Retrieval-Augmented Generation. 15-32 - Wonkyo Choe
, Rongxiang Wang
, Felix Xiaozhu Lin
:
AnA: An Attentive Autonomous Driving System. 33-46 - Chanyoung Park
, Jungho Lee
, Chun-Yi Liu
, Kyungtae Kang
, Mahmut Taylan Kandemir
, Wonil Choi
:
AnyKey: A Key-Value SSD for All Workload Types. 47-63 - Sankeerth Durvasula
, Adrian Zhao
, Fan Chen
, Ruofan Liang
, Pawan Kumar Sanjaya
, Yushi Guan
, Christina Giannoula
, Nandita Vijaykumar
:
ARC: Warp-level Adaptive Atomic Reduction in GPUs to Accelerate Differentiable Rendering. 64-83 - Rohan Yadav
, Michael Bauer
, David Broman
, Michael Garland
, Alex Aiken
, Fredrik Kjolstad
:
Automatic Tracing in Task-Based Runtime Systems. 84-99 - Tao Lu
, Yuxun Chen
, Zonghui Wang
, Xiaohang Wang
, Wenzhi Chen
, Jiaheng Zhang
:
BatchZK: A Fully Pipelined GPU-Accelerated System for Batch Generation of Zero-Knowledge Proofs. 100-115 - Shaobo Li
, Yirui Eric Zhou
, Hao Ren
, Jian Huang
:
ByteFS: System Support for (CXL-based) Memory-Semantic Solid-State Drives. 116-132 - Siddharth Jayashankar
, Edward Chen
, Tom Tang
, Wenting Zheng
, Dimitrios Skarlatos
:
Cinnamon: A Framework for Scale-Out Encrypted AI. 133-150 - Rishi Ranjan
, Ian Paterson
, Matthew Hicks
:
ClosureX: Compiler Support for Correct Persistent Fuzzing. 151-163 - Benjamin Reidys
, Pantea Zardoshti
, Íñigo Goiri
, Celine Irvene
, Daniel S. Berger
, Haoran Ma
, Kapil Arya
, Eli Cortez
, Taylor Stark
, Eugene Bak
, Mehmet Iyigun
, Stanko Novakovic
, Lisa Hsu
, Karel Trueba
, Abhisek Pan
, Chetan Bansal
, Saravan Rajmohan
, Jian Huang
, Ricardo Bianchini
:
Coach: Exploiting Temporal Patterns for All-Resource Oversubscription in Cloud Platforms. 164-181 - Rohan Yadav
, Shiv Sundram
, Wonchan Lee
, Michael Garland
, Michael Bauer
, Alex Aiken
, Fredrik Kjolstad
:
Composing Distributed Computations Through Task and Kernel Fusion. 182-197 - Shenggan Cheng
, Shengjie Lin
, Lansong Diao
, Hao Wu
, Siyu Wang
, Chang Si
, Ziming Liu
, Xuanlei Zhao
, Jiangsu Du
, Wei Lin
, Yang You
:
Concerto: Automatic Communication Optimization and Scheduling for Large-Scale Deep Learning. 198-213 - Kapil Agrawal
, Sangeetha Abdu Jyothi
:
Cooperative Graceful Degradation in Containerized Clouds. 214-232 - Divyanshu Saxena
, William Zhang
, Shankara Pailoor
, Isil Dillig
, Aditya Akella
:
Copper and Wire: Bridging Expressiveness and Performance for Service Mesh Policies. 233-248 - Jiahui Xu
, Lana Josipovic
:
CRUSH: A Credit-Based Approach for Functional Unit Sharing in Dynamically Scheduled HLS. 249-263 - Rohan Basu Roy
, Vijay Gadepally
, Devesh Tiwari
:
DarwinGame: Playing Tournaments for Tuning Applications in Noisy Cloud Environments. 264-279 - Yibiao Yang
, Maolin Sun
, Jiangchang Wu
, Qingyang Li
, Yuming Zhou
:
Debugger Toolchain Validation via Cross-Level Debugging. 280-294 - Kaiqiang Xu
, Decang Sun
, Hao Wang
, Zhenghang Ren
, Xinchen Wan
, Xudong Liao
, Zilong Wang
, Junxue Zhang
, Kai Chen
:
Design and Operation of Shared Machine Learning Clusters on Campus. 295-310 - Cunchi Lv
, Xiao Shi
, Zhengyu Lei
, Jinyue Huang
, Wenting Tan
, Xiaohui Zheng
, Xiaofang Zhao
:
Dilu: Enabling GPU Resourcing-on-Demand for Serverless DL Serving via Introspective Elasticity. 311-325 - Yuanpei Wu
, Dong Du
, Chao Xu
, Yubin Xia
, Ming Fu
, Binyu Zang
, Haibo Chen
:
D-VSync: Decoupled Rendering and Displaying for Smartphone Graphics. 326-341 - Pu (Luke) Yi
, Yifan Yang
, Chae Young Lee
, Sara Achour
:
Early Termination for Hyperdimensional Computing Using Inferential Statistics. 342-360 - Kuntai Du
, Yihua Cheng
, Peder A. Olsen
, Shadi A. Noghabi
, Junchen Jiang
:
Earth+: On-Board Satellite Imagery Compression Leveraging Historical Earth Observations. 361-376 - Weigao Su
, Vishal Shrivastav
:
EDM: An Ultra-Low Latency Ethernet Fabric for Memory Disaggregation. 377-394 - Noushin Azami
, Alex Fallin
, Martin Burtscher
:
Efficient Lossless Compression of Scientific Floating-Point Data on CPUs and GPUs. 395-409 - Zhaoying Li
, Pranav Dangi
, Chenyang Yin
, Thilini Kaushalya Bandara
, Rohan Juneja
, Cheng Tan
, Zhenyu Bai
, Tulika Mitra
:
Enhancing CGRA Efficiency Through Aligned Compute and Communication Provisioning. 410-425 - Yuka Ikarashi
, Kevin Qian
, Samir Droubi
, Alex Reinking
, Gilbert Louis Bernstein
, Jonathan Ragan-Kelley
:
Exo 2: Growing a Scheduling Language. 426-444 - Daliang Xu
, Hao Zhang
, Liming Yang
, Ruiqi Liu
, Gang Huang
, Mengwei Xu
, Xuanzhe Liu
:
Fast On-device LLM Inference with NPUs. 445-462 - Xuran Cai
, Amir Kafshdar Goharshady
, S. Hitarth
, Chun Kit Lam
:
Faster Chaitin-like Register Allocation via Grammatical Decompositions of Control-Flow Graphs. 463-477 - Jinghan Sun
, Benjamin Reidys
, Daixuan Li
, Jichuan Chang
, Marc Snir
, Jian Huang
:
FleetIO: Managing Multi-Tenant Cloud Storage with Multi-Agent Reinforcement Learning. 478-492 - Seonho Lee
, Amar Phanishayee
, Divya Mahajan
:
Forecasting GPU Performance for Deep Learning Training and Inference. 493-508 - Minhui Xie
, Shaoxun Zeng
, Hao Guo
, Shiwei Gao
, Youyou Lu
:
Frugal: Efficient and Economic Embedding Model Training with Commodity GPUs. 509-523 - Xinglin Pan
, Wenxiang Lin
, Lin Zhang
, Shaohuai Shi
, Zhenheng Tang
, Rui Wang
, Bo Li
, Xiaowen Chu
:
FSMoE: A Flexible and Scalable Training System for Sparse Mixture-of-Experts Models. 524-539 - Jianan Lu
, Ashwini Raina
, Asaf Cidon
, Michael J. Freedman
:
Fusion: An Analytics Object Store Optimized for Query Pushdown. 540-556 - Byungsoo Jeon, Mengdi Wu, Shiyi Cao, Sunghyun Kim, Sunghyun Park, Neeraj Aggarwal, Colin Unger, Daiyaan Arfeen
, Peiyuan Liao
, Xupeng Miao
, Mohammad Alizadeh, Gregory R. Ganger, Tianqi Chen, Zhihao Jia:
GraphPipe: Improving Performance and Scalability of DNN Training with Graph Pipeline Parallelism. 557-571 - Seonyoung Cheon
, Yongwoo Lee
, Hoyun Youm
, Dongkwan Kim
, Sungwoo Yun
, Kunmo Jeong
, Dongyoon Lee
, Hanjun Kim
:
HALO: Loop-aware Bootstrapping Management for Fully Homomorphic Encryption. 572-585 - Yixuan Mei
, Yonghao Zhuang
, Xupeng Miao
, Juncheng Yang
, Zhihao Jia
, Rashmi Vinayak
:
Helix: Serving Large Language Models over Heterogeneous GPUs and Network via Max-Flow. 586-602 - Sushant Dinesh
, Yongye Zhu
, Christopher W. Fletcher
:
H-Houdini: Scalable Invariant Learning. 603-618 - Dimitrios Chasapis
, Georgios Vavouliotis
, Daniel A. Jiménez
, Marc Casas
:
Instruction-Aware Cooperative TLB and Cache Replacement Policies. 619-636 - Seungmin Baek
, Minbok Wi
, Seonyong Park
, Hwayong Nam
, Michael Jaemin Kim
, Nam Sung Kim
, Jung Ho Ahn
:
Marionette: A RowHammer Attack via Row Coupling. 637-652 - Shaoxun Zeng
, Minhui Xie
, Shiwei Gao
, Youmin Chen
, Youyou Lu
:
Medusa: Accelerating Serverless LLM Inference with Materialization. 653-668 - Weikai Lin
, Yu Feng
, Yuhao Zhu
:
MetaSapiens: Real-Time Neural Rendering with Efficiency-Aware Pruning and Accelerated Foveated Rendering. 669-682 - Haiyu Huang
, Cheng Chen
, Kunyi Chen
, Pengfei Chen
, Guangba Yu
, Zilong He
, Yilun Wang
, Huxing Zhang
, Qi Zhou
:
Mint: Cost-Efficient Tracing with All Requests Collection via Commonality and Variability Analysis. 683-697 - Moinuddin Qureshi
, Salman Qazi
:
MOAT: Securely Mitigating Rowhammer with Per-Row Activation Counters. 698-714 - Shiyi Cao
, Shu Liu
, Tyler Griggs
, Peter Schafhalter
, Xiaoxuan Liu
, Ying Sheng
, Joseph E. Gonzalez
, Matei Zaharia
, Ion Stoica
:
MoE-Lightning: High-Throughput MoE Inference on Memory-constrained GPUs. 715-730 - Shuaiting Li
, Chengxuan Wang
, Juncan Deng
, Zeyu Wang
, Zewen Ye
, Zongsheng Wang
, Haibin Shen
, Kejie Huang
:
MVQ: Towards Efficient DNN Compression and Acceleration with Masked Vector Quantization. 731-745 - Wei Hao
, Zixi Wang
, Lauren Hong
, Lingxiao Li
, Nader Karayanni
, AnMei Dasbach-Prisk
, Chengzhi Mao
, Junfeng Yang
, Asaf Cidon
:
Nazar: Monitoring and Adapting ML Models on Mobile Devices. 746-761 - Yihao Sun
, Ahmedur Rahman Shovon
, Thomas Gilray
, Sidharth Kumar
, Kristopher K. Micinski
:
Optimizing Datalog for the GPU. 762-776 - Amanda Xu
, Abtin Molavi
, Swamit Tannu
, Aws Albarghouthi
:
Optimizing Quantum Circuits, Fast and Slow. 777-793 - Sami Alabed
, Daniel Belov
, Bart Chrzaszcz
, Juliana Franco
, Dominik Grewe
, Dougal Maclaurin
, James Molloy
, Tom Natan
, Tamara Norman
, Xiaoyue Pan
, Adam Paszke
, Norman A. Rink
, Michael Schaarschmidt
, Timur Sitdikov
, Agnieszka Swietlik
, Dimitrios Vytiniotis
, Joel Wee
:
PartIR: Composing SPMD Partitioning Strategies for Machine Learning. 794-810 - Foteini Strati
, Michal Friedman
, Ana Klimovic
:
PCcheck: Persistent Concurrent Checkpointing for ML. 811-827 - Shaofeng Wu
, Qiang Su
, Zhixiong Niu
, Hong Xu
:
Performance Prediction of On-NIC Network Functions with Multi-Resource Contention and Traffic Awareness. 828-842 - Yifan Tan
, Cheng Tan
, Zeyu Mi
, Haibo Chen
:
PipeLLM: Fast and Confidential Large Language Model Services with Speculative Pipelined Encryption. 843-857 - Yupeng Tang
, Seung-Seob Lee
, Abhishek Bhattacharjee
, Anurag Khandelwal
:
pulse: Accelerating Distributed Pointer-Traversals on Disaggregated Memory. 858-875 - Keyi Yin
, Hezi Zhang
, Xiang Fang
, Yunong Shi
, Travis S. Humble
, Ang Li
, Yufei Ding
:
QECC-Synth: A Layout Synthesizer for Quantum Error Correction Codes on Sparse Architectures. 876-890 - Anagha Molakalmur Anil Kumar
, Aditya Prasanna
, Arrvindh Shriraman
:
RANGE-BLOCKS: A Synchronization Facility for Domain-Specific Architectures. 891-906 - Anirudh Jain
, Pulkit Gupta
, Thomas M. Conte
:
RASSM: Residue-based Acceleration of Single Sparse Matrix Computation via Adaptive Tiling. 907-923 - Yan Liu
, Jianxin Lai
, Long Li
, Tianxiang Sui
, Linjie Xiao
, Peng Yuan
, Xiaojing Zhang
, Qing Zhu
, Wenguang Chen
, Jingling Xue
:
ReSBM: Region-based Scale and Minimal-Level Bootstrapping Management for FHE via Min-Cut. 924-939 - Stephen M. Blackburn, Zixian Cai, Rui Chen, Xi Yang, John Zhang, John N. Zigman:
Rethinking Java Performance Analysis. 940-954 - Zhilei Han
, Fei He
:
Robustness Verification for Checking Crash Consistency of Non-volatile Memory. 955-969 - Qinhan Tan
, Yuheng Yang
, Thomas Bourgeat
, Sharad Malik
, Mengjia Yan
:
RTL Verification for Secure Speculation Using Contract Shadow Logic. 970-986 - Shravan Narayan
, Tal Garfinkel
, Evan Johnson
, Zachary Yedidia
, Yingchen Wang
, Andrew Brown
, Anjo Vahldiek-Oberwagner
, Michael LeMay
, Wenyong Huang
, Xin Wang
, Mingqiu Sun
, Dean M. Tullsen
, Deian Stefan
:
Segue & ColorGuard: Optimizing SFI Performance and Scalability on Modern Architectures. 987-1002 - Huan Zhao
, Dylan Wolff
, Umang Mathur
, Abhik Roychoudhury
:
Selectively Uniform Concurrency Testing. 1003-1019 - Yaohui Cai
, Kaixin Yang
, Chenhui Deng
, Cunxi Yu
, Zhiru Zhang
:
SmoothE: Differentiable E-Graph Extraction. 1020-1034 - Seah Kim
, Roger Hsiao
, Borivoje Nikolic
, James Demmel
, Yakun Sophia Shao
:
SuperNoVA: Algorithm-Hardware Co-Design for Resource-Aware SLAM. 1035-1051 - Wei Zhao
, Anand Jayarajan
, Gennady Pekhimenko
:
Tally: Non-Intrusive Performance Isolation for Concurrent Deep Learning Workloads. 1052-1068 - Brett Saiki
, Jackson Brough
, Jonas Regehr
, Jesús Ponce
, Varun Pradeep
, Aditya Akhileshwaran
, Zachary Tatlock
, Pavel Panchekha
:
Target-Aware Implementation of Real Expressions. 1069-1083 - Difan Tan
, Jiawei Li
, Hua Wang
, Xiaoxiao Li
, Wenbo Liu
, Zijin Qin
, Ke Zhou
, Ming Xie
, Mengling Tao
:
Tela: A Temporal Load-Aware Cloud Virtual Disk Placement Scheme. 1084-1100 - Cheng Wang
, Mingyu Gao
:
UniZK: Accelerating Zero-Knowledge Proof with Unified Hardware and Flexible Kernel Mapping. 1101-1117 - Zibo Wang
, Yijia Zhang
, Fuchun Wei
, Bingqiang Wang
, Yanlin Liu
, Zhiheng Hu
, Jingyi Zhang
, Xiaoxin Xu
, Jian He
, Xiaoliang Wang
, Wanchun Dou
, Guihai Chen
, Chen Tian
:
Using Analytical Performance/Power Model and Fine-Grained DVFS to Enhance AI Accelerator Energy Efficiency. 1118-1132 - Ramya Prabhu
, Ajay Nayak
, Jayashree Mohan
, Ramachandran Ramjee
, Ashish Panwar
:
vAttention: Dynamic Memory Management for Serving LLMs without PagedAttention. 1133-1150 - Minwook Kim
, Seongyeop Jeong
, Jin-Soo Kim
:
ZRAID: Leveraging Zone Random Write Area (ZRWA) for Alleviating Partial Parity Tax in ZNS RAID. 1151-1165

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.