


default search action
IISWC 2024: Vancouver, BC, Canada
- IEEE International Symposium on Workload Characterization, IISWC 2024, Vancouver, BC, Canada, September 15-17, 2024. IEEE 2024, ISBN 979-8-3503-5603-8
- Junrui Pan, Timothy G. Rogers:
CRISP: Concurrent Rendering and Compute Simulation Platform for GPUs. 1-14 - Jaehong Cho, Minsu Kim, Hyunmin Choi, Guseul Heo, Jongse Park:
LLMServingSim: A HW/SW Co-Simulation Infrastructure for LLM Inference Serving at Scale. 15-29 - Rajveer Bachkaniwala, Harshith Lanka, Kexin Rong
, Ada Gavrilovska:
Lotus: Characterization of Machine Learning Preprocessing Pipelines via Framework and Hardware Profiling. 30-43 - Seung Hun Choi, Myung Jae Chung, Young Geun Kim, Sung Woo Chung:
Mediator: Characterizing and Optimizing Multi-DNN Inference for Energy Efficient Edge Intelligence. 44-56 - Joyjit Kundu, Wenzhe Guo, Ali BanaGozar, Udari De Alwis, Sourav Sengupta, Puneet Gupta, Arindam Mallik:
Performance Modeling and Workload Analysis of Distributed Large Language Model Training and Inference. 57-67 - José A. Morgado, Leonel Sousa, Aleksandar Ilic
:
CARM Tool: Cache-Aware Roofline Model Automatic Benchmarking and Application Analysis. 68-81 - Viyom Mittal, Pedro Bruel, Michalis Faloutsos, Dejan S. Milojicic, Eitan Frachtenberg:
SHARP: A Distribution-Based Framework for Reproducible Performance Evaluation. 82-93 - Georgia Antoniou, Haris Volos, Yiannakis Sazeides:
Taming Performance Variability caused by Client-Side Hardware Configuration. 94-107 - Xinquan Lin, Haobo Xu, Yinhe Han, Yiming Gan:
HEX-SIM: Evaluating Multi-modal Large Language Models on Multi-chiplet NPUs. 108-120 - Shmeelok Chakraborty, Yuewen Hou, Ang Chen, Gokul Subramanian Ravi:
Empowering the Quantum Cloud User with QRIO. 121-131 - Tersiteab Adem, Andrew McCrabb, Vidushi Goyal, Valeria Bertacco:
Evergreen: Comprehensive Carbon Model for Performance-Emission Tradeoffs. 132-143 - Saichand Samudrala, Jiawen Wu, Chen Chen, Haoxuan Shan, Jonathan Ku, Yiran Chen, Jeyavijayan Rajendran:
Performance Analysis of Zero-Knowledge Proofs. 144-155 - Alexander Hankin, Abdulrahman Mahmoud, Mark Hempstead, David Brooks, Gu-Yeon Wei:
VelociTI: An Architecture-level Performance Modeling Framework for Trapped Ion Quantum Computers. 156-168 - Seonjin Na, Geonhwa Jeong, Byung Hoon Ahn, Jeffrey Young, Tushar Krishna, Hyesoon Kim:
Understanding Performance Implications of LLM Inference on CPUs. 169-180 - Cheng Chen, Christina Giannoula, Andreas Moshovos:
Low-Bitwidth Floating Point Quantization for Efficient High-Quality Diffusion Models. 181-193 - Chakshu Moar, Faraz Tahmasebi, Michael Pellauer, Hyoukjun Kwon:
Characterizing the Accuracy-Efficiency Trade-off of Low-rank Decomposition in Language Models. 194-209 - Yuchen Xia, Jiho Kim, Yuhan Chen, Haojie Ye, Souvik Kundu, Cong Callie Hao, Nishil Talati:
Understanding the Performance and Estimating the Cost of LLM Fine-Tuning. 210-223 - Kailash Gogineni, Yongsheng Mei, Karthikeya Gogineni, Peng Wei, Tian Lan, Guru Venkataramani:
Characterizing and Optimizing the End-to-End Performance of Multi-Agent Reinforcement Learning Systems. 224-235 - Nick Lindsay, Abhishek Bhattacharjee:
Understanding Address Translation Scaling Behaviours Using Hardware Performance Counters. 236-246 - Farzana Ahmed Siddique, Deyuan Guo, Zhenxing Fan, MohammadHosein Gholamrezaei, Morteza Baradaran, Alif Ahmed, Hugo Abbot
, Kyle Durrer, Kumaresh Nandagopal, Ethan Ermovick, Khyati Kiyawat, Beenish Gul, Abdullah T. Mughrabi, Ashish Venkat, Kevin Skadron:
Architectural Modeling and Benchmarking for Digital DRAM PIM. 247-261 - K. P. Arun, Debadatta Mishra:
Kindle: A Comprehensive Framework for Exploring OS-Architecture Interplay in Hybrid Memory Systems. 262-272 - Anoop Mysore Nataraja, Ricardo Fernández Pascual, Alberto Ros:
Enhanced System-Level Coherence for Heterogeneous Unified Memory Architectures. 273-283 - Michael Wu, Sibren Isaacman, Abhishek Bhattacharjee:
Characterizing Emerging Page Replacement Policies for Memory-Intensive Applications. 284-294 - Brandon Alexander Burtchell, Martin Burtscher:
Characterizing CUDA and OpenMP Synchronization Primitives. 295-308 - Demirhan Sevim
, Baturalp Bilgin, Ismail Akturk:
Evaluating Performance and Energy Efficiency of Parallel Programming Models in Heterogeneous Computing Systems. 309-319 - Yiqian Liu, Avery Vanausdal, Martin Burtscher:
Performance Impact of Removing Data Races from GPU Graph Analytics Programs. 320-331

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.