default search action
PPoPP 2022: Seoul, Republic of Korea
- Jaejin Lee, Kunal Agrawal, Michael F. Spear:
PPoPP '22: 27th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, Seoul, Republic of Korea, April 2 - 6, 2022. ACM 2022, ISBN 978-1-4503-9204-4 - Konstantinos Kallas, Filip Niksic, Caleb Stanford, Rajeev Alur:
Stream processing with dependency-guided synchronization. 1-16 - Chao Chen, Chris Porter, Santosh Pande:
CASE: a compiler-assisted SchEduling framework for multi-GPU systems. 17-31 - Younghyun Cho, Jiyeon Park, Florian Negele, Changyeon Jo, Thomas R. Gross, Bernhard Egger:
Dopia: online parallelism management for integrated CPU/GPU architectures. 32-45 - Rohan Basu Roy, Tirthak Patel, Vijay Gadepally, Devesh Tiwari:
Mashup: making serverless computing useful for HPC workflows via hybrid execution. 46-60 - Sam Westrick, Mike Rainey, Daniel Anderson, Guy E. Blelloch:
Parallel block-delayed sequences. 61-75 - Yuhao Zhu:
RTNN: accelerating neighbor search using hardware ray tracing. 76-89 - Yuyao Niu, Zhengyang Lu, Haonan Ji, Shuhui Song, Zhou Jin, Weifeng Liu:
TileSpGEMM: a tiled algorithm for parallel sparse general matrix-matrix multiplication on GPUs. 90-106 - Yuke Wang, Boyuan Feng, Yufei Ding:
QGTC: accelerating quantized graph neural networks via GPU tensor core. 107-119 - Jiaao He, Jidong Zhai, Tiago Antunes, Haojie Wang, Fuwen Luo, Shangfeng Shi, Qin Li:
FasterMoE: modeling and optimizing training of large-scale dynamic pre-trained models. 120-134 - Shigang Li, Torsten Hoefler:
Near-optimal sparse allreduce for distributed deep learning. 135-149 - Liyan Zheng, Jidong Zhai, Xiongchao Tang, Haojie Wang, Teng Yu, Yuyang Jin, Shuaiwen Leon Song, Wenguang Chen:
Vapro: performance variance detection and diagnosis for production-run parallel applications. 150-162 - Hongyu Fan, Weiting Liu, Fei He:
Interference relation-guided SMT solving for multi-threaded program verification. 163-176 - Yuyang Jin, Haojie Wang, Runxin Zhong, Chen Zhang, Jidong Zhai:
PerFlow: a domain specific framework for automatic performance analysis of parallel applications. 177-191 - Zixuan Ma, Jiaao He, Jiezhong Qiu, Huanqi Cao, Yuanwei Wang, Zhenbo Sun, Liyan Zheng, Haojie Wang, Shizhi Tang, Tianyu Zheng, Junyang Lin, Guanyu Feng, Zeqiang Huang, Jie Gao, Aohan Zeng, Jianwei Zhang, Runxin Zhong, Tianhui Shi, Sha Liu, Weimin Zheng, Jie Tang, Hongxia Yang, Xin Liu, Jidong Zhai, Wenguang Chen:
BaGuaLu: targeting brain scale pretrained models with over 37 million cores. 192-204 - Zhuoqiang Guo, Denghui Lu, Yujin Yan, Siyu Hu, Rongrong Liu, Guangming Tan, Ninghui Sun, Wanrun Jiang, Lijun Liu, Yixiao Chen, Linfeng Zhang, Mohan Chen, Han Wang, Weile Jia:
Extending the limit of molecular dynamics with ab initio accuracy to 10 billion atoms. 205-218 - Mohsen Koohi Esfahani, Peter Kilpatrick, Hans Vandierendonck:
LOTUS: locality optimizing triangle counting. 219-233 - Huanqi Cao, Yuanwei Wang, Haojie Wang, Heng Lin, Zixuan Ma, Wanwang Yin, Wenguang Chen:
Scaling graph traversal to 281 trillion edges with 40 million cores. 234-245 - Zak Cutner, Nobuko Yoshida, Martin Vassor:
Deadlock-free asynchronous message reordering in rust with multiparty session types. 246-261 - Hagit Attiya, Ohad Ben-Baruch, Panagiota Fatourou, Danny Hendler, Eleftherios Kosmas:
Detectable recovery of lock-free data structures. 262-277 - Naama Ben-David, Guy E. Blelloch, Yuanhao Wei:
Lock-free locks revisited. 278-293 - Nian Liu, Jinyu Gu, Dahai Tang, Kenli Li, Binyu Zang, Haibo Chen:
Asymmetry-aware scalable locking. 294-308 - Yuanhao Wei, Naama Ben-David, Michal Friedman, Guy E. Blelloch, Erez Petrank:
FliT: a library for simple and efficient persistent algorithms. 309-321 - Benjamin Reidys, Jian Huang:
Understanding and detecting deep memory persistency bugs in NVM programs with DeepMC. 322-336 - Panagiota Fatourou, Nikolaos D. Kallimanis, Eleftherios Kosmas:
The performance power of software combining in persistence. 337-352 - Anastasiia Postnikova, Nikita Koval, Giorgi Nadiradze, Dan Alistarh:
Multi-queues can be state-of-the-art priority schedulers. 353-367 - Jacob Nelson-Slivon, Ahmed Hassan, Roberto Palmieri:
Bundling linked data structures for linearizable range queries. 368-384 - Trevor Brown, William Sigouin, Dan Alistarh:
PathCAS: an efficient middle ground for concurrent search data structures. 385-399 - Tadeusz Kobus, Maciej Kokocinski, Pawel T. Wojciechowski:
Jiffy: a lock-free skip list with batch updates and snapshots. 400-415 - Anubhav Srivastava, Trevor Brown:
Elimination (a, b)-trees with fast, durable updates. 416-430 - Jiasi Shen, Martin C. Rinard, Nikos Vasilakis:
Automatic synthesis of parallel unix commands and pipelines with KumQuat. 431-432 - Orestis Korakitis, Simon Garcia De Gonzalo, Nicolas L. Guidotti, João Pedro Barreto, José C. Monteiro, Antonio J. Peña:
Towards OmpSs-2 and OpenACC interoperation. 433-434 - Zhen Xie, Jie Liu, Sam Ma, Jiajia Li, Dong Li:
LB-HM: load balance-aware data placement on heterogeneous memory for task-parallel HPC applications. 435-436 - Yafan Huang, Shengjian Guo, Sheng Di, Guanpeng Li, Franck Cappello:
Hardening selective protection across multiple program inputs for HPC applications. 437-438 - Taspon Gonggiatgul, Ghassan Shobaki, Pinar Muyan-Özçelik:
A parallel branch-and-bound algorithm with history-based domination. 439-440 - Atmn Patel, Johannes Doerfert:
Remote OpenMP offloading. 441-442 - Weihua Zhang, Chuanlei Zhao, Lu Peng, Yuzhe Lin, Fengzhe Zhang, Jinhu Jiang:
High performance GPU concurrent B+tree. 443-444 - Daniel Anderson, Guy E. Blelloch, Laxman Dhulipala, Magdalen Dobson, Yihan Sun:
The problem-based benchmark suite (PBBS), V2. 445-447 - Da Yan, Wei Wang, Xiaowen Chu:
An LLVM-based open-source compiler for NVIDIA GPUs. 448-449 - Yiqiu Wang, Shangdi Yu, Laxman Dhulipala, Yan Gu, Julian Shun:
ParGeo: a library for parallel computational geometry. 450-452 - Srdan Milakovic, Oguz Selvitopi, Israt Nisa, Zoran Budimlic, Aydin Buluç:
Parallel algorithms for masked sparse matrix-matrix products. 453-454 - Shihui Song, Peng Jiang:
Rethinking graph data placement for graph neural network training on multiple GPUs. 455-456 - Ivan Kuraj, Armando Solar-Lezama, Nadia Polikarpova:
Optimizing consistency for partially replicated data stores. 457-458 - Kazem Cheshmi, Michelle Mills Strout, Maryam Mehri Dehnavi:
Optimizing sparse computations jointly. 459-460 - Ruslan Nikolaev, Binoy Ravindran:
wCQ: a fast wait-free queue with bounded memory usage. 461-462 - Jan Hückelheim, Laurent Hascoët:
Automatic differentiation of parallel loops with formal methods. 463-464 - Junmin Xiao, Qing Xue, Hui Ma, Xiaoyang Zhang, Guangming Tan:
A W-cycle algorithm for efficient batched SVD on GPUs. 465-466
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.