


default search action
20th CGO 2022: Seoul, South Korea
- Jae W. Lee, Sebastian Hack, Tatiana Shpeisman:

IEEE/ACM International Symposium on Code Generation and Optimization, CGO 2022, Seoul, Korea, Republic of, April 2-6, 2022. IEEE 2022, ISBN 978-1-6654-0584-3 - Siddharth Bhat, Tobias Grosser

:
Lambda the Ultimate SSA: Optimizing Functional Programs in SSA. 1-11 - Lukas Sommer, Cristian Axenie

, Andreas Koch:
SPNC: An Open-Source MLIR-Based Compiler for Fast Sum-Product Network Inference on CPUs and GPUs. 1-11 - Minsu Kim, Jeong-Keun Park, Soo-Mook Moon:

Solving PBQP-Based Register Allocation using Deep Reinforcement Learning. 1-12 - Mhd Ghaith Olabi, Juan Gómez-Luna, Onur Mutlu

, Wen-Mei Hwu, Izzat El Hajj:
A Compiler Framework for Optimizing Dynamic Parallelism on GPUs. 1-13 - Charitha Saumya

, Kirshanthan Sundararajah
, Milind Kulkarni:
DARM: Control-Flow Melding for SIMT Thread Divergence Reduction. 1-13 - Ao Li

, Bojian Zheng, Gennady Pekhimenko, Fan Long:
Automatic Horizontal Fusion for GPU Kernels. 14-27 - Joseph Huber, Melanie Cornelius, Giorgis Georgakoudis

, Shilei Tian
, Jose Manuel Monsalve Diaz
, Kuter Dinel, Barbara M. Chapman, Johannes Doerfert:
Efficient Execution of OpenMP on GPUs. 41-52 - Ajay Brahmakshatriya, Saman P. Amarasinghe:

GraphIt to CUDA Compiler in 2021 LOC: A Case for High-Performance DSL Implementation via Staging with BuilDSL. 53-65 - Joao Rivera, Franz Franchetti, Markus Püschel:

A Compiler for Sound Floating-Point Computations using Affine Arithmetic. 66-78 - Hannes Kallwies

, Martin Leucker, Torben Scheffel, Malte Schmitz, Daniel Thoma:
Aggregate Update Problem for Multi-clocked Dataflow Languages. 79-91 - Chris Cummins, Bram Wasti, Jiadong Guo, Brandon Cui, Jason Ansel, Sahir Gomez, Somya Jain, Jia Liu, Olivier Teytaud, Benoit Steiner, Yuandong Tian, Hugh Leather:

CompilerGym: Robust, Performant Compiler Optimization Environments for AI Research. 92-105 - Nicolas Derumigny, Théophile Bastian, Fabian Gruber, Guillaume Iooss, Christophe Guillon, Louis-Noël Pouchet, Fabrice Rastello:

PALMED: Throughput Characterization for Superscalar Architectures. 106-117 - Sunghyun Park, Salar Latifi, Yongjun Park, Armand Behroozi, Byungsoo Jeon, Scott A. Mahlke:

SRTuner: Effective Compiler Optimization Customization by Exposing Synergistic Relations. 118-130 - Xudong Wang, Xuezheng Xu, Qingan Li, Mengting Yuan, Jingling Xue:

Recovering Container Class Types in C++ Binaries. 131-143 - Vaibhav Kiran Kurhe, Pratik Karia, Shubhani Gupta, Abhishek Rose, Sorav Bansal:

Automatic Generation of Debug Headers through BlackBox Equivalence Checking. 144-154 - Linan Tian, Yangyang Shi, Liwei Chen, Yanqi Yang, Gang Shi:

Gadgets Splicing: Dynamic Binary Transformation for Precise Rewriting. 155-167 - Angelo Matni, Enrico Armenio Deiana, Yian Su, Lukas Gross, Souradip Ghosh, Sotiris Apostolakis, Ziyang Xu, Zujun Tan, Ishita Chaturvedi, Brian Homerding, Tommy McMichen, David I. August, Simone Campanoni:

NOELLE Offers Empowering LLVM Extensions. 179-192 - Yongwoo Lee, Seonyeong Heo, Seonyoung Cheon, Shinnung Jeong, Changsu Kim, Eunkyung Kim, Dongyoon Lee, Hanjun Kim:

HECATE: Performance-Aware Scale Optimization for Homomorphic Encryption Compiler. 193-204 - Daniel Donenfeld

, Stephen Chou, Saman P. Amarasinghe:
Unified Compilation for Lossless Compression and Sparse Computing. 205-216 - Rodrigo C. O. Rocha, Pavlos Petoumenos

, Björn Franke, Pramod Bhatotia, Michael F. P. O'Boyle:
Loop Rolling for Code Size Reduction. 217-229 - Sean Stirling, Rodrigo C. O. Rocha, Kim M. Hazelwood, Hugh Leather, Michael F. P. O'Boyle, Pavlos Petoumenos

:
F3M: Fast Focused Function Merging. 242-253 - Harishankar Vishwanathan

, Matan Shachnai
, Srinivas Narayana, Santosh Nagarakatte
:
Sound, Precise, and Fast Abstract Interpretation with Tristate Numbers. 254-265 - Xuezheng Xu, Xudong Wang, Jingling Xue:

M3V: Multi-modal Multi-view Context Embedding for Repair Operator Prediction. 266-277 - Zifan Nan, Xipeng Shen, Hui Guan:

Enabling Near Real-Time NLU-Driven Natural Language Programming through Dynamic Grammar Graph-Based Translation. 278-289 - Ján Veselý, Raghavendra Pradyumna Pothukuchi

, Ketaki Joshi, Samyak Gupta, Jonathan D. Cohen, Abhishek Bhattacharjee:
Distill: Domain-Specific Compilation for Cognitive Models. 301-312 - Cédric Bastoul, Zhen Zhang, Harenome Razanajato, Nelson Lossing, Adilla Susungi, Javier de Juan, Etienne Filhol, Baptiste Jarry, Gianpietro Consolaro, Renwei Zhang:

Optimizing GPU Deep Learning Operators with Polyhedral Scheduling Constraint Injection. 313-324 - Miheer Vaidya, Aravind Sukumaran-Rajam, Atanas Rountev

, P. Sadayappan:
Comprehensive Accelerator-Dataflow Co-design Optimization for Convolutional Neural Networks. 325-335

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.


Google
Google Scholar
Semantic Scholar
Internet Archive Scholar
CiteSeerX
ORCID














