default search action
SC 2022: Dallas, TX, USA
- Felix Wolf, Sameer Shende, Candace Culhane, Sadaf R. Alam, Heike Jagode:
SC22: International Conference for High Performance Computing, Networking, Storage and Analysis, Dallas, TX, USA, November 13-18, 2022. IEEE 2022, ISBN 978-1-6654-5444-5 - Oguz Selvitopi, Saliya Ekanayake, Giulia Guidi, Muaaz G. Awan, Georgios A. Pavlopoulos, Ariful Azad, Nikos Kyrpides, Leonid Oliker, Katherine A. Yelick, Aydin Buluç:
Extreme-Scale Many-against-Many Protein Similarity Search. 1:1-1:12 - Qinglei Cao, Sameh Abdulah, Rabab Alomairy, Yu Pei, Pratik Nag, George Bosilca, Jack J. Dongarra, Marc G. Genton, David E. Keyes, Hatem Ltaief, Ying Sun:
Reshaping Geostatistical Modeling and Prediction for Extreme-Scale Environmental Applications. 2:1-2:12 - Luca Fedeli, Axel Huebl, France Boillod-Cerneux, Thomas Clark, Kevin Gott, Conrad Hillairet, Stephan Jaure, Adrien Leblanc, Rémi Lehe, Andrew Myers, Christelle Piechurski, Mitsuhisa Sato, Neïl Zaïm, Weiqun Zhang, Jean-Luc Vay, Henri Vincenti:
Pushing the Frontier in the Design of Laser-Based Electron Accelerators with Groundbreaking Mesh-Refined Particle-In-Cell Simulations on Exascale-Class Supercomputers. 3:1-3:12 - Tsuyoshi Ichimura, Kohei Fujita, Ryota Kusakabe, Kentaro Koyama, Sota Murakami, Yuma Kikuchi, Takane Hori, Muneo Hori, Hikaru Inoue, Takafumi Nose, Takahiro Kawashima, Maddegedara Lalith:
Extreme Scale Earthquake Simulation with Uncertainty Quantification. 4:1-4:11 - Wei Hu, Hong An, Zhuoqiang Guo, Qingcai Jiang, Xinming Qin, Junshi Chen, Weile Jia, Chao Yang, Zhaolong Luo, Jielan Li, Wentiao Wu, Guangming Tan, Dongning Jia, Qinglin Lu, Fangfang Liu, Min Tian, Fang Li, Yeqi Huang, Liyi Wang, Sha Liu, Jinlong Yang:
2.5 Million-Atom Ab Initio Electronic-Structure Simulation of Complex Metallic Heterostructures with DGDFT. 5:1-5:13 - Ramakrishnan Kannan, Piyush Sao, Hao Lu, Jakub Kurzak, Gundolf Schenk, Yongmei Shi, Seung-Hwan Lim, Sharat Israni, Vijay Thakkar, Guojing Cong, Robert M. Patton, Sergio E. Baranzini, Richard W. Vuduc, Thomas E. Potok:
Exaflops Biomedical Knowledge Graph Analytics. 6:1-6:11 - Giuseppe M. J. Barca, Calum Snowdon, Jorge L. Galvez Vallejo, Fazeleh S. Kazemian, Alistair P. Rendell, Mark S. Gordon:
Scaling Correlated Fragment Molecular Orbital Calculations on Summit. 7:1-7:14 - Xiao Wang, Aristeidis Tsaris, Debangshu Mukherjee, Mohamed Wahib, Peng Chen, Mark Oxley, Olga Ovchinnikova, Jacob D. Hinkle:
Image Gradient Decomposition for Parallel and Memory-Efficient Ptychographic Reconstruction. 8:1-8:13 - Narangerelt Batsoyol, Benjamin S. Pullman, Mingxun Wang, Nuno Bandeira, Steven Swanson:
P-Massive: A Real-Time Search Engine for a Multi-Terabyte Mass Spectrometry Database. 9:1-9:15 - Salvatore Di Girolamo, Daniele De Sensi, Konstantin Taranov, Milos Malesevic, Maciej Besta, Timo Schneider, Severin Kistler, Torsten Hoefler:
Building Blocks for Network-Accelerated Distributed File Systems. 10:1-10:14 - Torsten Hoefler, Tommaso Bonato, Daniele De Sensi, Salvatore Di Girolamo, Shigang Li, Marco Heddes, Jon Belk, Deepak Goel, Miguel Castro, Steve Scott:
HammingMesh: A Network Topology for Large-Scale Deep Learning. 11:1-11:18 - Kartik Lakhotia, Maciej Besta, Laura Monroe, Kelly Isham, Patrick Iff, Torsten Hoefler, Fabrizio Petrini:
PolarFly: A Cost-Effective and Flexible Low-Diameter Topology. 12:1-12:15 - Ellis Wilson, Frank Mueller, Scott Pakin:
Combining Hard and Soft Constraints in Quantum Constraint-Satisfaction Systems. 13:1-13:14 - Honghui Shang, Li Shen, Yi Fan, Zhiqian Xu, Chu Guo, Jie Liu, Wenhao Zhou, Huan Ma, Rongfen Lin, Yuling Yang, Fang Li, Zhuoya Wang, Yunquan Zhang, Zhenyu Li:
Large-Scale Simulation of Quantum Computational Chemistry on a New Sunway Supercomputer. 14:1-14:14 - Tirthak Patel, Daniel Silver, Devesh Tiwari:
Charter: Identifying the Most-Critical Gate Operations in Quantum Circuits via Amplified Gate Reversibility. 15:1-15:16 - Mihailo Isakov, Mikaela Currier, Eliakin Del Rosario, Sandeep Madireddy, Prasanna Balaprakash, Philip H. Carns, Robert B. Ross, Glenn K. Lockwood, Michel A. Kinsy:
A Taxonomy of Error Sources in HPC I/O Machine Learning Models. 16:1-16:14 - Yafan Huang, Shengjian Guo, Sheng Di, Guanpeng Li, Franck Cappello:
Mitigating Silent Data Corruptions in HPC Applications across Multiple Program Inputs. 17:1-17:14 - Feng Zhang, Yihua Hu, Haipeng Ding, Zhiming Yao, Zhewei Wei, Xiao Zhang, Xiaoyong Du:
Optimizing Random Access to Hierarchically-Compressed Data on GPU. 18:1-18:15 - Yuanwei Wang, Huanqi Cao, Zixuan Ma, Wanwang Yin, Wenguang Chen:
Scaling Graph 500 SSSP to 140 Trillion Edges with over 40 Million Cores. 19:1-19:15 - Yao Kang, Xin Wang, Zhiling Lan:
Study of Workload Interference with Intelligent Routing on Dragonfly. 20:1-20:14 - Srinivasan Ramesh, Hank Childs, Allen D. Malony:
SERVIZ: A Shared In Situ Visualization Service. 21:1-21:14 - Rohan Basu Roy, Tirthak Patel, Devesh Tiwari:
DayDream: Executing Dynamic Scientific Workflows on Serverless Platforms with Hot Starts. 22:1-22:18 - Luke Logan, Jaime Cernuda Garcia, Jay F. Lofstead, Xian-He Sun, Anthony Kougkas:
LabStor: A Modular and Extensible Platform for Developing High-Performance, Customized I/O Stacks in Userspace. 23:1-23:15 - Yiqin Dai, Yong Dong, Kai Lu, Ruibo Wang, Wei Zhang, Juan Chen, Mingtian Shao, Zheng Wang:
Towards Scalable Resource Management for Supercomputers. 24:1-24:15 - Alexandros Nikolaos Ziogas, Grzegorz Kwasniewski, Tal Ben-Nun, Timo Schneider, Torsten Hoefler:
Deinsum: Practically I/O Optimal Multi-Linear Algebra. 25:1-25:15 - Ahmad Abdelfattah, Pieter Ghysels, Wajih Boukaram, Stanimire Tomov, Xiaoye Sherry Li, Jack J. Dongarra:
Addressing Irregular Patterns of Matrix Computations on GPUs and Their Impact on Applications Powered by Sparse Direct Solvers. 26:1-26:14 - Zonghao Feng, Qipeng Xie, Qiong Luo, Yujie Chen, Haoxuan Li, Huizhong Li, Qiang Yan:
Accelerating Elliptic Curve Digital Signature Algorithms on GPUs. 27:1-27:13 - Hua Huang, Edmond Chow:
CA3DMM: A New Algorithm Based on a Unified View of Parallel Matrix Multiplication. 28:1-28:15 - Olivier Beaumont, Philippe Duchon, Lionel Eyraud-Dubois, Julien Langou, Mathieu Vérité:
Symmetric Block-Cyclic Distribution: Fewer Communications Leads to Faster Dense Cholesky Factorization. 29:1-29:15 - Mathias Jacquelin, Mauricio Araya-Polo, Jie Meng:
Scalable Distributed High-Order Stencil Computations. 30:1-30:13 - Philip Munksgaard, Troels Henriksen, Ponnuswamy Sadayappan, Cosmin E. Oancea:
Memory Optimizations in an Array Language. 31:1-31:15 - Kazem Cheshmi, Zachary Cetinic, Maryam Mehri Dehnavi:
Vectorizing Sparse Matrix Computations with Partially-Strided Codelets. 32:1-32:15 - Ignacio Laguna, Ganesh Gopalakrishnan:
Finding Inputs that Trigger Floating-Point Exceptions in GPUs via Bayesian Optimization. 33:1-33:14 - Farid Zakaria, Thomas R. W. Scogland, Todd Gamblin, Carlos Maltzahn:
Mapping Out the HPC Dependency Chaos. 34:1-34:12 - Todd Gamblin, Massimiliano Culpo, Gregory Becker, Sergei Shudler:
Using Answer Set Programming for HPC Dependency Solving. 35:1-35:15 - Sixing Yu, Phuong Nguyen, Waqwoya Abebe, Wei Qian, Ali Anwar, Ali Jannesari:
SPATL: Salient Parameter Aggregation and Transfer Learning for Heterogeneous Federated Learning. 36:1-36:14 - Shigang Li, Kazuki Osawa, Torsten Hoefler:
Efficient Quantized Sparse Matrix Operations on Tensor Cores. 37:1-37:15 - Xiaohui Wang, Yang Wei, Ying Xiong, Guyue Huang, Xian Qian, Yufei Ding, Mingxuan Wang, Lei Li:
LightSeq2: Accelerated Training for Transformer-Based Models on GPUs. 38:1-38:14 - Qingxiao Sun, Yi Liu, Hailong Yang, Ruizhe Zhang, Ming Dun, Mingzhen Li, Xiaoyan Liu, Wencong Xiao, Yong Li, Zhongzhi Luan, Depei Qian:
CoGNN: Efficient Scheduling for Concurrent GNN Training on GPUs. 39:1-39:15 - Bartlomiej Przybylski, Maciej Pawlik, Pawel Zuk, Bartlomiej Lagosz, Maciej Malawski, Krzysztof Rzadca:
Using Unused: Non-Invasive Dynamic FaaS Infrastructure with HPC-Whisk. 40:1-40:15 - Moiz Arif, Kevin Assogba, M. Mustafa Rafique:
Canary: Fault-Tolerant FaaS for Stateful Time-Sensitive Applications. 41:1-41:16 - Yuqi Fu, Li Liu, Haoliang Wang, Yue Cheng, Songqing Chen:
SFS: Smart OS Scheduling for Serverless Functions. 42:1-42:16 - Maciej Besta, Cesare Miglioli, Paolo Sylos Labini, Jakub Tetek, Patrick Iff, Raghavendra Kanakagiri, Saleh Ashkboos, Kacper Janda, Michal Podstawski, Grzegorz Kwasniewski, Niels Gleinig, Flavio Vella, Onur Mutlu, Torsten Hoefler:
ProbGraph: High-Performance and High-Accuracy Graph Mining with Probabilistic Set Representations. 43:1-43:17 - Juno Kim, Steven Swanson:
Blaze: Fast Graph Processing on Fast SSDs. 44:1-44:15 - Dan Chen, Chuangyi Gui, Yi Zhang, Hai Jin, Long Zheng, Yu Huang, Xiaofei Liao:
GraphFly: Efficient Asynchronous Streaming Graphs Processing via Dependency-Flow. 45:1-45:14 - Reza Yazdani Aminabadi, Samyam Rajbhandari, Ammar Ahmad Awan, Cheng Li, Du Li, Elton Zheng, Olatunji Ruwase, Shaden Smith, Minjia Zhang, Jeff Rasley, Yuxiong He:
DeepSpeed- Inference: Enabling Efficient Inference of Transformer Models at Unprecedented Scale. 46:1-46:15 - Baorun Mu, Saeed Soori, Bugra Can, Mert Gürbüzbalaban, Maryam Mehri Dehnavi:
HyLo: A Hybrid Low-Rank Natural Gradient Descent Method. 47:1-47:16 - Xuncheng Zhao, Mingfan Li, Qian Xiao, Junshi Chen, Fei Wang, Li Shen, Meijia Zhao, Wenhao Wu, Hong An, Lixin He, Xiao Liang:
AI for Quantum Mechanics: High Performance Quantum Many-Body Simulations via Deep Learning. 48:1-48:15 - Chen Zhang, Haojie Wang, Zixuan Ma, Lei Xie, Zeyu Song, Jidong Zhai:
UniQ: A Unified Programming Model for Efficient Quantum Circuit Simulation. 49:1-49:16 - Yuxin Chen, Benjamin Brock, Serban D. Porumbescu, Aydin Buluç, Katherine A. Yelick, John D. Owens:
Scalable Irregular Parallelism with GPUs: Getting CPUs Out of the Way. 50:1-50:16 - Hochan Lee, William Ruys, Ian Henriksen, Arthur Peters, Yineng Yan, Sean Stephens, Bozhi You, Henrique Fingler, Martin Burtscher, Milos Gligoric, Karl Schulz, Keshav Pingali, Christopher J. Rossbach, Mattan Erez, George Biros:
Parla: A Python Orchestration System for Heterogeneous Architectures. 51:1-51:15 - Guanxian Jiang, Qihui Zhou, Tatiana Jin, Boyang Li, Yunjian Zhao, Yichao Li, James Cheng:
VSGM: View-Based GPU-Accelerated Subgraph Matching on Large Graphs. 52:1-52:15 - Yihua Wei, Peng Jiang:
STMatch: Accelerating Graph Pattern Matching on GPU with Stack-Based Loop Optimizations. 53:1-53:13 - Dongxu Yang, Junhong Liu, Jiaxing Qi, Junjie Lai:
WholeGraph: A Fast Graph Neural Network Training Framework with Multi-GPU Distributed Shared Memory Architecture. 54:1-54:14 - Qi Chen, Shaonan Ma, Kang Chen, Teng Ma, Xin Liu, Dexun Chen, Yongwei Wu, Zuoning Chen:
SeqDLM: A Sequencer-Based Distributed Lock Manager for Efficient Shared File Access in a Parallel File System. 55:1-55:14 - Yingjin Qian, Wen Cheng, Lingfang Zeng, Marc-André Vef, Oleg Drokin, Andreas Dilger, Shuichi Ihara, Wusheng Zhang, Yang Wang, André Brinkmann:
MetaWBC: POSIX-Compliant Metadata Write-Back Caching for Distributed File Systems. 56:1-56:20 - Dominic Manno, Jason Lee, Prajwal Challa, Qing Zheng, David Bonnie, Gary Grider, Bradley W. Settlemyer:
GUFI: Fast, Secure File System Metadata Search for Both Privileged and Unprivileged Users. 57:1-57:14 - Robert Schenck, Ola Rønning, Troels Henriksen, Cosmin E. Oancea:
AD for an Array Language with Nested Parallelism. 58:1-58:15 - Rohan Yadav, Alex Aiken, Fredrik Kjolstad:
SpDISTAL: Compiling Distributed Sparse Tensor Computations. 59:1-59:15 - William S. Moses, Sri Hari Krishna Narayanan, Ludger Paehler, Valentin Churavy, Michel Schanen, Jan Hückelheim, Johannes Doerfert, Paul D. Hovland:
Scalable Automatic Differentiation of Multiple Parallel Paradigms through Compiler Augmentation. 60:1-60:18 - Sian Jin, Dingwen Tao, Houjun Tang, Sheng Di, Suren Byna, Zarija Lukic, Franck Cappello:
Accelerating Parallel Write via Deeply Integrating Predictive Lossy Compression with HDF5. 61:1-61:15 - Jinyang Liu, Sheng Di, Kai Zhao, Xin Liang, Zizhong Chen, Franck Cappello:
Dynamic Quality Metric Oriented Error Bounded Lossy Compression for Scientific Datasets. 62:1-62:15 - Menghan Jia, Yiming Zhang, Xinbiao Gan, Dongsheng Li, Erci Xu, Ruibo Wang, Kai Lu:
vGraph: Memory-Efficient Multicore Graph Processing for Traversal-Centric Algorithms. 63:1-63:14 - Philipp Schaad, Tal Ben-Nun, Torsten Hoefler:
Boosting Performance Optimization with Interactive Data Movement Visualization. 64:1-64:16 - Prasoon Sinha, Akhil Guliani, Rutwik Jain, Brandon Tran, Matthew D. Sinclair, Shivaram Venkataraman:
Not All GPUs Are Created Equal: Characterizing Variability in Large-Scale, Accelerator-Rich Systems. 65:1-65:15 - Zhen Du, Jiajia Li, Yinshan Wang, Xueqi Li, Guangming Tan, Ninghui Sun:
AlphaSparse: Generating High Performance SpMV Codes Directly from Sparse Matrices. 66:1-66:15 - Konstantinos Parasyris, James Diffenderfer, Harshitha Menon, Ignacio Laguna, Jackson Vanover, Ryan Vogt, Daniel Osei-Kuffuor:
Approximate Computing Through the Lens of Uncertainty Quantification. 67:1-67:14 - Jose P. Pinilla, Steven J. E. Wilton:
Positive-Phase Temperature Scaling for Quantum-Assisted Boltzmann Machine Training. 68:1-68:12 - Kaihua Fu, Jiuchen Shi, Quan Chen, Ningxin Zheng, Wei Zhang, Deze Zeng, Minyi Guo:
QoS-Aware Irregular Collaborative Inference for Improving Throughput of DNN Services. 69:1-69:14 - Zheng Wang, Yuke Wang, Boyuan Feng, Dheevatsa Mudigere, Bharath Muthiah, Yufei Ding:
EL-Rec: Efficient Large-Scale Recommendation Model Training via Tensor-Train Embedding Table. 70:1-70:14 - Xiaoyang Sun, Wei Wang, Shenghao Qiu, Renyu Yang, Songfang Huang, Jie Xu, Zheng Wang:
STRONGHOLD: Fast and Affordable Billion-Scale Deep Learning Model Training. 71:1-71:17 - Yuntao Gui, Yidi Wu, Han Yang, Tatiana Jin, Boyang Li, Qihui Zhou, James Cheng, Fan Yu:
HGL: Accelerating Heterogeneous GNN Training with Holistic Representation and Optimization. 72:1-72:15 - Tal Ben-Nun, Linus Groner, Florian Deconinck, Tobias Wicky, Eddie Davis, Johann Dahm, Oliver Elbert, Rhea George, Jeremy McGibbon, Lukas Trümper, Elynn Wu, Oliver Fuhrer, Thomas C. Schulthess, Torsten Hoefler:
Productive Performance Engineering for Weather and Climate Modeling with Python. 73:1-73:14 - Misun Min, Yu-Hsiang Lan, Paul F. Fischer, Elia Merzari, Stefan Kerkemeier, Malachi Phillips, Thilina Rathnayake, April Novak, Derek Gaston, Noel Chalmers, Tim Warburton:
Optimization of Full-Core Reactor Simulations on Summit. 74:1-74:11 - Milinda Fernando, David Neilsen, Eric W. Hirschmann, Yosef Zlochower, Hari Sundar, Omar Ghattas, George Biros:
A GPU-Accelerated AMR Solver for Gravitational Wave Propagation. 75:1-75:15 - Cong Li, Yu Zhang, Jialei Wang, Hang Chen, Xian Liu, Tai Huang, Liang Peng, Shen Zhou, Lixin Wang, Shijian Ge:
From Correctable Memory Errors to Uncorrectable Memory Errors: What Error Bits Tell. 76:1-76:14 - Rohit Zambre, Aparna Chandramowlishwaran:
Lessons Learned on MPI+Threads Communication. 77:1-77:16 - Hao Lu, Michael A. Matheson, Vladyslav Oles, J. Austin Ellis, Wayne Joubert, Feiyi Wang:
Climbing the Summit and Pushing the Frontier of Mixed Precision Benchmarks at Extreme Scale. 78:1-78:15 - Santosh Pandey, Lingda Li, Thomas Flynn, Adolfy Hoisie, Hang Liu:
Scalable Deep Learning-Based Microarchitecture Simulation on GPUs. 79:1-79:15 - Paul Caheny, Lluc Alvarez, Marc Casas, Miquel Moretó:
TD-NUCA: Runtime Driven Management of NUCA Caches in Task Dataflow Programming Models. 80:1-80:15 - Pengmiao Zhang, Rajgopal Kannan, Ajitesh Srivastava, Anant V. Nori, Viktor K. Prasanna:
ReSemble: Reinforced Ensemble Framework for Data Prefetching. 81:1-81:14 - Junmin Xiao, Yunfei Pang, Qing Xue, Chaoyang Shui, Ke Meng, Hui Ma, Mingyi Li, Xiaoyang Zhang, Guangming Tan:
W-Cycle SVD: A Multilevel Algorithm for Batched SVD on GPUs. 82:1-82:16 - Qianxiang Ma, Sameer Deshmukh, Rio Yokota:
Scalable Linear Time Dense Direct Solver for 3-D Problems without Trailing Sub-Matrix Dependencies. 83:1-83:12 - Chao Chen, Per-Gunnar Martinsson:
Solving Linear Systems on a GPU with Hierarchically Off-Diagonal Low-Rank Approximations. 84:1-84:15 - Pengcheng Li, Yixin Guo, Yingwei Luo, Xiaolin Wang, Zhenlin Wang, Xu Liu:
Graph Neural Networks Based Memory Inefficiency Detection Using Selective Sampling. 85:1-85:14 - Pengcheng Li, Yixin Guo, Yongbin Gu:
Predicting Reuse Interval for Optimized Web Caching: An LSTM-Based Machine Learning Approach. 86:1-86:15 - Stella Bitchebe, Alain Tchana:
Out of Hypervisor (OoH): Efficient Dirty Page Tracking in Userspace Using Hardware Virtualization Features. 87:1-87:14
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.