default search action
HPEC 2020: Waltham, MA, USA
- 2020 IEEE High Performance Extreme Computing Conference, HPEC 2020, Waltham, MA, USA, September 22-24, 2020. IEEE 2020, ISBN 978-1-7281-9219-2
- Adam Michaleas, Lars Gjesteby, Michael P. Snyder, David Chavez, Meagan Ash, Matthew A. Melton, Damon G. Lamb, Sara N. Burke, Kevin J. Otto, Lee Kamentsky, Webster Guan, Kwanghun Chung, Laura J. Brattain:
Active Learning Pipeline for Brain Mapping in a High Performance Computing Environment. 1-6 - Tianyu Fu, Ziqian Wan, Guohao Dai, Yu Wang, Huazhong Yang:
LessMine: Reducing Sample Space and Data Access for Dense Pattern Mining. 1-7 - Shihao Zeng, Yihua Huang:
A Hybrid-Pipelined Architecture for FPGA-based Binary Weight DenseNet with High Performance-Efficiency. 1-5 - Zach Fredin, Jiri Zemanek, Camron Blackburn, Erik Strand, Amira Abdel-Rahman, Premila Rowles, Neil Gershenfeld:
Discrete Integrated Circuit Electronics (DICE). 1-8 - Ruizhi Zhang, Sasindu Wijeratne, Yang Yang, Sanmukh R. Kuppannagari, Viktor K. Prasanna:
A High Throughput Parallel Hash Table on FPGA using XOR-based Memory. 1-7 - Lisa J. K. Durbeck, Peter Athanas:
Incremental Streaming Graph Partitioning. 1-8 - Aditya Narayan, Ajay Joshi, Ayse K. Coskun:
Bandwidth Allocation in Silicon-Photonic Networks Using Application Instrumentation. 1-2 - Mark P. Blanco, Scott McMillan, Tze Meng Low:
Towards an Objective Metric for the Performance of Exact Triangle Count. 1-7 - Amir Sabbagh Molahosseini, Hans Vandierendonck:
Half-Precision Floating-Point Formats for PageRank: Opportunities and Challenges. 1-7 - Rushi Patel, Pierre-Francois Wolfe, Robert Munafo, Mayank Varia, Martin C. Herbordt:
Arithmetic and Boolean Secret Sharing MPC on FPGAs in the Data Center. 1-8 - Suha N. Kayum, Hussain J. AlSalem, Thierry-Laurent D. Tonellot, Ali Momin:
A Fault Tolerant Implementation for a Massively Parallel Seismic Framework. 1-8 - Tim Kaler, Brian Wheatman, Sarah Wooders:
High-Throughput Image Alignment for Connectomics using Frugal Snap Judgments. 1-9 - Peter Z. Vaillancourt, J. Eric Coulter, Richard Knepper, Brandon Barker:
Self-Scaling Clusters and Reproducible Containers to Enable Scientific Computing. 1-8 - Piotr Luszczek, Yaohung M. Tsai, Neil Lindquist, Hartwig Anzt, Jack J. Dongarra:
Scalable Data Generation for Evaluating Mixed-Precision Solvers. 1-6 - Haitham Ghalwash, Chun-Hsi Huang:
A congestion control mechanism for SDN-based fat-tree networks. 1-7 - William Butera:
A Dynamically Configurable Network for Software-Defined Hardware. 1-7 - Zhihui Du, Sen Zhang, David A. Bader, Jingkun Hu:
An Efficient LP Rounding Scheme for Replica Placement. 1-7 - Kai Huang, Mehmet Güngör, Stratis Ioannidis, Miriam Leeser:
Optimizing Use of Different Types of Memory for FPGAs in High Performance Computing. 1-7 - Mert Hidayetoglu, Carl Pearson, Vikram Sharma Mailthody, Eiman Ebrahimi, Jinjun Xiong, Rakesh Nagi, Wen-Mei Hwu:
At-Scale Sparse Deep Neural Network Inference With Efficient GPU Implementation. 1-7 - Wu-chun Feng, Da Zhang, Jing Zhang, Kaixi Hou, Sarunya Pumma, Hao Wang:
A Feasibility Study for MPI over HDFS. 1-7 - Anthony M. Cabrera, Roger D. Chamberlain:
Design and Performance Evaluation of Optimizations for OpenCL FPGA Kernels. 1-7 - Teresa M. Ranadive, Muthu Manikandan Baskaran:
Large-scale Sparse Tensor Decomposition Using a Damped Gauss-Newton Method. 1-8 - Chunshu Wu, Tong Geng, Chen Yang, Vipin Sachdeva, Woody Sherman, Martin C. Herbordt:
A Communication-Efficient Multi-Chip Design for Range-Limited Molecular Dynamics. 1-8 - Benjamin W. Priest, Alec Michael Dunton, Geoffrey Sanders:
Scaling Graph Clustering with Distributed Sketches. 1-7 - Suzanne J. Matthews, Aaron St. Leger:
Energy-Efficient Analysis of Synchrophasor Data using the NVIDIA Jetson Nano. 1-7 - John Goodhue, Julie Ma, Adrian Del Maestro, Sia Najafi, Bruce Segee, Scott Valcourt, Ralph Zottola:
Northeast Cyberteam - Building an Environment for Sharing Best Practices and Solutions for Research Computing. 1-5 - Sayan Ghosh, Mahantesh Halappanavar:
TriC: Distributed-memory Triangle Counting by Exploiting the Graph Structure. 1-6 - Richa Singh, Thomas Conroy, Patrick Schaumont:
Variable Precision Multiplication for Software-Based Neural Networks. 1-7 - Jesun Sahariar Firoz, Ang Li, Jiajia Li, Kevin J. Barker:
On the Feasibility of Using Reduced-Precision Tensor Core Operations for Graph Analytics. 1-7 - Albert Reuther, Peter Michaleas, Michael Jones, Vijay Gadepally, Siddharth Samsi, Jeremy Kepner:
Survey of Machine Learning Accelerators. 1-12 - Christopher Rackauckas, Qing Nie:
Stability-Optimized High Order Methods and Stiffness Detection for Pathwise Stiff Stochastic Differential Equations. 1-8 - Paul G. Flikkema, James Palmer, Tolga Yalçin, Bertrand Cambou:
Dynamic Computational Diversity with Multi-Radix Logic and Memory. 1-6 - Balasubramanian Seshasayee, Joshua B. Fryman, Ibrahim Hur:
Hash Table Scalability on Intel PIUMA. 1-2 - Cade Brown, Ahmad Abdelfattah, Stanimire Tomov, Jack J. Dongarra:
Design, Optimization, and Benchmarking of Dense Linear Algebra Algorithms on AMD GPUs. 1-7 - Márton Elekes, Attila Nagy, Dávid Sándor, János Benjamin Antal, Timothy A. Davis, Gábor Szárnyas:
A GraphBLAS solution to the SIGMOD 2014 Programming Contest using multi-source BFS. 1-7 - Safaa Diab, Mhd Ghaith Olabi, Izzat El Hajj:
KTRussExPLORER: Exploring the Design Space of K-truss Decomposition Optimizations on GPUs. 1-8 - Andrew Lumsdaine, Luke Dalessandro, Kevin Deweese, Jesun Firoz, Scott McMillan:
Triangle Counting with Cyclic Distributions. 1-8 - Filip Pawlowski, Rob H. Bisseling, Bora Uçar, A. N. Yzelman:
Combinatorial Tiling for Sparse Neural Networks. 1-7 - Adam Gjersvik:
Enhanced Parallel Simulation for ACAS X Development. 1-7 - Vitaliy Gleyzer, Andrew J. Soszynski, Edward K. Kao:
Leveraging Linear Algebra to Count and Enumerate Simple Subgraphs. 1-8 - Lucia Minah Yang, Alyson Fox:
Analysis of floating-point round-off error in linear algebra routines for graph clustering. 1-7 - Paul Sathre, Atharva Gondhalekar, Mohamed W. Hassan, Wu-Chun Feng:
MetaCL: Automated "Meta" OpenCL Code Generation for High-Level Synthesis on FPGA. 1-8 - Dian-Lun Lin, Tsung-Wei Huang:
A Novel Inference Algorithm for Large Sparse Neural Network using Task Graph Parallelism. 1-7 - Hector A. Li Sanchez, Alan D. George:
Hardware Acceleration of Nonlocal Means-Based Speckle Noise Removal Applied to SAR Imagery. 1-7 - Manoj Kumar, Pratap Pattnaik:
Post Quantum Cryptography(PQC) - An overview: (Invited Paper). 1-9 - Manish Bhattarai, Gopinath Chennupati, Erik Skau, Raviteja Vangara, Hristo N. Djidjev, Boian S. Alexandrov:
Distributed Non-Negative Tensor Train Decomposition. 1-10 - Andrew C. Kirby, Siddharth Samsi, Michael Jones, Albert Reuther, Jeremy Kepner, Vijay Gadepally:
Layer-Parallel Training with GPU Concurrency of Deep Residual Neural Networks via Nonlinear Multigrid. 1-7 - Siddharth Samsi, Michael Jones, Mark M. Veillette:
Compute, Time and Energy Characterization of Encoder-Decoder Networks with Automatic Mixed Precision Training. 1-6 - Justin A. Goodwin, Olivia M. Brown, Victoria Helus:
Fast Training of Deep Neural Networks Robust to Adversarial Perturbations. 1-7 - Cong Wang, George Papadimitriou, Mariam Kiran, Anirban Mandal, Ewa Deelman:
Identifying Execution Anomalies for Data Intensive Workflows Using Lightweight ML Techniques. 1-7 - Xin Wang, Wei Zhang:
Packing Narrow-Width Operands to Improve Energy Efficiency of General-Purpose GPU Computing. 1-7 - Mohammad Hasanzadeh-Mofrad, Rami G. Melhem, Muhammad Yousuf Ahmad, Mohammad Hammoud:
Studying the Effects of Hashing of Sparse Deep Neural Networks on Data and Model Parallelisms. 1-7 - Luke Kljucaric, Alex Johnson, Alan D. George:
Architectural Analysis of Deep Learning on Edge Accelerators. 1-7 - Siddharth Samsi, Jeremy Kepner, Vijay Gadepally, Michael B. Hurley, Michael Jones, Edward K. Kao, Sanjeev Mohindra, Albert Reuther, Steven Thomas Smith, William Song, Diane Staheli, Paul Monticciolo:
GraphChallenge.org Triangle Counting Performance. 1-9 - Steven D. Harris, Roger D. Chamberlain, Christopher D. Gill:
OpenCL Performance on the Intel Heterogeneous Architecture Research Platform. 1-9 - Siddharth Samsi, Andrew Prout, Michael Jones, Andrew C. Kirby, Bill Arcand, Bill Bergeron, David Bestor, Chansup Byun, Vijay Gadepally, Michael Houle, Matthew Hubbell, Anna Klein, Peter Michaleas, Lauren Milechin, Julie Mullen, Antonio Rosa, Charles Yee, Albert Reuther, Jeremy Kepner:
Benchmarking network fabrics for data distributed training of deep neural networks. 1-6 - Stijn Eyerman, Wim Heirman, Yigit Demir, Kristof Du Bois, Ibrahim Hur:
Projecting Performance for PIUMA using Down-Scaled Simulation. 1-7 - Ta-Yang Wang, Ajitesh Srivastava, Viktor K. Prasanna:
A Framework for Task Mapping onto Heterogeneous Platforms. 1-6 - Tianjian Lu, Thibault Marin, Yue Zhuo, Yi-Fan Chen, Chao Ma:
Accelerating MRI Reconstruction on TPUs. 1-9 - Uchenna Chukwu, Raouf Dridi, Jesse Berwald, Michael Booth, John Dawson, DeYung Le, Mark Wainger, Steven P. Reinhardt:
Constrained-optimization Approach Delivers Superior Classical Performance for Graph Partitioning via Quantum-ready Method. 1-6 - Muthu Manikandan Baskaran, Charles Jin, Benoît Meister, Jonathan Springer:
Automatic Mapping and Optimization to Kokkos with Polyhedral Compilation. 1-7 - Sanil Rao, Anurag Kutuluru, Paul Brouwer, Scott McMillan, Franz Franchetti:
GBTLX: A First Look. 1-7 - Ryan S. Luley, Qinru Qiu:
A Deep Q-Learning Approach for GPU Task Scheduling. 1-7 - Keita Teranishi, Daniel M. Dunlavy, Jeremy M. Myers, Richard F. Barrett:
SparTen: Leveraging Kokkos for On-node Parallelism in a Second-Order Method for Fitting Canonical Polyadic Tensor Models to Poisson Data. 1-7 - Maarten Hattink, Giuseppe Di Guglielmo, Luca P. Carloni, Keren Bergman:
A Scalable Architecture for CNN Accelerators Leveraging High-Performance Memories. 1-6 - Wesley Brewer, Greg Behm, Alan L. Scheinine, Ben Parsons, Wesley Emeneker, Robert P. Trevino:
Inference Benchmarking on HPC Systems. 1-9 - Dimitri Leggas, Thomas Henretty, James R. Ezick, Muthu Manikandan Baskaran, Brendan von Hofe, Grace H. Cimaszewski, Harper Langston, Richard Lethin:
Multiscale Data Analysis Using Binning, Tensor Decompositions, and Backtracking. 1-7 - Carl L. Colena, Michael J. Russell, Stephen A. Braun:
Minesweeper: A Novel and Fast Ordered-Statistic CFAR Algorithm. 1-6 - Hao Wen, Wei Zhang:
Denial of Service in CPU-GPU Heterogeneous Architectures. 1-5 - Brian A. Page, Peter M. Kogge:
Scalability of Streaming on Migrating Threads. 1-8 - Vijay Gadepally, Mihailo Isakov, Rashmi S. Agrawal, Jeremy Kepner, Karen Gettings, Michel A. Kinsy:
Homomorphic Encryption Based Secure Sensor Data Processing. 1-7 - Mohammad Hasanzadeh-Mofrad, Rami G. Melhem, Muhammad Yousuf Ahmad, Mohammad Hammoud:
Accelerating Distributed Inference of Sparse Deep Neural Networks via Mitigating the Straggler Effect. 1-7 - Harper Langston, Pierre-David Letourneau, Julia Wei, Larry Weintraub, Mitchell Tong Harris, Richard Lethin, Eric Papenhausen, Meifeng Lin:
Approximate Inverse Chain Preconditioner: Iteration Count Case Study for Spectral Support Solvers. 1-8 - Andrew J. Weinert, Ngaire Underhill, Bilal Gill, Ashley Wicks:
Processing of Crowdsourced Observations of Aircraft in a High Performance Computing Environment. 1-6 - Dimitris Floros, Nikos Pitsianis, Xiaobai Sun:
Fast Graphlet Transform of Sparse Graphs. 1-8 - Pouya Haghi, Anqi Guo, Qingqing Xiong, Rushi Patel, Chen Yang, Tong Geng, Justin T. Broaddus, Ryan J. Marshall, Anthony Skjellum, Martin C. Herbordt:
FPGAs in the Network and Novel Communicator Support Accelerate MPI Collectives. 1-10 - Carlo Pascoe, Lawrence C. Stewart, Brian W. Sherman, Vipin Sachdeva, Martin C. Herbordt:
Execution of Complete Molecular Dynamics Simulations on Multiple FPGAs. 1-2 - Mahsa Bayati, Miriam Leeser, Ningfang Mi:
Exploiting GPU Direct Access to Non-Volatile Memory to Accelerate Big Data Processing. 1-6 - Jianyu Mao, Kiana Harris, Nae-Rong Chang, Caleb Pennell, Yiming Ren:
Train and Deploy an Image Classifier for Disaster Response. 1-5 - Windy S. Slater, Nayana P. Tiwari, Tyler M. Lovelly, Jesse K. Mee:
Total Ionizing Dose Radiation Testing of NVIDIA Jetson Nano GPUs. 1-3 - Mark Barnell, Courtney Raymond, Matthew Wilson, Darrek Isereau, Chris Cicotta:
Target Classification in Synthetic Aperture Radar and Optical Imagery Using Loihi Neuromorphic Hardware. 1-6 - Jialing Zhang, Jiaxi Chen, Aekyeung Moon, Xiaoyan Zhuo, Seung Woo Son:
Bit-Error Aware Quantization for DCT-based Lossy Compression. 1-7 - Yuan Meng, Yang Yang, Sanmukh R. Kuppannagari, Rajgopal Kannan, Viktor K. Prasanna:
How to Efficiently Train Your AI Agent? Characterizing and Evaluating Deep Reinforcement Learning on Heterogeneous Platforms. 1-7 - Yutai Zhou, Shawn Manuel, Peter Morales, Sheng Li, Jaime Peña, Ross E. Allen:
Towards a Distributed Framework for Multi-Agent Reinforcement Learning Research. 1-9 - Jingbo Hu, Guohao Dai, Yu Wang, Huazhong Yang:
GraphSDH: A General Graph Sampling Framework with Distribution and Hierarchy. 1-7 - Todd Hricik, David A. Bader, Oded Green:
Using RAPIDS AI to Accelerate Graph Data Science Workflows. 1-4 - Dimitris Floros, Tiancheng Liu, Nikos Pitsianis, Xiaobai Sun:
Using Graphlet Spectrograms for Temporal Pattern Analysis of Virus-Research Collaboration Networks. 1-7 - Darko Ivanovich, Chenfeng Zhao, Xuan Zhang, Roger D. Chamberlain, Amit Deliwala, Viktor Gruev:
Chip-to-chip Optical Data Communications using Polarization Division Multiplexing. 1-8 - R. Usha, Prachi Pandey, N. Mangala:
A Comprehensive Comparison and Analysis of OpenACC and OpenMP 4.5 for NVIDIA GPUs. 1-6 - Justin Thaler, Woong Shin, Steven Roberts, James H. Rogers, Todd Rosedahl:
Hybrid Approach to HPC Cluster Telemetry and Hardware Log Analytics. 1-7 - Prasanth Chatarasi, Stephen Neuendorffer, Samuel Bayliss, Kees A. Vissers, Vivek Sarkar:
Vyasa: A High-Performance Vectorizing Compiler for Tensor Convolutions on the Xilinx AI Engine. 1-10 - Jeremy M. Myers, Daniel M. Dunlavy, Keita Teranishi, David S. Hollman:
Parameter Sensitivity Analysis of the SparTen High Performance Sparse Tensor Decomposition Software. 1-7 - Jeremy Kepner, Chad R. Meiners, Chansup Byun, Sarah McGuire, Timothy Davis, William Arcand, Jonathan Bernays, David Bestor, William Bergeron, Vijay Gadepally, Raul Harnasch, Matthew Hubbell, Micheal Houle, Michael Jones, Andrew C. Kirby, Anna Klein, Lauren Milechin, Julie Mullen, Andrew Prout, Albert Reuther, Antonio Rosa, Siddharth Samsi, Doug Stetson, Adam Tse, Charles Yee, Peter Michaleas:
Multi-Temporal Analysis and Scaling Relations of 100, 000, 000, 000 Network Packets. 1-6 - Daniel O'Malley, John K. Golden:
Homomorphic Encryption for Quantum Annealing with Spin Reversal Transformations. 1-6 - Shekhar Dwivedi, Andreas Heumann:
Profiling and Optimization of CT Reconstruction on Nvidia Quadro GV100. 1-7 - Jeremy Kepner, Andreas Kipf, Darren Engwirda, Navin Vembar, Michael Jones, Lauren Milechin, Vijay Gadepally, Chris Hill, Tim Kraska, William Arcand, David Bestor, William Bergeron, Chansup Byun, Matthew Hubbell, Michael Houle, Andrew C. Kirby, Anna Klein, Julie Mullen, Andrew Prout, Albert Reuther, Antonio Rosa, Sid Samsi, Charles Yee, Peter Michaleas:
Fast Mapping onto Census Blocks. 1-8 - Wesley Brewer, Greg Behm, Alan L. Scheinine, Ben Parsons, Wesley Emeneker, Robert P. Trevino:
iBench: a Distributed Inference Simulation and Benchmark Suite. 1-6 - Matthew Hutchinson, Siddharth Samsi, William Arcand, David Bestor, Bill Bergeron, Chansup Byun, Micheal Houle, Matthew Hubbell, Michael Jones, Jeremy Kepner, Andrew C. Kirby, Peter Michaleas, Lauren Milechin, Julie Mullen, Andrew Prout, Antonio Rosa, Albert Reuther, Charles Yee, Vijay Gadepally:
Accuracy and Performance Comparison of Video Action Recognition Approaches. 1-8 - Gregory A. Ciccarelli, Michael Nolan, Hrishikesh M. Rao, Tanya Talkar, Anne T. O'Brien, Gloria Vergara-Diaz, Ross Zafonte, Thomas F. Quatieri, Ryan J. McKindles, Paolo Bonato, Adam C. Lammert:
Human balance models optimized using a large-scale, parallel architecture with applications to mild traumatic brain injury. 1-8 - Sean Fraser, Helen Xu, Charles E. Leiserson:
Work-Efficient Parallel Algorithms for Accurate Floating-Point Prefix Sums. 1-7 - Evan T. Kain, Tyler M. Lovelly, Alan D. George:
Evaluating SEU Resilience of CNNs with Fault Injection. 1-5 - Nina Mujkanovic, Karthee Sivalingam, Alfio Lazzaro:
Optimising AI Training Deployments using Graph Compilers and Containers. 1-8 - Seunghwa Kang, Alexandre Fender, Joe Eaton, Brad Rees:
Computing PageRank Scores of Web Crawl Data Using DGX A100 Clusters. 1-4 - Sriram Aananthakrishnan, Robert Pawlowski, Joshua B. Fryman, Ibrahim Hur:
Efficient Sparse Matrix-Vector Multiplication on Intel PIUMA Architecture. 1-2 - Kaushik Velusamy, Thomas B. Rolinger, Janice McMahon:
Performance Strategies for Parallel Bitonic Sort on a Migratory Thread Architecture. 1-7 - Paul L. Springer, Thomas Schibler, Géraud Krawezik, Jack Lightholder, Peter M. Kogge:
Machine Learning Algorithm Performance on the Lucata Computer. 1-7 - Roozbeh Karimi, David M. Koppelman, Chris J. Michael:
Fast GPU Graph Contraction by Combining Efficient Shallow Searches and Post-Culling. 1-7 - Tong Geng, Chunshu Wu, Cheng Tan, Bo Fang, Ang Li, Martin C. Herbordt:
CQNN: a CGRA-based QNN Framework. 1-7 - Tian Ye, Rajgopal Kannan, Viktor K. Prasanna:
Accelerator Design and Performance Modeling for Homomorphic Encrypted CNN Inference. 1-7 - Jeremy Kepner, Simon Alford, Vijay Gadepally, Michael Jones, Lauren Milechin, Albert Reuther, Ryan A. Robinett, Sid Samsi:
GraphChallenge.org Sparse Deep Neural Network Performance. 1-7 - Andrew C. Kirby, Dimitri J. Mavriplis:
GPU-Accelerated Discontinuous Galerkin Methods: 30x Speedup on 345 Billion Unknowns. 1-7 - Géraud Krawezik, Shannon K. Kuntz, Peter M. Kogge:
Implementing Sparse Linear Algebra Kernels on the Lucata Pathfinder-A Computer. 1-6 - David Langerman, Alex Johnson, Kyle Buettner, Alan D. George:
Beyond Floating-Point Ops: CNN Performance Prediction with Critical Datapath Length. 1-9 - Daniel Hawthorne, Michael P. Kapralos, Raymond W. Blaine, Suzanne J. Matthews:
Evaluating Cryptographic Performance of Raspberry Pi Clusters. 1-9 - Chansup Byun, Jeremy Kepner, William Arcand, David Bestor, Bill Bergeron, Vijay Gadepally, Michael Houle, Matthew Hubbell, Michael Jones, Andrew C. Kirby, Anna Klein, Peter Michaleas, Lauren Milechin, Julie Mullen, Andrew Prout, Antonio Rosa, Siddharth Samsi, Charles Yee, Albert Reuther:
Best of Both Worlds: High Performance Interactive and Batch Launching. 1-7 - Alan Ehret, Eliakin Del Rosario, Karen Gettings, Michel A. Kinsy:
A Hardware Root-of-Trust Design for Low-Power SoC Edge Devices. 1-6 - Austin Chase Minor, Zhihui Du, Yankui Sun, David A. Bader, Chao Wu, Jianyan Wei:
GPU Accelerated Anomaly Detection of Large Scale Light Curves. 1-7
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.