


Остановите войну!
for scientists:


default search action
Torsten Hoefler
Torsten Höfler
Person information

- affiliation: ETH Zürich
Refine list

refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
showing all ?? records
2020 – today
- 2024
- [j60]Maciej Besta
, Robert Gerstenberger
, Emanuel Peter
, Marc Fischer
, Michal Podstawski
, Claude Barthels
, Gustavo Alonso
, Torsten Hoefler
:
Demystifying Graph Databases: Analysis and Taxonomy of Data Organization, System Designs, and Graph Queries. ACM Comput. Surv. 56(2): 31:1-31:40 (2024) - 2023
- [j59]Torsten Hoefler, Thomas Häner, Matthias Troyer:
Disentangling Hype from Practicality: On Realistically Achieving Quantum Advantage. Commun. ACM 66(5): 82-87 (2023) - [j58]Torsten Hoefler
, Duncan Roweth, Keith D. Underwood, Robert Alverson, Mark Griswold, Vahid Tabatabaee, Mohan Kalkunte, Surendra Anubolu, Siyuan Shen, Moray McLaren, Abdul Kabbani, Steve Scott:
Data Center Ethernet and Remote Direct Memory Access: Issues at Hyperscale. Computer 56(7): 67-77 (2023) - [j57]Torsten Hoefler
, Bjorn Stevens
, Andreas F. Prein
, Johanna Baehr
, Thomas C. Schulthess, Thomas F. Stocker
, John A. Taylor
, Daniel Klocke
, Pekka Manninen, Piers M. Forster, Tobias Kölling, Nicolas Gruber
, Hartwig Anzt, Claudia Frauen
, Florian Ziemen
, Milan Klöwer
, Karthik Kashinath, Christoph M. Schär, Oliver Fuhrer, Bryan N. Lawrence
:
Earth Virtualization Engines: A Technical Perspective. Comput. Sci. Eng. 25(3): 50-59 (2023) - [j56]Satoshi Matsuoka
, Jens Domke
, Mohamed Wahib
, Aleksandr Drozd
, Torsten Hoefler
:
Myths and legends in high-performance computing. Int. J. High Perform. Comput. Appl. 37(3-4): 245-259 (2023) - [j55]Maciej Besta
, Marc Fischer, Vasiliki Kalavri
, Michael Kapralov, Torsten Hoefler
:
Practice of Streaming Processing of Dynamic Graphs: Concepts, Models, and Systems. IEEE Trans. Parallel Distributed Syst. 34(6): 1860-1876 (2023) - [j54]Paul Scheffler, Florian Zaruba, Fabian Schuiki, Torsten Hoefler, Luca Benini:
Sparse Stream Semantic Registers: A Lightweight ISA Extension Accelerating General Sparse Linear Algebra. IEEE Trans. Parallel Distributed Syst. 34(12): 3147-3161 (2023) - [c256]Tal Ben-Nun, Berke Ates
, Alexandru Calotoiu, Torsten Hoefler:
Bridging Control-Centric and Data-Centric Optimization. CGO 2023: 173-185 - [c255]Tal Ben-Nun
, Lukas Gianinazzi
, Torsten Hoefler
, Yishai Oltchik:
Maximum Flows in Parametric Graph Templates. CIAC 2023: 97-111 - [c254]Patrick Iff, Maciej Besta, Matheus A. Cavalcante, Tim Fischer, Luca Benini, Torsten Hoefler:
Sparse Hamming Graph: A Customizable Network-on-Chip Topology. DAC 2023: 1-6 - [c253]Patrick Iff, Maciej Besta, Matheus A. Cavalcante, Tim Fischer, Luca Benini, Torsten Hoefler:
HexaMesh: Scaling to Hundreds of Chiplets with an Optimized Chiplet Arrangement. DAC 2023: 1-6 - [c252]Tiziano De Matteis
, Lukas Gianinazzi
, Johannes de Fine Licht
, Torsten Hoefler
:
Streaming Task Graph Scheduling for Dataflow Architectures. HPDC 2023: 225-237 - [c251]Elias Frantar, Saleh Ashkboos, Torsten Hoefler, Dan Alistarh:
OPTQ: Accurate Quantization for Generative Pre-trained Transformers. ICLR 2023 - [c250]Langwen Huang, Torsten Hoefler:
Compressing multidimensional weather and climate data into neural networks. ICLR 2023 - [c249]Lukas Trümper
, Tal Ben-Nun
, Philipp Schaad
, Alexandru Calotoiu
, Torsten Hoefler
:
Performance Embeddings: A Similarity-Based Transfer Tuning Approach to Performance Optimization. ICS 2023: 50-62 - [c248]Marcin Copik
, Roman Böhringer
, Alexandru Calotoiu
, Torsten Hoefler
:
FMI: Fast and Cheap Message Passing for Serverless Functions. ICS 2023: 373-385 - [c247]Marcin Copik, Konstantin Taranov, Alexandru Calotoiu, Torsten Hoefler:
rFaaS: Enabling High Performance Serverless with RDMA and Leases. IPDPS 2023: 897-907 - [c246]Maciej Besta
, Robert Gerstenberger
, Marc Fischer
, Michal Podstawski
, Nils Blach
, Berke Egeli
, George Mitenkov
, Wojciech Chlapek
, Marek T. Michalewicz
, Hubert Niewiadomski
, Jürgen Müller, Torsten Hoefler
:
The Graph Database Interface: Scaling Online Transactional and Analytical Graph Workloads to Hundreds of Thousands of Cores. SC 2023: 22:1-22:18 - [c245]Marcin Chrapek
, Mikhail Khalilov
, Torsten Hoefler
:
HEAR: Homomorphically Encrypted Allreduce. SC 2023: 36:1-36:17 - [c244]Maciej Besta
, Pawel Renc
, Robert Gerstenberger
, Paolo Sylos Labini
, Alexandros Nikolaos Ziogas
, Tiancheng Chen
, Lukas Gianinazzi
, Florian Scheidl
, Kalman Szenes
, Armon Carigiet
, Patrick Iff
, Grzegorz Kwasniewski
, Raghavendra Kanakagiri
, Chio Ge
, Sammy Jaeger
, Jaroslaw Was
, Flavio Vella
, Torsten Hoefler
:
High-Performance and Programmable Attentional Graph Neural Networks with Global Tensor Formulations. SC 2023: 66:1-66:16 - [c243]Roberto L. Castro
, Andrei Ivanov
, Diego Andrade
, Tal Ben-Nun
, Basilio B. Fraguela
, Torsten Hoefler
:
VENOM: A Vectorized N: M Format for Unleashing the Power of Sparse Tensor Cores. SC 2023: 72:1-72:14 - [c242]Wenqi Jiang
, Shigang Li
, Yu Zhu
, Johannes de Fine Licht
, Zhenhao He
, Runbin Shi
, Cédric Renggli, Shuai Zhang
, Theodoros Rekatsinas
, Torsten Hoefler
, Gustavo Alonso
:
Co-design Hardware and Algorithm for Vector Search. SC 2023: 87:1-87:15 - [c241]Philipp Schaad
, Timo Schneider
, Tal Ben-Nun
, Alexandru Calotoiu
, Alexandros Nikolaos Ziogas
, Torsten Hoefler
:
FuzzyFlow: Leveraging Dataflow To Find and Squash Program Optimization Bugs. SC 2023: 88:1-88:15 - [c240]Yue Shi
, Tommy Nguyen
, Samuel Alexander Stein, Tim Stavenger
, Marvin Warner
, Martin Roetteler
, Torsten Hoefler
, Ang Li
:
A Reference Implementation for a Quantum Message Passing Interface. SC Workshops 2023: 1420-1425 - [c239]Daniele De Sensi
, Tiziano De Matteis
, Konstantin Taranov
, Salvatore Di Girolamo
, Tobias Rahn
, Torsten Hoefler
:
Noise in the Clouds: Influence of Network Performance Variability on Application Scalability. SIGMETRICS (Abstracts) 2023: 17-18 - [c238]Kartik Lakhotia
, Kelly Isham
, Laura Monroe
, Maciej Besta
, Torsten Hoefler
, Fabrizio Petrini
:
In-network Allreduce with Multiple Spanning Trees on PolarFly. SPAA 2023: 165-176 - [c237]Andrei Ivanov, Benjamin Rothenberger, Arnaud Dethise, Marco Canini, Torsten Hoefler, Adrian Perrig:
SAGE: Software-based Attestation for GPU Execution. USENIX Annual Technical Conference 2023: 485-499 - [i155]Niels Gleinig, Tal Ben-Nun, Torsten Hoefler:
A Theory of I/O-Efficient Sparse Neural Network Inference. CoRR abs/2301.01048 (2023) - [i154]Satoshi Matsuoka, Jens Domke, Mohamed Wahib, Aleksandr Drozd, Torsten Hoefler:
Myths and Legends in High-Performance Computing. CoRR abs/2301.02432 (2023) - [i153]Jinfan Chen, Shigang Li, Ran Guo, Jinhui Yuan, Torsten Hoefler:
AutoDDL: Automatic Distributed Deep Learning with Asymptotically Optimal Communication. CoRR abs/2301.06813 (2023) - [i152]Niels Gleinig, Tobias Rohner, Torsten Hoefler:
Approximate Reversible Circuits for NISQ-Era Quantum Computers. CoRR abs/2302.01066 (2023) - [i151]Torsten Hoefler, Duncan Roweth, Keith D. Underwood, Bob Alverson, Mark Griswold, Vahid Tabatabaee, Mohan Kalkunte, Surendra Anubolu, Siyuan Shen, Abdul Kabbani, Moray McLaren, Steve Scott:
Datacenter Ethernet and RDMA: Issues at Hyperscale. CoRR abs/2302.03337 (2023) - [i150]Kartik Lakhotia, Laura Monroe, Kelly Isham, Maciej Besta, Nils Blach, Torsten Hoefler, Fabrizio Petrini:
PolarStar: Expanding the Scalability Horizon of Diameter-3 Networks. CoRR abs/2302.07217 (2023) - [i149]Lukas Trümper, Tal Ben-Nun, Philipp Schaad, Alexandru Calotoiu, Torsten Hoefler:
Performance Embeddings: A Similarity-based Approach to Automatic Performance Optimization. CoRR abs/2303.08142 (2023) - [i148]Andrei Ivanov, Nikoli Dryden, Tal Ben-Nun, Saleh Ashkboos, Torsten Hoefler:
STen: Productive and Efficient Sparsity in PyTorch. CoRR abs/2304.07613 (2023) - [i147]Kazuki Osawa, Satoki Ishikawa, Rio Yokota, Shigang Li, Torsten Hoefler:
ASDL: A Unified Interface for Gradient Preconditioning in PyTorch. CoRR abs/2305.04684 (2023) - [i146]Thomas Benz, Michael Rogenmoser, Paul Scheffler, Samuel Riedel, Alessandro Ottaviano, Andreas Kurth, Torsten Hoefler, Luca Benini:
A High-performance, Energy-efficient Modular DMA Engine Architecture. CoRR abs/2305.05240 (2023) - [i145]Paul Scheffler, Florian Zaruba, Fabian Schuiki, Torsten Hoefler, Luca Benini:
Sparse Stream Semantic Registers: A Lightweight ISA Extension Accelerating General Sparse Linear Algebra. CoRR abs/2305.05559 (2023) - [i144]Marcin Copik, Roman Böhringer, Alexandru Calotoiu, Torsten Hoefler:
FMI: Fast and Cheap Message Passing for Serverless Functions. CoRR abs/2305.08763 (2023) - [i143]Maciej Besta, Robert Gerstenberger, Marc Fischer, Michal Podstawski, Jürgen Müller, Nils Blach, Berke Egeli, George Mitenkov, Wojciech Chlapek, Marek T. Michalewicz, Torsten Hoefler:
High-Performance Graph Databases That Are Portable, Programmable, and Scale to Hundreds of Thousands of Cores. CoRR abs/2305.11162 (2023) - [i142]Tal Ben-Nun, Berke Ates, Alexandru Calotoiu, Torsten Hoefler:
Bridging Control-Centric and Data-Centric Optimization. CoRR abs/2306.00366 (2023) - [i141]Tiziano De Matteis, Lukas Gianinazzi, Johannes de Fine Licht, Torsten Hoefler:
Streaming Task Graph Scheduling for Dataflow Architectures. CoRR abs/2306.02730 (2023) - [i140]Tim Dettmers, Ruslan Svirschevski, Vage Egiazarian, Denis Kuznedelev, Elias Frantar, Saleh Ashkboos, Alexander Borzunov, Torsten Hoefler, Dan Alistarh:
SpQR: A Sparse-Quantized Representation for Near-Lossless LLM Weight Compression. CoRR abs/2306.03078 (2023) - [i139]Wenqi Jiang, Shigang Li, Yu Zhu, Johannes de Fine Licht, Zhenhao He, Runbin Shi, Cédric Renggli, Shuai Zhang, Theodoros Rekatsinas, Torsten Hoefler, Gustavo Alonso:
Co-design Hardware and Algorithm for Vector Search. CoRR abs/2306.11182 (2023) - [i138]Philipp Schaad
, Timo Schneider, Tal Ben-Nun, Alexandru Calotoiu, Alexandros Nikolaos Ziogas, Torsten Hoefler:
FuzzyFlow: Leveraging Dataflow To Find and Squash Program Optimization Bugs. CoRR abs/2306.16178 (2023) - [i137]Torsten Hoefler, Thomas Häner, Matthias Troyer:
Disentangling Hype from Practicality: On Realistically Achieving Quantum Advantage. CoRR abs/2307.00523 (2023) - [i136]Tal Ben-Nun, Lukas Gianinazzi, Torsten Hoefler, Yishai Oltchik:
Maximum Flows in Parametric Graph Templates. CoRR abs/2307.08420 (2023) - [i135]Yunqiang Li, Jan C. van Gemert, Torsten Hoefler, Bert Moons, Evangelos Eleftheriou, Bram-Ernst Verhoef:
Differentiable Transportation Pruning. CoRR abs/2307.08483 (2023) - [i134]Maciej Besta, Nils Blach, Ales Kubicek, Robert Gerstenberger, Lukas Gianinazzi, Joanna Gajda, Tomasz Lehmann, Michal Podstawski, Hubert Niewiadomski, Piotr Nyczyk, Torsten Hoefler:
Graph of Thoughts: Solving Elaborate Problems with Large Language Models. CoRR abs/2308.09687 (2023) - [i133]Julia Bazinska, Andrei Ivanov, Tal Ben-Nun, Nikoli Dryden, Maciej Besta, Siyuan Shen, Torsten Hoefler:
Cached Operator Reordering: A Unified View for Fast GNN Training. CoRR abs/2308.12093 (2023) - [i132]Mikhail Khalilov, Marcin Chrapek, Siyuan Shen, Alessandro Vezzu, Thomas Benz, Salvatore Di Girolamo, Timo Schneider, Daniele De Sensi, Luca Benini, Torsten Hoefler:
OSMOSIS: Enabling Multi-Tenancy in Datacenter SmartNICs. CoRR abs/2309.03628 (2023) - [i131]Torsten Hoefler, Bjorn Stevens, Andreas F. Prein, Johanna Baehr, Thomas C. Schulthess, Thomas F. Stocker, John A. Taylor, Daniel Klocke, Pekka Manninen, Piers M. Forster, Tobias Kölling, Nicolas Gruber, Hartwig Anzt, Claudia Frauen, Florian Ziemen, Milan Klöwer, Karthik Kashinath, Christoph M. Schär, Oliver Fuhrer, Bryan N. Lawrence:
Earth Virtualization Engines - A Technical Perspective. CoRR abs/2309.09002 (2023) - [i130]Daniele De Sensi, Edgar Costa Molero, Salvatore Di Girolamo, Laurent Vanbever, Torsten Hoefler:
Canary: Congestion-Aware In-Network Allreduce Using Dynamic Trees. CoRR abs/2309.16214 (2023) - [i129]Roberto L. Castro, Andrei Ivanov, Diego Andrade, Tal Ben-Nun, Basilio B. Fraguela, Torsten Hoefler:
VENOM: A Vectorized N: M Format for Unleashing the Power of Sparse Tensor Cores. CoRR abs/2310.02065 (2023) - [i128]Nils Blach, Maciej Besta, Daniele De Sensi, Jens Domke, Hussein Harake, Shigang Li, Patrick Iff, Marek Konieczny, Kartik Lakhotia, Ales Kubicek, Marcel Ferrari, Fabrizio Petrini, Torsten Hoefler:
A High-Performance Design, Implementation, Deployment, and Evaluation of The Slim Fly Network. CoRR abs/2310.03742 (2023) - [i127]Saleh Ashkboos, Ilia Markov, Elias Frantar, Tingxuan Zhong, Xincheng Wang, Jie Ren, Torsten Hoefler, Dan Alistarh:
Towards End-to-end 4-Bit Inference on Generative Large Language Models. CoRR abs/2310.09259 (2023) - [i126]Wenqi Jiang, Marco Zeller, Roger Waleffe, Torsten Hoefler, Gustavo Alonso:
Chameleon: a Heterogeneous and Disaggregated Accelerator System for Retrieval-Augmented Language Models. CoRR abs/2310.09949 (2023) - [i125]Patrick Iff, Benigna Bruggmann, Maciej Besta, Luca Benini, Torsten Hoefler:
RapidChiplet: A Toolchain for Rapid Design Space Exploration of Chiplet Architectures. CoRR abs/2311.06081 (2023) - [i124]Wei Qiu, Marcin Copik, Yun Wang, Alexandru Calotoiu, Torsten Hoefler:
User-guided Page Merging for Memory Deduplication in Serverless Systems. CoRR abs/2311.13588 (2023) - [i123]Maciej Besta, Afonso Claudino Catarino, Lukas Gianinazzi, Nils Blach, Piotr Nyczyk, Hubert Niewiadomski, Torsten Hoefler:
HOT: Higher-Order Dynamic Graph Representation Learning with Efficient Transformers. CoRR abs/2311.18526 (2023) - 2022
- [j53]Torsten Hoefler, Ariel Hendel, Duncan Roweth:
The Convergence of Hyperscale Data Center and High-Performance Computing Networks. Computer 55(7): 29-37 (2022) - [j52]Torsten Hoefler:
Benchmarking Data Science: 12 Ways to Lie With Statistics and Performance on Parallel Computers. Computer 55(8): 49-56 (2022) - [j51]Marcin Copik
, Tobias Grosser, Torsten Hoefler, Paolo Bientinesi, Benjamin Berkels
:
Work-Stealing Prefix Scan: Addressing Load Imbalance in Large-Scale Image Registration. IEEE Trans. Parallel Distributed Syst. 33(3): 523-535 (2022) - [c236]Konstantin Taranov, Benjamin Rothenberger, Daniele De Sensi, Adrian Perrig, Torsten Hoefler:
NeVerMore: Exploiting RDMA Mistakes in NVMe-oF Storage Applications. CCS 2022: 2765-2778 - [c235]Andrea Cossettini, Konstantin Taranov, Christian Vogt, Michele Magno
, Torsten Hoefler, Luca Benini:
A RDMA Interface for Ultra-Fast Ultrasound Data-Streaming over an Optical Link. DATE 2022: 80-83 - [c234]Niels Gleinig, Torsten Hoefler:
Circuits for Measurement Based Quantum State Preparation. DATE 2022: 328-333 - [c233]Andrea Biagioni, Paolo Cretaro, Ottorino Frezza, Francesca Lo Cicero, Alessandro Lonardo, Michele Martinelli, Pier Stanislao Paolucci
, Elena Pastorelli, Francesco Simula, Matteo Turisini, Piero Vicini
, Roberto Ammendola, Pascale Bernier-Bruna, Claire Chen, Said Derradji, Stéphane Guez, Pierre-Axel Lagadec, Gregoire Pichon, Etienne Walter, Gaetan De Gassowski, Matthieu Hautreaux, Stephane Mathieu, Gilles Moreau, Marc Pérache
, Hugo Taboada, Torsten Hoefler, Timo Schneider, Matteo Barnaba, Giuseppe Piero Brandino, Francesco De Giorgi, Matteo Poggi, Iakovos Mavroidis, Yannis Papaefstathiou, Nikolaos Tampouratzis, Benjamin Kalisch, Ulrich Krackhardt, Mondrian Nuessle, Pantelis Xirouchakis, Vangelis Mageiropoulos, Michalis Gianioudis, Harisis Loukas, Aggelos Ioannou
, Nikos Kallimanis, Nikos Chrysos, Manolis Katevenis, Wolfang Frings, Dominik Gottwald, Felime Guimaraes, Max Holicki, Volker Marx, Yannik Muller, Carsten Clauss, Hugo Falter, Xu Huang, Jennifer Lopez Barillao, Thomas Moschny, Simon Pickartz, Francisco J. Alfaro, Jesús Escudero-Sahuquillo
, Pedro Javier García, Francisco J. Quiles, José L. Sánchez, Adrián Castelló
, Jose Duro, María Engracia Gómez, Enrique S. Quintana-Ortí, Julio Sahuquillo, Eugenio Stabile:
RED-SEA: Network Solution for Exascale Architectures. DSD 2022: 712-719 - [c232]Shiyi Cao, Salvatore Di Girolamo, Torsten Hoefler:
Accelerating Data Serialization/Deserialization Protocols with In-Network Compute. ExaMPI@SC 2022: 22-30 - [c231]Johannes de Fine Licht, Christopher A. Pattison, Alexandros Nikolaos Ziogas, David Simmons-Duffin, Torsten Hoefler:
Fast Arbitrary Precision Floating Point on FPGA. FCCM 2022: 1-9 - [c230]Carl-Johannes Johnsen
, Tiziano De Matteis
, Tal Ben-Nun, Johannes de Fine Licht, Torsten Hoefler:
Temporal Vectorization: A Compiler Approach to Automatic Multi-Pumping. ICCAD 2022: 85:1-85:9 - [c229]Bryan A. Plummer, Nikoli Dryden, Julius Frost, Torsten Hoefler, Kate Saenko:
Neural Parameter Allocation Search. ICLR 2022 - [c228]Larissa Schmid
, Marcin Copik, Alexandru Calotoiu, Dominik Werle, Andreas Reiter, Michael Selzer, Anne Koziolek, Torsten Hoefler:
Performance-detective: automatic deduction of cheap and accurate performance models. ICS 2022: 3:1-3:13 - [c227]Alexandru Calotoiu, Tal Ben-Nun, Grzegorz Kwasniewski, Johannes de Fine Licht, Timo Schneider, Philipp Schaad
, Torsten Hoefler:
Lifting C semantics for dataflow optimization. ICS 2022: 17:1-17:13 - [c226]Oliver Rausch, Tal Ben-Nun, Nikoli Dryden, Andrei Ivanov, Shigang Li, Torsten Hoefler:
A data-centric optimization framework for machine learning. ICS 2022: 36:1-36:13 - [c225]Andrei Lascu, Alastair F. Donaldson, Tobias Grosser, Torsten Hoefler:
Metamorphic Fuzzing of C++ Libraries. ICST 2022: 35-46 - [c224]Niels Gleinig, Maciej Besta, Torsten Hoefler:
I/O-Optimal Cache-Oblivious Sparse Matrix-Sparse Matrix Multiplication. IPDPS 2022: 36-46 - [c223]András Strausz, Flavio Vella
, Salvatore Di Girolamo, Maciej Besta, Torsten Hoefler:
Asynchronous Distributed-Memory Triangle Counting and LCC with RMA Caching. IPDPS 2022: 291-301 - [c222]Maciej Besta, Raphael Grob, Cesare Miglioli
, Nicola Bernold, Grzegorz Kwasniewski, Gabriel Gjini, Raghavendra Kanakagiri, Saleh Ashkboos, Lukas Gianinazzi, Nikoli Dryden, Torsten Hoefler:
Motif Prediction with Graph Neural Networks. KDD 2022: 35-45 - [c221]Maciej Besta, Patrick Iff, Florian Scheidl, Kazuki Osawa, Nikoli Dryden, Michal Podstawski, Tiancheng Chen, Torsten Hoefler:
Neural Graph Databases. LoG 2022: 31 - [c220]Saleh Ashkboos, Langwen Huang, Nikoli Dryden, Tal Ben-Nun, Peter Dueben, Lukas Gianinazzi, Luca Kummer, Torsten Hoefler:
ENS-10: A Dataset For Post-Processing Ensemble Weather Forecasts. NeurIPS 2022 - [c219]Nikoli Dryden, Torsten Hoefler:
Spatial Mixture-of-Experts. NeurIPS 2022 - [c218]Shigang Li, Torsten Hoefler:
Near-optimal sparse allreduce for distributed deep learning. PPoPP 2022: 135-149 - [c217]Salvatore Di Girolamo, Daniele De Sensi, Konstantin Taranov, Milos Malesevic, Maciej Besta, Timo Schneider, Severin Kistler, Torsten Hoefler:
Building Blocks for Network-Accelerated Distributed File Systems. SC 2022: 10:1-10:14 - [c216]Torsten Hoefler, Tommaso Bonato
, Daniele De Sensi, Salvatore Di Girolamo, Shigang Li, Marco Heddes, Jon Belk, Deepak Goel, Miguel Castro, Steve Scott:
HammingMesh: A Network Topology for Large-Scale Deep Learning. SC 2022: 11:1-11:18 - [c215]Kartik Lakhotia, Maciej Besta, Laura Monroe, Kelly Isham, Patrick Iff, Torsten Hoefler, Fabrizio Petrini:
PolarFly: A Cost-Effective and Flexible Low-Diameter Topology. SC 2022: 12:1-12:15 - [c214]Alexandros Nikolaos Ziogas, Grzegorz Kwasniewski, Tal Ben-Nun, Timo Schneider, Torsten Hoefler:
Deinsum: Practically I/O Optimal Multi-Linear Algebra. SC 2022: 25:1-25:15 - [c213]Shigang Li
, Kazuki Osawa, Torsten Hoefler:
Efficient Quantized Sparse Matrix Operations on Tensor Cores. SC 2022: 37:1-37:15 - [c212]Maciej Besta, Cesare Miglioli
, Paolo Sylos Labini, Jakub Tetek, Patrick Iff, Raghavendra Kanakagiri, Saleh Ashkboos, Kacper Janda, Michal Podstawski, Grzegorz Kwasniewski, Niels Gleinig, Flavio Vella
, Onur Mutlu, Torsten Hoefler:
ProbGraph: High-Performance and High-Accuracy Graph Mining with Probabilistic Set Representations. SC 2022: 43:1-43:17 - [c211]Philipp Schaad
, Tal Ben-Nun, Torsten Hoefler:
Boosting Performance Optimization with Interactive Data Movement Visualization. SC 2022: 64:1-64:16 - [c210]Tal Ben-Nun, Linus Groner, Florian Deconinck
, Tobias Wicky, Eddie Davis, Johann Dahm, Oliver Elbert, Rhea George, Jeremy McGibbon, Lukas Trümper, Elynn Wu, Oliver Fuhrer, Thomas C. Schulthess, Torsten Hoefler:
Productive Performance Engineering for Weather and Climate Modeling with Python. SC 2022: 73:1-73:14 - [c209]Konstantin Taranov, Steve Byan, Virendra J. Marathe, Torsten Hoefler:
KafkaDirect: Zero-copy Data Access for Apache Kafka over RDMA Networks. SIGMOD Conference 2022: 2191-2204 - [c208]Niels Gleinig, Torsten Hoefler:
The Red-Blue Pebble Game on Trees and DAGs with Large Input. SIROCCO 2022: 135-153 - [i122]Shigang Li, Torsten Hoefler:
Near-Optimal Sparse Allreduce for Distributed Deep Learning. CoRR abs/2201.07598 (2022) - [i121]Konstantin Taranov, Benjamin Rothenberger, Daniele De Sensi, Adrian Perrig, Torsten Hoefler:
NeVerMore: Exploiting RDMA Mistakes in NVMe-oF Storage Applications. CoRR abs/2202.08080 (2022) - [i120]András Strausz, Flavio Vella, Salvatore Di Girolamo, Maciej Besta, Torsten Hoefler:
Asynchronous Distributed-Memory Triangle Counting and LCC with RMA Caching. CoRR abs/2202.13976 (2022) - [i119]Marcin Copik, Alexandru Calotoiu, Konstantin Taranov, Torsten Hoefler:
FaasKeeper: a Blueprint for Serverless Services. CoRR abs/2203.14859 (2022) - [i118]Johannes de Fine Licht, Christopher A. Pattison, Alexandros Nikolaos Ziogas, David Simmons-Duffin, Torsten Hoefler:
Fast Arbitrary Precision Floating Point on FPGA. CoRR abs/2204.06256 (2022) - [i117]Tal Ben-Nun, Linus Groner, Florian Deconinck, Tobias Wicky, Eddie Davis, Johann Dahm, Oliver Elbert, Rhea George, Jeremy McGibbon, Lukas Trümper, Elynn Wu, Oliver Fuhrer, Thomas C. Schulthess, Torsten Hoefler:
Productive Performance Engineering for Weather and Climate Modeling with Python. CoRR abs/2205.04148 (2022) - [i116]Lukas Gianinazzi, Tal Ben-Nun, Saleh Ashkboos, Yves Baumann, Piotr Luczynski, Torsten Hoefler:
The spatial computer: A model for energy-efficient parallel computation. CoRR abs/2205.04934 (2022) - [i115]Maciej Besta, Torsten Hoefler:
Parallel and Distributed Graph Neural Networks: An In-Depth Concurrency Analysis. CoRR abs/2205.09702 (2022) - [i114]Alexandros Nikolaos Ziogas, Grzegorz Kwasniewski, Tal Ben-Nun, Timo Schneider, Torsten Hoefler:
Deinsum: Practically I/O Optimal Multilinear Algebra. CoRR abs/2206.08301 (2022) - [i113]Salvatore Di Girolamo, Daniele De Sensi, Konstantin Taranov, Milos Malesevic, Maciej Besta, Timo Schneider, Severin Kistler, Torsten Hoefler:
Building Blocks for Network-Accelerated Distributed File Systems. CoRR abs/2206.10007 (2022) - [i112]Saleh Ashkboos, Langwen Huang, Nikoli Dryden, Tal Ben-Nun, Peter Dueben, Lukas Gianinazzi, Luca Kummer, Torsten Hoefler:
ENS-10: A Dataset For Post-Processing Ensemble Weather Forecast. CoRR abs/2206.14786 (2022) - [i111]Philipp Schaad
, Tal Ben-Nun, Torsten Hoefler:
Boosting Performance Optimization with Interactive Data Movement Visualization. CoRR abs/2207.07433 (2022) - [i110]Kartik Lakhotia, Maciej Besta, Laura Monroe, Kelly Isham, Patrick Iff, Torsten Hoefler, Fabrizio Petrini:
PolarFly: A Cost-Effective and Flexible Low-Diameter Topology. CoRR abs/2208.01695 (2022) - [i109]Maciej Besta, Cesare Miglioli, Paolo Sylos Labini, Jakub Tetek, Patrick Iff, Raghavendra Kanakagiri, Saleh Ashkboos, Kacper Janda, Michal Podstawski, Grzegorz Kwasniewski, Niels Gleinig, Flavio Vella, Onur Mutlu, Torsten Hoefler:
ProbGraph: High-Performance and High-Accuracy Graph Mining with Probabilistic Set Representations. CoRR abs/2208.11469 (2022) - [i108]Torsten Hoefler, Tommaso Bonato, Daniele De Sensi, Salvatore Di Girolamo, Shigang Li, Marco Heddes, Jon Belk, Deepak Goel, Miguel Castro, Steve Scott:
HammingMesh: A Network Topology for Large-Scale Deep Learning. CoRR abs/2209.01346 (2022) - [i107]Andrei Ivanov, Benjamin Rothenberger, Arnaud Dethise, Marco Canini, Torsten Hoefler, Adrian Perrig
:
SAGE: Software-based Attestation for GPU Execution. CoRR abs/2209.03125 (2022) - [i106]Shigang Li, Kazuki Osawa, Torsten Hoefler:
Efficient Quantized Sparse Matrix Operations on Tensor Cores. CoRR abs/2209.06979 (2022) - [i105]Maciej Besta, Patrick Iff, Florian Scheidl, Kazuki Osawa, Nikoli Dryden, Michal Podstawski, Tiancheng Chen, Torsten Hoefler:
Neural Graph Databases. CoRR abs/2209.09732 (2022) - [i104]