


default search action
25th CLUSTER 2023: Santa Fe, NM, USA
- IEEE International Conference on Cluster Computing, CLUSTER 2023, Santa Fe, NM, USA, October 31 - Nov. 3, 2023. IEEE 2023, ISBN 979-8-3503-0792-4

- Sahil Tyagi

, Martin Swany
:
Accelerating Distributed ML Training via Selective Synchronization. 1-12 - Kevin Assogba

, Eduardo Lima, M. Mustafa Rafique, Minseok Kwon:
PredictDDL: Reusable Workload Performance Prediction for Distributed Deep Learning. 13-24 - Frank Wanye, Vitaliy Gleyzer, Edward K. Kao, Wu-Chun Feng:

Exact Distributed Stochastic Block Partitioning. 25-36 - Taehoon Kim

, Kwangwon Koh, Changdae Kim
, Eunji Pak, Yeonjeong Jeong, Sang-Hoon Kim:
DEHype: Retrofitting Hypervisors for a Resource-Disaggregated Environment. 37-48 - Xinying Wang, Lipeng Wan, Scott Klasky, Dongfang Zhao, Feng Yan:

SciLance: Mitigate Load Imbalance for Parallel Scientific Applications in Cloud Environments. 49-59 - Michael Wilkins, Hanming Wang, Peizhi Liu, Bangyen Pham, Yanfei Guo, Rajeev Thakur

, Peter A. Dinda, Nikos Hardavellas
:
Generalized Collective Algorithms for the Exascale Era. 60-71 - Melvin Chelli, Cèdric Prigent

, René Schubotz, Alexandru Costan, Gabriel Antoniu, Loïc Cudennec, Philipp Slusallek:
FedGuard: Selective Parameter Aggregation for Poisoning Attack Mitigation in Federated Learning. 72-81 - Wei Wang, Zhiquan Lai, Shengwei Li, Weijie Liu, Keshi Ge, Yujie Liu, Ao Shen, Dongsheng Li:

Prophet: Fine-grained Load Balancing for Parallel Training of Large-scale MoE Models. 82-94 - Turja Kundu, Tong Shu:

HIOS: Hierarchical Inter-Operator Scheduler for Real-Time Inference of DAG-Structured Deep Learning Models on Multiple GPUs. 95-106 - Yuzuo Zhang, Xinyuan Tu, Lin Wang, Yuchong Hu, Fang Wang, Ye Wang:

FullRepair: Towards Optimal Repair Pipelining in Erasure-Coded Clustered Storage Systems. 107-117 - Krijn Doekemeijer, Nick Tehrany, Balakrishnan Chandrasekaran, Matias Bjørling, Animesh Trivedi:

Performance Characterization of NVMe Flash Devices with Zoned Namespaces (ZNS). 118-131 - Inhyuk Park, Qing Zheng

, Dominic Manno, Soonyeal Yang, Jason Lee, David Bonnie, Bradley W. Settlemyer, Youngjae Kim, Woosuk Chung, Gary Grider:
KV-CSD: A Hardware-Accelerated Key-Value Store for Data-Intensive Applications. 132-144 - Xingguo Jia, Xingzi Yu

, Yun Wang
, Senhao Yu, Zhengwei Qi:
Rethinking Virtual Machines Live Migration for Memory Disaggregation. 145-157 - George Michelogiannakis

, Yehia Arafa, Brandon Cook, Liang Yuan Dai, Abdel-Hameed A. Badawy, Madeleine Glick, Yuyang Wang
, Keren Bergman, John Shalf
:
Efficient Intra-Rack Resource Disaggregation for HPC Using Co-Packaged DWDM Photonics. 158-172 - Hongliang Li, Hairui Zhao

, Zhewen Xu
, Xiang Li, Haixiao Xu:
ExplSched: Maximizing Deep Learning Cluster Efficiency for Exploratory Jobs. 173-184 - Urvij Saroliya

, Eishi Arima, Dai Liu, Martin Schulz
:
Hierarchical Resource Partitioning on Modern GPUs: A Reinforcement Learning Approach. 185-196 - Yihao Sun, Sidharth Kumar, Thomas Gilray, Kristopher K. Micinski:

Communication-Avoiding Recursive Aggregation. 197-208 - Wenxuan Li, Helin Cheng, Zhengyang Lu, Yuechen Lu, Weifeng Liu

:
HASpMV: Heterogeneity-Aware Sparse Matrix-Vector Multiplication on Modern Asymmetric Multicore Processors. 209-220 - Daniel Rosendo

, Marta Mattoso
, Alexandru Costan, Renan Souza
, Débora B. Pina
, Patrick Valduriez, Gabriel Antoniu:
ProvLight: Efficient Workflow Provenance Capture on the Edge-to-Cloud Continuum. 221-233 - Zhangyu Liu, Cheng Zhang, Huijun Wu, Jianbin Fang, Lin Peng, Guixin Ye, Zhanyong Tang:

Optimizing HPC I/O Performance with Regression Analysis and Ensemble Learning. 234-246 - Arkaprabha Ganguli, Robert Underwood

, Julie Bessac, David Krasowska, Jon C. Calhoun, Sheng Di, Franck Cappello:
A Lightweight, Effective Compressibility Estimation Method for Error-bounded Lossy Compression. 247-258 - Yiltan Hassan Temuçin, Scott Levy, Whit Schonbein

, Ryan E. Grant, Ahmad Afsahi:
A Dynamic Network-Native MPI Partitioned Aggregation Over InfiniBand Verbs. 259-270 - Yiwen Zhang, Guokuan Li, Jiguang Wan, Junyue Wang, Jun Li, Ting Yao, Huatao Wu, Daohui Wang:

DoW-KV: A DPU-offloaded and Write-optimized Key-Value Store on Disaggregated Persistent Memory. 271-283 - Jesper Larsson Träff, Sascha Hunold, Ioannis Vardas, Nikolaus Manes Funk:

Uniform Algorithms for Reduce-scatter and (most) other Collectives for MPI. 284-294 - Wenhai Lin, Jingchang Qin, Yiquan Chen

, Zhen Jin, Jiexiong Xu
, Yuzhong Zhang, Shishun Cai, Lirong Fu, Yi Chen, Wenzhi Chen:
JACO: JAva Code Layout Optimizer Enabling Continuous Optimization without Pausing Application Services. 295-306 - Zhenyu Xu, Miaoxiang Yu, Jillian Cai, Qing Yang, Tao Wei:

A Finite-Difference Time-Domain (FDTD) solver with linearly scalable performance in an FPGA cluster. 307-317 - Hengquan Mei, Huaizhi Qu

, Jingwei Sun, Yanjie Gao, Haoxiang Lin
, Guangzhong Sun:
GPU Occupancy Prediction of Deep Learning Models Using Graph Neural Network. 318-329 - Qinglei Cao

, Sameh Abdulah, Hatem Ltaief
, Marc G. Genton
, David E. Keyes
, George Bosilca:
Reducing Data Motion and Energy Consumption of Geospatial Modeling Applications Using Automated Precision Conversion. 330-342 - Zixuan Chen

, Zhigao Zhao, Zijian Li
, Jiang Shao, Sen Liu, Yang Xu:
SDT: A Low-cost and Topology-reconfigurable Testbed for Network Research. 343-353 - Jiajun Huang, Kaiming Ouyang, Yujia Zhai, Jinyang Liu

, Min Si, Ken Raffenetti, Hui Zhou, Atsushi Hori, Zizhong Chen, Yanfei Guo, Rajeev Thakur:
PiP-MColl: Process-in-Process-based Multi-object MPI Collectives. 354-364 - Olamide Timothy Tawose, Lei Yang, Dongfang Zhao:

TopoCommit: A Topological Commit Protocol for Cross-Ledger Transactions in Scientific Computing. 365-375

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.


Google
Google Scholar
Semantic Scholar
Internet Archive Scholar
CiteSeerX
ORCID














