


default search action
26th CLUSTER 2024: Kobe, Japan
- IEEE International Conference on Cluster Computing, CLUSTER 2024, Kobe, Japan, September 24-27, 2024. IEEE 2024, ISBN 979-8-3503-5871-1

- Yutong Lu, Wuchun Feng, Mohamed Wahib:

Welcome Message from the IEEE Cluster 2024 Program Chairs. xiii - Lishan Yang

, George Papadimitriou, Dimitris Sartzetakis, Adwait Jog, Evgenia Smirni
, Dimitris Gizopoulos:
GPU Reliability Assessment: Insights Across the Abstraction Layers. 1-13 - Jiyu Luo, Tao Yan, Qingguo Xu, Jingwei Sun, Guangzhong Sun:

Siesta: Synthesizing Proxy Applications for MPI Programs. 14-26 - Xiang Fu, Shiman Meng, Weiping Zhang, Luanzheng Guo, Kento Sato, Dong H. Ahn, Ignacio Laguna, Gregory L. Lee, Martin Schulz:

Distributed Order Recording Techniques for Efficient Record-and-Replay of Multi - Threaded Programs. 27-38 - Gan Sun, Jiang Zhou, Bo Li, Xiaoyan Gu, Weiping Wang

, Shuibing He:
FTGraph: A Flexible Tree-Based Graph Store on Persistent Memory for Large-Scale Dynamic Graphs. 39-50 - Xiaohui Wei, Weikai Tang, Hao Qi

, Hengshan Yue:
PGSampler: Accelerating GPU-Based Graph Sampling in GNN Systems via Workload Fusion. 51-61 - Aishwarya Sarkar

, Sayan Ghosh, Nathan R. Tallent
, Ali Jannesari:
MassiveGNN: Efficient Training via Prefetching for Massively Connected Distributed Graphs. 62-73 - Emile Cadorel, Dimitri Saingre:

A Protocol to Assess the Accuracy of Process-Level Power Models. 74-84 - Omri Mor

, George Bosilca, Marc Snir:
Holistic Performance Analysis for Asynchronous Many-Task Runtimes. 85-96 - Tomé Maseda

, Jonatan Enes, Roberto R. Expósito, Juan Touriño:
Automated Approach for Accurate CPU Power Modelling. 97-107 - Majid Salimi Beni, Biagio Cosenza

, Sascha Hunold
:
MPI Collective Algorithm Selection in the Presence of Process Arrival Patterns. 108-119 - Gerald Collom, Derek Schafer, Amanda Bienz, Patrick G. Bridges

, Galen M. Shipman:
Optimizing Neighbor Collectives with Topology Objects. 120-130 - Hamed Sharifian, Amir Hossein Sojoodi, Ahmad Afsahi:

A Topology- and Load-Aware Design for Neighborhood Allgather. 131-142 - Petros Anastasiadis, Nikela Papadopoulou, Nectarios Koziris, Georgios I. Goumas:

Uncut-GEMMs: Communication-Aware Matrix Multiplication on Multi-GPU Nodes. 143-154 - Yifei He, Stefano Markidis:

High-Performance FFT Code Generation via MLIR Linalg Dialect and SIMD Micro-Kernels. 155-165 - Bo Fang, Xinyi Li, Harvey Dam, Cheng Tan, Siva Kumar Sastry Hari, Timothy Tsai, Ignacio Laguna, Dingwen Tao, Ganesh Gopalakrishnan, Prashant J. Nair, Kevin J. Barker, Ang Li:

Understanding Mixed Precision GEMM with MPGemmFI: Insights into Fault Resilience. 166-178 - Yang Zhou, Fang Wang, Zhan Shi, Dan Feng:

Parallelism or Fairness? How to Be Friendly for SSDs in Cloud Environments. 179-189 - Pierre Jacquet

, Thomas Ledoux
, Romain Rouvoy:
SlackVM: Packing Virtual Machines in Oversubscribed Cloud Infrastructures. 190-201 - Ranhao Jia, Zixiao Chen, Chentao Wu, Jie Li, Minyi Guo, Hongwen Huang:

RL-Cache: An Efficient Reinforcement Learning Based Cache Partitioning Approach for Multi-Tenant CDN Services. 202-213 - Hongjian Zhang, Akira Nukada, Qiucheng Liao:

FCUFS: Core-Level Frequency Tuning for Energy Optimization on Intel Processors. 214-225 - Sejeong Oh, Gordon Euhyun Moon, Sungyong Park:

ML-Based Dynamic Operator-Level Query Mapping for Stream Processing Systems in Heterogeneous Computing Environments. 226-237 - Yao Xu, Gene Cooperman:

Enabling Practical Transparent Checkpointing for MPI: A Topological Sort Approach. 238-249 - Md Rajib Hossen, Vanessa V. Sochat, Abhik Sarkar, Mohammad A. Islam, Daniel J. Milroy:

Enabling Workload-Driven Elasticity in MPI-based Ensembles. 250-262 - Mohammad Reza Hoseiny Farahabady, Albert Y. Zomaya:

Geo-Distributed Analytical Streaming Architecture for IoT Platforms. 263-274 - Jingwen Du, Fang Wang, Dan Feng, Dexin Zeng, Sheng Yi:

Seastar: A Cache-Efficient and Load-Balanced Key-Value Store on Disaggregated Memory. 275-285 - Reza Farahani, Narges Mehran, Sashko Ristov, Radu Prodan:

HEFTLess: A Bi-Objective Serverless Workflow Batch Orchestration on the Computing Continuum. 286-296 - Jie Li

, George Michelogiannakis
, Samuel A. Maloney
, Brandon Cook, Estela Suarez
, John Shalf
, Yong Chen
:
Job Scheduling in High Performance Computing Systems with Disaggregated Memory Resources. 297-309 - Mingtian Shao, Wenzhe Zhang, Ruibo Wang, Huijun Wu, Yiqin Dai, Kai Lu:

Fully Decentralized Data Distribution for Exascale-HPC: End of the Provider-Demander Matching Puzzle. 310-321 - Shixun Wu, Yitong Ding, Yujia Zhai, Jinyang Liu, Jiajun Huang, Zizhe Jian, Huangliang Dai

, Sheng Di, Bryan M. Wong, Zizhong Chen, Franck Cappello:
FT K-Means: A High-Performance K-Means on GPU with Fault Tolerance. 322-334 - Wenqing Lin, Hemeng Wang, Haodong Deng, Qingxiao Sun:

ScalFrag: Efficient Tiled-MTTKRP with Adaptive Launching on GPUs. 335-345 - Scott Levy, Whit Schonbein

, Craig D. Ulmer:
Leveraging High-Performance Data Transfer to Offload Data Management Tasks to SmartNICs. 346-356 - Meng Tang, Jaime Cernuda, Jie Ye, Luanzheng Guo, Nathan R. Tallent

, Anthony Kougkas, Xian-He Sun:
DaYu: Optimizing Distributed Scientific Workflows by Decoding Dataflow Semantics and Dynamics. 357-369 - Jonathan Bader, Fabian Skalski, Fabian Lehmann, Dominik Scheinert, Jonathan Will, Lauritz Thamsen, Odej Kao:

Sizey: Memory-Efficient Execution of Scientific Workflow Tasks. 370-381 - Jannis Klinkenberg, Clément Foyer

, Pierre Clouzet, Brice Goglin, Emmanuel Jeannot, Christian Terboven, Anara Kozhokanova:
Phase-Based Data Placement Optimization in Heterogeneous Memory. 382-393 - Wenyang Zhao, Osamu Miyashita, Miki Nakano, Florence Tama

:
Xphase3d: Memory-Distributed Phase Retrieval for Reconstructing Large-Scale 3D Density Maps of Biological Macromolecules. 394-402 - Chunhong Du, Shanjiang Tang, Song Meng, Jiekai Gou, Ce Yu, Yusen Li, Hao Fu, Ye Tian, Ding Yuan:

Accuracy-Efficiency Optimization for Multi-Stage Small Object Detection in Surveillance Video with Collaborative Frame Sampling. 403-413 - Keichi Takahashi, Takashi Abe, Akihiro Musa, Yoshihiko Sato, Yoichi Shimomura, Hiroyuki Takizawa, Shunichi Koshimura:

Modernizing an Operational Real-Time Tsunami Simulator to Support Diverse Hardware Platforms. 414-425 - Ahmad Tarraf, Javier Fernández Muñoz, David E. Singh, Taylan Özden

, Jesús Carretero, Felix Wolf:
I/O Behind the Scenes: Bandwidth Requirements of HPC Applications with Asynchronous I/O. 426-439 - Sohei Koyama, Kohei Hiraga, Osamu Tatebe

:
FINCHFS: Design of Ad-Hoc File System for I/O Heavy HPC Workloads. 440-450 - Yujie Shi, Yu Hua, Jianming Huang:

A High-Performance and Fast-Recovery Scheme for Secure Non-Volatile Memory Systems. 451-463

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.


Google
Google Scholar
Semantic Scholar
Internet Archive Scholar
CiteSeerX
ORCID














