


default search action
18th IEEE CLOUD 2025: Helsinki, Finland
- Rong N. Chang, Carl K. Chang, Jingwei Yang, Nimanthi Atukorala, Dan Chen, Sumi Helal, Sasu Tarkoma, Qiang He, Tevfik Kosar, Claudio A. Ardagna, Feras Awaysheh, Volker Hilt, Yogesh Simmhan:

18th IEEE International Conference on Cloud Computing, CLOUD 2025, Helsinki, Finland, July 7-12, 2025. IEEE 2025, ISBN 979-8-3315-5557-3 - Lingfei Wang, Maria A. Rodriguez, Nir Lipovetzky:

Accelerating RL-Based Scheduler Adaptation with Transfer Learning in Evolving HPC Architectures. 1-11 - Dalal Alharthi, Rozhin Yasaei:

LLM-Powered Automated Cloud Forensics: From Log Analysis to Investigation. 12-22 - Hyunseung Jung, HyungJun Kim, Heonchang Yu:

Korel: Mitigating Stragglers via Real-Time Automatic Mixed Precision in Distributed Deep Learning Environments. 23-31 - Jovan Prodanov, Blaz Bertalanic, Carolina Fortuna, Shih-Kai Chou, Matjaz B. Juric, Ramon Sanchez-Iborra, Jernej Hribar:

Multi-Agent Reinforcement Learning-Based In-Place Scaling Engine for Edge-Cloud Systems. 32-42 - Julien Soulé, Jean-Paul Jamont, Michel Occello, Louis-Marie Traonouez, Paul Théron:

Streamlining Resilient Kubernetes Autoscaling with Multi-Agent Systems via an Automated Online Design Framework. 43-53 - Ewan Warburton, Abdessalam Elhabbash, Saad Ezzini, Yehia Elkhatib:

The IoT Whisperer: A Framework for Intelligent IoT Service Composition Through LLMs. 54-64 - Peini Liu, Jordi Guitart:

Dynamic In-node Group-Aware Scheduling for Multi-Tenant Machine Learning Services on Kubernetes. 65-74 - Oliver Larsson, Thijs Metsch, Cristian Klein, Erik Elmroth:

ESTHER: Application-First Hardware-Level QoS-Enforcement for Cloud Native Environments. 75-85 - Mostafa Anouar Ghorab, Mohamed Aymen Saied:

Towards Secure Cloud-Native Computing: Unveiling Kubernetes Misconfigurations with Large Language Models. 86-96 - Jiawen Liu, Yuehao Xu, Zhijun Ding:

Is Your Cluster Truly Fully Loaded? Exploring Shadow Resources in Host State Synchronization. 97-108 - Jacopo Bufalino, Jose Luiz Martin Navarro, Aleksi Peltonen, Tuomas Aura:

Helm-ET: Reducing Exposure to Lateral Movement in Kubernetes Artifacts. 109-120 - Seokwon Choi, Hyeonsang Eom:

HeteroScheduler: Dynamic Task Scheduling for CPU-GPU Optimization and Contention Mitigation in Cloud Data Centers. 121-131 - Minjae Kang, Heonchang Yu:

MOBOS: Co-Optimizing Cost and Execution Time in Serverless Workflow with Multi-Objective Bayesian Optimization. 132-142 - Christopher Lohse, Diego Tsutsumi, Amadou Ba, Pavithra Harsha, Chitra Subramanian, Martin Straesser, Marco Ruffini:

Causal Latency Modelling for Cloud Microservices. 143-151 - Rui Li, Devesh Tiwari, Gene Cooperman:

HotSwap: Enabling Live Dependency Sharing in Serverless Computing. 152-162 - Takeshi Yoshimura, Tatsuhiro Chiba, Manish Sethi, Daniel G. Waddington, Swaminathan Sundararaman:

Speeding up Model Loading with Fastsafetensors. 163-174 - Kihyun Kim, Jinwoo Kim, Hyunsun Chung, Myung-Hoon Cha, Hong-Yeon Kim, Youngjae Kim:

Cost-Efficient VM Selection for Cloud-Based LLM Inference with KV Cache Offloading. 175-185 - Moshik Hershcovitch, Andrew Wood, Leshem Choshen, Guy Girmonsky, Roy Leibovitz, Or Ozeri, Ilias Ennmouri, Michal Malka, Sang (Peter) Chin, Swaminathan Sundararaman, Danny Harnik:

ZipNN: Lossless Compression for AI Models. 186-198 - Hyungwoo Lee, Kihyun Kim, Jinwoo Kim, Jungmin So, Myung-Hoon Cha, Hong-Yeon Kim, James J. Kim, Youngjae Kim:

Disk-Based Shared KV Cache Management for Fast Inference in Multi-Instance LLM RAG Systems. 199-209 - Kfir Toledo, Pravein Govindan Kannan, Michal Malka, Etai Lev-Ran, Or Ozeri, Vita Bortnikov, Ziv Nevo, Kathy Barabash:

ClusterLink: Redefining Application Connectivity for the Multi-cloud Era. 210-222 - Haida Zhang, Lin Sun, Zhengtong Zhang, Jiayang Xia, Ziang Huang, Jiansi Wang, Haopeng Chen, Yan Jiao, Yongming Xu:

Precomputation-Optimized Lakehouse Architecture for Online Analytical Processing Tasks. 223-232 - Mathew Falloon, Hui Ma, Gang Chen:

Energy-Aware Resource Allocation and Container Migration in Distributed Data Centers Under Variable Energy Pricing: A Genetic Programming Hyper-Heuristic Approach. 233-242 - Reza Farahani, Radu Prodan:

EnergyLess: An Energy-Aware Serverless Workflow Batch Orchestration on the Computing Continuum. 243-254 - Elvis Rodrigues, Jacob Goldverg, Tevfik Kosar:

Carbon-Aware Temporal Data Transfer Scheduling Across Cloud Datacenters. 255-264 - Kuangyuan Li, Jingrun Zhang, Pengfei Chen, Hongyang Chen, Ruipeng Hong, Wanqi Yang, Chen Sun:

TraceWizard: End-to-End Distributed Tracing Across Host and Network Devices in Cloud. 265-276 - Pol G. Recasens, Ferran Agullo, Yue Zhu, Chen Wang, Eun Kyung Lee, Olivier Tardieu, Jordi Torres, Josep Lluís Berral:

Mind the Memory Gap: Unveiling GPU Bottlenecks in Large-Batch LLM Inference. 277-287 - Marianna Ojanen, Maryam Sabzevari, Sándor Szedmák:

Efficient Microservice Monitoring Via Kernel Transformation and FFT Forecasting. 288-295 - Gaulthier Gain, Benoit Knott, Laurent Mathy:

Efficient Versioning for Unikernels. 296-307 - MohammadReza HoseinyFarahabady, Albert Y. Zomaya:

Real-Time Interference-Aware CPU and I/O Capping Mechanism for Multi-Tenant Containers. 308-317 - Angelo Marchese, Orazio Tomarchio:

SLO-Aware Container Orchestration on Kubernetes Clusters. 318-327 - Myeongjun Kim, Heonchang Yu:

ReSACO: A Meta Reinforcement Learning Method for Fast Offloading in Mobile Edge Computing. 328-338 - Sharmen Akhter, Eui-Nam Huh:

MSTH-Former: Optimizing Workload Prediction in Edge-Cloud Continuum with Multi-Scale Temporal and Hierarchical Knowledge Convergence and Distillation. 339-350 - Manish Pandey, Byungchul Tak, Young-Woo Kwon:

PROBA: Enhancing Serverless Edge Computing via Adaptive Task Scheduling and Probabilistic Resource Sharing. 351-361 - Pasindu Tennage, Antoine Desjardins, Lefteris Kokoris-Kogias:

RACS-SADL: Robust and Understandable Randomized Consensus in the Cloud. 362-373 - Robin Lichtenthäler, Guido Wirtz:

An Experimental Validation of Architectural Measures for Cloud-Native Quality Evaluations. 374-384 - Abdul Alim, Ali Sydney, Liran Schour, Abdullah Kayi, Laurent Schares, Pavlos Maniotis, Anand Singh, Bengi Karacali:

Routing Strategies for RoCE Networks in AI Clouds. 385-396 - Dongzhao Song, Jingfan Meng, Qianru Yu, Jun Jim Xu:

QPS- Fit: An Efficient and Performant Parallel Algorithm for Hybrid Optical and Packet Switching. 397-408 - Hokun Park, Donggyun Kim, HyungJun Kim, Gyujeong Lim, Heonchang Yu:

HEART: Heterogeneous-Aware Traffic Allocation in Multi-Replica Deployments on Kubernetes. 409-419 - Junseo Jang, Jaehyun Hwang:

Optimizing Receive Flow Steering for Mixed Traffic in High-Performance Cloud Datacenters. 420-429 - Seungmin Shin, Leeiu Kim, Wookyung Lee, Eyee Hyun Nam, Seungmin Kim, Bryan S. Kim, Sungjin Lee, Eunji Lee:

Avoiding Pitfalls in Networked Key-Value Store for Tiered Memory. 430-441 - Saman Akbari, Manfred Hauswirth:

Universal Workers: A Vision for Eliminating Cold Starts in Serverless Computing. 442-444 - Yue Zhu, Hao Yu, Chen Wang, Zhuoran Liu, Eun Kyung Lee:

Towards Efficient Key-Value Cache Management for Prefix Prefilling in LLM Inference. 445-447 - Ansar Rafique, Brian D. Marsden:

Automated LLM Deployment and Evaluation: A Cloud-Native Approach Using LLM-as-a-Judge. 448-450 - Milind Varma, Sai Venkat Malreddy, Liting Hu:

DNN-Adapt: Reinforcement Learning-Based Hybrid Batching for Efficient DNN Serving. 451-453 - Emanuele Carlini, Patrizio Dazzi, Matteo Mordacchini:

Game-Theoretic Reinforcement Learning for Task Optimization Under Time-Sensitive Constraints. 454-456 - Yewon Shin, Jonghyeok Park:

Revisiting SQL Statement Logging for SQLite on AWS S3. 457-459 - Germán T. Eizaguirre, Marc Hostau, Marc Sáanchez-Artigas:

Serverless Data Analytics (Finally) Bridging the Gap: Introducing the Ortzi DataFrame. 460-467 - Kemalcan Bora, Elli Kartsakli, Eduardo Quiñones Moreno:

Temporal Fusion Transformer Based Vertical Scaling Management for Kubernetes. 468-473

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.


Google
Google Scholar
Semantic Scholar
Internet Archive Scholar
CiteSeerX
ORCID














