


default search action
ACM SIGMOD Conference 2025: Berlin, Germany - Companion Volume
- Volker Markl, Joseph M. Hellerstein, Azza Abouzied:
Companion of the 2025 International Conference on Management of Data, SIGMOD/PODS 2025, Berlin, Germany, June 22-27, 2025. ACM 2025, ISBN 979-8-4007-1564-8
Keynote Talk Abstracts
- Philip A. Bernstein:
Fifty Years of Transaction Processing Research. 1-2 - Christos H. Papadimitriou:
How to Build A Brain. 3-4 - Margo I. Seltzer:
The Case for Collaboration. 5-6
Demo Short Papers
- Pratyush Agnihotri, Carsten Binnig:
Demonstrating PDSP-Bench: A Benchmarking System for Parallel and Distributed Stream Processing. 7-10 - Ashwin Alaparthi, Paul Loh, Ryan Marcus:
ScaleLLM: A Technique for Scalable LLM-augmented Data Systems. 11-14 - Angelos-Christos G. Anadiotis, Muhammad Ghufran Khan, Ioana Manolescu:
Catching up with Disorder: Dynamic Graphs with Out-of-Order Updates. 15-18 - Arman Ashkari, El Kindi Rezig:
CausalExplain: Causal Explanations of Black-box Models with Training Data Subsets. 19-22 - Teona Bagashvili, Tarikul Islam Papon, Manos Athanassoulis:
ACE-in-Action: A Smart DBMS Bufferpool for SSDs. 23-26 - Wenchao Bai, Wenfei Fan, Jiahui Jin, Daji Li, Jian Li, Shuhao Liu, Mingliang Ouyang, Qiang Yuan:
MiniClean: A Single-Machine System for Cleaning Big Graphs. 27-30 - Björn Bamberg, Denis Hirn, Torsten Grust:
How DuckDB is USING KEY to Unlock Recursive Query Performance. 31-34 - Kaustubh Beedkar, Aurélien Bertrand, Haralampos Gavriilidis, Augusto José Fonseca, Zoi Kaoudi, Mingxi Liu, Volker Markl, Juri Petersen, Fábio Porto, Víctor Ribeiro, Mads Sejer Pedersen, Lucas Giusti Tavares, Michalis Vargiamis, Chen Xu:
Apache Wayang in Action: Enabling Data Systems Integration via a Unified Data Analytics Framework. 35-38 - Lennart Behme, Leonard Geißler, Pratham Agrawal, Emil Badura, Benjamin Ueber, Kaustubh Beedkar, Volker Markl:
Finding What You're Looking For: A Distribution-Aware Dataset Search Engine in Action. 39-42 - Hadar Ben-Efraim, Susan B. Davidson, Amit Somech:
PY-SHARQ: A Holistic Python Library for Explaining Association Rules on Relational Data. 43-46 - Kyle Bossonney, Nicolás Buzeta, Vicente Calisto, Juan-Eduardo López, Cristian Riveros, Stijn Vansummeren:
CORE+: A Complex Event Recognition Engine in C++. 47-50 - Mohamed Bouadi, Arta Alavi, Salima Benbernou, Mourad Ouziri:
DANTE: Hybrid AI System for Context-Aware Interpretable Feature Engineering. 51-54 - Felix S. Campbell, Yuval Moskovitch:
Locator: Local Stability for Rankings. 55-58 - Jeffery Cao, Lampros Flokas, Yujian Xu, Eugene Wu, Xu Chu, Cong Yu:
Prompt Editor: A Taxonomy-driven System for Guided LLM Prompt Development in Enterprise Settings. 59-62 - Tsz Nam Chan, Bojian Zhu, Dingming Wu, Yun Peng, Leong Hou U, Wei Tu, Ruisheng Wang:
A Fast Line Density Visualization Plugin for Geographic Information Systems. 63-66 - Kasidis Chanthatrojwong, Sourav S. Bhowmick, Byron Choi:
PASCAL: A Theory-Informed Visual Interface for Property Graph Schema Visualization. 67-70 - Kaiwen Chen, Yueting Chen, Nick Koudas, Xiaohui Yu:
RTS+: Reliable Text to SQL. 71-74 - Noam Chen, Anna Zeng, Michael J. Cafarella, Batya Kenig, Markos Markakis, Oren Mishali, Brit Youngmann, Babak Salimi:
CausaLens: A System for Summarizing Causal DAGs. 75-78 - Mariana M. Garcez Duarte, Dwi P. A. Nugroho, Georges Tod, Evert Bevernage, Pieter Moelans, Emine Tas, Esteban Zimányi, Mahmoud Sakr, Steffen Zeuch, Volker Markl:
Mobility Stream Processing on NebulaStream and MEOS. 79-82 - Yael Einy, Guy Dar, Slava Novgorodov, Tova Milo:
Sentence to Model: Cost-Effective Data Collection LLM Agent. 83-86 - Saeed Fathollahzadeh, Essam Mansour, Matthias Boehm:
Demonstrating CatDB: LLM-based Generation of Data-centric ML Pipelines. 87-90 - Yannis Foufoulas, Theoni Palaiologou, Alkis Simitsis:
UDFBench: A Tool for Benchmarking UDF Queries on SQL Engines. 91-94 - Victor Giannakouris, Immanuel Trummer:
SwellDB: Dynamic Query-Driven Table Generation with Large Language Models. 95-98 - Amir Gilad, Tova Milo, Kathy Razmadze, Ron Zadicario:
Demonstration of DPClustX: Differentially Private Explanations for Clusters. 99-102 - Justin Breese, Vijayan Prabhakaran, Martin Grund, Stefania Leone, Amit Shukla, Michael Armbrust, Reynold Xin, Matei Zaharia, Lennart Kats, Sung Chiu, Tatiana Romanova, Philip Nord, Mitchell Webster, Chris Munson, Bo Pang, David Ma:
Blink Twice - Automatic Workload Pinning and Regression Detection for Versionless Apache Spark using Retries. 103-106 - Suchit Gupte, John Paparrizos:
ShapX Engine: A Demonstration of Shapley Value Approximations. 107-110 - Eldar Hacohen, Yuval Moskovitch, Amit Somech:
OmniTune: A Universal Framework for Query Refinement via LLMs. 111-114 - Yuto Hayamizu, Ryoji Kawamichi, Tsuyoshi Ozawa, Masaru Kitsuregawa, Kazuo Goda:
anagodb: Offering Massive Parallelism for Database Engine. 115-118 - Shiyi He, Alexandra Meliou, Anna Fariha:
ChARLES: Change-Aware Recovery of Latent Evolution Semantics in Relational Data. 119-122 - Jeffrey Heer, Dominik Moritz, Ron Pechunk:
Mosaic: An Architecture for Linking Databases and Scalable Interactive Visualizations. 123-126 - Kaiyuan Hu, Jiongli Zhu, Boris Glavic, Babak Salimi:
Zorro: Quantifying Uncertainty in Models & Predictions Arising from Dirty Data. 127-130 - Xuhua Huang, Zirui Hu, Siyang Weng, Rong Zhang, Chengcheng Yang, Xuan Zhou, Weining Qian, Chuanhui Yang, Quanqing Xu:
A Query-Aware Enormous Database Generator For System Performance Evaluation. 131-134 - Tharushi Jayasekara, Immanuel Trummer:
Demonstrating CEDAR: A System for Cost-Efficient Data-Driven Claim Verification. 135-138 - Michael Jungmair:
LingoDB-CT: Understanding LingoDB's Inner Workings. 139-142 - Eugenie Y. Lai, Inbal Croitoru, Noam Bitton, Ariel Shalem, Brit Youngmann, Sainyam Galhotra, El Kindi Rezig, Michael J. Cafarella:
SeerCuts: Explainable Attribute Discretization. 143-146 - Longbin Lai, Changwei Luo, Yunkai Lou, Mingchen Ju, Zhengyi Yang:
Graphy'our Data: Towards End-to-End Modeling, Exploring and Generating Report from Raw Data. 147-150 - Jiale Lao, Immanuel Trummer:
Demonstrating SQLBarber: Leveraging Large Language Models to Generate Customized and Realistic SQL Workloads. 151-154 - Yu Lei, Xinle Jiang, Hua Lu, Christian S. Jensen, Bo Tang, Huan Li:
TEQ: An Open and Developer-friendly Testbed for Edge-based Query Processing Algorithms. 155-158 - Nativ Levy, Michael J. Cafarella, Amir Gilad, Sudeepa Roy, Brit Youngmann:
CauSumX: Summarized Causal Explanations For Group-By-Average Queries. 159-162 - Peizheng Li, Chaoyi Chen, Hao Yuan, Zhenbo Fu, Hang Shen, Xinbo Yang, Qiange Wang, Xin Ai, Yanfeng Zhang, Yingyou Wen, Ge Yu:
NeutronRAG: Towards Understanding the Effectiveness of RAG from a Data Retrieval Perspective. 163-166 - Zhaoheng Li, Supawit Chockchowwat, Hanxi Fang, Yongjoo Park:
Demo of Kishu: Time-Traveling for Computational Notebooks. 167-170 - Jiangneng Li, Haitao Yuan, Jie Wang, Ziting Wang, Han Mao Kiah, Gao Cong:
Demonstrating MAST: An Efficient System for Point Cloud Data Analytics. 171-174 - Zhiyu Liang, Dongrui Cai, Chenyuan Zhang, Zheng Liang, Chen Liang, Bo Zheng, Shi Qiu, Jin Wang, Hongzhi Wang:
KDSelector: A Knowledge-Enhanced and Data-Efficient Model Selector Learning Framework for Time Series Anomaly Detection. 175-178 - Tim Littau, Rihan Hai:
Qymera: Simulating Quantum Circuits using RDBMS. 179-182 - Chunwei Liu, Gerardo Vitagliano, Brandon Rose, Matthew Printz, David Andrew Samson, Michael J. Cafarella:
PalimpChat: Declarative and Interactive AI analytics. 183-186 - Christoph Mayer, Haozhe Zhang, Mahmoud Abo Khamis, Dan Olteanu, Dan Suciu:
LpBound in Action: Cardinality Estimation with One-Sided Guarantees. 187-190 - Amin Meghrazi, Pranav Maneriker, Swati Padhee, Srinivasan Parthasarathy:
Interactive Fairness Auditing: Leveraging AVOIR for Dynamic Evaluation and Mitigation. 191-194 - Adrian Michalke, Aljoscha P. Lepping, Volker Markl, Ricardo Martinez, Nils L. Schubert, Lukas Schwerdtfeger, Taha Tekdogan, Steffen Zeuch, Ariane Ziehn, Christoph Falkensteiner, Kyle Krüger, Alexander Meyer, Tobias Röschl, Svea Wilkending:
NebulaStream: An Extensible, High-Performance Streaming Engine for Multi-Modal Edge Applications. 195-198 - Amedeo Pachera, Angela Bonifati, Andrea Mauri:
Grafixer: Enabling User-Centric Repairs for Property Graphs. 199-202 - Marcel Parciak, Brecht Vandevoort, Frank Neven, Liesbet M. Peeters, Stijn Vansummeren:
LLM-Matcher: A Name-Based Schema Matching Tool using Large Language Models. 203-206 - Alok Pareek, Bhushan Khaladkar, Sanket Malde, Vamshi Saggurthi:
Real Time Sentinel: An LLM Based PII Detector - A Streaming Integration and Intelligence Platform. 207-210 - Sophie Pfister, Alberto Lerner, Abishek Ramdas, Philippe Cudré-Mauroux:
Alpha Demo: A Hardware-Accelerated Data Model for Ad-Hoc Manipulation of Point Clouds. 211-214 - Shaikh Quader, Ghadeer Abuoda, Yonis Abokar, Marin Litoiu, Manos Papagelis:
Demo of LearnedWMP: Workload Memory Prediction Using Deep Query Template Representations. 215-218 - Florens Rohde, Victor Christen, Erhard Rahm:
SecUREmatch: Integrating Clerical Review in Privacy-Preserving Record Linkage. 219-222 - Gianluca Rossi, Riccardo Tommasini, Angela Bonifati:
TD-Join: Leveraging Temporal Dependencies in Time Series Joins. 223-226 - Diandre Miguel Sabale, Wolfgang Gatterbauer:
PatternVis: A Tool for Relational Pattern Visualization. 227-230 - Wenbo Sun, Ziyu Li, Rihan Hai:
Database as Runtime: Compiling LLMs to SQL for In-database Model Serving. 231-234 - Zhaoyan Sun, Xuanhe Zhou, Jianming Wu, Wei Zhou, Guoliang Li:
D-Bot: An LLM-Powered DBA Copilot. 235-238 - Govind Venkatraman Krishnan, Eduardo Ramirez, Drew Koszewnik, Yujia (Cynthia) Xie, Tej Vepa, Bernardo Gomez Palacio:
Introducing RAW Hollow: An In-Memory, Co-Located, Compressed Object Store with Opt-In Strong Consistency. 239-242 - Pengyi Wang, Sibei Chen, Ju Fan, Bin Wu, Nan Tang, Jian Tan:
Andromeda: Debugging Database Performance Issues with Retrieval-Augmented Large Language Models. 243-246 - Patrick Wang, Wan Shen Lim, William Zhang, Samuel Arch, Andrew Pavlo:
Automated Database Tuning vs. Human-Based Tuning in a Simulated Stressful Work Environment: A Demonstration of the Database Gym. 247-250 - Haixin Wang, Cheng Xu, Ce Zhang, Haibo Hu, Shikun Tian, Shenglong Chen, Ying Yan, Jianliang Xu:
Authenticating Multi-Chain Queries: Verifiable Virtual Filesystem Is All You Need. 251-254 - Zixin Wei, Jun Han, Xiaolin Han, Chenhao Ma:
SemExplorer: A User Interface for Semantic Approach to Customized Dataset Search. 255-258 - Jingzhe Xu, Yuhao Deng, Chengliang Chai, Zequn Li, Yuping Wang, Lei Cao:
OIE: An Interpretable System for Outlier Explanation and Summarization. 259-262 - Mike Xydas, Anna Mitsopoulou, George Katsogiannis-Meimarakis, Christos Tsapelas, Stavroula Eleftherakis, Antonis Mandamadiotis, Georgia Koutrika:
DataDazzle: Intelligent Data Exploration through Natural Language. 263-266 - Yansha Jia, Zhengxin You, Yujie Wang, Qiaomu Shen, Bo Tang:
VQLens: A Demonstration of Vector Query Execution Analysis. 267-270 - Geoffrey X. Yu, Ziniu Wu, Ferdi Kossmann, Tianyu Li, Markos Markakis, Amadou Ngom, Sophie Zhang, Tim Kraska, Samuel Madden:
Virtualizing Cloud Data Infrastructures with BRAD. 271-274 - Yuanhao Zhong, Yuhao Deng, Chengliang Chai, Ruixin Gu, Ye Yuan, Guoren Wang, Lei Cao:
Doctopus: A System for Budget-aware Structural Data Extraction from Unstructured Documents. 275-278 - Jun-Peng Zhu, Peng Cai, Kai Xu, Li Li, Yishen Sun, Shuai Zhou, Haihuang Su, Liu Tang, Qi Liu:
UNITQA: A Unified Automated Tabular Question Answering System with Multi-Agent Large Language Models. 279-282
Industry Papers
- Molham Aref, Paolo Guagliardo, George Kastrinis, Leonid Libkin, Victor Marsault, Wim Martens, Mary McGrath, Filip Murlak, Nathaniel Nystrom, Liat Peterfreund, Allison Rogers, Cristina Sirangelo, Domagoj Vrgoc, David Zhao, Abdul Zreika:
Rel: A Programming Language for Relational Data. 283-296 - Nicolas Bruno, César A. Galindo-Legaria, Milind Joshi:
Query Decorrelation in the Fabric Data Warehouse. 297-309 - Ramesh Chandra, Haogang Chen, Ray Matharu, Sarah Cai, Jeff Chen, Priyam Dutta, Bogdan Ghita, Todd Greenstein, Gopal Holla, Peng Huang, Yuchen Huo, Adrian Ionescu, Adriana Ispas, Tim Januschowski, Vihang Karajgaonkar, Stefania Leone, David Lewis, Andrew Li, Nong Li, Cheng Lian, Stephen Link, Qing Lu, Yesheng Ma, Chris Pettitt, Vijayan Prabhakaran, Bogdan Raducanu, Kyle Rong, Paul Roome, Samarth Shetty, Sean Smith, Xiaotong Sun, Yuyuan Tang, Weitao Wen, Lei Xia, Junlin Zeng, Ben Zhang, Reynold Xin, Matei Zaharia:
Unity Catalog: Open and Universal Governance for the Lakehouse and Beyond. 310-322 - Zihao Chen, Jiazhi Jiang, Jiangang Liu, Chao Zhang, Yuqi Diao, Yang Li, Hanmei Luo, Peng Chen:
Oceanus: Enable SLO-Aware Vertical Autoscaling for Cloud-Native Streaming Services in Tencent. 323-335 - Zongzhi Chen, Xinjun Yang, Mo Sha, Feifei Li, Kang Wang, Zheyu Miao, Jie Xu, Jianfeng Wang, Sheng Wang:
CloudJump II: Optimizing Cloud Databases for Shared Storage. 336-349 - Zihao Chen, Chenyang Zhang, Chen Xu, Zhao Zhang, Jiaqiang Wang, Weining Qian, Aoying Zhou:
Scheduling Data Processing Pipelines for Incremental Training on MLP-based Recommendation Models. 350-363 - Yangshen Deng, Zhengxin You, Long Xiang, Qilong Li, Peiqi Yuan, Zhaoyang Hong, Yitao Zheng, Wanting Li, Runzhong Li, Haotian Liu, Kyriakos Mouratidis, Man Lung Yiu, Huan Li, Qiaomu Shen, Rui Mao, Bo Tang:
AlayaDB: The Data Foundation for Efficient and Effective Long-context LLM Inference. 364-377 - Chenguang Fang, Chen Qian, Qi Yang, Zeyu Wang, Zhenkun Yang, Fanyu Kong, Quanqing Xu, Hui Cao, Fusheng Han, Chuanhui Yang:
MaLT: A Framework for Managing Large Transactions in OceanBase. 378-390 - Sen Gao, Jianwen Zhao, Hao Zhang, Shixuan Sun, Chen Liang, Gongye Chen, Wenliang Zhang, Bo Ren, Chao Liu, Chenyi Zhang, Quan Chen, Chao Li, Jingwen Leng, Minyi Guo:
GES: High-Performance Graph Processing Engine and Service in Huawei. 391-403 - Anja Gruenheid, Jesús Camacho-Rodríguez, Carlo Curino, Raghu Ramakrishnan, Stanislav Pak, Sumedh Sakdeo, Lenisha Gandhi, Sandeep K. Singhal, Pooja Nilangekar, Daniel J. Abadi:
AutoComp: Automated Data Compaction for Log-Structured Tables in Data Lakes. 404-417 - Martin Grund, Stefania Leone, Herman van Hövell, Sven Wagner-Boysen, Sebastian Hillig, Hyukjin Kwon, David Lewis, Jakob Mund, Polo-Francois Poli, Lionel Montrieux, Othon Crelier, Xiao Li, Reynold Xin, Matei Zaharia, Michalis Petropoulos, Thanos Papathanasiou:
Databricks Lakeguard: Supporting Fine-grained Access Control and Multi-user Capabilities for Apache Spark Workloads. 418-430 - Shashank Gugnani, Zhen Hua Liu, Hui J. Chang, Beda Christoph Hammerschmidt, Srinivas Kareenhalli, Kishy Kumar, Tirthankar Lahiri, Ying Lu, Douglas McMahon, Ajit Mylavarapu, Sukhada Pendse, Ananth Raghavan:
JSON Relational Duality: A Revolutionary Combination of Document, Object, and Relational Models. 431-443 - Benjamin Hilprecht, Nico Mürdter, Arthur Arnold, Kristijan Ziza, Franz Färber, Wolfgang Lehner:
Scalable Execution of Application Logic within Everest BusinessStore. 444-456 - Gabriela Jacques-Silva, Evangelia Kalyvianaki, Katriel Cohn-Gordon, Adham Meguid, Huy Nguyen, Danny Ben-David, Carl Nayak, Varun Saravagi, George Stasa, Ioannis Papagiannis, David Taïeb, Kalkidan Tamirat, Haiyang Wu, Bo Xi, Taining Zhang, Qi Zhou:
Unified Lineage System: Tracking Data Provenance at Scale. 457-470 - Rong Kang, Yanbin Chen, Ye Liu, Fuxin Jiang, Qingshuo Li, Miao Ma, Jian Liu, Guangliang Zhao, Tieying Zhang, Jianjun Chen, Lei Zhang:
ABase: the Multi-Tenant NoSQL Serverless Database for Diverse and Dynamic Workloads in Large-scale Cloud Environments. 471-484 - Kihong Kim, Hyunwook Kim, Jinsu Lee, Taehyung Lee, Alexander Böhm, Norman May, Guido Moerkotte, Daniel Ritter, Ralf Dentzer, Heiko Gerwens, Irena Kofman, Mihnea Andrei:
Enterprise Application-Database Co-Innovation for Hybrid Transactional/Analytical Processing: A Virtual Data Model and Its Query Optimization Needs. 485-498 - Lukas Landgraf, Florian Wolf, Wolfgang Lehner:
Experimental Evaluation of Optimizing Memory Consumption in SAP HANA using PEOopt. 499-511 - Zeyan Li, Jie Song, Tieying Zhang, Tao Yang, Xiongjun Ou, Yingjie Ye, Pengfei Duan, Muchen Lin, Jianjun Chen:
Adaptive and Efficient Log Parsing as a Cloud Service. 512-524 - Ji You Li, Jiachi Zhang, Yuhang Liu, Wenchao Zhou, Xin Zhou, Fangyuan Zhou, Feifei Li:
Eigen+: Memory Over-Subscription for Alibaba Cloud Databases. 525-538 - Wei Li, Jiachi Zhang, Ye Yin, Yan Li, Zhanyang Zhu, Yuhao Li, Zhencan Peng, Lan Lu, Wenchao Zhou, Liang Lin, Feifei Li:
Flux: Unifying Heterogeneous Infrastructure for Alibaba AnalyticDB. 539-552 - Shige Liu, Zhifang Zeng, Li Chen, Adil Ainihaer, Arun Ramasami, Songting Chen, Yu Xu, Mingxi Wu, Jianguo Wang:
TigerVector: Supporting Vector Search in Graph Databases for Advanced RAGs. 553-565 - Bingqing Lyu, Xiaoli Zhou, Longbin Lai, Yufan Yang, Yunkai Lou, Wenyuan Yu, Ying Zhang, Jingren Zhou:
A Modular Graph-Native Query Optimization Framework. 566-579 - Norman May, Alexander Böhm, Daniel Ritter, Frank Renkes, Mihnea Andrei, Wolfgang Lehner:
SAP HANA Cloud: Data Management for Modern Enterprise Applications. 580-592 - Norifumi Nishikawa, Akira Shimizu, Akira Ito, Shinji Fujiwara, Yuto Hayamizu, Masaru Kitsuregawa, Kazuo Goda:
Dynamic Pruning for Recursive Joins. 593-607 - Jeffrey Pound, Floris Chabert, Arjun Bhushan, Ankur Goswami, Anil Pacaci, Shihabur Rahman Chowdhury:
MicroNN: An On-device Disk-resident Updatable Vector Database. 608-621 - Daniel Sotolongo, Daniel Mills, Tyler Akidau, Anirudh Santhiar, Attila-Péter Tóth, Botong Huang, Boyuan Zhang, Igor Belianski, Ling Geng, Matt Uhlar, Nikhil Shah, Olivia Zhou, Saras Nowak, Sasha Lionheart, Vlad Lifliand, Wendy Grus, Yiwen Zhu, Ankur Sharma, Dzmitry Pauliukevich, Enrico Sartorello, Ilaria Battiston, Ivan Kalev, Lawrence Benson, Leon Papke, Niklas Semmler, Till Merker, Yi Huang:
Streaming Democratized: Ease Across the Latency Spectrum with Delayed View Semantics and Snowflake Dynamic Tables. 622-634 - V. Srinivasan, Andrew Gooding, Sunil Sayyaparaju, Thomas Lopatic, Kevin Porter, Ashish Krishnadeo Shinde, Sri Varun Poluri, B. Narendran, Daudkhan Pathan, Srinivasan Seshadri:
Asynchronous Replication Strategies for a Real-Time DBMS. 635-647 - Jeff Swenson, Andy Kimball, Raphael 'kena' Poss, Rebecca Taft, Jay Lim, Adam Storm, Sumeer Bhola, Paul Bulkley-Logston, Pj Tatlow, Rachael Harding, Rafi Shamim, Aditya Maru, Irfan Sharif:
CockroachDB Serverless: Sub-second Scaling from Zero with Multi-region Cluster Virtualization. 648-661 - Vishal Vyas, Andrei Paduroiu, Srikanth Kandula, Hari Ohm Prasath Rajagopal, Mukesh Punhani, Marco Manzo, Ankur Goyal, Santosh Chandrachood, Rick Sears, Joseph Marques, Sushant Majithia:
Managed Resource Scaling in Amazon EMR. 662-674 - Donghui Wang, Yuxing Chen, Chengyao Jiang, Anqun Pan, Wei Jiang, Songli Wang, Hailin Lei, Chong Zhu, Lixiong Zheng, Wei Lu, Yunpeng Chai, Feng Zhang, Xiaoyong Du:
TXSQL: Lock Optimizations Towards High Contented Workloads. 675-688 - Xinjun Yang, Yingqiang Zhang, Hao Chen, Feifei Li, Gerry Fan, Yang Kong, Bo Wang, Jing Fang, Yuhui Wang, Tao Huang, Wenpu Hu, Jim Kao, Jianping Jiang:
Unlocking the Potential of CXL for Disaggregated Memory in Cloud-Native Databases. 689-702 - Tim Zeyl, Qi Cheng, Reza Pournaghi, Jason Lam, Weicheng Wang, Calvin Wong, Chong Chen, Per-Åke Larson:
Including Bloom Filters in Bottom-up Optimization. 703-715 - Shihao Zhou, Qi Mao, Yi Cheng, Hongcheng Qi, Yilun Huang, Peng Cai, Jun-Peng Zhu:
RedTAO: A Trillion-edge High-throughput Graph Store. 716-728 - Xuanhe Zhou, Wei Zhou, Liguo Qi, Hao Zhang, Dihao Chen, Bingsheng He, Mian Lu, Guoliang Li, Fan Wu, Yuqiang Chen:
OpenMLDB: A Real-Time Relational Data Feature Computation System for Online ML. 729-742 - Yiwen Zhu, Rathijit Sen, Brian Kroth, Sergiy Matusevych, Andreas C. Mueller, Tengfei Huang, Rahul Challapalli, Weihan Tang, Xin He, Mo Liu, Estera Kot, Sule Kahraman, Arshdeep Sekhon, Dario Bernal, Aditya Lakra, Shaily Fozdar, Dhruv Relwani, Rui Fang, Long Tian, Karuna Sagar Krishna, Ashit Gosalia, Carlo Curino, Subru Krishnan:
Rockhopper: A Robust Optimizer for Spark Configuration Tuning in Production Environment. 743-756 - Andreas Zimmerer, Damien Dam, Jan Kossmann, Juliane Waack, Ismail Oukid, Andreas Kipf:
Pruning in Snowflake: Working Smarter, Not Harder. 757-770
Panel Summaries
- Carsten Binnig, Danica Porobic:
Panel on AI for Future Databases: A New Beginning or a Boulevard of Broken Dreams? 771 - Eugene Wu, Raul Castro Fernandez:
Where Does Academic Database Research Go From Here? 772-774
Tutorial Papers
- Daniel Alabi, Sainyam Galhotra, Shagufta Mehnaz, Zeyu Song, Eugene Wu:
Privacy and Security in Distributed Data Markets. 775-787 - Abdullah Al-Mamun, Jianguo Wang, Walid G. Aref:
Learned Indexes From the One-dimensional to the Multi-dimensional Spaces: Challenges, Techniques, and Opportunities. 788-796 - Rico Bergmann, Dirk Habich:
Reproducible Prototyping of Query Optimizer Components. 797-804 - Daokun Hu, Quanqing Xu, Chuanghui Yang:
OLTP Engines on Modern Storage Architectures. 805-812 - Bojan Karlas, Babak Salimi, Sebastian Schelter:
Navigating Data Errors in Machine Learning Pipelines: Identify, Debug, and Learn. 813-820 - Brian Kroth, Sergiy Matusevych, Yiwen Zhu:
Autotuning Systems: Techniques, Challenges, and Opportunities. 821-828 - Rodrigo Laigner, George Christodoulou, Kyriakos Psarakis, Asterios Katsifodimos, Yongluan Zhou:
Transactional Cloud Applications: Status Quo, Challenges, and Opportunities. 829-836 - Guoliang Li, Jiayi Wang, Chenyang Zhang, Jiannan Wang:
Data+AI: LLM4Data and Data4LLM. 837-843 - Ningyi Liao, Siqiang Luo, Xiaokui Xiao, Reynold Cheng:
Advances in Designing Scalable Graph Neural Networks: The Perspective of Graph Data Management. 844-850 - Vidya Setlur:
Supporting Human-Centric Data Exploration Through Semantics and Natural Language Interaction. 851-854 - Utku Sirin, Stratos Idreos:
Data Storage and Management for Image AI Pipelines. 855-863
Workshop Summaries
- Akhil Arora, Stefania Dumbrava:
Eighth Joint Workshop on Graph Data Management Experiences & Systems (GRADES) and Network Data Analytics (NDA). 864-865 - Tanja Auge, Seokki Lee:
ProvenanceWeek2025. 866-867 - Carsten Binnig, Eric Sedlar:
21st International Workshop on Data Management on New Hardware (DaMoN). 868-869 - Renata Borovica-Gajic, Manisha Luthra, Ryan Marcus, Rajesh Bordawekar, Oded Shmueli:
Eighth International Workshop on Exploiting Artificial Intelligence Techniques for Data Management (aiDM). 870-871 - Faiza Allah Bukhsh, Paolo Ceravolo, Xu Chu, Samira Maghool, Eugene Wu, Cong Yu:
LLM-DPM - Workshop on Large Language Models for Data Process Management. 872-873 - Remco Chang, Kexin Rong, Roee Shraga:
Ninth Workshop on Human-In-the-Loop Data Analytics (HILDA). 874-875 - Avrilia Floratou, Jignesh M. Patel, Subru Krishnan:
First Workshop Connecting Academia and Industry on Modern Integrated Database and AI Systems (MIDAS). 876-877 - Stefan Grafberger, Madelon Hulsebos, Matteo Interlandi, Shreya Shankar:
Ninth Workshop on Data Management for End-to-End Machine Learning (DEEM). 878-879 - Michael Liut, Sourav S. Bhowmick, Abdussalam Alawini:
Fourth International Workshop on Data Systems Education (DataEd'25). 880-881 - Ibrahim Sabek, Immanuel Trummer:
Second Workshop on Quantum Computing and Quantum-Inspired Technology for Data-Intensive Systems and Applications (Q-Data). 882-883 - Amir Shaikhha, Torsten Grust:
The 19th International Symposium on Database Programming Languages (DBPL). 884-885 - Gerardo Vitagliano, Chunwei Liu, Lei Cao, Huan Sun, Paolo Papotti:
First Workshop on Novel Optimizations for Visionary AI Systems (NOVAS). 886-887

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.