


default search action
28th EDBT 2025: Barcelona, Spain
- Alkis Simitsis, Bettina Kemme, Anna Queralt, Oscar Romero, Petar Jovanovic:

Proceedings 28th International Conference on Extending Database Technology, EDBT 2025, Barcelona, Spain, March 25-28, 2025. OpenProceedings.org 2024
Volume 1
Research Track
- Fernando de Meer Pardo, Claude Lehmann, Dennis Gehrig, Andrea Nagy, Stefano Nicoli, Branka Hadji Misheva, Martin Braschler, Kurt Stockinger

:
GraLMatch: Matching Groups of Entities with Graphs and Language Models. 1-12 - Trung-Hoang Le

, Hady W. Lauw:
Selecting Comparative Sets of Reviews Across Multiple Items. 13-24 - Panagiotis Bouros, Theodoros Chondrogiannis, Daniel Kowalski:

Fast Geosocial Reachability Queries. 25-38 - Ioannis Xarchakos, Nick Koudas:

Coping With Data Drift in Online Video Analytics. 39-52 - Qihao Cheng

, Da Yan, Tianhao Wu
, Lyuheng Yuan, Ji Cheng, Zhongyi Huang, Yang Zhou:
Efficient Enumeration of Large Maximal k-Plexes. 53-65 - Daren Chao, Nick Koudas, Xiaohui Yu, Yueting Chen:

Ensembling Object Detectors for Effective Video Query Processing. 66-79 - Yingjun Dai, Ahmed El-Roby, Elmira Adeeb, Vivek Thaker:

OmniMatch: Overcoming the Cold-Start Problem in Cross-Domain Recommendations using Auxiliary Reviews. 80-91 - Gyanendra Shrestha, Chutian Jiang, Sai Akula, Vivek Yannam, Anna Pyayt, Michael N. Gubanov:

Tabular Embeddings for Tables with Bi-Dimensional Hierarchical Metadata and Nesting. 92-105 - Angela Bonifati, Stefania Dumbrava, Haridimos Kondylakis, Georgia Troullinou, Giannis Vassiliou:

Progressive Querying on Knowledge Graphs. 106-118 - Giorgos Alexiou, George Papastefanatos, Vassilis Stamatopoulos, Georgia Koutrika, Nectarios Koziris:

QueryER: A Framework for Fast Analysis-Aware Deduplication over Dirty Data. 119-131 - Ala Eddine Laouir, Abdessamad Imine:

Private Approximate Query over Horizontal Data Federation. 132-144 - Adeel Aslam, Kaustubh Beedkar, Giovanni Simonini:

SPO-Join: Efficient Stream Inequality Join. 145-157
Experiments & Analyses Track
- Jonathan Fürst, Catherine Kosten, Farhad Nooralahzadeh, Yi Zhang, Kurt Stockinger

:
Evaluating the Data Model Robustness of Text-to-SQL Systems Based on Real User Queries. 158-170 - Nikolai Merkel, Daniel Stoll, Ruben Mayer, Hans-Arno Jacobsen:

An Experimental Comparison of Partitioning Strategies for Distributed Graph Neural Network Training. 171-184 - Sana Ebrahimi, Rishi Advani, Abolfazl Asudeh

:
Evaluating the Feasibility of Sampling-Based Techniques for Training Multilayer Perceptrons. 185-198 - Anna Mitsopoulou, Georgia Koutrika:

Analysis of Text-to-SQL Benchmarks: Limitations, Challenges and Opportunities. 199-212
Volume 2
Research Track
- Sina Shaham, Gabriel Ghinita, Bhaskar Krishnamachari, Cyrus Shahabi:

Differentially Private Publication of Smart Electricity Grid Data. 213-225 - Naiqing Guan, Kaiwen Chen, Nick Koudas:

DataSculpt: Cost-Efficient Label Function Design via Prompting Large Language Models. 226-232 - Hyunjin Choo, Minho Eom, Gyuri Kim, Young-Gyu Yoon, Kijung Shin:

RASP: Robust Mining of Frequent Temporal Sequential Patterns under Temporal Variations. 233-245 - Goetz Graefe, Marius Kuhrt, Bernhard Seeger:

Modifying an existing sort order with offset-value codes. 246-254 - Arnab Phani, Matthias Boehm:

MEMPHIS: Holistic Lineage-based Reuse and Memory Management for Multi-backend ML Systems. 255-269 - Tavor Lipman, Tova Milo, Amit Somech

, Tomer Wolfson, Oz Zafar:
LINX: A Language Driven Generative System for Goal-Oriented Automated Data Exploration. 270-283 - Jacco Johannes Egbert Kiezebrink, Wieger R. Punter, Odysseas Papapetrou

, Kevin Verbeek:
Synopses for Summarizing Spatial Data Streams. 284-296 - Jan-Eric Hellenberg, Fabian Mahling, Lukas Laskowski, Felix Naumann, Matteo Paganelli, Fabian Panse:

PRISMA: A Privacy-Preserving Schema Matcher using Functional Dependencies. 297-309 - Panos Vassiliadis, Alexandros Karakasidis:

Time-Related Patterns Of Schema Evolution. 310-323 - Tao Li, Feng Liang

, Jinqi Quan, Huang Chuang, Teng Wang, Runhuai Huang, Jie Wu, Xiping Hu:
Taste: Towards Practical Deep Learning-based Approaches for Semantic Type Detection in the Cloud. 324-336
Experiments & Analyses Track
- Angelo Mozzillo

, Luca Zecchini, Luca Gagliardelli, Adeel Aslam, Sonia Bergamaschi, Giovanni Simonini:
Evaluation of Dataframe Libraries for Data Preparation on a Single Machine. 337-349 - Felix Neutatz, Marius Lindauer

, Ziawasch Abedjan:
How Green is AutoML for Tabular Data? 350-363
Research Track
- Fatemeh Ahmadi

, Marc Speckmann, Malte F. Kuhlmann
, Ziawasch Abedjan:
MaTElDa: Multi-Table Error Detection. 364-376 - Christina Christodoulakis, Moshe Gabel, Angela Demke Brown:

Metadata Unification in Open Data with Gnomon. 377-383 - Akshay A. Bapat, Saravanan Thirumuruganathan, Nick Koudas:

Pythia: A Neural Model for Data Prefetching. 384-396 - Martin Pekár Christensen

, Aristotelis Leventidis, Matteo Lissandrini, Laura Di Rocco, Renée J. Miller, Katja Hose
:
Fantastic Tables and Where to Find Them: Table Search in Semantic Data Lakes. 397-410 - Michail Theologitis

, Georgios Frangias, Georgios Anestis, Vasilis Samoladas, Antonios Deligiannakis:
Communication-Efficient Distributed Deep Learning via Federated Dynamic Averaging. 411-424
Experiments & Analyses Track
- Ran Wei, Zichen Zhu, Andrew Kryczka, Jay Zhuang, Manos Athanassoulis:

Benchmarking, Analyzing, and Optimizing WA of Partial Compaction in RocksDB. 425-437
Research Track
- Reza Salkhordeh, Felix Martin Schuhknecht, Hossein Asadi, Steffen Eiden, André Brinkmann:

No Time to Halt: In-Situ Analysis for Large-Scale Data Processing via Virtual Snapshotting. 438-450 - Aneesh Raman, Konstantinos Karatsenidis, Shaolin Xie, Matthaios Olma, Subhadeep Sarkar

, Manos Athanassoulis:
QuIT your B+-tree for the Quick Insertion Tree. 451-463 - Nikolaos Koutroumanis, Christos Doulkeridis, Akrivi Vlachou:

Parallel Spatial Join Processing with Adaptive Replication. 464-476 - Henning Koehler, Muhammad Farhan, Qing Wang:

Stable Tree Labelling for Accelerating Distance Queries on Dynamic Road Networks. 477-489 - André L. C. Mendonça, Felipe T. Brito

, Javam C. Machado:
PEG: Local Differential Privacy for Edge-Labeled Graphs. 490-502 - Andrea Colombo, Teodoro Baldazzi, Luigi Bellomarini, Emanuel Sallinger, Stefano Ceri:

Template-based Explainable Inference over High-Stakes Financial Knowledge Graphs. 503-515
Experiments & Analyses Track
- Adrian Lutsch

, Muhammad El-Hindi, Matthias Heinrich, Daniel Ritter, Zsolt István, Carsten Binnig:
Benchmarking Analytical Query Processing in Intel SGXv2. 516-528 - Ralph Peeters

, Aaron Steiner
, Christian Bizer
:
Entity Matching using Large Language Models. 529-541
Volume 3
Research Track
- Sedir Mohammed

, Felix Naumann, Hazar Harmouch
:
Step-by-Step Data Cleaning Recommendations to Improve ML Prediction Accuracy. 542-554 - Mahesh Dananjaya, Vasilis Gavrielatos, Antonios Katsarakis, Nikos Ntarmos, Vijay Nagarajan:

Fast, Highly Available, and Recoverable Transactions on Disaggregated Data Stores. 555-568 - Stefano Calzavara, Lorenzo Cazzaro

, Donald Gera, Salvatore Orlando:
Watermarking Decision Tree Ensembles. 569-575 - Xinglin Du, Peng Tang, Rui Chen, Ning Wang, Chengyu Hu, Shanqing Guo:

Query Rewriting-Based View Generation for Efficient Multi-Relation Multi-Query with Differential Privacy. 576-588 - Wang Yue, Martin Boissier

, Manisha Luthra, Tilmann Rabl:
Dema: Efficient Decentralized Aggregation for Non-Decomposable Quantile Functions. 589-595 - Landy Andriamampianina, Franck Ravat

, Jiefu Song, Nathalie Vallès-Parlangeau, Yanpei Wang:
Selective Evolving Centrality in Temporal Heterogeneous Graphs. 596-608 - Eugenie Y. Lai, Yuze Lou, Brit Youngmann, Michael J. Cafarella:

Toward Standardized Data Preparation: A Bottom-Up Approach. 609-622 - Tanmay Surve, Romila Pradhan

:
Explaining Fairness Violations using Machine Unlearning. 623-635 - Minglang Xie, Jianye Yang, Wenjie Zhang, Shiyu Yang, Xuemin Lin:

Deep Skyline Community Search. 636-648 - Enas Khwaileh, Yannis Velegrakis

:
Dataset Discovery using Semantic Matching. 649-660 - Josef Schmeißer, Clemens Lutz, Volker Markl:

Efficiently Indexing Large Data on GPUs with Fast Interconnects. 661-667 - Kasun Amarasinghe, Farhana Choudhury, Jianzhong Qi

, James Bailey:
Learned Indexes with Distribution Smoothing via Virtual Points. 668-680 - Parisa Esmaeilian Ghahroudi, Sean Chester

, Alex Thomo:
Efficient Multicore Discovery of Small, High-Quality k-Plex Teams in Multi-attributed Networks. 681-693 - Camilla Birch Okkels

, Martin Aumüller, Viktor Bello Thomsen, Arthur Zimek
:
High-dimensional density-based clustering using locality-sensitive hashing. 694-706 - Tianshu Wang

, Xiaoyang Chen, Hongyu Lin, Xianpei Han, Le Sun, Hao Wang, Zhenyu Zeng:
DBCopilot: Natural Language Querying over Massive Databases via Schema Routing. 707-721 - Yu Liu, Qi Luo, Yanwei Zheng, Wenjie Zhang, Xuemin Lin, Dongxiao Yu:

Effective and Efficient Community Search over Large-Scale Hypergraphs. 722-734 - Hafiz Tayyab Rauf, Alex Teodor Bogatu, Norman W. Paton, André Freitas:

Gem: Gaussian Mixture Model Embeddings for Numerical Feature Distributions. 735-747 - Hoa Thi Le, Angela Bonifati, Andrea Mauri:

Graph Consistency Rule Mining with LLMs: an Exploratory Study. 748-754 - Bole Chang, Linxin Xie, Wei Li, Meng Qin, Jianfeng Hou:

Z-Shadow: An Efficient Method for Estimating Bicliques in Massive Graphs Using Füredi's Theorem. 755-768 - Christian Knödler, Naeem Ramzan, Ilia Petrov:

hybridNDP: Dynamic Operation Offloading and Cooperative Query Execution in Smart Storage Settings. 769-782 - Renzo Angles, Angela Bonifati, Roberto García

, Domagoj Vrgoc:
Path-based Algebraic Foundations of Graph Query Languages. 783-795 - Christoph Schinninger, Fabian Panse, Constantin Kühne, Lisa Ehrlinger:

Icewafl: A Configurable Data Stream Polluter. 796-802 - Mengying Wang

, Hanchao Ma, Yiyang Bian, Yangxin Fan, Yinghui Wu:
Generating Skyline Datasets for Data Science Models. 803-815 - Loredana Caruccio, Stefano Cirillo

, Giuseppe Polese, Roberto Stanzione:
An RFD-based approach for concept drift detection in Machine Learning Systems. 816-828 - Otmar Ertl:

ExaLogLog: Space-Efficient and Practical Approximate Distinct Counting up to the Exa-Scale. 829-841 - Nripsuta Ani Saxena, Ronit Mathur, Cyrus Shahabi:

Legally-Compliant Spatial Fairness Framework: Advancing Beyond Spatial Fairness. 842-854 - Nodirbek Korchiev, Akash Pateria, Vodelina Samatova, Sogolsadat Mansouri, Kemafor Anyanwu:

Taming the Beast of User-Programmed Transactions on Blockchains: A Declarative Transaction Approach. 855-866 - Mohamed Maher, Osama Fayez Oun, Mahmoud Saeed Mesmeh, Radwa El Shawi:

FedForecaster: An Automated Federated Learning Approach for Time-series Forecasting. 867-873 - Sijie Dong, Soror Sahri, Themis Palpanas, Qitong Wang:

Automated Data Quality Validation in an End-to-End GNN Framework. 874-880
Experiments & Analyses Track
- Peichen Xie, Zhigao Zheng, Yongluan Zhou

, Yang Xiu, Hao Liu
, Zhixiang Yang, Yu Zhang, Bo Du
:
GPU Architectures in Graph Analytics: A Comparative Experimental Study. 881-893 - Ling Zhang, Shaleen Deep, Joyce Cahoon, Jignesh M. Patel, Anja Gruenheid:

From Feature Selection to Resource Prediction: An Analysis of Commonly Applied Workflows and Techniques. 894-908 - Ananya Rahaman, Anny Zheng, Mostafa Milani, Fei Chiang, Rachel Pottinger:

Evaluating SQL Understanding in Large Language Models. 909-921 - Zeyu Zhang, Paul Groth, Iacer Calixto

, Sebastian Schelter:
A Deep Dive Into Cross-Dataset Entity Matching with Large and Small Language Models. 922-934 - Thomas Bodner, Theo Radig, David Justen, Daniel Ritter, Tilmann Rabl:

An Empirical Evaluation of Serverless Cloud Infrastructure for Large-Scale Data Processing. 935-948 - Mark Dodds, Khuzaima Daudjee:

Apache Ignite + Calcite Composable Database System: Experimental Evaluation and Analysis. 949-961
Vision Track
- Sihem Amer-Yahia, Jasmina Bogojeska

, Roberta Facchinetti, Valeria Franceschi, Aristides Gionis, Katja Hose, Georgia Koutrika, Roger D. Kouyos
, Matteo Lissandrini, Silviu Maniu, Katsiaryna Mirylenka, Davide Mottin, Themis Palpanas, Mattia Rigotti, Yannis Velegrakis
:
Towards Reliable Conversational Data Analytics. 962-969 - Mouna Ammar, Christopher Rost, Riccardo Tommasini, Shubhangi Agarwal

, Angela Bonifati, Petra Selmer, Evgeny Kharlamov, Erhard Rahm:
Towards Hybrid Graphs: Unifying Property Graphs and Time Series. 970-977 - Sepehr Sadoughi

, Nikolay Yakovets
, George Fletcher
:
Breaking Down the Data-metadata Barrier for Effective Property Graph Data Management. 978-984 - Koyena Pal

, David Bau, Renée J. Miller:
Model Lakes. 985-995
Industrial & Applications Track
- Boge Liu, Chunling Wang, Xiaoshuang Chen, Yu Hao, Zhengyi Yang

, Yi Jin, Yixing Yang, Wenke Yang, Wanchuan Zhang, Wenjie Zhang:
PhoebeDB: A Disk-Based RDBMS Kernel for High-Performance and Cost-Effective OLTP. 996-1004 - Andreas Kouvaras, Periklis Mantenoglou

, Alexander Artikis:
Generating Activity Definitions with Large Language Models. 1005-1013 - Gerald White, Deep Mistry, Kevin Chhoa, Senjuti Basu Roy, Lingyi Zhang, Adam Bienkowski, Krishna R. Pattipati:

A Computational Framework for Estimating Days of Maintenance Delay of Naval Ships. 1014-1022 - Zhijia Chen, Weiyi Meng, Eduard C. Dragut:

ComCrawler: General Crawling Solution for Aticle Comments. 1023-1031 - Rakesh Menon, Kun Qian, Liqun Chen, Ishika Joshi, Daniel Pandyan, Shashank Srivastava, Yunyao Li:

FISQL: Enhancing Text-to-SQL Systems with Rich Interactive Feedback. 1032-1038 - Chanuk Lim, Kyong-Ha Lee, Hyun Ji Jeong, Sungsu Lim:

GRAIL: Graph Retrieval-Augmented In-Context Learning for Node Classification in Real-World Textual-Attributed Graphs. 1039-1047 - Liat Antwarg Friedman, Gal Lavee, Bracha Shapira, Dorin Shmaryahu:

Data Completion In E-commerce. 1048-1056 - Ilaria Bordino, Francesco Di Iorio, Andrea Galliani, Alessio Rosatelli, Lorenzo Severini

:
UniAsk: AI-powered search for banking knowledge bases. 1057-1065
Demonstration Track
- Mihail Stoian, Alexander van Renen, Jan Kobiolka, Ping-Lin Kuo, Andreas Zimmerer, Josif Grabocka, Andreas Kipf:

Virtual: Compressing Data Lake Files. 1066-1069 - Georgios Grigoropoulos, Alexandros Troupiotis-Kapeliaris, Ilias Chamatidis, Evangelia Filippou, Konstantina Bereta:

Transforming Maritime Safety: Data-driven Applications for the Real-Time Detection and Mitigation of Maritime Incidents. 1070-1073 - Panagiotis Gidarakos, Nikolaos Theologitis, Stavros Maroulis, Loukas Kavouras, Giorgos Giannopoulos, George Papastefanatos:

GLOVES: Global Counterfactual-based Visual Explanations. 1074-1077 - Chiara Forresi

, Matteo Francia, Enrico Gallinucci, Matteo Golfarelli:
ASSO: the Automated Schemaless Stream Overseer. 1078-1081 - Joel Rorseth, Parke Godfrey, Lukasz Golab, Divesh Srivastava, Jarek Szlichta:

LADYBUG: an LLM Agent DeBUGger for data-driven applications. 1082-1085 - Evgeny S. Skvortsov, Shayan Mirjafari, Ojaswa Garg, Yilin Xia, Shawn Bowers, Bertram Ludäscher:

LogicLM: Robust Application of Large Language Models with Logic Programming for Data Analytics. 1086-1089 - Mohamed Abdelaal, Samuel Lokadjaja, Arne Kreuz, Harald Schöning:

DataLens: ML-Oriented Interactive Tabular Data Quality Dashboard. 1090-1093 - Haralampos Gavriilidis, Lennart Behme, Christian Munz, Varun Pandey, Volker Markl:

CompoDB: A Demonstration of Modular Data Systems in Practice. 1094-1097 - Anastasiia Avksientieva, Parke Godfrey, Lukasz Golab, Divesh Srivastava, Jarek Szlichta:

REACT: REcourse Analysis with Counterfactuals and Explanation Tables. 1098-1101 - Zesong Zhang, Jianzhong Qi

, Xin Cao, Christian S. Jensen
:
SemaSK: Answering Semantics-aware Spatial Keyword Queries with Large Language Models. 1102-1105 - Fajrian Yunus, Pratik Karmakar, Pierre Senellart, Talel Abdessalem, Stéphane Bressan:

Using A Probabilistic Database in an Image Retrieval Application. 1106-1109 - Marc Maynou, Sergi Nadal:

Supporting Data Discovery Tasks at Scale with FREYJA. 1110-1113 - Francesco Invernici, Anna Bernasconi

, Francesca Curati, Jelena Jakimov, Amirhossein Samavi:
TETYS: Configurable Topic Modeling Exploration for Big Corpora of Text Documents. 1114-1117 - Wenbo Sun, Ziyu Li, Vaishnav Srinidhi, Rihan Hai:

Database is All You Need: Serving LLMs with Relational Queries. 1118-1121 - Justus Henneberg, Felix Schuhknecht:

Do Research, not Data Visualization! How to Create More Consistent Plots for Experimental Research Papers in Less Time. 1122-1125 - Sven Rasmusen, Konstantina Pityanou

, Dimitra Papatsaroucha, Sofiane Lagraa, Moussa Ouedraogo, Evangelos K. Markakis:
Secure and Transparent Data Sharing with TrustShare: A GDPR-Compliant Platform. 1126-1129 - Ariane Ziehn, Lily Seidl, Samira Akili, Steffen Zeuch, Volker Markl:

Enabling Complex Event Processing in NebulaStream. 1130-1133 - Antonios Kontaxakis, Dimitris Sacharidis, Alkis Simitsis, Alberto Abelló, Sergi Nadal:

Hyppo: Efficient Discovery and Execution of Data Science Pipelines in Collaborative Environments. 1134-1137 - Pasquale Leonardo Lazzaro

, Marialaura Lazzaro, Paolo Missier, Riccardo Torlone:
PROLIT: Supporting the Transparency of Data Preparation Pipelines through Narratives over Data Provenance. 1138-1141 - Moein Shirdel, Joel Rorseth, Parke Godfrey, Lukasz Golab, Divesh Srivastava, Jarek Szlichta:

AprèsCoT: Explaining LLM Answers with Knowledge Graphs and Chain of Thought. 1142-1145 - Thomas Bodner, Tilmann Rabl:

An Interactive Analysis of Serverless Cloud Infrastructure. 1146-1149 - Jáchym Bártík, Alzbeta Srutková, Irena Holubová:

TransforMMer: A Universal Multi-Model Data Generator. 1150-1153 - Andrea Baraldi, Matteo Brucato, Miroslav Dudík, Francesco Guerra, Matteo Interlandi:

FairnessEval: a Framework for Evaluating Fairness of Machine Learning Models. 1154-1157 - Charlotte Felius, Peter Boncz:

VCrypt: Leveraging Vectorized and Compressed Execution for Client-side Encryption. 1158-1161
Tutorial Track
- Vincent T'kindt, Patrick Marcel:

Can Operations Research bring you to the next level? Basics and application. 1162-1165 - Da Yan, Lyuheng Yuan, Akhlaque Ahmad, Saugat Adhikari:

Systems for Scalable Graph Analytics and Machine Learning: Trends and Methods. 1166-1169 - Mohamed-Amine Baazizi, Dario Colazzo, Giorgio Ghelli, Carlo Sartiani, Stefanie Scherzinger:

Everything You Always Wanted to Know About JSON Schema (But Were Afraid to Ask). 1170-1173 - Chuangtao Ma

, Yongrui Chen, Tianxing Wu, Arijit Khan
, Haofen Wang:
Unifying Large Language Models and Knowledge Graphs for Question Answering: Recent Advances and Opportunities. 1174-1177

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.


Google
Google Scholar
Semantic Scholar
Internet Archive Scholar
CiteSeerX
ORCID














