default search action
ACM SIGMOD Conference 2022: Philadelphia, PA, USA
- Zachary G. Ives, Angela Bonifati, Amr El Abbadi:
SIGMOD '22: International Conference on Management of Data, Philadelphia, PA, USA, June 12 - 17, 2022. ACM 2022, ISBN 978-1-4503-9249-5
Keynote Talks
- Barbara Liskov:
Reflections on a Career in Computer Science. 1 - Laks V. S. Lakshmanan:
On a Quest for Combating Filter Bubbles and Misinformation. 2 - Christopher Ré:
Is Data Management the Beating Heart of AI Systems? 3
Session 1: Transaction Processing
- Chuzhe Tang, Zhaoguo Wang, Xiaodong Zhang, Qianmian Yu, Binyu Zang, Haibing Guan, Haibo Chen:
Ad Hoc Transactions in Web Applications: The Good, the Bad, and the Ugly. 4-18 - Youmin Chen, Xiangyao Yu, Paraschos Koutris, Andrea C. Arpaci-Dusseau, Remzi H. Arpaci-Dusseau, Jiwu Shu:
Plor: General Transactions with Predictable, Low Tail Latency. 19-33 - Jianqiu Zhang, Kaisong Huang, Tianzheng Wang, King Lv:
Skeena: Efficient and Consistent Cross-Engine Transactions. 34-48 - Jong-Bin Kim, Jaeseon Yu, Jaechan Ahn, Sooyong Kang, Hyungsoo Jung:
Diva: Making MVCC Systems HTAP-Friendly. 49-64 - Yijian Liu, Li Su, Vivek Shah, Yongluan Zhou, Marcos Antonio Vaz Salles:
Hybrid Deterministic and Nondeterministic Execution of Transactions in Actor Systems. 65-78
Session 2: Query Processing and Optimization 1
- Yisu Remy Wang, Mahmoud Abo Khamis, Hung Q. Ngo, Reinhard Pichler, Dan Suciu:
Optimizing Recursive Queries with Progam Synthesis. 79-93 - Zhaoguo Wang, Zhou Zhou, Yicun Yang, Haoran Ding, Gansen Hu, Ding Ding, Chuzhe Tang, Haibo Chen, Jinyang Li:
WeTune: Automatic Discovery and Verification of Query Rewrite Rules. 94-107 - Qichen Wang, Ke Yi:
Conjunctive Queries with Comparisons. 108-121 - Riccardo Mancini, Srinivas Karthik, Bikash Chandra, Vasilis Mageirakos, Anastasia Ailamaki:
Efficient Massively Parallel Join Optimization for Large Queries. 122-135 - Supun Abeysinghe, Qiyang He, Tiark Rompf:
Efficient Incrementialization of Correlated Nested Aggregate Queries using Relative Partial Aggregate Indexes (RPAI). 136-149
Session 3: ML for Data Management 1
- Barrie Kersbergen, Olivier Sprangers, Sebastian Schelter:
Serenade - Low-Latency Session-Based Recommendation in e-Commerce at Scale. 150-159 - Hanchen Wang, Rong Hu, Ying Zhang, Lu Qin, Wei Wang, Wenjie Zhang:
Neural Subgraph Counting with Wasserstein Estimator. 160-175 - Justin Talbot, Daniel Ting:
Statistical Schema Learning with Occam's Razor. 176-189 - Immanuel Trummer:
DB-BERT: A Database Tuning Tool that "Reads the Manual". 190-203 - Xiu Tang, Sai Wu, Mingli Song, Shanshan Ying, Feifei Li, Gang Chen:
PreQR: Pre-training Representation for SQL Understanding. 204-216
Session 4: Responsible Data Management and Fairness
- Sainyam Galhotra, Anna Fariha, Raoni Lourenço, Juliana Freire, Alexandra Meliou, Divesh Srivastava:
DataPrism: Exposing Disconnect between Data and Systems. 217-231 - Maliha Tashfia Islam, Anna Fariha, Alexandra Meliou, Babak Salimi:
Through the Data Management Lens: Experimental Analysis and Evaluation of Fair Classification. 232-246 - Romila Pradhan, Jiongli Zhu, Boris Glavic, Babak Salimi:
Interpretable Data-Based Explanations for Fairness Debugging. 247-261 - Dong Wei, Md Mouinul Islam, Baruch Schieber, Senjuti Basu Roy:
Rank Aggregation with Proportionate Fairness. 262-275 - Sainyam Galhotra, Karthikeyan Shanmugam, Prasanna Sattigeri, Kush R. Varshney:
Causal Feature Selection for Algorithmic Fairness. 276-285
Session 5: Streaming and Sensor Networks 1
- Yunlong Xu, Jinshu Liu, Fatemeh Nargesian:
TSUBASA: Climate Network Construction on Historical and Real-Time Data. 286-295 - Bogyeong Kim, Kyoseung Koo, Undraa Enkhbat, Bongki Moon:
DenForest: Enabling Fast Deletion in Incremental Density-Based Clustering over Sliding Windows. 296-309 - Hadar Sivan, Moshe Gabel, Assaf Schuster:
AutoMon: Automatic Distributed Monitoring for Arbitrary Multivariate Functions. 310-324 - David Tench, Evan West, Victor Zhang, Michael A. Bender, Abiyaz Chowdhury, J. Ahmed Dellas, Martin Farach-Colton, Tyler Seip, Kenny Zhang:
GraphZeppelin: Storage-Friendly Sketching for Connected Components on Dynamic Graph Streams. 325-339 - Adar Amir, Ilya Kolchinsky, Assaf Schuster:
DLACEP: A Deep-Learning Based Framework for Approximate Complex Event Processing. 340-354
Session 6: Data Cleaning and Integration
- Amir Gilad, Zhengjie Miao, Sudeepa Roy, Jun Yang:
Understanding Queries by Conditional Instances. 355-368 - Lampros Flokas, Weiyuan Wu, Yejia Liu, Jiannan Wang, Nakul Verma, Eugene Wu:
Complaint-Driven Training Data Debugging at Interactive Speeds. 369-383 - Wenfei Fan, Ziyan Han, Yaoshu Wang, Min Xie:
Parallel Rule Discovery from Large Datasets by Sampling. 384-398 - Zezhou Huang, Eugene Wu:
Reptile: Aggregation-level Explanations for Hierarchical Data. 399-413 - Sainyam Galhotra, Donatella Firmani, Barna Saha, Divesh Srivastava:
Hierarchical Entity Resolution using an Oracle. 414-428 - Dezhong Yao, Yuhong Gu, Gao Cong, Hai Jin, Xinqiao Lv:
Entity Resolution with Hierarchical Graph Attention Networks. 429-442 - Jianhong Tu, Ju Fan, Nan Tang, Peng Wang, Chengliang Chai, Guoliang Li, Ruixue Fan, Xiaoyong Du:
Domain Adaptation for Deep Entity Resolution. 443-457
Session 7: Data Management for ML 1
- Pei-Yu Hou, Daniel Robert Korn, Cleber C. Melo-Filho, David R. Wright, Alexander Tropsha, Rada Chirkova:
Compact Walks: Taming Knowledge-Graph Embeddings with Domain- and Task-Specific Pathways. 458-469 - Xupeng Miao, Yining Shi, Hailin Zhang, Xin Zhang, Xiaonan Nie, Zhi Yang, Bin Cui:
HET-GMP: A Graph-based System Approach to Scaling Large Embedding Model Training. 470-480 - Alexander Renz-Wieland, Rainer Gemulla, Zoi Kaoudi, Volker Markl:
NuPS: A Parameter Server for Machine Learning with Non-Uniform Parameter Access. 481-495 - Daniel Kang, Nikos Aréchiga, Sudeep Pillai, Peter D. Bailis, Matei Zaharia:
Finding Label and Model Errors in Perception Data With Learned Observation Assertions. 496-505 - Supun Nakandala, Arun Kumar:
Nautilus: An Optimized System for Deep Transfer Learning over Evolving Training Datasets. 506-520 - Chaoji Zuo, Sepehr Assadi, Dong Deng:
Spine: Scaling up Programming-by-Negative-Example for String Filtering and Transformation. 521-530 - Jinglin Peng, Bolin Ding, Jiannan Wang, Kai Zeng, Jingren Zhou:
One Size Does Not Fit All: A Bandit-Based Sampler Combination Framework with Theoretical Guarantees. 531-544
Session 8: Query Processing and Data Management for ML
- Pramod Chunduri, Jaeho Bang, Yao Lu, Joy Arulraj:
Zeus: Efficiently Localizing Actions in Videos using Reinforcement Learning. 545-558 - Jiashen Cao, Karan Sarkar, Ramyad Hadidi, Joy Arulraj, Hyesoon Kim:
FiGO: Fine-Grained Query Optimization in Video Analytics. 559-572 - Zihao Chen, Baokun Han, Chen Xu, Weining Qian, Aoying Zhou:
Redundancy Elimination in Distributed Matrix Computation. 573-586 - Kwanghyun Park, Karla Saur, Dalitso Banda, Rathijit Sen, Matteo Interlandi, Konstantinos Karanasos:
End-to-end Optimization of Machine Learning Prediction Queries. 587-601 - Zhuangdi Xu, Gaurav Tarlok Kakkar, Joy Arulraj, Umakishore Ramachandran:
EVA: A Symbolic Approach to Accelerating Exploratory Video Analytics with Materialized Views. 602-616
Session 9: Database Monitoring and Tuning
- Matthew Butrovich, Wan Shen Lim, Lin Ma, John Rollinson, William Zhang, Yu Xia, Andrew Pavlo:
Tastes Great! Less Filling! High Performance and Accurate Training Data Collection for Self-Driving Database Management Systems. 617-630 - Xinyi Zhang, Hong Wu, Yang Li, Jian Tan, Feifei Li, Bin Cui:
Towards Dynamic and Safe Configuration Tuning for Cloud Databases. 631-645 - Baoqing Cai, Yu Liu, Ce Zhang, Guangyu Zhang, Ke Zhou, Li Liu, Chunhua Li, Bin Cheng, Jie Yang, Jiashu Xing:
HUNTER: An Online Cloud Database Hybrid Tuning System for Personalized Requirements. 646-659 - Tarique Siddiqui, Saehan Jo, Wentao Wu, Chi Wang, Vivek R. Narasayya, Surajit Chaudhuri:
ISUM: Efficiently Compressing Large and Complex Workloads for Scalable Index Tuning. 660-673 - Jinhan Xin, Kai Hwang, Zhibin Yu:
LOCAT: Low-Overhead Online Configuration Auto-Tuning of Spark SQL Applications. 674-684
Session 10: Distributed and Parallel Databases
- Tobias Ziegler, Carsten Binnig, Viktor Leis:
ScaleStore: A Fast and Cost-Efficient Storage Engine using DRAM, NVMe, and RDMA. 685-699 - Michael Abebe, Horatiu Lazu, Khuzaima Daudjee:
Proteus: Autonomous Adaptive Storage for Mixed Workloads. 700-714 - Linguan Yang, Xinan Yan, Bernard Wong:
Natto: Providing Distributed Transaction Prioritization for High-Contention Workloads. 715-729 - Yu Sun, Zheng Zheng, Shaoxu Song, Fei Chiang:
Confidence Bounded Replica Currency Estimation. 730-743 - Yikai Zhao, Yinda Zhang, Yuanpeng Li, Yi Zhou, Chunhui Chen, Tong Yang, Bin Cui:
MinMax Sampling: A Near-optimal Global Summary for Aggregation in the Wide Area. 744-758
Session 11: Database Security, Privacy and Control
- Wei Dong, Juanru Fang, Ke Yi, Yuchao Tao, Ashwin Machanavajjhala:
R2T: Instance-optimal Truncation for Differentially Private Query Evaluation with Foreign Keys. 759-772 - Seng Pei Liew, Tsubasa Takahashi, Shun Takagi, Fumiyuki Kato, Yang Cao, Masatoshi Yoshikawa:
Network Shuffling: Privacy Amplification via Random Walks. 773-787 - Sainan Li, Qilei Yin, Guoliang Li, Qi Li, Zhuotao Liu, Jinwei Zhu:
Unsupervised Contextual Anomaly Detection for Database Systems. 788-802 - Zhao Chang, Dong Xie, Sheng Wang, Feifei Li:
Towards Practical Oblivious Join. 803-817 - Chenghong Wang, Johes Bater, Kartik Nayak, Ashwin Machanavajjhala:
IncShrink: Architecting Efficient Outsourced Databases using Incremental MPC and Differential Privacy. 818-832
Session 12: Graph Data Management and Mining
- Chenhao Ma, Yixiang Fang, Reynold Cheng, Laks V. S. Lakshmanan, Xiaolin Han:
A Convex-Programming Approach for Efficient Directed Densest Subgraph Discovery. 845-859 - Kaiqiang Yu, Cheng Long, Shengxin Liu, Da Yan:
Efficient Algorithms for Maximal k-Biplex Enumeration. 860-873 - Yahui Sun, Shuai Ma, Bin Cui:
Hunting Temporal Bumps in Graphs with Dynamic Vertex Properties. 874-888 - Junghoon Kim, Siqiang Luo, Gao Cong, Wenyuan Yu:
DMCS : Density Modularity based Community Search. 889-903 - Wentao Li, Miao Qiao, Lu Qin, Lijun Chang, Ying Zhang, Xuemin Lin:
On Scalable Computation of Graph Eccentricities. 904-916
Session 13: ML for Data Management and Query Processing
- Qiyu Liu, Yanyan Shen, Lei Chen:
HAP: An Efficient Hamming Space Index Based on Augmented Pigeonhole Principle. 917-930 - Zongheng Yang, Wei-Lin Chiang, Sifei Luan, Gautam Mittal, Michael Luo, Ion Stoica:
Balsa: Learning a Query Optimizer Without Expert Demonstrations. 931-944 - Lixi Zhang, Chengliang Chai, Xuanhe Zhou, Guoliang Li:
LearnedSQLGen: Constraint-aware SQL Generation using Reinforcement Learning. 945-958 - Xiao Hu, Yuxi Liu, Haibo Xiu, Pankaj K. Agarwal, Debmalya Panigrahi, Sudeepa Roy, Jun Yang:
Selectivity Functions of Range Queries are Learnable. 959-972 - Kangfei Zhao, Jeffrey Xu Yu, Zongyan He, Rui Li, Hao Zhang:
Lightweight and Accurate Cardinality Estimation by Neural Network Gaussian Process. 973-987
Session 14: Modern Hardware and In-memory DBMS
- Sangjin Lee, Alberto Lerner, André Ryser, Kibin Park, Chanyoung Jeon, Jinsub Park, Yong Ho Song, Philippe Cudré-Mauroux:
X-SSD: A Storage System with Native Support for Database Logging and Replication. 988-1002 - Nils Boeschen, Carsten Binnig:
GaccO - A GPU-accelerated OLTP DBMS. 1003-1016 - Clemens Lutz, Sebastian Breß, Steffen Zeuch, Tilmann Rabl, Volker Markl:
Triton Join: Efficiently Scaling to a Large Join State on GPUs with Fast Interconnects. 1017-1032 - Qing Wang, Youyou Lu, Jiwu Shu:
Sherman: A Write-Optimized Distributed B+Tree Index on Disaggregated Memory. 1033-1048 - Daokun Hu, Zhiwen Chen, Wenkui Che, Jianhua Sun, Hao Chen:
Halo: A Hybrid PMem-DRAM Persistent Hash Index with Fast Recovery. 1049-1063
Session 15: Streaming and Sensor Networks 2
- Xuebin Ren, Liang Shi, Weiren Yu, Shusen Yang, Cong Zhao, Zongben Xu:
LDP-IDS: Local Differential Privacy for Infinite Data Streams. 1064-1077 - Bonaventura Del Monte, Steffen Zeuch, Tilmann Rabl, Volker Markl:
Rethinking Stateful Stream Processing with RDMA. 1078-1092 - Maor Yankovitch, Ilya Kolchinsky, Assaf Schuster:
HYPERSONIC: A Hybrid Parallelization Approach for Scalable Complex Event Processing. 1093-1107 - Zhuo Zhang, Junhao Gan, Zhifeng Bao, Seyed Mohammad Hussein Kazemi, Guangyong Chen, Fengyuan Zhu:
Approximate Range Thresholding. 1108-1121 - Lei Ma, Chuan Lei, Olga Poppe, Elke A. Rundensteiner:
Gloria: Graph-based Sharing Optimizer for Event Trend Aggregation. 1122-1135
Session 16: Knowledge Discovery and Data Mining
- Martino Ciaperoni, Aristides Gionis, Athanasios Katsamanis, Panagiotis Karras:
SIEVE: A Space-Efficient Algorithm for Viterbi Decoding. 1136-1145 - Zhizhi Wang, Chaoji Zuo, Dong Deng:
TxtAlign: Efficient Near-Duplicate Text Alignment Search via Bottom-k Sketches for Plagiarism Detection. 1146-1159 - Shay Gershtein, Tova Milo, Slava Novgorodov, Kathy Razmadze:
Classifier Construction Under Budget Constraints. 1160-1174 - Paul Boniol, Mohammed Meftah, Emmanuel Remy, Themis Palpanas:
dCAM: Dimension-wise Class Activation Map for Explaining Multivariate Data Series Classification. 1175-1189 - Dmitrii Babaev, Nikita Ovsov, Ivan Kireev, Mariya Ivanova, Gleb Gusev, Ivan Nazarov, Alexander Tuzhilin:
CoLES: Contrastive Learning for Event Sequences with Self-Supervision. 1190-1199
Session 17: Query Processing and Optimization 2
- Yizhou Dai, Miao Qiao, Lijun Chang:
Anchored Densest Subgraph. 1200-1213 - Kyoungmin Kim, Jisung Jung, In Seo, Wook-Shin Han, Kangwoo Choi, Jaehyok Chong:
Learned Cardinality Estimation: An In-depth Study. 1214-1227 - Ibrahim Sabek, Tenzin Samten Ukyab, Tim Kraska:
LSched: A Workload-Aware Learned Query Scheduler for Analytical Database Systems. 1228-1242 - Adrian Vogelsgesang, Thomas Neumann, Viktor Leis, Alfons Kemper:
Efficient Evaluation of Arbitrarily-Framed Holistic SQL Aggregates and Window Functions. 1243-1256 - George Christodoulou, Panagiotis Bouros, Nikos Mamoulis:
HINT: A Hierarchical Index for Intervals in Main Memory. 1257-1270
Session 18: Data Management for ML 2
- Yiming Li, Yanyan Shen, Lei Chen:
Camel: Managing Data for Efficient Stream Learning. 1271-1285 - Lijie Xu, Shuang Qiu, Binhang Yuan, Jiawei Jiang, Cédric Renggli, Shaoduo Gan, Kaan Kara, Guoliang Li, Ji Liu, Wentao Wu, Jieping Ye, Ce Zhang:
In-Database Machine Learning with CorgiPile: Stochastic Gradient Descent without Full Data Shuffle. 1286-1300 - Qiange Wang, Yanfeng Zhang, Hao Wang, Chaoyi Chen, Xiaodong Zhang, Ge Yu:
NeutronStar: Distributed GNN Training with Hybrid Dependency Management. 1301-1315 - Fangcheng Fu, Huanran Xue, Yong Cheng, Yangyu Tao, Bin Cui:
BlindFL: Vertical Federated Machine Learning without Peeking into Your Data. 1316-1330 - Evgenios M. Kornaropoulos, Silei Ren, Roberto Tamassia:
The Price of Tailoring the Index to Your Data: Poisoning Attacks on Learned Index Structures. 1331-1344
Session 19: Databases for Emerging Hardware
- Qizhen Zhang, Xinyi Chen, Sidharth Sankhe, Zhilei Zheng, Ke Zhong, Sebastian Angel, Ang Chen, Vincent Liu, Boon Thau Loo:
Optimizing Data-intensive Systems in Disaggregated Data Centers with TELEPORT. 1345-1359 - Yu-Ching Hu, Yuliang Li, Hung-Wei Tseng:
TCUDB: Accelerating Database with Tensor Processors. 1360-1374 - Matthias Jasny, Lasse Thostrup, Tobias Ziegler, Carsten Binnig:
P4DB - The Case for In-Network OLTP. 1375-1389 - Anil Shanbhag, Bobbi W. Yogatama, Xiangyao Yu, Samuel Madden:
Tile-based Lightweight Integer Compression in GPU. 1390-1403 - Mijin An, In-Yeong Song, Yong Ho Song, Sang-Won Lee:
Avoiding Read Stalls on Flash Storage. 1404-1417
Session 20: Database Security and Distributed Data Management
- Zhiqi Wang, Zili Shao:
TimeUnion: An Efficient Architecture with Unified Data Model for Timeseries Management Systems on Hybrid Cloud Storage. 1418-1432 - Jiacheng Wu, Jin Wang, Carlo Zaniolo:
Optimizing Parallel Recursive Datalog Evaluation on Multicore Machines. 1433-1446 - Hao Zhang, Jeffrey Xu Yu, Yikai Zhang, Kangfei Zhao:
Parallel Query Processing: To Separate Communication from Computation. 1447-1461 - Harshavardhan Unnibhavi, David Cerdeira, Antonio Barbalace, Nuno Santos, Pramod Bhatotia:
Secure and Policy-Compliant Query Processing on Heterogeneous Computational Storage Architectures. 1462-1477 - Yu Xia, Xiangyao Yu, Matthew Butrovich, Andrew Pavlo, Srinivas Devadas:
Litmus: Towards a Practical Database Management System with Verifiable ACID Properties and Transaction Correctness. 1478-1492
Session 21: ML for Data Management 2
- Yoshihiko Suhara, Jinfeng Li, Yuliang Li, Dan Zhang, Çagatay Demiralp, Chen Chen, Wang-Chiew Tan:
Annotating Columns with Pre-trained Language Models. 1493-1503 - Zixuan Zhao, Raul Castro Fernandez:
Leva: Boosting Machine Learning Performance with Relational Embedding Data Augmentation. 1504-1517 - Sepideh Nikookar, Paras Sakharkar, Sathyanarayanan Somasunder, Senjuti Basu Roy, Adam Bienkowski, Matthew Macesker, Krishna R. Pattipati, David Sidoti:
Cooperative Route Planning Framework for Multiple Distributed Assets in Maritime Applications. 1518-1527 - Wentao Wu, Chi Wang, Tarique Siddiqui, Junxiong Wang, Vivek R. Narasayya, Surajit Chaudhuri, Philip A. Bernstein:
Budget-aware Index Tuning with Reinforcement Learning. 1528-1541 - Jingyi Yang, Peizhi Wu, Gao Cong, Tieying Zhang, Xiao He:
SAM: Database Generation from Query Workloads with Supervised Autoregressive Models. 1542-1555
Session 22: Provenance and Uncertainty
- Felix S. Campbell, Bahareh Sadat Arab, Boris Glavic:
Efficient Answering of Historical What-if Queries. 1556-1569 - Daniel Deutch, Nave Frost, Benny Kimelfeld, Mikaël Monet:
Computing the Shapley Value of Facts in Query Answering. 1570-1583 - Thomas Hütter, Nikolaus Augsten, Christoph M. Kirsch, Michael J. Carey, Chen Li:
JEDI: These aren't the JSON documents you're looking for? 1584-1597 - Sainyam Galhotra, Amir Gilad, Sudeepa Roy, Babak Salimi:
HypeR: Hypothetical Reasoning With What-If and How-To Queries Using a Probabilistic Causal Approach. 1598-1611 - Daniel Ting:
Adaptive Threshold Sampling. 1612-1625
Session 23: Storage and Indexing
- Christoph Anneser, Andreas Kipf, Huanchen Zhang, Thomas Neumann, Alfons Kemper:
Adaptive Hybrid Indexes. 1626-1639 - Brian Hentschel, Utku Sirin, Stratos Idreos:
Entropy-Learned Hashing: Constant Time Hashing with Controllable Uniformity. 1640-1654 - Feng Zhang, Weitao Wan, Chenyang Zhang, Jidong Zhai, Yunpeng Chai, Haixiang Li, Xiaoyong Du:
CompressDB: Enabling Efficient Compressed Data Direct Processing for Various Databases. 1655-1669