default search action
Nan Tang 0001
Person information
- unicode name: 汤南
- affiliation: Hong Kong University of Science and Technology Guangzhou (HKUST-GZ), Information Hub, Guangzhou, China
- affiliation (former): Hamad Bin Khalifa University, Qatar Computing Research Institute, Doha, Qatar
- affiliation (former): University of Edinburgh, UK
- affiliation (former): Centrum Wiskunde & Informatica, Amsterdam, The Netherlands
- affiliation (former, PhD 2007): Chinese University of Hong Kong, Hong Kong
Other persons with the same name
- Nan Tang — disambiguation page
- Nan Tang 0002 — Dalian University of Technology, Dalian, China
- Nan Tang 0003 — University of Arkansas, USA
- Nan Tang 0004 — Xi'an Shiyou University, Xian, China
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [j68]Tongyu Liu, Ju Fan, Nan Tang, Guoliang Li, Xiaoyong Du:
Controllable Tabular Data Synthesis Using Diffusion Models. Proc. ACM Manag. Data 2(1): 28:1-28:29 (2024) - [j67]Yuhao Deng, Chengliang Chai, Lei Cao, Nan Tang, Jiayi Wang, Ju Fan, Ye Yuan, Guoren Wang:
MisDetect: Iterative Mislabel Detection using Early Loss. Proc. VLDB Endow. 17(6): 1159-1172 (2024) - [j66]Yuhao Deng, Chengliang Chai, Lei Cao, Qin Yuan, Siyuan Chen, Yanrui Yu, Zhaoze Sun, Junyi Wang, Jiajun Li, Ziqi Cao, Kaisen Jin, Chi Zhang, Yuqing Jiang, Yuanfang Zhang, Yuping Wang, Ye Yuan, Guoren Wang, Nan Tang:
LakeBench: A Benchmark for Discovering Joinable and Unionable Tables in Data Lakes. Proc. VLDB Endow. 17(8): 1925-1938 (2024) - [j65]Ju Fan, Zihui Gu, Songyue Zhang, Yuxin Zhang, Zui Chen, Lei Cao, Guoliang Li, Samuel Madden, Xiaoyong Du, Nan Tang:
Combining Small Language Models and Large Language Models for Zero-Shot NL2SQL. Proc. VLDB Endow. 17(11): 2750-2763 (2024) - [j64]Yushi Sun, Xin Hao, Kai Sun, Yifan Xu, Xiao Yang, Xin Luna Dong, Nan Tang, Lei Chen:
Are Large Language Models a Good Replacement of Taxonomies? Proc. VLDB Endow. 17(11): 2919-2932 (2024) - [j63]Yupeng Xie, Yuyu Luo, Guoliang Li, Nan Tang:
HAIChart: Human and AI Paired Visualization System. Proc. VLDB Endow. 17(11): 3178-3191 (2024) - [j62]Boyan Li, Yuyu Luo, Chengliang Chai, Guoliang Li, Nan Tang:
The Dawn of Natural Language to SQL: Are We Fully Ready? [Experiment, Analysis \u0026 Benchmark ]. Proc. VLDB Endow. 17(11): 3318-3331 (2024) - [j61]Chengliang Chai, Yuhao Deng, Yutong Zhan, Ziqi Cao, Yuanfang Zhang, Lei Cao, Yu-Ping Wang, Zhiwei Zhang, Ye Yuan, Guoren Wang, Nan Tang:
LakeCompass: An End-to-End System for Table Maintenance, Search and Analysis in Data Lakes. Proc. VLDB Endow. 17(12): 4381-4384 (2024) - [j60]Mohamed Y. Eltabakh, Zan Ahmad Naeem, Mohammad Shahmeer Ahmad, Mourad Ouzzani, Nan Tang:
RetClean: Retrieval-Based Tabular Data Cleaning Using LLMs and Data Lakes. Proc. VLDB Endow. 17(12): 4421-4424 (2024) - [j59]Ju Fan, Jianhong Tu, Guoliang Li, Peng Wang, Xiaoyong Du, Xiaofeng Jia, Song Gao, Nan Tang:
Unicorn: A Unified Multi-Tasking Matching Model. SIGMOD Rec. 53(1): 44-53 (2024) - [j58]Tongyu Liu, Ju Fan, Guoliang Li, Nan Tang, Xiaoyong Du:
Tabular data synthesis with generative adversarial networks: design space and optimizations. VLDB J. 33(2): 255-280 (2024) - [c72]Nan Tang, Chenyu Yang, Ju Fan, Lei Cao, Yuyu Luo, Alon Y. Halevy:
VerifAI: Verified Generative AI. CIDR 2024 - [c71]Chengliang Chai, Kaisen Jin, Nan Tang, Ju Fan, Lianpeng Qiao, Yuping Wang, Yuyu Luo, Ye Yuan, Guoren Wang:
Mitigating Data Scarcity in Supervised Machine Learning Through Reinforcement Learning Guided Data Generation. ICDE 2024: 3613-3626 - [c70]Meihao Fan, Xiaoyue Han, Ju Fan, Chengliang Chai, Nan Tang, Guoliang Li, Xiaoyong Du:
Cost-Effective In-Context Learning for Entity Resolution: A Design Space Exploration. ICDE 2024: 3696-3709 - [c69]Sibei Chen, Hanbing Liu, Waiting Jin, Xiangyu Sun, Xiaoyao Feng, Ju Fan, Xiaoyong Du, Nan Tang:
ChatPipe: Orchestrating Data Preparation Pipelines by Optimizing Human-ChatGPT Interactions. SIGMOD Conference Companion 2024: 484-487 - [c68]Yuhao Deng, Deng Qiyan, Chengliang Chai, Lei Cao, Nan Tang, Ju Fan, Jiayi Wang, Ye Yuan, Guoren Wang:
IDE: A System for Iterative Mislabel Detection. SIGMOD Conference Companion 2024: 500-503 - [i23]Boyan Li, Yuyu Luo, Chengliang Chai, Guoliang Li, Nan Tang:
The Dawn of Natural Language to SQL: Are We Fully Ready? CoRR abs/2406.01265 (2024) - [i22]Xiao Yang, Kai Sun, Hao Xin, Yushi Sun, Nikita Bhalla, Xiangsen Chen, Sajal Choudhary, Rongze Daniel Gui, Ziran Will Jiang, Ziyu Jiang, Lingkun Kong, Brian Moran, Jiaqi Wang, Yifan Ethan Xu, An Yan, Chenyu Yang, Eting Yuan, Hanwen Zha, Nan Tang, Lei Chen, Nicolas Scheffer, Yue Liu, Nirav Shah, Rakesh Wanga, Anuj Kumar, Wen-tau Yih, Xin Luna Dong:
CRAG - Comprehensive RAG Benchmark. CoRR abs/2406.04744 (2024) - [i21]Yizhang Zhu, Shiyin Du, Boyan Li, Yuyu Luo, Nan Tang:
Are Large Language Models Good Statisticians? CoRR abs/2406.07815 (2024) - [i20]Yupeng Xie, Yuyu Luo, Guoliang Li, Nan Tang:
HAIChart: Human and AI Paired Visualization System. CoRR abs/2406.11033 (2024) - [i19]Yushi Sun, Hao Xin, Kai Sun, Yifan Ethan Xu, Xiao Yang, Xin Luna Dong, Nan Tang, Lei Chen:
Are Large Language Models a Good Replacement of Taxonomies? CoRR abs/2406.11131 (2024) - [i18]Xinyu Liu, Shuyu Shen, Boyan Li, Peixian Ma, Runzhi Jiang, Yuyu Luo, Yuxin Zhang, Ju Fan, Guoliang Li, Nan Tang:
A Survey of NL2SQL with Large Language Models: Where are we, and where are we going? CoRR abs/2408.05109 (2024) - 2023
- [j57]Jianhong Tu, Ju Fan, Nan Tang, Peng Wang, Guoliang Li, Xiaoyong Du, Xiaofeng Jia, Song Gao:
Unicorn: A Unified Multi-tasking Model for Supporting Matching Tasks in Data Integration. Proc. ACM Manag. Data 1(1): 84:1-84:26 (2023) - [j56]Yuyu Luo, Yihui Zhou, Nan Tang, Guoliang Li, Chengliang Chai, Leixian Shen:
Learned Data-aware Image Representations of Line Charts for Similarity Search. Proc. ACM Manag. Data 1(1): 88:1-88:29 (2023) - [j55]Sibei Chen, Nan Tang, Ju Fan, Xuemi Yan, Chengliang Chai, Guoliang Li, Xiaoyong Du:
HAIPipe: Combining Human-generated and Machine-generated Pipelines for Data Preparation. Proc. ACM Manag. Data 1(1): 91:1-91:26 (2023) - [j54]Zihui Gu, Ju Fan, Nan Tang, Lei Cao, Bowen Jia, Sam Madden, Xiaoyong Du:
Few-shot Text-to-SQL Translation using Structure and Content Prompt Learning. Proc. ACM Manag. Data 1(2): 147:1-147:28 (2023) - [j53]Chengliang Chai, Jiabin Liu, Nan Tang, Ju Fan, Dongjing Miao, Jiayi Wang, Yuyu Luo, Guoliang Li:
GoodCore: Data-effective and Data-efficient Machine Learning through Coreset Selection over Incomplete Data. Proc. ACM Manag. Data 1(2): 157:1-157:27 (2023) - [j52]Yong Wang, Kaiyu Li, Guoliang Li, Nan Tang:
Road-Aware Indexing for Trajectory Range Queries. IEEE Trans. Knowl. Data Eng. 35(8): 8476-8489 (2023) - [j51]Shuang Hao, Chengliang Chai, Guoliang Li, Nan Tang, Ning Wang, Xiang Yu:
HOFD: An Outdated Fact Detector for Knowledge Bases. IEEE Trans. Knowl. Data Eng. 35(10): 10775-10789 (2023) - [c67]Zui Chen, Zihui Gu, Lei Cao, Ju Fan, Samuel Madden, Nan Tang:
Symphony: Towards Natural Language Query Answering over Multi-modal Data Lakes. CIDR 2023 - [c66]Chengliang Chai, Jiayi Wang, Nan Tang, Ye Yuan, Jiabin Liu, Yuhao Deng, Guoren Wang:
Efficient Coreset Selection with Cluster-based Methods. KDD 2023: 167-178 - [c65]Chengliang Chai, Nan Tang, Ju Fan, Yuyu Luo:
Demystifying Artificial Intelligence for Data Preparation. SIGMOD Conference Companion 2023: 13-20 - [c64]Chenyu Yang, Ruixue Fan, Nan Tang, Meihui Zhang, Xiaoman Zhao, Ju Fan, Xiaoyong Du:
Pay "Attention" to Chart Images for What You Read on Text. SIGMOD Conference Companion 2023: 111-114 - [i17]Mohammad Shahmeer Ahmad, Zan Ahmad Naeem, Mohamed Y. Eltabakh, Mourad Ouzzani, Nan Tang:
RetClean: Retrieval-Based Data Cleaning Using Foundation Models and Data Lakes. CoRR abs/2303.16909 (2023) - [i16]Sibei Chen, Hanbing Liu, Waiting Jin, Xiangyu Sun, Xiaoyao Feng, Ju Fan, Xiaoyong Du, Nan Tang:
ChatPipe: Orchestrating Data Preparation Program by Optimizing Human-ChatGPT Interactions. CoRR abs/2304.03540 (2023) - [i15]Zihui Gu, Ju Fan, Nan Tang, Songyue Zhang, Yuxin Zhang, Zui Chen, Lei Cao, Guoliang Li, Sam Madden, Xiaoyong Du:
Interleaving Pre-Trained Language Models and Large Language Models for Zero-Shot NL2SQL Generation. CoRR abs/2306.08891 (2023) - [i14]Nan Tang, Chenyu Yang, Ju Fan, Lei Cao:
VerifAI: Verified Generative AI. CoRR abs/2307.02796 (2023) - [i13]Zui Chen, Lei Cao, Sam Madden, Ju Fan, Nan Tang, Zihui Gu, Zeyuan Shang, Chunwei Liu, Michael J. Cafarella, Tim Kraska:
SEED: Simple, Efficient, and Effective Data Management via Large Language Models. CoRR abs/2310.00749 (2023) - [i12]Meihao Fan, Xiaoyue Han, Ju Fan, Chengliang Chai, Nan Tang, Guoliang Li, Xiaoyong Du:
Cost-Effective In-Context Learning for Entity Resolution: A Design Space Exploration. CoRR abs/2312.03987 (2023) - 2022
- [j50]Xiang Yu, Chengliang Chai, Xinning Zhang, Nan Tang, Ji Sun, Guoliang Li:
AlphaQO: Robust Learned Query Optimizer. Int. J. Softw. Informatics 12(1): 7-29 (2022) - [j49]Guoliang Li, Nan Tang, Chengliang Chai:
Preface. J. Comput. Sci. Technol. 37(5): 1003-1004 (2022) - [j48]Chengliang Chai, Jiabin Liu, Nan Tang, Guoliang Li, Yuyu Luo:
Selective Data Acquisition in the Wild for Model Charging. Proc. VLDB Endow. 15(7): 1466-1478 (2022) - [j47]Jianhong Tu, Xiaoyue Han, Ju Fan, Nan Tang, Chengliang Chai, Guoliang Li, Xiaoyong Du:
DADER: Hands-Off Entity Resolution with Domain Adaptation. Proc. VLDB Endow. 15(12): 3666-3669 (2022) - [j46]Jiayi Wang, Chengliang Chai, Nan Tang, Jiabin Liu, Guoliang Li:
Coresets over Multiple Tables for Feature-rich and Data-efficient Machine Learning. Proc. VLDB Endow. 16(1): 64-76 (2022) - [j45]Jinfeng Peng, Derong Shen, Nan Tang, Tieying Liu, Yue Kou, Tiezheng Nie, Hang Cui, Ge Yu:
Self-supervised and Interpretable Data Cleaning with Sequence Generative Adversarial Networks. Proc. VLDB Endow. 16(3): 433-446 (2022) - [j44]Yuyu Luo, Xuedi Qin, Chengliang Chai, Nan Tang, Guoliang Li, Wenbo Li:
Steerable Self-Driving Data Visualization. IEEE Trans. Knowl. Data Eng. 34(1): 475-490 (2022) - [j43]Yuyu Luo, Nan Tang, Guoliang Li, Jiawei Tang, Chengliang Chai, Xuedi Qin:
Natural Language to Visualization by Neural Machine Translation. IEEE Trans. Vis. Comput. Graph. 28(1): 217-226 (2022) - [j42]Xuedi Qin, Chengliang Chai, Yuyu Luo, Tianyu Zhao, Nan Tang, Guoliang Li, Jianhua Feng, Xiang Yu, Mourad Ouzzani:
Interactively discovering and ranking desired tuples by data exploration. VLDB J. 31(4): 753-777 (2022) - [c63]Zihui Gu, Ju Fan, Nan Tang, Preslav Nakov, Xiaoman Zhao, Xiaoyong Du:
PASTA: Table-Operations Aware Fact Verification via Sentence-Table Cloze Pre-training. EMNLP 2022: 4971-4983 - [c62]Xuedi Qin, Chengliang Chai, Nan Tang, Jian Li, Yuyu Luo, Guoliang Li, Yaoyu Zhu:
Synthesizing Privacy Preserving Entity Resolution Datasets. ICDE 2022: 2359-2371 - [c61]Jiabin Liu, Chengliang Chai, Yuyu Luo, Yin Lou, Jianhua Feng, Nan Tang:
Feature Augmentation with Reinforcement Learning. ICDE 2022: 3360-3372 - [c60]Jianhong Tu, Ju Fan, Nan Tang, Peng Wang, Chengliang Chai, Guoliang Li, Ruixue Fan, Xiaoyong Du:
Domain Adaptation for Deep Entity Resolution. SIGMOD Conference 2022: 443-457 - [i11]Zihui Gu, Ju Fan, Nan Tang, Preslav Nakov, Xiaoman Zhao, Xiaoyong Du:
PASTA: Table-Operations Aware Fact Verification via Sentence-Table Cloze Pre-training. CoRR abs/2211.02816 (2022) - 2021
- [j41]Tongyu Liu, Ju Fan, Yinqing Luo, Nan Tang, Guoliang Li, Xiaoyong Du:
Adaptive Data Augmentation for Supervised Learning over Missing Data. Proc. VLDB Endow. 14(7): 1202-1214 (2021) - [j40]Nan Tang, Ju Fan, Fangyi Li, Jianhong Tu, Xiaoyong Du, Guoliang Li, Samuel Madden, Mourad Ouzzani:
RPT: Relational Pre-trained Transformer Is Almost All You Need towards Democratizing Data Preparation. Proc. VLDB Endow. 14(8): 1254-1261 (2021) - [j39]Saravanan Thirumuruganathan, Han Li, Nan Tang, Mourad Ouzzani, Yash Govind, Derek Paulsen, Glenn Fung, AnHai Doan:
Deep Learning for Blocking in Entity Matching: A Design Space Exploration. Proc. VLDB Endow. 14(11): 2459-2472 (2021) - [j38]Jiabin Liu, Fu Zhu, Chengliang Chai, Yuyu Luo, Nan Tang:
Automatic Data Acquisition for Deep Learning. Proc. VLDB Endow. 14(12): 2739-2742 (2021) - [j37]Ji Sun, Jintao Zhang, Zhaoyan Sun, Guoliang Li, Nan Tang:
Learned Cardinality Estimation: A Design Space Exploration and A Comparative Evaluation. Proc. VLDB Endow. 15(1): 85-97 (2021) - [j36]Shuang Hao, Nan Tang, Guoliang Li, Jianhua Feng, Ning Wang:
Mis-categorized entities detection. VLDB J. 30(4): 515-536 (2021) - [c59]Xuedi Qin, Chengliang Chai, Yuyu Luo, Tianyu Zhao, Nan Tang, Guoliang Li, Jianhua Feng, Xiang Yu, Mourad Ouzzani:
Ranking Desired Tuples by Database Exploration. ICDE 2021: 1973-1978 - [c58]Yuyu Luo, Nan Tang, Guoliang Li, Chengliang Chai, Wenbo Li, Xuedi Qin:
Synthesizing Natural Language to Visualization (NL2VIS) Benchmarks from NL2SQL Benchmarks. SIGMOD Conference 2021: 1235-1247 - [c57]Ji Sun, Guoliang Li, Nan Tang:
Learned Cardinality Estimation for Similarity Queries. SIGMOD Conference 2021: 1745-1757 - 2020
- [j35]Yuyu Luo, Nan Tang, Guoliang Li, Wenbo Li, Tianyu Zhao, Xiang Yu:
DeepEye: A Data Science System for Monitoring and Exploring COVID-19 Data. IEEE Data Eng. Bull. 43(2): 121-132 (2020) - [j34]John K. Feser, Sam Madden, Nan Tang, Armando Solar-Lezama:
Deductive optimization of relational data storage. Proc. ACM Program. Lang. 4(OOPSLA): 170:1-170:30 (2020) - [j33]Abdulhakim Ali Qahtan, Nan Tang, Mourad Ouzzani, Yang Cao, Michael Stonebraker:
Pattern Functional Dependencies for Data Cleaning. Proc. VLDB Endow. 13(5): 684-697 (2020) - [j32]Yuyu Luo, Chengliang Chai, Xuedi Qin, Nan Tang, Guoliang Li:
VisClean: Interactive Cleaning for Progressive Visualization. Proc. VLDB Endow. 13(12): 2821-2824 (2020) - [j31]Yuyu Luo, Wenbo Li, Tianyu Zhao, Xiang Yu, Lixi Zhang, Guoliang Li, Nan Tang:
DeepTrack: Monitoring and Exploring Spatio-Temporal Data - A Case of Tracking COVID-19 -. Proc. VLDB Endow. 13(12): 2841-2844 (2020) - [j30]El Kindi Rezig, Ashrita Brahmaroutu, Nesime Tatbul, Mourad Ouzzani, Nan Tang, Timothy G. Mattson, Samuel Madden, Michael Stonebraker:
Debugging Large-Scale Data Science Pipelines using Dagger. Proc. VLDB Endow. 13(12): 2993-2996 (2020) - [j29]Xuedi Qin, Yuyu Luo, Nan Tang, Guoliang Li:
Making data visualization more efficient and effective: a survey. VLDB J. 29(1): 93-117 (2020) - [c56]El Kindi Rezig, Lei Cao, Giovanni Simonini, Maxime Schoemans, Samuel Madden, Nan Tang, Mourad Ouzzani, Michael Stonebraker:
Dagger: A Data (not code) Debugger. CIDR 2020 - [c55]Saravanan Thirumuruganathan, Nan Tang, Mourad Ouzzani, AnHai Doan:
Data Curation with Deep Learning. EDBT 2020: 277-286 - [c54]Yuyu Luo, Chengliang Chai, Xuedi Qin, Nan Tang, Guoliang Li:
Interactive Cleaning for Progressive Visualization through Composite Questions. ICDE 2020: 733-744 - [c53]Xiang Yu, Guoliang Li, Chengliang Chai, Nan Tang:
Reinforcement Learning with Tree-LSTM for Join Order Selection. ICDE 2020: 1297-1308 - [c52]Shuang Hao, Chengliang Chai, Guoliang Li, Nan Tang, Ning Wang, Xiang Yu:
Outdated Fact Detection in Knowledge Bases. ICDE 2020: 1890-1893 - [c51]Xuedi Qin, Chengliang Chai, Yuyu Luo, Nan Tang, Guoliang Li:
Interactively Discovering and Ranking Desired Tuples without Writing SQL Queries. SIGMOD Conference 2020: 2745-2748 - [c50]Mashaal Musleh, Mourad Ouzzani, Nan Tang, AnHai Doan:
CoClean: Collaborative Data Cleaning. SIGMOD Conference 2020: 2757-2760 - [i10]Nan Tang, Ju Fan, Fangyi Li, Jianhong Tu, Xiaoyong Du, Guoliang Li, Sam Madden, Mourad Ouzzani:
Relational Pretrained Transformers towards Democratizing Data Preparation [Vision]. CoRR abs/2012.02469 (2020)
2010 – 2019
- 2019
- [j28]Yong Wang, Guoliang Li, Nan Tang:
Querying Shortest Paths on Time Dependent Road Networks. Proc. VLDB Endow. 12(11): 1249-1261 (2019) - [j27]El Kindi Rezig, Lei Cao, Michael Stonebraker, Giovanni Simonini, Wenbo Tao, Samuel Madden, Mourad Ouzzani, Nan Tang, Ahmed K. Elmagarmid:
Data Civilizer 2.0: A Holistic Framework for Data Preparation and Analytics. Proc. VLDB Endow. 12(12): 1954-1957 (2019) - [j26]Sibo Wang, Renchi Yang, Runhui Wang, Xiaokui Xiao, Zhewei Wei, Wenqing Lin, Yin Yang, Nan Tang:
Efficient Algorithms for Approximate Single-Source Personalized PageRank Queries. ACM Trans. Database Syst. 44(4): 18:1-18:37 (2019) - [c49]Dong Deng, Wenbo Tao, Ziawasch Abedjan, Ahmed K. Elmagarmid, Ihab F. Ilyas, Guoliang Li, Samuel Madden, Mourad Ouzzani, Michael Stonebraker, Nan Tang:
Unsupervised String Transformation Learning for Entity Consolidation. ICDE 2019: 196-207 - [c48]Saravanan Thirumuruganathan, Mourad Ouzzani, Nan Tang:
Explaining Entity Resolution Predictions: Where are we and What needs to be done? HILDA@SIGMOD 2019: 10:1-10:6 - [c47]Mohammad Mahdavi, Ziawasch Abedjan, Raul Castro Fernandez, Samuel Madden, Mourad Ouzzani, Michael Stonebraker, Nan Tang:
Raha: A Configuration-Free Error Detection System. SIGMOD Conference 2019: 865-882 - [c46]Abdulhakim Ali Qahtan, Nan Tang, Mourad Ouzzani, Yang Cao, Michael Stonebraker:
ANMAT: Automatic Knowledge Discovery and Error Detection through Pattern Functional Dependencies. SIGMOD Conference 2019: 1977-1980 - [c45]Nan Tang, Eugene Wu, Guoliang Li:
Towards Democratizing Relational Data Visualization. SIGMOD Conference 2019: 2025-2030 - [p1]Mourad Ouzzani, Nan Tang, Raul Castro Fernandez:
Data civilizer: end-to-end support for data discovery, integration, and cleaning. Making Databases Work 2019: 291-300 - [i9]John K. Feser, Samuel Madden, Nan Tang, Armando Solar-Lezama:
Deductive Optimization of Relational Data Storage. CoRR abs/1903.03229 (2019) - [i8]Ji Sun, Dong Deng, Ihab F. Ilyas, Guoliang Li, Samuel Madden, Mourad Ouzzani, Michael Stonebraker, Nan Tang:
Technical Report: Optimizing Human Involvement for Entity Matching and Consolidation. CoRR abs/1906.06574 (2019) - [i7]Sibo Wang, Renchi Yang, Runhui Wang, Xiaokui Xiao, Zhewei Wei, Wenqing Lin, Yin Yang, Nan Tang:
Efficient Algorithms for Approximate Single-Source Personalized PageRank Queries. CoRR abs/1908.10583 (2019) - [i6]Raul Castro Fernandez, Nan Tang, Mourad Ouzzani, Michael Stonebraker, Samuel Madden:
Dataset-On-Demand: Automatic View Search and Presentation for Data Discovery. CoRR abs/1911.11876 (2019) - 2018
- [j25]Xuedi Qin, Yuyu Luo, Nan Tang, Guoliang Li:
DeepEye: An automatic big data visualization framework. Big Data Min. Anal. 1(1): 75-82 (2018) - [j24]Divy Agrawal, Sanjay Chawla, Bertty Contreras-Rojas, Ahmed K. Elmagarmid, Yasser Idris, Zoi Kaoudi, Sebastian Kruse, Ji Lucas, Essam Mansour, Mourad Ouzzani, Paolo Papotti, Jorge-Arnulfo Quiané-Ruiz, Nan Tang, Saravanan Thirumuruganathan, Anis Troudi:
RHEEM: Enabling Cross-Platform Data Processing - May The Big Data Be With You! -. Proc. VLDB Endow. 11(11): 1414-1427 (2018) - [j23]Muhammad Ebraheem, Saravanan Thirumuruganathan, Shafiq R. Joty, Mourad Ouzzani, Nan Tang:
Distributed Representations of Tuples for Entity Resolution. Proc. VLDB Endow. 11(11): 1454-1467 (2018) - [j22]Shuang Hao, Nan Tang, Guoliang Li, Jian Li, Jianhua Feng:
Distilling relations using knowledge bases. VLDB J. 27(4): 497-519 (2018) - [c44]Xuedi Qin, Yuyu Luo, Nan Tang, Guoliang Li:
DeepEye: Visualizing Your Data by Keyword Search. EDBT 2018: 441-444 - [c43]Yuyu Luo, Xuedi Qin, Nan Tang, Guoliang Li:
DeepEye: Towards Automatic Data Visualization. ICDE 2018: 101-112 - [c42]Shuang Hao, Nan Tang, Guoliang Li, Jianhua Feng:
Discovering Mis-Categorized Entities. ICDE 2018: 413-424 - [c41]Raul Castro Fernandez, Essam Mansour, Abdulhakim Ali Qahtan, Ahmed K. Elmagarmid, Ihab F. Ilyas, Samuel Madden, Mourad Ouzzani, Michael Stonebraker, Nan Tang:
Seeping Semantics: Linking Datasets Using Word Embeddings for Data Discovery. ICDE 2018: 989-1000 - [c40]Essam Mansour, Dong Deng, Raul Castro Fernandez, Abdulhakim Ali Qahtan, Wenbo Tao, Ziawasch Abedjan, Ahmed K. Elmagarmid, Ihab F. Ilyas, Samuel Madden, Mourad Ouzzani, Michael Stonebraker, Nan Tang:
Building Data Civilizer Pipelines with an Advanced Workflow Engine. ICDE 2018: 1593-1596 - [c39]Shuang Hao, Yi Xu, Nan Tang, Guoliang Li, Jianhua Feng:
Cleaning Your Wrong Google Scholar Entries. ICDE 2018: 1597-1600 - [c38]Abdulhakim Ali Qahtan, Ahmed K. Elmagarmid, Mourad Ouzzani, Nan Tang:
FAHES: Detecting Disguised Missing Values. ICDE 2018: 1609-1612 - [c37]Abdulhakim Ali Qahtan, Ahmed K. Elmagarmid, Raul Castro Fernandez, Mourad Ouzzani, Nan Tang:
FAHES: A Robust Disguised Missing Values Detector. KDD 2018: 2100-2109 - [c36]Yuyu Luo, Xuedi Qin, Nan Tang, Guoliang Li, Xinran Wang:
DeepEye: Creating Good Data Visualizations by Keyword Search. SIGMOD Conference 2018: 1733-1736 - [i5]Saravanan Thirumuruganathan, Nan Tang, Mourad Ouzzani:
Data Curation with Deep Learning [Vision]: Towards Self Driving Data Curation. CoRR abs/1803.01384 (2018) - [i4]Saravanan Thirumuruganathan, Shameem Ahamed Puthiya Parambath, Mourad Ouzzani, Nan Tang, Shafiq R. Joty:
Reuse and Adaptation for Entity Resolution through Transfer Learning. CoRR abs/1809.11084 (2018) - 2017
- [j21]Jiannan Wang, Nan Tang:
Dependable Data Repairing with Fixing Rules. ACM J. Data Inf. Qual. 8(3-4): 16:1-16:34 (2017) - [j20]Zuhair Khayyat, William Lucia, Meghna Singh, Mourad Ouzzani, Paolo Papotti, Jorge-Arnulfo Quiané-Ruiz, Nan Tang, Panos Kalnis:
Errata for "Lightning Fast and Space Efficient Inequality Joins" (PVLDB 8(13): 2074-2085). Proc. VLDB Endow. 10(9): 985 (2017) - [j19]Rohit Singh, Venkata Vamsikrishna Meduri, Ahmed K. Elmagarmid, Samuel Madden, Paolo Papotti, Jorge-Arnulfo Quiané-Ruiz, Armando Solar-Lezama, Nan Tang:
Synthesizing Entity Matching Rules by Examples. Proc. VLDB Endow. 11(2): 189-202 (2017) - [j18]Shuang Hao, Nan Tang, Guoliang Li, Jian He, Na Ta, Jianhua Feng:
A Novel Cost-Based Model for Data Repairing. IEEE Trans. Knowl. Data Eng. 29(4): 727-742 (2017) - [j17]Zuhair Khayyat, William Lucia, Meghna Singh, Mourad Ouzzani, Paolo Papotti, Jorge-Arnulfo Quiané-Ruiz, Nan Tang, Panos Kalnis:
Fast and scalable inequality joins. VLDB J. 26(1): 125-150 (2017) - [c35]Dong Deng, Raul Castro Fernandez, Ziawasch Abedjan, Sibo Wang, Michael Stonebraker, Ahmed K. Elmagarmid, Ihab F. Ilyas, Samuel Madden, Mourad Ouzzani, Nan Tang:
The Data Civilizer System. CIDR 2017 - [c34]Shuang Hao, Nan Tang, Guoliang Li, Jian He, Na Ta, Jianhua Feng:
A Novel Cost-Based Model for Data Repairing. ICDE 2017: 49-50 - [c33]Shuang Hao, Nan Tang, Guoliang Li, Jian Li:
Cleaning Relations Using Knowledge Bases. ICDE 2017: 933-944 - [c32]Enzo Veltri, Donatello Santoro, Giansalvatore Mecca, Paolo Papotti, Jian He, Gouliang Li, Nan Tang:
Interactive Data Repairing: the FALCON Dive. SEBD 2017: 267 - [c31]Saravanan Thirumuruganathan, Laure Berti-Équille, Mourad Ouzzani, Jorge-Arnulfo Quiané-Ruiz, Nan Tang:
UGuide: User-Guided Discovery of FD-Detectable Errors. SIGMOD Conference 2017: 1385-1397 - [c30]Rohit Singh, Venkata Vamsikrishna Meduri, Ahmed K. Elmagarmid, Samuel Madden, Paolo Papotti, Jorge-Arnulfo Quiané-Ruiz, Armando Solar-Lezama, Nan Tang:
Generating Concise Entity Matching Rules. SIGMOD Conference 2017: 1635-1638 - [c29]Raul Castro Fernandez, Dong Deng, Essam Mansour, Abdulhakim Ali Qahtan, Wenbo Tao, Ziawasch Abedjan, Ahmed K. Elmagarmid, Ihab F. Ilyas, Samuel Madden, Mourad Ouzzani, Michael Stonebraker, Nan Tang:
A Demo of the Data Civilizer System. SIGMOD Conference 2017: 1639-1642 - [i3]Dong Deng, Wenbo Tao, Ziawasch Abedjan, Ahmed K. Elmagarmid, Ihab F. Ilyas, Samuel Madden, Mourad Ouzzani, Michael Stonebraker, Nan Tang:
Entity Consolidation: The Golden Record Problem. CoRR abs/1709.10436 (2017) - [i2]Muhammad Ebraheem, Saravanan Thirumuruganathan, Shafiq R. Joty, Mourad Ouzzani, Nan Tang:
DeepER - Deep Entity Resolution. CoRR abs/1710.00597 (2017) - 2016
- [j16]Ziawasch Abedjan, Xu Chu, Dong Deng, Raul Castro Fernandez, Ihab F. Ilyas, Mourad Ouzzani, Paolo Papotti, Michael Stonebraker, Nan Tang:
Detecting Data Errors: Where are we and what needs to be done? Proc. VLDB Endow. 9(12): 993-1004 (2016) - [c28]Divy Agrawal, Sanjay Chawla, Ahmed K. Elmagarmid, Zoi Kaoudi, Mourad Ouzzani, Paolo Papotti, Jorge-Arnulfo Quiané-Ruiz, Nan Tang, Mohammed J. Zaki:
Road to Freedom in Big Data Analytics. EDBT 2016: 479-484 - [c27]Jian He, Enzo Veltri, Donatello Santoro, Guoliang Li, Giansalvatore Mecca, Paolo Papotti, Nan Tang:
Interactive and Deterministic Data Cleaning. SIGMOD Conference 2016: 893-907 - [c26]Nan Tang, Qing Chen, Prasenjit Mitra:
Graph Stream Summarization: From Big Bang to Big Crunch. SIGMOD Conference 2016: 1481-1496 - [c25]Divy Agrawal, Mouhamadou Lamine Ba, Laure Berti-Équille, Sanjay Chawla, Ahmed K. Elmagarmid, Hossam Hammady, Yasser Idris, Zoi Kaoudi, Zuhair Khayyat, Sebastian Kruse, Mourad Ouzzani, Paolo Papotti, Jorge-Arnulfo Quiané-Ruiz, Nan Tang, Mohammed J. Zaki:
Rheem: Enabling Multi-Platform Task Execution. SIGMOD Conference 2016: 2069-2072 - 2015
- [j15]Xu Chu, Mourad Ouzzani, John Morcos, Ihab F. Ilyas, Paolo Papotti, Nan Tang, Yin Ye:
KATARA: Reliable Data Cleaning with Knowledge Bases and Crowdsourcing. Proc. VLDB Endow. 8(12): 1952-1955 (2015) - [j14]Zuhair Khayyat, William Lucia, Meghna Singh, Mourad Ouzzani, Paolo Papotti, Jorge-Arnulfo Quiané-Ruiz, Nan Tang, Panos Kalnis:
Lightning Fast and Space Efficient Inequality Joins. Proc. VLDB Endow. 8(13): 2074-2085 (2015) - [c24]Matteo Interlandi, Nan Tang:
Proof positive and negative in data cleaning. ICDE 2015: 18-29 - [c23]Nan Tang:
Big RDF data cleaning. ICDE Workshops 2015: 77-79 - [c22]Zuhair Khayyat, Ihab F. Ilyas, Alekh Jindal, Samuel Madden, Mourad Ouzzani, Paolo Papotti, Jorge-Arnulfo Quiané-Ruiz, Nan Tang, Si Yin:
BigDansing: A System for Big Data Cleansing. SIGMOD Conference 2015: 1215-1230 - [c21]Xu Chu, John Morcos, Ihab F. Ilyas, Mourad Ouzzani, Paolo Papotti, Nan Tang, Yin Ye:
KATARA: A Data Cleaning System Powered by Knowledge Bases and Crowdsourcing. SIGMOD Conference 2015: 1247-1261 - [i1]Nan Tang, Qing Chen, Prasenjit Mitra:
On Summarizing Graph Streams. CoRR abs/1510.02219 (2015) - 2014
- [j13]Wenfei Fan, Shuai Ma, Nan Tang, Wenyuan Yu:
Interaction between Record Matching and Data Repairing. ACM J. Data Inf. Qual. 4(4): 16:1-16:38 (2014) - [j12]Wenfei Fan, Floris Geerts, Nan Tang, Wenyuan Yu:
Conflict resolution with data currency and consistency. ACM J. Data Inf. Qual. 5(1-2): 6:1-6:37 (2014) - [j11]Wenfei Fan, Jianzhong Li, Nan Tang, Wenyuan Yu:
Incremental Detection of Inconsistencies in Distributed Data. IEEE Trans. Knowl. Data Eng. 26(6): 1367-1383 (2014) - [c20]Nan Tang:
Big Data Cleaning. APWeb 2014: 13-24 - [c19]Jiannan Wang, Nan Tang:
Towards dependable data repairing with fixing rules. SIGMOD Conference 2014: 457-468 - [c18]Ahmed K. Elmagarmid, Ihab F. Ilyas, Mourad Ouzzani, Jorge-Arnulfo Quiané-Ruiz, Nan Tang, Si Yin:
NADEEF/ER: generic and interactive entity resolution. SIGMOD Conference 2014: 1071-1074 - 2013
- [j10]Amr Ebaid, Ahmed K. Elmagarmid, Ihab F. Ilyas, Mourad Ouzzani, Jorge-Arnulfo Quiané-Ruiz, Nan Tang, Si Yin:
NADEEF: A Generalized Data Cleaning System. Proc. VLDB Endow. 6(12): 1218-1221 (2013) - [c17]Wenfei Fan, Floris Geerts, Shuai Ma, Nan Tang, Wenyuan Yu:
Data Quality Problems beyond Consistency and Deduplication. In Search of Elegance in the Theory and Practice of Computation 2013: 237-249 - [c16]Wenfei Fan, Floris Geerts, Nan Tang, Wenyuan Yu:
Inferring data currency and consistency for conflict resolution. ICDE 2013: 470-481 - [c15]Michele Dallachiesa, Amr Ebaid, Ahmed Eldawy, Ahmed K. Elmagarmid, Ihab F. Ilyas, Mourad Ouzzani, Nan Tang:
NADEEF: a commodity data cleaning system. SIGMOD Conference 2013: 541-552 - 2012
- [j9]Wenfei Fan, Jianzhong Li, Shuai Ma, Nan Tang, Yinghui Wu:
Adding regular expressions to graph reachability and pattern queries. Frontiers Comput. Sci. 6(3): 313-338 (2012) - [j8]George Beskales, Gautam Das, Ahmed K. Elmagarmid, Ihab F. Ilyas, Felix Naumann, Mourad Ouzzani, Paolo Papotti, Jorge-Arnulfo Quiané-Ruiz, Nan Tang:
The data analytics group at the qatar computing research institute. SIGMOD Rec. 41(4): 33-38 (2012) - [j7]Wenfei Fan, Jianzhong Li, Shuai Ma, Nan Tang, Wenyuan Yu:
Towards certain fixes with editing rules and master data. VLDB J. 21(2): 213-238 (2012) - [c14]Wenfei Fan, Jianzhong Li, Nan Tang, Wenyuan Yu:
Incremental Detection of Inconsistencies in Distributed Data. ICDE 2012: 318-329 - 2011
- [j6]Wenfei Fan, Jianzhong Li, Shuai Ma, Nan Tang, Wenyuan Yu:
CerFix: A System for Cleaning Data with Certain Fixes. Proc. VLDB Endow. 4(12): 1375-1378 (2011) - [c13]Wenfei Fan, Jianzhong Li, Shuai Ma, Nan Tang, Yinghui Wu:
Adding regular expressions to graph reachability and pattern queries. ICDE 2011: 39-50 - [c12]Wenfei Fan, Jianzhong Li, Shuai Ma, Nan Tang, Wenyuan Yu:
Interaction between record matching and data repairing. SIGMOD Conference 2011: 469-480 - 2010
- [j5]Wenfei Fan, Jianzhong Li, Shuai Ma, Nan Tang, Wenyuan Yu:
Towards Certain Fixes with Editing Rules and Master Data. Proc. VLDB Endow. 3(1): 173-184 (2010) - [j4]Wenfei Fan, Jianzhong Li, Shuai Ma, Nan Tang, Yinghui Wu, Yunpeng Wu:
Graph Pattern Matching: From Intractable to Polynomial Time. Proc. VLDB Endow. 3(1): 264-275 (2010) - [j3]Ying Zhang, Nan Tang, Peter A. Boncz:
Projective Distribution of XQuery with Updates. IEEE Trans. Knowl. Data Eng. 22(8): 1059-1076 (2010)
2000 – 2009
- 2009
- [c11]Nan Tang, Lefteris Sidirourgos, Peter A. Boncz:
Space-economical partial gram indices for exact substring matching. CIKM 2009: 285-294 - [c10]Nan Tang, Jeffrey Xu Yu, Hao Tang, M. Tamer Özsu, Peter A. Boncz:
Materialized View Selection in XML Databases. DASFAA 2009: 616-630 - [c9]Ying Zhang, Nan Tang, Peter A. Boncz:
Efficient Distribution of Full-Fledged XQuery. ICDE 2009: 565-576 - 2008
- [j2]Nan Tang, Jeffrey Xu Yu, Kam-Fai Wong, Jianxin Li:
Fast XML Structural Join Algorithms by Partitioning. J. Res. Pract. Inf. Technol. 40(1): 33-54 (2008) - [c8]Nan Tang, Jeffrey Xu Yu, M. Tamer Özsu, Byron Choi, Kam-Fai Wong:
Multiple Materialized View Selection for XPath Query Rewriting. ICDE 2008: 873-882 - [c7]Nan Tang, Jeffrey Xu Yu, M. Tamer Özsu, Kam-Fai Wong:
Hierarchical Indexing Approach to Support XPath Queries. ICDE 2008: 1510-1512 - 2007
- [b1]Nan Tang:
Efficient Xpath query processing in native XML databases. Chinese University of Hong Kong, Hong Kong, 2007 - 2006
- [j1]Kam-Fai Wong, Jeffrey Xu Yu, Nan Tang:
Answering XML Queries Using Path-Based Indexes: A Survey. World Wide Web 9(3): 277-299 (2006) - [c6]Jiefeng Cheng, Jeffrey Xu Yu, Nan Tang:
Fast Reachability Query Processing. DASFAA 2006: 674-688 - [c5]Nan Tang, Jeffrey Xu Yu, Kam-Fai Wong, Haifeng Jiang:
Fast Structural Join with a Location Function. DASFAA 2006: 777-786 - 2005
- [c4]Nan Tang, Jeffrey Xu Yu, Kam-Fai Wong, Kevin Lü, Jianxin Li:
Accelerating XML Structural Join by Partitioning. DEXA 2005: 280-289 - [c3]Nan Tang, Guoren Wang, Jeffrey Xu Yu, Kam-Fai Wong, Ge Yu:
WIN: An Effcient Data Placement Strategy for Parallel XML Databases. ICPADS (1) 2005: 349-355 - 2004
- [c2]Bing Sun, Bo Zhou, Nan Tang, Guoren Wang, Ge Yu, Fulin Jia:
Answering XML Twig Queries with Automata. APWeb 2004: 170-179 - 2003
- [c1]Yaxin Yu, Guoren Wang, Ge Yu, Gang Wu, Junan Hu, Nan Tang:
Data Placement and Query Processing Based on RPE Parallelisms. COMPSAC 2003: 151-
Coauthor Index
aka: Sam Madden
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-10-07 21:26 CEST by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint