default search action
Nan Tang 0001
汤南
Person information
- unicode name: 汤南
- affiliation: Hong Kong University of Science and Technology Guangzhou (HKUST-GZ), Information Hub, Guangzhou, China
- affiliation (former): Hamad Bin Khalifa University, Qatar Computing Research Institute, Doha, Qatar
- affiliation (former): University of Edinburgh, UK
- affiliation (former): Centrum Wiskunde & Informatica, Amsterdam, The Netherlands
- affiliation (former, PhD 2007): Chinese University of Hong Kong, Hong Kong
Other persons with the same name
- Nan Tang — disambiguation page
- Nan Tang 0002 — Dalian University of Technology, Dalian, China
- Nan Tang 0003 — University of Arkansas, USA
- Nan Tang 0004 — Xi'an Shiyou University, Xian, China
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
showing all ?? records
2020 – today
- 2024
- [j68]Tongyu Liu, Ju Fan, Nan Tang, Guoliang Li, Xiaoyong Du:
Controllable Tabular Data Synthesis Using Diffusion Models. Proc. ACM Manag. Data 2(1): 28:1-28:29 (2024) - [j67]Yuhao Deng, Chengliang Chai, Lei Cao, Nan Tang, Jiayi Wang, Ju Fan, Ye Yuan, Guoren Wang:
MisDetect: Iterative Mislabel Detection using Early Loss. Proc. VLDB Endow. 17(6): 1159-1172 (2024) - [j66]Yuhao Deng, Chengliang Chai, Lei Cao, Qin Yuan, Siyuan Chen, Yanrui Yu, Zhaoze Sun, Junyi Wang, Jiajun Li, Ziqi Cao, Kaisen Jin, Chi Zhang, Yuqing Jiang, Yuanfang Zhang, Yuping Wang, Ye Yuan, Guoren Wang, Nan Tang:
LakeBench: A Benchmark for Discovering Joinable and Unionable Tables in Data Lakes. Proc. VLDB Endow. 17(8): 1925-1938 (2024) - [j65]Ju Fan, Zihui Gu, Songyue Zhang, Yuxin Zhang, Zui Chen, Lei Cao, Guoliang Li, Samuel Madden, Xiaoyong Du, Nan Tang:
Combining Small Language Models and Large Language Models for Zero-Shot NL2SQL. Proc. VLDB Endow. 17(11): 2750-2763 (2024) - [j64]Yushi Sun, Xin Hao, Kai Sun, Yifan Xu, Xiao Yang, Xin Luna Dong, Nan Tang, Lei Chen:
Are Large Language Models a Good Replacement of Taxonomies? Proc. VLDB Endow. 17(11): 2919-2932 (2024) - [j63]Yupeng Xie, Yuyu Luo, Guoliang Li, Nan Tang:
HAIChart: Human and AI Paired Visualization System. Proc. VLDB Endow. 17(11): 3178-3191 (2024) - [j62]Boyan Li, Yuyu Luo, Chengliang Chai, Guoliang Li, Nan Tang:
The Dawn of Natural Language to SQL: Are We Fully Ready? [Experiment, Analysis \u0026 Benchmark ]. Proc. VLDB Endow. 17(11): 3318-3331 (2024) - [j61]Chengliang Chai, Yuhao Deng, Yutong Zhan, Ziqi Cao, Yuanfang Zhang, Lei Cao, Yu-Ping Wang, Zhiwei Zhang, Ye Yuan, Guoren Wang, Nan Tang:
LakeCompass: An End-to-End System for Table Maintenance, Search and Analysis in Data Lakes. Proc. VLDB Endow. 17(12): 4381-4384 (2024) - [j60]Mohamed Y. Eltabakh, Zan Ahmad Naeem, Mohammad Shahmeer Ahmad, Mourad Ouzzani, Nan Tang:
RetClean: Retrieval-Based Tabular Data Cleaning Using LLMs and Data Lakes. Proc. VLDB Endow. 17(12): 4421-4424 (2024) - [j59]Ju Fan, Jianhong Tu, Guoliang Li, Peng Wang, Xiaoyong Du, Xiaofeng Jia, Song Gao, Nan Tang:
Unicorn: A Unified Multi-Tasking Matching Model. SIGMOD Rec. 53(1): 44-53 (2024) - [j58]Tongyu Liu, Ju Fan, Guoliang Li, Nan Tang, Xiaoyong Du:
Tabular data synthesis with generative adversarial networks: design space and optimizations. VLDB J. 33(2): 255-280 (2024) - [c72]Nan Tang, Chenyu Yang, Ju Fan, Lei Cao, Yuyu Luo, Alon Y. Halevy:
VerifAI: Verified Generative AI. CIDR 2024 - [c71]Chengliang Chai, Kaisen Jin, Nan Tang, Ju Fan, Lianpeng Qiao, Yuping Wang, Yuyu Luo, Ye Yuan, Guoren Wang:
Mitigating Data Scarcity in Supervised Machine Learning Through Reinforcement Learning Guided Data Generation. ICDE 2024: 3613-3626 - [c70]Meihao Fan, Xiaoyue Han, Ju Fan, Chengliang Chai, Nan Tang, Guoliang Li, Xiaoyong Du:
Cost-Effective In-Context Learning for Entity Resolution: A Design Space Exploration. ICDE 2024: 3696-3709 - [c69]Sibei Chen, Hanbing Liu, Waiting Jin, Xiangyu Sun, Xiaoyao Feng, Ju Fan, Xiaoyong Du, Nan Tang:
ChatPipe: Orchestrating Data Preparation Pipelines by Optimizing Human-ChatGPT Interactions. SIGMOD Conference Companion 2024: 484-487 - [c68]Yuhao Deng, Deng Qiyan, Chengliang Chai, Lei Cao, Nan Tang, Ju Fan, Jiayi Wang, Ye Yuan, Guoren Wang:
IDE: A System for Iterative Mislabel Detection. SIGMOD Conference Companion 2024: 500-503 - [i23]Boyan Li, Yuyu Luo, Chengliang Chai, Guoliang Li, Nan Tang:
The Dawn of Natural Language to SQL: Are We Fully Ready? CoRR abs/2406.01265 (2024) - [i22]Xiao Yang, Kai Sun, Hao Xin, Yushi Sun, Nikita Bhalla, Xiangsen Chen, Sajal Choudhary, Rongze Daniel Gui, Ziran Will Jiang, Ziyu Jiang, Lingkun Kong, Brian Moran, Jiaqi Wang, Yifan Ethan Xu, An Yan, Chenyu Yang, Eting Yuan, Hanwen Zha, Nan Tang, Lei Chen, Nicolas Scheffer, Yue Liu, Nirav Shah, Rakesh Wanga, Anuj Kumar, Wen-tau Yih, Xin Luna Dong:
CRAG - Comprehensive RAG Benchmark. CoRR abs/2406.04744 (2024) - [i21]Yizhang Zhu, Shiyin Du, Boyan Li, Yuyu Luo, Nan Tang:
Are Large Language Models Good Statisticians? CoRR abs/2406.07815 (2024) - [i20]Yupeng Xie, Yuyu Luo, Guoliang Li, Nan Tang:
HAIChart: Human and AI Paired Visualization System. CoRR abs/2406.11033 (2024) - [i19]Yushi Sun, Hao Xin, Kai Sun, Yifan Ethan Xu, Xiao Yang, Xin Luna Dong, Nan Tang, Lei Chen:
Are Large Language Models a Good Replacement of Taxonomies? CoRR abs/2406.11131 (2024) - [i18]Xinyu Liu, Shuyu Shen, Boyan Li, Peixian Ma, Runzhi Jiang, Yuyu Luo, Yuxin Zhang, Ju Fan, Guoliang Li, Nan Tang:
A Survey of NL2SQL with Large Language Models: Where are we, and where are we going? CoRR abs/2408.05109 (2024) - 2023
- [j57]Jianhong Tu, Ju Fan, Nan Tang, Peng Wang, Guoliang Li, Xiaoyong Du, Xiaofeng Jia, Song Gao:
Unicorn: A Unified Multi-tasking Model for Supporting Matching Tasks in Data Integration. Proc. ACM Manag. Data 1(1): 84:1-84:26 (2023) - [j56]Yuyu Luo, Yihui Zhou, Nan Tang, Guoliang Li, Chengliang Chai, Leixian Shen:
Learned Data-aware Image Representations of Line Charts for Similarity Search. Proc. ACM Manag. Data 1(1): 88:1-88:29 (2023) - [j55]Sibei Chen, Nan Tang, Ju Fan, Xuemi Yan, Chengliang Chai, Guoliang Li, Xiaoyong Du:
HAIPipe: Combining Human-generated and Machine-generated Pipelines for Data Preparation. Proc. ACM Manag. Data 1(1): 91:1-91:26 (2023) - [j54]Zihui Gu, Ju Fan, Nan Tang, Lei Cao, Bowen Jia, Sam Madden, Xiaoyong Du:
Few-shot Text-to-SQL Translation using Structure and Content Prompt Learning. Proc. ACM Manag. Data 1(2): 147:1-147:28 (2023) - [j53]Chengliang Chai, Jiabin Liu, Nan Tang, Ju Fan, Dongjing Miao, Jiayi Wang, Yuyu Luo, Guoliang Li:
GoodCore: Data-effective and Data-efficient Machine Learning through Coreset Selection over Incomplete Data. Proc. ACM Manag. Data 1(2): 157:1-157:27 (2023) - [j52]Yong Wang, Kaiyu Li, Guoliang Li, Nan Tang:
Road-Aware Indexing for Trajectory Range Queries. IEEE Trans. Knowl. Data Eng. 35(8): 8476-8489 (2023) - [j51]Shuang Hao, Chengliang Chai, Guoliang Li, Nan Tang, Ning Wang, Xiang Yu:
HOFD: An Outdated Fact Detector for Knowledge Bases. IEEE Trans. Knowl. Data Eng. 35(10): 10775-10789 (2023) - [c67]Zui Chen, Zihui Gu, Lei Cao, Ju Fan, Samuel Madden, Nan Tang:
Symphony: Towards Natural Language Query Answering over Multi-modal Data Lakes. CIDR 2023 - [c66]Chengliang Chai, Jiayi Wang, Nan Tang, Ye Yuan, Jiabin Liu, Yuhao Deng, Guoren Wang:
Efficient Coreset Selection with Cluster-based Methods. KDD 2023: 167-178 - [c65]Chengliang Chai, Nan Tang, Ju Fan, Yuyu Luo:
Demystifying Artificial Intelligence for Data Preparation. SIGMOD Conference Companion 2023: 13-20 - [c64]Chenyu Yang, Ruixue Fan, Nan Tang, Meihui Zhang, Xiaoman Zhao, Ju Fan, Xiaoyong Du:
Pay "Attention" to Chart Images for What You Read on Text. SIGMOD Conference Companion 2023: 111-114 - [i17]Mohammad Shahmeer Ahmad, Zan Ahmad Naeem, Mohamed Y. Eltabakh, Mourad Ouzzani, Nan Tang:
RetClean: Retrieval-Based Data Cleaning Using Foundation Models and Data Lakes. CoRR abs/2303.16909 (2023) - [i16]Sibei Chen, Hanbing Liu, Waiting Jin, Xiangyu Sun, Xiaoyao Feng, Ju Fan, Xiaoyong Du, Nan Tang:
ChatPipe: Orchestrating Data Preparation Program by Optimizing Human-ChatGPT Interactions. CoRR abs/2304.03540 (2023) - [i15]Zihui Gu, Ju Fan, Nan Tang, Songyue Zhang, Yuxin Zhang, Zui Chen, Lei Cao, Guoliang Li, Sam Madden, Xiaoyong Du:
Interleaving Pre-Trained Language Models and Large Language Models for Zero-Shot NL2SQL Generation. CoRR abs/2306.08891 (2023) - [i14]Nan Tang, Chenyu Yang, Ju Fan, Lei Cao:
VerifAI: Verified Generative AI. CoRR abs/2307.02796 (2023) - [i13]Zui Chen, Lei Cao, Sam Madden, Ju Fan, Nan Tang, Zihui Gu, Zeyuan Shang, Chunwei Liu, Michael J. Cafarella, Tim Kraska:
SEED: Simple, Efficient, and Effective Data Management via Large Language Models. CoRR abs/2310.00749 (2023) - [i12]Meihao Fan, Xiaoyue Han, Ju Fan, Chengliang Chai, Nan Tang, Guoliang Li, Xiaoyong Du:
Cost-Effective In-Context Learning for Entity Resolution: A Design Space Exploration. CoRR abs/2312.03987 (2023) - 2022
- [j50]Xiang Yu, Chengliang Chai, Xinning Zhang, Nan Tang, Ji Sun, Guoliang Li:
AlphaQO: Robust Learned Query Optimizer. Int. J. Softw. Informatics 12(1): 7-29 (2022) - [j49]Guoliang Li, Nan Tang, Chengliang Chai:
Preface. J. Comput. Sci. Technol. 37(5): 1003-1004 (2022) - [j48]Chengliang Chai, Jiabin Liu, Nan Tang, Guoliang Li, Yuyu Luo:
Selective Data Acquisition in the Wild for Model Charging. Proc. VLDB Endow. 15(7): 1466-1478 (2022) - [j47]Jianhong Tu, Xiaoyue Han, Ju Fan, Nan Tang, Chengliang Chai, Guoliang Li, Xiaoyong Du:
DADER: Hands-Off Entity Resolution with Domain Adaptation. Proc. VLDB Endow. 15(12): 3666-3669 (2022) - [j46]Jiayi Wang, Chengliang Chai, Nan Tang, Jiabin Liu, Guoliang Li:
Coresets over Multiple Tables for Feature-rich and Data-efficient Machine Learning. Proc. VLDB Endow. 16(1): 64-76 (2022) - [j45]Jinfeng Peng, Derong Shen, Nan Tang, Tieying Liu, Yue Kou, Tiezheng Nie, Hang Cui, Ge Yu:
Self-supervised and Interpretable Data Cleaning with Sequence Generative Adversarial Networks. Proc. VLDB Endow. 16(3): 433-446 (2022) - [j44]Yuyu Luo, Xuedi Qin, Chengliang Chai, Nan Tang, Guoliang Li, Wenbo Li:
Steerable Self-Driving Data Visualization. IEEE Trans. Knowl. Data Eng. 34(1): 475-490 (2022) - [j43]Yuyu Luo, Nan Tang, Guoliang Li, Jiawei Tang, Chengliang Chai, Xuedi Qin:
Natural Language to Visualization by Neural Machine Translation. IEEE Trans. Vis. Comput. Graph. 28(1): 217-226 (2022) - [j42]Xuedi Qin, Chengliang Chai, Yuyu Luo, Tianyu Zhao, Nan Tang, Guoliang Li, Jianhua Feng, Xiang Yu, Mourad Ouzzani:
Interactively discovering and ranking desired tuples by data exploration. VLDB J. 31(4): 753-777 (2022) - [c63]Zihui Gu, Ju Fan, Nan Tang, Preslav Nakov, Xiaoman Zhao, Xiaoyong Du:
PASTA: Table-Operations Aware Fact Verification via Sentence-Table Cloze Pre-training. EMNLP 2022: 4971-4983 - [c62]Xuedi Qin, Chengliang Chai, Nan Tang, Jian Li, Yuyu Luo, Guoliang Li, Yaoyu Zhu:
Synthesizing Privacy Preserving Entity Resolution Datasets. ICDE 2022: 2359-2371 - [c61]Jiabin Liu, Chengliang Chai, Yuyu Luo, Yin Lou, Jianhua Feng, Nan Tang:
Feature Augmentation with Reinforcement Learning. ICDE 2022: 3360-3372 - [c60]Jianhong Tu, Ju Fan, Nan Tang, Peng Wang, Chengliang Chai, Guoliang Li, Ruixue Fan, Xiaoyong Du:
Domain Adaptation for Deep Entity Resolution. SIGMOD Conference 2022: 443-457 - [i11]Zihui Gu, Ju Fan, Nan Tang, Preslav Nakov, Xiaoman Zhao, Xiaoyong Du:
PASTA: Table-Operations Aware Fact Verification via Sentence-Table Cloze Pre-training. CoRR abs/2211.02816 (2022) - 2021
- [j41]Tongyu Liu, Ju Fan, Yinqing Luo, Nan Tang, Guoliang Li, Xiaoyong Du:
Adaptive Data Augmentation for Supervised Learning over Missing Data. Proc. VLDB Endow. 14(7): 1202-1214 (2021) - [j40]Nan Tang, Ju Fan, Fangyi Li, Jianhong Tu, Xiaoyong Du, Guoliang Li, Samuel Madden, Mourad Ouzzani:
RPT: Relational Pre-trained Transformer Is Almost All You Need towards Democratizing Data Preparation. Proc. VLDB Endow. 14(8): 1254-1261 (2021) - [j39]Saravanan Thirumuruganathan, Han Li, Nan Tang, Mourad Ouzzani, Yash Govind, Derek Paulsen, Glenn Fung, AnHai Doan:
Deep Learning for Blocking in Entity Matching: A Design Space Exploration. Proc. VLDB Endow. 14(11): 2459-2472 (2021) - [j38]Jiabin Liu, Fu Zhu, Chengliang Chai, Yuyu Luo, Nan Tang:
Automatic Data Acquisition for Deep Learning. Proc. VLDB Endow. 14(12): 2739-2742 (2021) - [j37]Ji Sun, Jintao Zhang, Zhaoyan Sun, Guoliang Li, Nan Tang:
Learned Cardinality Estimation: A Design Space Exploration and A Comparative Evaluation. Proc. VLDB Endow. 15(1): 85-97 (2021) - [j36]Shuang Hao, Nan Tang, Guoliang Li, Jianhua Feng, Ning Wang:
Mis-categorized entities detection. VLDB J. 30(4): 515-536 (2021) - [c59]Xuedi Qin, Chengliang Chai, Yuyu Luo, Tianyu Zhao, Nan Tang, Guoliang Li, Jianhua Feng, Xiang Yu, Mourad Ouzzani:
Ranking Desired Tuples by Database Exploration. ICDE 2021: 1973-1978 - [c58]Yuyu Luo, Nan Tang, Guoliang Li, Chengliang Chai, Wenbo Li, Xuedi Qin:
Synthesizing Natural Language to Visualization (NL2VIS) Benchmarks from NL2SQL Benchmarks. SIGMOD Conference 2021: 1235-1247 - [c57]Ji Sun, Guoliang Li, Nan Tang:
Learned Cardinality Estimation for Similarity Queries. SIGMOD Conference 2021: 1745-1757 - 2020
- [j35]Yuyu Luo, Nan Tang, Guoliang Li, Wenbo Li, Tianyu Zhao, Xiang Yu:
DeepEye: A Data Science System for Monitoring and Exploring COVID-19 Data. IEEE Data Eng. Bull. 43(2): 121-132 (2020) - [j34]John K. Feser, Sam Madden, Nan Tang, Armando Solar-Lezama:
Deductive optimization of relational data storage. Proc. ACM Program. Lang. 4(OOPSLA): 170:1-170:30 (2020) - [j33]Abdulhakim Ali Qahtan, Nan Tang, Mourad Ouzzani, Yang Cao, Michael Stonebraker:
Pattern Functional Dependencies for Data Cleaning. Proc. VLDB Endow. 13(5): 684-697 (2020) - [j32]Yuyu Luo, Chengliang Chai, Xuedi Qin, Nan Tang, Guoliang Li:
VisClean: Interactive Cleaning for Progressive Visualization. Proc. VLDB Endow. 13(12): 2821-2824 (2020) - [j31]Yuyu Luo, Wenbo Li, Tianyu Zhao, Xiang Yu, Lixi Zhang, Guoliang Li, Nan Tang:
DeepTrack: Monitoring and Exploring Spatio-Temporal Data - A Case of Tracking COVID-19 -. Proc. VLDB Endow. 13(12): 2841-2844 (2020) - [j30]El Kindi Rezig, Ashrita Brahmaroutu, Nesime Tatbul, Mourad Ouzzani, Nan Tang, Timothy G. Mattson, Samuel Madden, Michael Stonebraker:
Debugging Large-Scale Data Science Pipelines using Dagger. Proc. VLDB Endow. 13(12): 2993-2996 (2020) - [j29]Xuedi Qin, Yuyu Luo, Nan Tang, Guoliang Li:
Making data visualization more efficient and effective: a survey. VLDB J. 29(1): 93-117 (2020) - [c56]El Kindi Rezig, Lei Cao, Giovanni Simonini, Maxime Schoemans, Samuel Madden, Nan Tang, Mourad Ouzzani, Michael Stonebraker:
Dagger: A Data (not code) Debugger. CIDR 2020 - [c55]Saravanan Thirumuruganathan, Nan Tang, Mourad Ouzzani, AnHai Doan:
Data Curation with Deep Learning. EDBT 2020: 277-286 - [c54]Yuyu Luo, Chengliang Chai, Xuedi Qin, Nan Tang, Guoliang Li:
Interactive Cleaning for Progressive Visualization through Composite Questions. ICDE 2020: 733-744 - [c53]Xiang Yu, Guoliang Li, Chengliang Chai, Nan Tang:
Reinforcement Learning with Tree-LSTM for Join Order Selection. ICDE 2020: 1297-1308 - [c52]Shuang Hao, Chengliang Chai, Guoliang Li, Nan Tang, Ning Wang, Xiang Yu:
Outdated Fact Detection in Knowledge Bases. ICDE 2020: 1890-1893 - [c51]Xuedi Qin, Chengliang Chai, Yuyu Luo, Nan Tang, Guoliang Li:
Interactively Discovering and Ranking Desired Tuples without Writing SQL Queries. SIGMOD Conference 2020: 2745-2748 - [c50]Mashaal Musleh, Mourad Ouzzani, Nan Tang, AnHai Doan:
CoClean: Collaborative Data Cleaning. SIGMOD Conference 2020: 2757-2760 - [i10]Nan Tang, Ju Fan, Fangyi Li, Jianhong Tu, Xiaoyong Du, Guoliang Li, Sam Madden, Mourad Ouzzani:
Relational Pretrained Transformers towards Democratizing Data Preparation [Vision]. CoRR abs/2012.02469 (2020)
2010 – 2019
- 2019
- [j28]Yong Wang, Guoliang Li, Nan Tang:
Querying Shortest Paths on Time Dependent Road Networks. Proc. VLDB Endow. 12(11): 1249-1261 (2019) - [j27]El Kindi Rezig, Lei Cao, Michael Stonebraker, Giovanni Simonini, Wenbo Tao, Samuel Madden, Mourad Ouzzani, Nan Tang, Ahmed K. Elmagarmid:
Data Civilizer 2.0: A Holistic Framework for Data Preparation and Analytics. Proc. VLDB Endow. 12(12): 1954-1957 (2019) - [j26]Sibo Wang, Renchi Yang, Runhui Wang, Xiaokui Xiao, Zhewei Wei, Wenqing Lin, Yin Yang, Nan Tang:
Efficient Algorithms for Approximate Single-Source Personalized PageRank Queries. ACM Trans. Database Syst. 44(4): 18:1-18:37 (2019) - [c49]Dong Deng, Wenbo Tao, Ziawasch Abedjan, Ahmed K. Elmagarmid, Ihab F. Ilyas, Guoliang Li, Samuel Madden, Mourad Ouzzani, Michael Stonebraker, Nan Tang:
Unsupervised String Transformation Learning for Entity Consolidation. ICDE 2019: 196-207 - [c48]Saravanan Thirumuruganathan, Mourad Ouzzani, Nan Tang:
Explaining Entity Resolution Predictions: Where are we and What needs to be done? HILDA@SIGMOD 2019: 10:1-10:6 - [c47]Mohammad Mahdavi, Ziawasch Abedjan, Raul Castro Fernandez, Samuel Madden, Mourad Ouzzani, Michael Stonebraker, Nan Tang:
Raha: A Configuration-Free Error Detection System. SIGMOD Conference 2019: 865-882 - [c46]Abdulhakim Ali Qahtan, Nan Tang, Mourad Ouzzani, Yang Cao, Michael Stonebraker:
ANMAT: Automatic Knowledge Discovery and Error Detection through Pattern Functional Dependencies. SIGMOD Conference 2019: 1977-1980 - [c45]Nan Tang, Eugene Wu, Guoliang Li:
Towards Democratizing Relational Data Visualization. SIGMOD Conference 2019: 2025-2030 - [p1]Mourad Ouzzani, Nan Tang, Raul Castro Fernandez:
Data civilizer: end-to-end support for data discovery, integration, and cleaning. Making Databases Work 2019: 291-300 - [i9]John K. Feser, Samuel Madden, Nan Tang, Armando Solar-Lezama:
Deductive Optimization of Relational Data Storage. CoRR abs/1903.03229 (2019) - [i8]Ji Sun, Dong Deng, Ihab F. Ilyas, Guoliang Li, Samuel Madden, Mourad Ouzzani, Michael Stonebraker, Nan Tang:
Technical Report: Optimizing Human Involvement for Entity Matching and Consolidation. CoRR abs/1906.06574 (2019) - [i7]Sibo Wang, Renchi Yang, Runhui Wang, Xiaokui Xiao, Zhewei Wei, Wenqing Lin, Yin Yang, Nan Tang:
Efficient Algorithms for Approximate Single-Source Personalized PageRank Queries. CoRR abs/1908.10583 (2019) - [i6]Raul Castro Fernandez, Nan Tang, Mourad Ouzzani, Michael Stonebraker, Samuel Madden:
Dataset-On-Demand: Automatic View Search and Presentation for Data Discovery. CoRR abs/1911.11876 (2019) - 2018
- [j25]Xuedi Qin, Yuyu Luo, Nan Tang, Guoliang Li:
DeepEye: An automatic big data visualization framework. Big Data Min. Anal. 1(1): 75-82 (2018) - [j24]Divy Agrawal, Sanjay Chawla, Bertty Contreras-Rojas, Ahmed K. Elmagarmid, Yasser Idris, Zoi Kaoudi, Sebastian Kruse, Ji Lucas, Essam Mansour, Mourad Ouzzani, Paolo Papotti, Jorge-Arnulfo Quiané-Ruiz, Nan Tang, Saravanan Thirumuruganathan, Anis Troudi:
RHEEM: Enabling Cross-Platform Data Processing - May The Big Data Be With You! -. Proc. VLDB Endow. 11(11): 1414-1427 (2018) - [j23]Muhammad Ebraheem, Saravanan Thirumuruganathan, Shafiq R. Joty, Mourad Ouzzani, Nan Tang:
Distributed Representations of Tuples for Entity Resolution. Proc. VLDB Endow. 11(11): 1454-1467 (2018) - [j22]Shuang Hao, Nan Tang, Guoliang Li, Jian Li, Jianhua Feng:
Distilling relations using knowledge bases. VLDB J. 27(4): 497-519 (2018) - [c44]Xuedi Qin, Yuyu Luo, Nan Tang, Guoliang Li:
DeepEye: Visualizing Your Data by Keyword Search. EDBT 2018: 441-444 - [c43]Yuyu Luo, Xuedi Qin, Nan Tang, Guoliang Li:
DeepEye: Towards Automatic Data Visualization. ICDE 2018: 101-112 - [c42]Shuang Hao, Nan Tang, Guoliang Li, Jianhua Feng:
Discovering Mis-Categorized Entities. ICDE 2018: 413-424 - [c41]Raul Castro Fernandez, Essam Mansour, Abdulhakim Ali Qahtan, Ahmed K. Elmagarmid, Ihab F. Ilyas, Samuel Madden, Mourad Ouzzani, Michael Stonebraker, Nan Tang:
Seeping Semantics: Linking Datasets Using Word Embeddings for Data Discovery. ICDE 2018: 989-1000 - [c40]Essam Mansour, Dong Deng, Raul Castro Fernandez, Abdulhakim Ali Qahtan, Wenbo Tao, Ziawasch Abedjan, Ahmed K. Elmagarmid, Ihab F. Ilyas, Samuel Madden, Mourad Ouzzani, Michael Stonebraker, Nan Tang:
Building Data Civilizer Pipelines with an Advanced Workflow Engine. ICDE 2018: 1593-1596 - [c39]Shuang Hao, Yi Xu, Nan Tang, Guoliang Li, Jianhua Feng:
Cleaning Your Wrong Google Scholar Entries. ICDE 2018: 1597-1600 - [c38]Abdulhakim Ali Qahtan, Ahmed K. Elmagarmid, Mourad Ouzzani, Nan Tang:
FAHES: Detecting Disguised Missing Values. ICDE 2018: 1609-1612 - [c37]Abdulhakim Ali Qahtan, Ahmed K. Elmagarmid, Raul Castro Fernandez, Mourad Ouzzani, Nan Tang:
FAHES: A Robust Disguised Missing Values Detector. KDD 2018: 2100-2109 - [c36]Yuyu Luo, Xuedi Qin, Nan Tang, Guoliang Li, Xinran Wang:
DeepEye: Creating Good Data Visualizations by Keyword Search. SIGMOD Conference 2018: 1733-1736 - [i5]Saravanan Thirumuruganathan, Nan Tang, Mourad Ouzzani:
Data Curation with Deep Learning [Vision]: Towards Self Driving Data Curation. CoRR abs/1803.01384 (2018) - [i4]Saravanan Thirumuruganathan, Shameem Ahamed Puthiya Parambath, Mourad Ouzzani, Nan Tang, Shafiq R. Joty:
Reuse and Adaptation for Entity Resolution through Transfer Learning. CoRR abs/1809.11084 (2018) - 2017
- [j21]Jiannan Wang, Nan Tang:
Dependable Data Repairing with Fixing Rules. ACM J. Data Inf. Qual. 8(3-4): 16:1-16:34 (2017) - [j20]Zuhair Khayyat, William Lucia, Meghna Singh, Mourad Ouzzani, Paolo Papotti, Jorge-Arnulfo Quiané-Ruiz, Nan Tang, Panos Kalnis:
Errata for "Lightning Fast and Space Efficient Inequality Joins" (PVLDB 8(13): 2074-2085). Proc. VLDB Endow. 10(9): 985 (2017) - [j19]Rohit Singh, Venkata Vamsikrishna Meduri, Ahmed K. Elmagarmid, Samuel Madden, Paolo Papotti, Jorge-Arnulfo Quiané-Ruiz, Armando Solar-Lezama, Nan Tang:
Synthesizing Entity Matching Rules by Examples. Proc. VLDB Endow. 11(2): 189-202 (2017) - [j18]Shuang Hao, Nan Tang, Guoliang Li, Jian He, Na Ta, Jianhua Feng:
A Novel Cost-Based Model for Data Repairing. IEEE Trans. Knowl. Data Eng. 29(4): 727-742 (2017) - [j17]Zuhair Khayyat, William Lucia, Meghna Singh, Mourad Ouzzani, Paolo Papotti, Jorge-Arnulfo Quiané-Ruiz, Nan Tang, Panos Kalnis:
Fast and scalable inequality joins. VLDB J. 26(1): 125-150 (2017) - [c35]Dong Deng, Raul Castro Fernandez, Ziawasch Abedjan, Sibo Wang, Michael Stonebraker, Ahmed K. Elmagarmid, Ihab F. Ilyas, Samuel Madden, Mourad Ouzzani, Nan Tang:
The Data Civilizer System. CIDR 2017 - [c34]Shuang Hao, Nan Tang, Guoliang Li, Jian He, Na Ta, Jianhua Feng:
A Novel Cost-Based Model for Data Repairing. ICDE 2017: 49-50 - [c33]