default search action
Yeye He
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [j19]Sibei Chen, Yeye He, Weiwei Cui, Ju Fan, Song Ge, Haidong Zhang, Dongmei Zhang, Surajit Chaudhuri:
Auto-Formula: Recommend Formulas in Spreadsheets using Contrastive Learning for Table Representations. Proc. ACM Manag. Data 2(3): 122 (2024) - [j18]Peng Li, Yeye He, Dror Yashar, Weiwei Cui, Song Ge, Haidong Zhang, Danielle Rifinski Fainman, Dongmei Zhang, Surajit Chaudhuri:
Table-GPT: Table Fine-tuned GPT for Diverse Table Tasks. Proc. ACM Manag. Data 2(3): 176 (2024) - [j17]Peng Li, Yeye He, Cong Yan, Yue Wang, Surajit Chaudhuri:
Auto-Tables: Relationalize Tables without Using Examples. SIGMOD Rec. 53(1): 76-85 (2024) - [c22]Haoyu Dong, Jianbo Zhao, Yuzhang Tian, Junyu Xiong, Mengyu Zhou, Yun Lin, José Cambronero, Yeye He, Shi Han, Dongmei Zhang:
Encoding Spreadsheets for Large Language Models. EMNLP 2024: 20728-20748 - [i15]Sibei Chen, Yeye He, Weiwei Cui, Ju Fan, Song Ge, Haidong Zhang, Dongmei Zhang, Surajit Chaudhuri:
Auto-Formula: Recommend Formulas in Spreadsheets using Contrastive Learning for Table Representations. CoRR abs/2404.12608 (2024) - [i14]Shiyu Xia, Junyu Xiong, Haoyu Dong, Jianbo Zhao, Yuzhang Tian, Mengyu Zhou, Yeye He, Shi Han, Dongmei Zhang:
Vision Language Models for Spreadsheet Understanding: Challenges and Opportunities. CoRR abs/2405.16234 (2024) - [i13]Yuzhang Tian, Jianbo Zhao, Haoyu Dong, Junyu Xiong, Shiyu Xia, Mengyu Zhou, Yun Lin, José Cambronero, Yeye He, Shi Han, Dongmei Zhang:
SpreadsheetLLM: Encoding Spreadsheets for Large Language Models. CoRR abs/2407.09025 (2024) - 2023
- [j16]Renzhi Wu, Alexander Bendeck, Xu Chu, Yeye He:
Ground Truth Inference for Weakly Supervised Entity Matching. Proc. ACM Manag. Data 1(1): 32:1-32:28 (2023) - [j15]Cong Yan, Yin Lin, Yeye He:
Predicate Pushdown for Data Science Pipelines. Proc. ACM Manag. Data 1(2): 136:1-136:28 (2023) - [j14]Yiming Lin, Yeye He, Surajit Chaudhuri:
Auto-BI: Automatically Build BI-Models Leveraging Local Join Prediction and Global Schema Graph. Proc. VLDB Endow. 16(10): 2578-2590 (2023) - [j13]Peng Li, Yeye He, Cong Yan, Yue Wang, Surajit Chaudhuri:
Auto-Tables: Synthesizing Multi-Step Transformations to Relationalize Tables without Using Examples. Proc. VLDB Endow. 16(11): 3391-3403 (2023) - [c21]Dezhan Tu, Yeye He, Weiwei Cui, Song Ge, Haidong Zhang, Shi Han, Dongmei Zhang, Surajit Chaudhuri:
Auto-Validate by-History: Auto-Program Data Quality Constraints to Validate Recurring Data Pipelines. KDD 2023: 4991-5003 - [i12]Dezhan Tu, Yeye He, Weiwei Cui, Song Ge, Haidong Zhang, Shi Han, Dongmei Zhang, Surajit Chaudhuri:
Auto-Validate by-History: Auto-Program Data Quality Constraints to Validate Recurring Data Pipelines. CoRR abs/2306.02421 (2023) - [i11]Yiming Lin, Yeye He, Surajit Chaudhuri:
Auto-BI: Automatically Build BI-Models Leveraging Local Join Prediction and Global Schema Graph. CoRR abs/2306.12515 (2023) - [i10]Peng Li, Yeye He, Cong Yan, Yue Wang, Surajit Chaudhuri:
Auto-Tables: Synthesizing Multi-Step Transformations to Relationalize Tables without Using Examples. CoRR abs/2307.14565 (2023) - [i9]Peng Li, Yeye He, Dror Yashar, Weiwei Cui, Song Ge, Haidong Zhang, Danielle Rifinski Fainman, Dongmei Zhang, Surajit Chaudhuri:
Table-GPT: Table-tuned GPT for Diverse Table Tasks. CoRR abs/2310.09263 (2023) - 2022
- [j12]Yue Wang, Vivek R. Narasayya, Yeye He, Surajit Chaudhuri:
PACk: An Efficient Partition-based Distributed Agglomerative Hierarchical Clustering Algorithm for Deduplication. Proc. VLDB Endow. 15(6): 1132-1145 (2022) - [i8]Renzhi Wu, Alexander Bendeck, Xu Chu, Yeye He:
Ground Truth Inference for Weakly Supervised Entity Matching. CoRR abs/2211.06975 (2022) - 2021
- [j11]Junwen Yang, Yeye He, Surajit Chaudhuri:
Auto-Pipeline: Synthesize Data Pipelines By-Target Using Reinforcement Learning and Search. Proc. VLDB Endow. 14(11): 2563-2575 (2021) - [j10]Renzhi Wu, Prem Sakala, Peng Li, Xu Chu, Yeye He:
Demonstration of Panda: A Weakly Supervised Entity Matching System. Proc. VLDB Endow. 14(12): 2735-2738 (2021) - [c20]Peng Li, Xiang Cheng, Xu Chu, Yeye He, Surajit Chaudhuri:
Auto-FuzzyJoin: Auto-Program Fuzzy Similarity Joins Without Labeled Examples. SIGMOD Conference 2021: 1064-1076 - [c19]Jie Song, Yeye He:
Auto-Validate: Unsupervised Data Validation Using Data-Domain Patterns Inferred from Data Lakes. SIGMOD Conference 2021: 1678-1691 - [i7]Peng Li, Xiang Cheng, Xu Chu, Yeye He, Surajit Chaudhuri:
Auto-FuzzyJoin: Auto-Program Fuzzy Similarity Joins Without Labeled Examples. CoRR abs/2103.04489 (2021) - [i6]Jie Song, Yeye He:
Auto-Validate: Unsupervised Data Validation Using Data-Domain Patterns Inferred from Data Lakes. CoRR abs/2104.04659 (2021) - [i5]Renzhi Wu, Prem Sakala, Peng Li, Xu Chu, Yeye He:
Demonstration of Panda: A Weakly Supervised Entity Matching System. CoRR abs/2106.10821 (2021) - [i4]Junwen Yang, Yeye He, Surajit Chaudhuri:
AutoPipeline: Synthesize Data Pipelines By-Target Using Reinforcement Learning and Search. CoRR abs/2106.13861 (2021) - [i3]Yeye He, Jie Song, Yue Wang, Surajit Chaudhuri, Vishal Anil, Blake Lassiter, Yaron Goland, Gaurav Malhotra:
Auto-Tag: Tagging-Data-By-Example in Data Lakes. CoRR abs/2112.06049 (2021) - 2020
- [j9]Yeye He, Zhongjun Jin, Surajit Chaudhuri:
Auto-Transform: Learning-to-Transform by Patterns. Proc. VLDB Endow. 13(11): 2368-2381 (2020) - [c18]Cong Yan, Yeye He:
Auto-Suggest: Learning-to-Recommend Data Preparation Steps Using Data Science Notebooks. SIGMOD Conference 2020: 1539-1554
2010 – 2019
- 2019
- [c17]Pei Wang, Yeye He:
Uni-Detect: A Unified Approach to Automated Error Detection in Tables. SIGMOD Conference 2019: 811-828 - [c16]Chen Zhao, Yeye He:
Auto-EM: End-to-end Fuzzy Entity-Matching using Pre-trained Deep Models and Transfer Learning. WWW 2019: 2413-2424 - 2018
- [j8]Yeye He, Xu Chu, Kris Ganjam, Yudian Zheng, Vivek R. Narasayya, Surajit Chaudhuri:
Transform-Data-by-Example (TDE): An Extensible Search Engine for Data Transformations. Proc. VLDB Endow. 11(10): 1165-1177 (2018) - [c15]Cong Yan, Yeye He:
Synthesizing Type-Detection Logic for Rich Semantic Data Types using Open-source Code. SIGMOD Conference 2018: 35-50 - [c14]Zhipeng Huang, Yeye He:
Auto-Detect: Data-Driven Error Detection in Tables. SIGMOD Conference 2018: 1377-1392 - [c13]Yeye He, Kris Ganjam, Kukjin Lee, Yue Wang, Vivek R. Narasayya, Surajit Chaudhuri, Xu Chu, Yudian Zheng:
Transform-Data-by-Example (TDE): Extensible Data Transformation in Excel. SIGMOD Conference 2018: 1785-1788 - 2017
- [j7]Erkang Zhu, Yeye He, Surajit Chaudhuri:
Auto-Join: Joining Tables by Leveraging Transformations. Proc. VLDB Endow. 10(10): 1034-1045 (2017) - [c12]Keqian Li, Yeye He, Kris Ganjam:
Discovering Enterprise Concepts Using Spreadsheet Tables. KDD 2017: 1873-1882 - [c11]Yue Wang, Yeye He:
Synthesizing Mapping Relationships Using Table Corpus. SIGMOD Conference 2017: 1117-1132 - [i2]Yue Wang, Yeye He:
Synthesizing Mapping Relationships Using Table Corpus. CoRR abs/1705.09276 (2017) - 2016
- [j6]Kaushik Chakrabarti, Surajit Chaudhuri, Zhimin Chen, Kris Ganjam, Yeye He:
Data services leveraging Bing's data assets. IEEE Data Eng. Bull. 39(3): 15-28 (2016) - [c10]Yeye He, Kaushik Chakrabarti, Tao Cheng, Tomasz Tylenda:
Automatic Discovery of Attribute Synonyms Using Query Logs and Table Corpora. WWW 2016: 1429-1439 - 2015
- [j5]Yeye He, Kris Ganjam, Xu Chu:
SEMA-JOIN: Joining Semantically-Related Tables Using Big Table Corpora. Proc. VLDB Endow. 8(12): 1358-1369 (2015) - [j4]Eli Cortez, Philip A. Bernstein, Yeye He, Lev Novik:
Annotating Database Schemas to Help Enterprise Search. Proc. VLDB Endow. 8(12): 1936-1939 (2015) - [c9]Xu Chu, Yeye He, Kaushik Chakrabarti, Kris Ganjam:
TEGRA: Table Extraction by Global Record Alignment. SIGMOD Conference 2015: 1713-1728 - [c8]Chi Wang, Kaushik Chakrabarti, Yeye He, Kris Ganjam, Zhimin Chen, Philip A. Bernstein:
Concept Expansion Using Web Tables. WWW 2015: 1198-1208 - 2014
- [j3]Akash Das Sarma, Yeye He, Surajit Chaudhuri:
ClusterJoin: A Similarity Joins Framework using Map-Reduce. Proc. VLDB Endow. 7(12): 1059-1070 (2014) - [c7]Yeye He, Siddharth Barman, Jeffrey F. Naughton:
On Load Shedding in Complex Event Processing. ICDT 2014: 213-224 - 2013
- [c6]Di Wang, Yeye He, Elke A. Rundensteiner, Jeffrey F. Naughton:
Utility-maximizing event stream suppression. SIGMOD Conference 2013: 589-600 - [c5]Yeye He, Dong Xin, Venkatesh Ganti, Sriram Rajaraman, Nirav Shah:
Crawling deep web entity pages. WSDM 2013: 355-364 - [c4]Bilyana Taneva, Tao Cheng, Kaushik Chakrabarti, Yeye He:
Mining acronym expansions and their meanings using query click log. WWW 2013: 1261-1272 - [i1]Yeye He, Siddharth Barman, Jeffrey F. Naughton:
On Load Shedding in Complex Event Processing. CoRR abs/1312.4283 (2013) - 2011
- [c3]Yeye He, Siddharth Barman, Jeffrey F. Naughton:
Preventing equivalence attacks in updated, anonymized data. ICDE 2011: 529-540 - [c2]Yeye He, Siddharth Barman, Di Wang, Jeffrey F. Naughton:
On the complexity of privacy-preserving complex event processing. PODS 2011: 165-174 - [c1]Yeye He, Dong Xin:
SEISA: set expansion by iterative similarity aggregation. WWW 2011: 427-436 - 2010
- [j2]Dong Xin, Yeye He, Venkatesh Ganti:
Keyword++: A Framework to Improve Keyword Search Over Entity Databases. Proc. VLDB Endow. 3(1): 711-722 (2010)
2000 – 2009
- 2009
- [j1]Yeye He, Jeffrey F. Naughton:
Anonymization of Set-Valued Data via Top-Down, Local Generalization. Proc. VLDB Endow. 2(1): 934-945 (2009)
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-11-15 19:32 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint