default search action
Scott Yih
Scott Wen-tau Yih – Wen-tau Yih – Wen-Tau Yih
Person information
- affiliation: Meta AI
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
showing all ?? records
2020 – today
- 2024
- [c130]Zhengbao Jiang, Zhiqing Sun, Weijia Shi, Pedro Rodríguez, Chunting Zhou, Graham Neubig, Xi Victoria Lin, Wen-tau Yih, Srini Iyer:
Instruction-tuned Language Models are Better Knowledge Learners. ACL (1) 2024: 5421-5434 - [c129]Jiawei Ma, Po-Yao Huang, Saining Xie, Shang-Wen Li, Luke Zettlemoyer, Shih-Fu Chang, Wen-Tau Yih, Hu Xu:
MoDE: CLIP Data Experts via Clustering. CVPR 2024: 26344-26353 - [c128]Mingda Chen, Xilun Chen, Wen-tau Yih:
Few-Shot Data Synthesis for Open Domain Multi-Hop Question Answering. EACL (1) 2024: 190-208 - [c127]Hu Xu, Po-Yao Huang, Xiaoqing Ellen Tan, Ching-Feng Yeh, Jacob Kahn, Christine Jou, Gargi Ghosh, Omer Levy, Luke Zettlemoyer, Wen-tau Yih, Shang-Wen Li, Saining Xie, Christoph Feichtenhofer:
Altogether: Image Captioning via Re-aligning Alt-text. EMNLP 2024: 19302-19318 - [c126]Xi Victoria Lin, Xilun Chen, Mingda Chen, Weijia Shi, Maria Lomeli, Richard James, Pedro Rodriguez, Jacob Kahn, Gergely Szilvasy, Mike Lewis, Luke Zettlemoyer, Wen-tau Yih:
RA-DIT: Retrieval-Augmented Dual Instruction Tuning. ICLR 2024 - [c125]Weijia Shi, Sewon Min, Maria Lomeli, Chunting Zhou, Margaret Li, Xi Victoria Lin, Noah A. Smith, Luke Zettlemoyer, Wen-tau Yih, Mike Lewis:
In-Context Pretraining: Language Modeling Beyond Document Boundaries. ICLR 2024 - [c124]Weijia Shi, Xiaochuang Han, Mike Lewis, Yulia Tsvetkov, Luke Zettlemoyer, Wen-tau Yih:
Trusting Your Evidence: Hallucinate Less with Context-aware Decoding. NAACL (Short Papers) 2024: 783-791 - [c123]Weijia Shi, Sewon Min, Michihiro Yasunaga, Minjoon Seo, Richard James, Mike Lewis, Luke Zettlemoyer, Wen-tau Yih:
REPLUG: Retrieval-Augmented Black-Box Language Models. NAACL-HLT 2024: 8371-8384 - [i83]Zhengbao Jiang, Zhiqing Sun, Weijia Shi, Pedro Rodriguez, Chunting Zhou, Graham Neubig, Xi Victoria Lin, Wen-tau Yih, Srinivasan Iyer:
Instruction-tuned Language Models are Better Knowledge Learners. CoRR abs/2402.12847 (2024) - [i82]Akari Asai, Zexuan Zhong, Danqi Chen, Pang Wei Koh, Luke Zettlemoyer, Hannaneh Hajishirzi, Wen-tau Yih:
Reliable, Adaptable, and Attributable Language Models with Retrieval. CoRR abs/2403.03187 (2024) - [i81]Sainbayar Sukhbaatar, Olga Golovneva, Vasu Sharma, Hu Xu, Xi Victoria Lin, Baptiste Rozière, Jacob Kahn, Daniel Li, Wen-tau Yih, Jason Weston, Xian Li:
Branch-Train-MiX: Mixing Expert LLMs into a Mixture-of-Experts LLM. CoRR abs/2403.07816 (2024) - [i80]Jiawei Ma, Po-Yao Huang, Saining Xie, Shang-Wen Li, Luke Zettlemoyer, Shih-Fu Chang, Wen-Tau Yih, Hu Xu:
MoDE: CLIP Data Experts via Clustering. CoRR abs/2404.16030 (2024) - [i79]Sheng-Chieh Lin, Luyu Gao, Barlas Oguz, Wenhan Xiong, Jimmy Lin, Wen-tau Yih, Xilun Chen:
FLAME: Factuality-Aware Alignment for Large Language Models. CoRR abs/2405.01525 (2024) - [i78]Minghan Li, Xilun Chen, Ari Holtzman, Beidi Chen, Jimmy Lin, Wen-tau Yih, Xi Victoria Lin:
Nearest Neighbor Speculative Decoding for LLM Generation and Attribution. CoRR abs/2405.19325 (2024) - [i77]Xiao Yang, Kai Sun, Hao Xin, Yushi Sun, Nikita Bhalla, Xiangsen Chen, Sajal Choudhary, Rongze Daniel Gui, Ziran Will Jiang, Ziyu Jiang, Lingkun Kong, Brian Moran, Jiaqi Wang, Yifan Ethan Xu, An Yan, Chenyu Yang, Eting Yuan, Hanwen Zha, Nan Tang, Lei Chen, Nicolas Scheffer, Yue Liu, Nirav Shah, Rakesh Wanga, Anuj Kumar, Wen-tau Yih, Xin Luna Dong:
CRAG - Comprehensive RAG Benchmark. CoRR abs/2406.04744 (2024) - [i76]Yejin Lee, Anna Y. Sun, Basil Hosmer, Bilge Acun, Can Balioglu, Changhan Wang, Charles David Hernandez, Christian Puhrsch, Daniel Haziza, Driss Guessous, Francisco Massa, Jacob Kahn, Jeffrey Wan, Jeremy Reizenstein, Jiaqi Zhai, Joe Isaacson, Joel Schlosser, Juan Pino, Kaushik Ram Sadagopan, Leonid Shamis, Linjian Ma, Min-Jae Hwang, Mingda Chen, Mostafa Elhoushi, Pedro Rodriguez, Ram Pasunuru, Scott Yih, Sravya Popuri, Xing Liu, Carole-Jean Wu:
Characterizing and Efficiently Accelerating Multimodal Generation Model Inference. CoRR abs/2410.00215 (2024) - [i75]Hu Xu, Po-Yao Huang, Xiaoqing Ellen Tan, Ching-Feng Yeh, Jacob Kahn, Christine Jou, Gargi Ghosh, Omer Levy, Luke Zettlemoyer, Wen-tau Yih, Shang-Wen Li, Saining Xie, Christoph Feichtenhofer:
Altogether: Image Captioning via Re-aligning Alt-text. CoRR abs/2410.17251 (2024) - 2023
- [c122]Hongjin Su, Weijia Shi, Jungo Kasai, Yizhong Wang, Yushi Hu, Mari Ostendorf, Wen-tau Yih, Noah A. Smith, Luke Zettlemoyer, Tao Yu:
One Embedder, Any Task: Instruction-Finetuned Text Embeddings. ACL (Findings) 2023: 1102-1121 - [c121]Sewon Min, Weijia Shi, Mike Lewis, Xilun Chen, Wen-tau Yih, Hannaneh Hajishirzi, Luke Zettlemoyer:
Nonparametric Masked Language Modeling. ACL (Findings) 2023: 2097-2118 - [c120]Hao Yan, Saurabh Srivastava, Yintao Tai, Sida I. Wang, Wen-tau Yih, Ziyu Yao:
Learning to Simulate Natural Language Feedback for Interactive Semantic Parsing. ACL (1) 2023: 3149-3170 - [c119]Akari Asai, Timo Schick, Patrick S. H. Lewis, Xilun Chen, Gautier Izacard, Sebastian Riedel, Hannaneh Hajishirzi, Wen-tau Yih:
Task-aware Retrieval with Instructions. ACL (Findings) 2023: 3650-3675 - [c118]Wang-Chiew Tan, Yuliang Li, Pedro Rodriguez, Richard James, Xi Victoria Lin, Alon Y. Halevy, Wen-tau Yih:
Reimagining Retrieval Augmented Language Models for Answering Queries. ACL (Findings) 2023: 6131-6146 - [c117]Minghan Li, Sheng-Chieh Lin, Barlas Oguz, Asish Ghoshal, Jimmy Lin, Yashar Mehdad, Wen-tau Yih, Xilun Chen:
CITADEL: Conditional Token Interaction via Dynamic Lexical Routing for Efficient and Effective Multi-Vector Retrieval. ACL (1) 2023: 11891-11907 - [c116]Yung-Sung Chuang, Wei Fang, Shang-Wen Li, Wen-tau Yih, James R. Glass:
Expand, Rerank, and Retrieve: Query Reranking for Open-Domain Question Answering. ACL (Findings) 2023: 12131-12147 - [c115]Wenhan Xiong, Anchit Gupta, Shubham Toshniwal, Yashar Mehdad, Scott Yih:
Adapting Pretrained Text-to-Text Models for Long Text Sequences. EMNLP (Findings) 2023: 5566-5578 - [c114]Sheng-Chieh Lin, Akari Asai, Minghan Li, Barlas Oguz, Jimmy Lin, Yashar Mehdad, Wen-tau Yih, Xilun Chen:
How to Train Your Dragon: Diverse Augmentation Towards Generalizable Dense Retrieval. EMNLP (Findings) 2023: 6385-6400 - [c113]Victor Zhong, Weijia Shi, Wen-tau Yih, Luke Zettlemoyer:
RoMQA: A Benchmark for Robust, Multi-evidence, Multi-answer Question Answering. EMNLP (Findings) 2023: 7055-7067 - [c112]Sewon Min, Kalpesh Krishna, Xinxi Lyu, Mike Lewis, Wen-tau Yih, Pang Wei Koh, Mohit Iyyer, Luke Zettlemoyer, Hannaneh Hajishirzi:
FActScore: Fine-grained Atomic Evaluation of Factual Precision in Long Form Text Generation. EMNLP 2023: 12076-12100 - [c111]Daniel Fried, Armen Aghajanyan, Jessy Lin, Sida Wang, Eric Wallace, Freda Shi, Ruiqi Zhong, Scott Yih, Luke Zettlemoyer, Mike Lewis:
InCoder: A Generative Model for Code Infilling and Synthesis. ICLR 2023 - [c110]Yuhang Lai, Chengxi Li, Yiming Wang, Tianyi Zhang, Ruiqi Zhong, Luke Zettlemoyer, Wen-Tau Yih, Daniel Fried, Sida I. Wang, Tao Yu:
DS-1000: A Natural and Reliable Benchmark for Data Science Code Generation. ICML 2023: 18319-18345 - [c109]Ansong Ni, Srini Iyer, Dragomir Radev, Veselin Stoyanov, Wen-Tau Yih, Sida I. Wang, Xi Victoria Lin:
LEVER: Learning to Verify Language-to-Code Generation with Execution. ICML 2023: 26106-26128 - [c108]Michihiro Yasunaga, Armen Aghajanyan, Weijia Shi, Richard James, Jure Leskovec, Percy Liang, Mike Lewis, Luke Zettlemoyer, Wen-Tau Yih:
Retrieval-Augmented Multimodal Language Modeling. ICML 2023: 39755-39769 - [c107]Tianyi Zhang, Tao Yu, Tatsunori Hashimoto, Mike Lewis, Wen-Tau Yih, Daniel Fried, Sida Wang:
Coder Reviewer Reranking for Code Generation. ICML 2023: 41832-41846 - [i74]Weijia Shi, Sewon Min, Michihiro Yasunaga, Minjoon Seo, Rich James, Mike Lewis, Luke Zettlemoyer, Wen-tau Yih:
REPLUG: Retrieval-Augmented Black-Box Language Models. CoRR abs/2301.12652 (2023) - [i73]Sheng-Chieh Lin, Akari Asai, Minghan Li, Barlas Oguz, Jimmy Lin, Yashar Mehdad, Wen-tau Yih, Xilun Chen:
How to Train Your DRAGON: Diverse Augmentation Towards Generalizable Dense Retrieval. CoRR abs/2302.07452 (2023) - [i72]Ansong Ni, Srini Iyer, Dragomir Radev, Ves Stoyanov, Wen-tau Yih, Sida I. Wang, Xi Victoria Lin:
LEVER: Learning to Verify Language-to-Code Generation with Execution. CoRR abs/2302.08468 (2023) - [i71]Xilun Chen, Lili Yu, Wenhan Xiong, Barlas Oguz, Yashar Mehdad, Wen-tau Yih:
VideoOFA: Two-Stage Pre-Training for Video-to-Text Generation. CoRR abs/2305.03204 (2023) - [i70]Imanol Schlag, Sainbayar Sukhbaatar, Asli Celikyilmaz, Wen-tau Yih, Jason Weston, Jürgen Schmidhuber, Xian Li:
Large Language Model Programs. CoRR abs/2305.05364 (2023) - [i69]Hao Yan, Saurabh Srivastava, Yintao Tai, Sida I. Wang, Wen-tau Yih, Ziyu Yao:
Learning to Simulate Natural Language Feedback for Interactive Semantic Parsing. CoRR abs/2305.08195 (2023) - [i68]Mingda Chen, Xilun Chen, Wen-tau Yih:
Efficient Open Domain Multi-Hop Question Answering with Few-Shot Data Synthesis. CoRR abs/2305.13691 (2023) - [i67]Sewon Min, Kalpesh Krishna, Xinxi Lyu, Mike Lewis, Wen-tau Yih, Pang Wei Koh, Mohit Iyyer, Luke Zettlemoyer, Hannaneh Hajishirzi:
FActScore: Fine-grained Atomic Evaluation of Factual Precision in Long Form Text Generation. CoRR abs/2305.14251 (2023) - [i66]Weijia Shi, Xiaochuang Han, Mike Lewis, Yulia Tsvetkov, Luke Zettlemoyer, Scott Wen-tau Yih:
Trusting Your Evidence: Hallucinate Less with Context-aware Decoding. CoRR abs/2305.14739 (2023) - [i65]Yung-Sung Chuang, Wei Fang, Shang-Wen Li, Wen-tau Yih, James R. Glass:
Expand, Rerank, and Retrieve: Query Reranking for Open-Domain Question Answering. CoRR abs/2305.17080 (2023) - [i64]Wang-Chiew Tan, Yuliang Li, Pedro Rodriguez, Richard James, Xi Victoria Lin, Alon Y. Halevy, Scott Yih:
Reimagining Retrieval Augmented Language Models for Answering Queries. CoRR abs/2306.01061 (2023) - [i63]Xi Victoria Lin, Xilun Chen, Mingda Chen, Weijia Shi, Maria Lomeli, Rich James, Pedro Rodriguez, Jacob Kahn, Gergely Szilvasy, Mike Lewis, Luke Zettlemoyer, Scott Yih:
RA-DIT: Retrieval-Augmented Dual Instruction Tuning. CoRR abs/2310.01352 (2023) - [i62]Weijia Shi, Sewon Min, Maria Lomeli, Chunting Zhou, Margaret Li, Xi Victoria Lin, Noah A. Smith, Luke Zettlemoyer, Scott Yih, Mike Lewis:
In-Context Pretraining: Language Modeling Beyond Document Boundaries. CoRR abs/2310.10638 (2023) - 2022
- [c106]Bill Yuchen Lin, Sida Wang, Xi Victoria Lin, Robin Jia, Lin Xiao, Xiang Ren, Scott Yih:
On Continual Model Refinement in Out-of-Distribution Data Streams. ACL (1) 2022: 3128-3139 - [c105]Yuning Mao, Lambert Mathias, Rui Hou, Amjad Almahairi, Hao Ma, Jiawei Han, Scott Yih, Madian Khabsa:
UniPELT: A Unified Framework for Parameter-Efficient Language Model Tuning. ACL (1) 2022: 6253-6264 - [c104]Xilun Chen, Kushal Lakhotia, Barlas Oguz, Anchit Gupta, Patrick S. H. Lewis, Stan Peshterliev, Yashar Mehdad, Sonal Gupta, Wen-tau Yih:
Salient Phrase Aware Dense Retrieval: Can a Dense Retriever Imitate a Sparse One? EMNLP (Findings) 2022: 250-262 - [c103]Devendra Singh Sachan, Mike Lewis, Mandar Joshi, Armen Aghajanyan, Wen-tau Yih, Joelle Pineau, Luke Zettlemoyer:
Improving Passage Retrieval with Zero-Shot Question Generation. EMNLP 2022: 3781-3797 - [c102]Barlas Oguz, Kushal Lakhotia, Anchit Gupta, Patrick S. H. Lewis, Vladimir Karpukhin, Aleksandra Piktus, Xilun Chen, Sebastian Riedel, Scott Yih, Sonal Gupta, Yashar Mehdad:
Domain-matched Pre-training Tasks for Dense Retrieval. NAACL-HLT (Findings) 2022: 1524-1534 - [c101]Barlas Oguz, Xilun Chen, Vladimir Karpukhin, Stan Peshterliev, Dmytro Okhonko, Michael Sejr Schlichtkrull, Sonal Gupta, Yashar Mehdad, Scott Yih:
UniK-QA: Unified Representations of Structured and Unstructured Knowledge for Open-Domain Question Answering. NAACL-HLT (Findings) 2022: 1535-1546 - [c100]Wenhan Xiong, Barlas Oguz, Anchit Gupta, Xilun Chen, Diana Liskovich, Omer Levy, Scott Yih, Yashar Mehdad:
Simple Local Attentions Remain Competitive for Long-Context Tasks. NAACL-HLT 2022: 1975-1986 - [c99]Patrick Huber, Armen Aghajanyan, Barlas Oguz, Dmytro Okhonko, Scott Yih, Sonal Gupta, Xilun Chen:
CCQA: A New Web-Scale Question Answering Dataset for Model Pre-Training. NAACL-HLT (Findings) 2022: 2402-2420 - [c98]Patrick S. H. Lewis, Barlas Oguz, Wenhan Xiong, Fabio Petroni, Scott Yih, Sebastian Riedel:
Boosted Dense Retriever. NAACL-HLT 2022: 3102-3117 - [c97]Yung-Sung Chuang, Rumen Dangovski, Hongyin Luo, Yang Zhang, Shiyu Chang, Marin Soljacic, Shang-Wen Li, Scott Yih, Yoon Kim, James R. Glass:
DiffCSE: Difference-based Contrastive Learning for Sentence Embeddings. NAACL-HLT 2022: 4207-4218 - [c96]Michele Bevilacqua, Giuseppe Ottaviano, Patrick S. H. Lewis, Scott Yih, Sebastian Riedel, Fabio Petroni:
Autoregressive Search Engines: Generating Substrings as Document Identifiers. NeurIPS 2022 - [c95]Zechun Liu, Barlas Oguz, Aasish Pappu, Lin Xiao, Scott Yih, Meng Li, Raghuraman Krishnamoorthi, Yashar Mehdad:
BiT: Robustly Binarized Multi-distilled Transformer. NeurIPS 2022 - [c94]Asish Ghoshal, Srinivasan Iyer, Bhargavi Paranjape, Kushal Lakhotia, Scott Wen-tau Yih, Yashar Mehdad:
QUASER: Question Answering with Scalable Extractive Rationalization. SIGIR 2022: 1208-1218 - [i61]Daniel Fried, Armen Aghajanyan, Jessy Lin, Sida Wang, Eric Wallace, Freda Shi, Ruiqi Zhong, Wen-tau Yih, Luke Zettlemoyer, Mike Lewis:
InCoder: A Generative Model for Code Infilling and Synthesis. CoRR abs/2204.05999 (2022) - [i60]Devendra Singh Sachan, Mike Lewis, Mandar Joshi, Armen Aghajanyan, Wen-tau Yih, Joelle Pineau, Luke Zettlemoyer:
Improving Passage Retrieval with Zero-Shot Question Generation. CoRR abs/2204.07496 (2022) - [i59]Yung-Sung Chuang, Rumen Dangovski, Hongyin Luo, Yang Zhang, Shiyu Chang, Marin Soljacic, Shang-Wen Li, Wen-tau Yih, Yoon Kim, James R. Glass:
DiffCSE: Difference-based Contrastive Learning for Sentence Embeddings. CoRR abs/2204.10298 (2022) - [i58]Michele Bevilacqua, Giuseppe Ottaviano, Patrick S. H. Lewis, Wen-tau Yih, Sebastian Riedel, Fabio Petroni:
Autoregressive Search Engines: Generating Substrings as Document Identifiers. CoRR abs/2204.10628 (2022) - [i57]Bill Yuchen Lin, Sida Wang, Xi Victoria Lin, Robin Jia, Lin Xiao, Xiang Ren, Wen-tau Yih:
On Continual Model Refinement in Out-of-Distribution Data Streams. CoRR abs/2205.02014 (2022) - [i56]Chi-Liang Liu, Hung-yi Lee, Wen-tau Yih:
Structured Prompt Tuning. CoRR abs/2205.12309 (2022) - [i55]Zechun Liu, Barlas Oguz, Aasish Pappu, Lin Xiao, Scott Yih, Meng Li, Raghuraman Krishnamoorthi, Yashar Mehdad:
BiT: Robustly Binarized Multi-distilled Transformer. CoRR abs/2205.13016 (2022) - [i54]Wenhan Xiong, Anchit Gupta, Shubham Toshniwal, Yashar Mehdad, Wen-tau Yih:
Adapting Pretrained Text-to-Text Models for Long Text Sequences. CoRR abs/2209.10052 (2022) - [i53]Victor Zhong, Weijia Shi, Wen-tau Yih, Luke Zettlemoyer:
RoMQA: A Benchmark for Robust, Multi-evidence, Multi-answer Question Answering. CoRR abs/2210.14353 (2022) - [i52]Akari Asai, Timo Schick, Patrick S. H. Lewis, Xilun Chen, Gautier Izacard, Sebastian Riedel, Hannaneh Hajishirzi, Wen-tau Yih:
Task-aware Retrieval with Instructions. CoRR abs/2211.09260 (2022) - [i51]Minghan Li, Sheng-Chieh Lin, Barlas Oguz, Asish Ghoshal, Jimmy Lin, Yashar Mehdad, Wen-tau Yih, Xilun Chen:
CITADEL: Conditional Token Interaction via Dynamic Lexical Routing for Efficient and Effective Multi-Vector Retrieval. CoRR abs/2211.10411 (2022) - [i50]Yuhang Lai, Chengxi Li, Yiming Wang, Tianyi Zhang, Ruiqi Zhong, Luke Zettlemoyer, Scott Wen-tau Yih, Daniel Fried, Sida I. Wang, Tao Yu:
DS-1000: A Natural and Reliable Benchmark for Data Science Code Generation. CoRR abs/2211.11501 (2022) - [i49]Michihiro Yasunaga, Armen Aghajanyan, Weijia Shi, Rich James, Jure Leskovec, Percy Liang, Mike Lewis, Luke Zettlemoyer, Wen-tau Yih:
Retrieval-Augmented Multimodal Language Modeling. CoRR abs/2211.12561 (2022) - [i48]Tianyi Zhang, Tao Yu, Tatsunori B. Hashimoto, Mike Lewis, Wen-tau Yih, Daniel Fried, Sida I. Wang:
Coder Reviewer Reranking for Code Generation. CoRR abs/2211.16490 (2022) - [i47]Sewon Min, Weijia Shi, Mike Lewis, Xilun Chen, Wen-tau Yih, Hannaneh Hajishirzi, Luke Zettlemoyer:
Nonparametric Masked Language Modeling. CoRR abs/2212.01349 (2022) - [i46]Asish Ghoshal, Arash Einolghozati, Ankit Arun, Haoran Li, Lili Yu, Yashar Mehdad, Scott Wen-tau Yih, Asli Celikyilmaz:
Improving Faithfulness of Abstractive Summarization by Controlling Confounding Effect of Irrelevant Sentences. CoRR abs/2212.09726 (2022) - [i45]Hongjin Su, Weijia Shi, Jungo Kasai, Yizhong Wang, Yushi Hu, Mari Ostendorf, Wen-tau Yih, Noah A. Smith, Luke Zettlemoyer, Tao Yu:
One Embedder, Any Task: Instruction-Finetuned Text Embeddings. CoRR abs/2212.09741 (2022) - 2021
- [c93]Jean Maillard, Vladimir Karpukhin, Fabio Petroni, Wen-tau Yih, Barlas Oguz, Veselin Stoyanov, Gargi Ghosh:
Multi-Task Retrieval for Knowledge-Intensive Tasks. ACL/IJCNLP (1) 2021: 1098-1111 - [c92]Divyansh Kaushik, Douwe Kiela, Zachary C. Lipton, Wen-tau Yih:
On the Efficacy of Adversarial Data Collection for Question Answering: Results from a Large-Scale Randomized Study. ACL/IJCNLP (1) 2021: 6618-6633 - [c91]Michael Sejr Schlichtkrull, Vladimir Karpukhin, Barlas Oguz, Mike Lewis, Wen-tau Yih, Sebastian Riedel:
Joint Verification and Reranking for Open Fact Checking Over Tables. ACL/IJCNLP (1) 2021: 6787-6799 - [c90]Kushal Lakhotia, Bhargavi Paranjape, Asish Ghoshal, Scott Yih, Yashar Mehdad, Srini Iyer:
FiD-Ex: Improving Sequence-to-Sequence Models for Extractive Rationale Generation. EMNLP (1) 2021: 3712-3727 - [c89]Qinyuan Ye, Belinda Z. Li, Sinong Wang, Benjamin Bolte, Hao Ma, Wen-tau Yih, Xiang Ren, Madian Khabsa:
On the Influence of Masking Policies in Intermediate Pre-training. EMNLP (1) 2021: 7190-7202 - [c88]Wenhan Xiong, Xiang Lorraine Li, Srini Iyer, Jingfei Du, Patrick S. H. Lewis, William Yang Wang, Yashar Mehdad, Scott Yih, Sebastian Riedel, Douwe Kiela, Barlas Oguz:
Answering Complex Open-Domain Questions with Multi-Hop Dense Retrieval. ICLR 2021 - [c87]Srinivasan Iyer, Sewon Min, Yashar Mehdad, Wen-tau Yih:
RECONSIDER: Improved Re-Ranking using Span-Focused Cross-Attention for Open Domain Question Answering. NAACL-HLT 2021: 1280-1287 - [c86]Nayeon Lee, Belinda Z. Li, Sinong Wang, Pascale Fung, Hao Ma, Wen-tau Yih, Madian Khabsa:
On Unifying Misinformation Detection. NAACL-HLT 2021: 5479-5485 - [e7]Marie-Francine Moens, Xuanjing Huang, Lucia Specia, Scott Wen-tau Yih:
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, EMNLP 2021, Virtual Event / Punta Cana, Dominican Republic, 7-11 November, 2021. Association for Computational Linguistics 2021 [contents] - [e6]Marie-Francine Moens, Xuanjing Huang, Lucia Specia, Scott Wen-tau Yih:
Findings of the Association for Computational Linguistics: EMNLP 2021, Virtual Event / Punta Cana, Dominican Republic, 16-20 November, 2021. Association for Computational Linguistics 2021 [contents] - [i44]Jean Maillard, Vladimir Karpukhin, Fabio Petroni, Wen-tau Yih, Barlas Oguz, Veselin Stoyanov, Gargi Ghosh:
Multi-task Retrieval for Knowledge-Intensive Tasks. CoRR abs/2101.00117 (2021) - [i43]Sewon Min, Jordan L. Boyd-Graber, Chris Alberti, Danqi Chen, Eunsol Choi, Michael Collins, Kelvin Guu, Hannaneh Hajishirzi, Kenton Lee, Jennimaria Palomaki, Colin Raffel, Adam Roberts, Tom Kwiatkowski, Patrick S. H. Lewis, Yuxiang Wu, Heinrich Küttler, Linqing Liu, Pasquale Minervini, Pontus Stenetorp, Sebastian Riedel, Sohee Yang, Minjoon Seo, Gautier Izacard, Fabio Petroni, Lucas Hosseini, Nicola De Cao, Edouard Grave, Ikuya Yamada, Sonse Shimaoka, Masatoshi Suzuki, Shumpei Miyawaki, Shun Sato, Ryo Takahashi, Jun Suzuki, Martin Fajcik, Martin Docekal, Karel Ondrej, Pavel Smrz, Hao Cheng, Yelong Shen, Xiaodong Liu, Pengcheng He, Weizhu Chen, Jianfeng Gao, Barlas Oguz, Xilun Chen, Vladimir Karpukhin, Stan Peshterliev, Dmytro Okhonko, Michael Sejr Schlichtkrull, Sonal Gupta, Yashar Mehdad, Wen-tau Yih:
NeurIPS 2020 EfficientQA Competition: Systems, Analyses and Lessons Learned. CoRR abs/2101.00133 (2021) - [i42]Nayeon Lee, Belinda Z. Li, Sinong Wang, Pascale Fung, Hao Ma, Wen-tau Yih, Madian Khabsa:
On Unifying Misinformation Detection. CoRR abs/2104.05243 (2021) - [i41]Qinyuan Ye, Belinda Z. Li, Sinong Wang, Benjamin Bolte, Hao Ma, Wen-tau Yih, Xiang Ren, Madian Khabsa:
On the Influence of Masking Policies in Intermediate Pre-training. CoRR abs/2104.08840 (2021) - [i40]Divyansh Kaushik, Douwe Kiela, Zachary C. Lipton, Wen-tau Yih:
On the Efficacy of Adversarial Data Collection for Question Answering: Results from a Large-Scale Randomized Study. CoRR abs/2106.00872 (2021) - [i39]Barlas Oguz, Kushal Lakhotia, Anchit Gupta, Patrick S. H. Lewis, Vladimir Karpukhin, Aleksandra Piktus, Xilun Chen, Sebastian Riedel, Wen-tau Yih, Sonal Gupta, Yashar Mehdad:
Domain-matched Pre-training Tasks for Dense Retrieval. CoRR abs/2107.13602 (2021) - [i38]Xilun Chen, Kushal Lakhotia, Barlas Oguz, Anchit Gupta, Patrick S. H. Lewis, Stan Peshterliev, Yashar Mehdad, Sonal Gupta, Wen-tau Yih:
Salient Phrase Aware Dense Retrieval: Can a Dense Retriever Imitate a Sparse One? CoRR abs/2110.06918 (2021) - [i37]Yuning Mao, Lambert Mathias, Rui Hou, Amjad Almahairi, Hao Ma, Jiawei Han, Wen-tau Yih, Madian Khabsa:
UniPELT: A Unified Framework for Parameter-Efficient Language Model Tuning. CoRR abs/2110.07577 (2021) - [i36]Patrick Huber, Armen Aghajanyan, Barlas Oguz, Dmytro Okhonko, Wen-tau Yih, Sonal Gupta, Xilun Chen:
CCQA: A New Web-Scale Question Answering Dataset for Model Pre-Training. CoRR abs/2110.07731 (2021) - [i35]Wenhan Xiong, Barlas Oguz, Anchit Gupta, Xilun Chen, Diana Liskovich, Omer Levy, Wen-tau Yih, Yashar Mehdad:
Simple Local Attentions Remain Competitive for Long-Context Tasks. CoRR abs/2112.07210 (2021) - [i34]Patrick S. H. Lewis, Barlas Oguz, Wenhan Xiong, Fabio Petroni, Wen-tau Yih, Sebastian Riedel:
Boosted Dense Retriever. CoRR abs/2112.07771 (2021) - [i33]Aleksandra Piktus, Fabio Petroni, Vladimir Karpukhin, Dmytro Okhonko, Samuel Broscheit, Gautier Izacard, Patrick S. H. Lewis, Barlas Oguz, Edouard Grave, Wen-tau Yih, Sebastian Riedel:
The Web Is Your Oyster - Knowledge-Intensive NLP against a Very Large Web Corpus. CoRR abs/2112.09924 (2021) - 2020
- [c85]Danqi Chen, Wen-tau Yih:
Open-Domain Question Answering. ACL (tutorial) 2020: 34-37 - [c84]Pengcheng Yin, Graham Neubig, Wen-tau Yih, Sebastian Riedel:
TaBERT: Pretraining for Joint Understanding of Textual and Tabular Data. ACL 2020: 8413-8426 - [c83]Jiezhong Qiu, Hao Ma, Omer Levy, Wen-tau Yih, Sinong Wang, Jie Tang:
Blockwise Self-Attention for Long Document Understanding. EMNLP (Findings) 2020: 2555-2565 - [c82]Belinda Z. Li, Sewon Min, Srinivasan Iyer, Yashar Mehdad, Wen-tau Yih:
Efficient One-Pass End-to-End Entity Linking for Questions. EMNLP (1) 2020: 6433-6441 - [c81]Vladimir Karpukhin, Barlas Oguz, Sewon Min, Patrick S. H. Lewis, Ledell Wu, Sergey Edunov, Danqi Chen, Wen-tau Yih:
Dense Passage Retrieval for Open-Domain Question Answering. EMNLP (1) 2020: 6769-6781 - [c80]Ziyu Yao, Yiqi Tang, Wen-tau Yih, Huan Sun, Yu Su:
An Imitation Game for Learning Semantic Parsers from User Interaction. EMNLP (1) 2020: 6883-6902 - [c79]Ethan Perez, Patrick S. H. Lewis, Wen-tau Yih, Kyunghyun Cho, Douwe Kiela:
Unsupervised Question Decomposition for Question Answering. EMNLP (1) 2020: 8864-8880 - [c78]Chandra Bhagavatula, Ronan Le Bras, Chaitanya Malaviya, Keisuke Sakaguchi, Ari Holtzman, Hannah Rashkin, Doug Downey, Wen-tau Yih, Yejin Choi:
Abductive Commonsense Reasoning. ICLR 2020 - [c77]Patrick S. H. Lewis, Ethan Perez, Aleksandra Piktus, Fabio Petroni, Vladimir Karpukhin, Naman Goyal, Heinrich Küttler, Mike Lewis, Wen-tau Yih, Tim Rocktäschel, Sebastian Riedel, Douwe Kiela:
Retrieval-Augmented Generation for Knowledge-Intensive NLP Tasks. NeurIPS 2020 - [c76]Sewon Min, Jordan L. Boyd-Graber, Chris Alberti, Danqi Chen, Eunsol Choi, Michael Collins, Kelvin Guu, Hannaneh Hajishirzi, Kenton Lee, Jennimaria Palomaki, Colin Raffel, Adam Roberts, Tom Kwiatkowski, Patrick S. H. Lewis, Yuxiang Wu, Heinrich Küttler, Linqing Liu, Pasquale Minervini, Pontus Stenetorp, Sebastian Riedel, Sohee Yang, Minjoon Seo, Gautier Izacard, Fabio Petroni, Lucas Hosseini, Nicola De Cao, Edouard Grave, Ikuya Yamada, Sonse Shimaoka, Masatoshi Suzuki, Shumpei Miyawaki, Shun Sato, Ryo Takahashi, Jun Suzuki, Martin Fajcik, Martin Docekal, Karel Ondrej, Pavel Smrz, Hao Cheng, Yelong Shen, Xiaodong Liu, Pengcheng He, Weizhu Chen, Jianfeng Gao, Barlas Oguz, Xilun Chen, Vladimir Karpukhin, Stan Peshterliev, Dmytro Okhonko, Michael Sejr Schlichtkrull, Sonal Gupta, Yashar Mehdad, Wen-tau Yih:
NeurIPS 2020 EfficientQA Competition: Systems, Analyses and Lessons Learned. NeurIPS (Competition and Demos) 2020: 86-111 - [i32]Ethan Perez, Patrick S. H. Lewis, Wen-tau Yih, Kyunghyun Cho, Douwe Kiela:
Unsupervised Question Decomposition for Question Answering. CoRR abs/2002.09758 (2020) - [i31]Vladimir Karpukhin, Barlas Oguz, Sewon Min, Ledell Wu, Sergey Edunov, Danqi Chen, Wen-tau Yih:
Dense Passage Retrieval for Open-Domain Question Answering. CoRR abs/2004.04906 (2020) - [i30]Ziyu Yao, Yiqi Tang, Wen-tau Yih, Huan Sun, Yu Su:
An Imitation Game for Learning Semantic Parsers from User Interaction. CoRR abs/2005.00689 (2020) - [i29]