default search action
Jimmy Lin
Person information
- affiliation: University of Waterloo, David R. Cheriton School of Computer Science
- affiliation: Twitter Inc., San Francisco, USA
- affiliation: University of Maryland, College Park, Institute for Advanced Computer Studies (UMIACS)
- affiliation: Massachusetts Institute of Technology (MIT), Artificial Intelligence Laboratory
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
showing all ?? records
2020 – today
- 2024
- [j66]Xinyu Zhang, Kelechi Ogueji, Xueguang Ma, Jimmy Lin:
Toward Best Practices for Training Multilingual Dense Retrieval Models. ACM Trans. Inf. Syst. 42(2): 39:1-39:33 (2024) - [c363]Jimmy Lin, Junkai Li, Jiasi Gao, Weizhi Ma, Yang Liu:
Jointly Modeling Spatio-Temporal Features of Tactile Signals for Action Classification. AAAI 2024: 13817-13825 - [c362]Mofetoluwa Adeyemi, Akintunde Oladipo, Ronak Pradeep, Jimmy Lin:
Zero-Shot Cross-Lingual Reranking with Large Language Models for Low-Resource Languages. ACL (Short Papers) 2024: 650-656 - [c361]Mohammad Dehghan, Mohammad Ali Alomrani, Sunyam Bagga, David Alfonso-Hermelo, Khalil Bibi, Abbas Ghaddar, Yingxue Zhang, Xiaoguang Li, Jianye Hao, Qun Liu, Jimmy Lin, Boxing Chen, Prasanna Parthasarathi, Mahdi Biparva, Mehdi Rezagholizadeh:
EWEK-QA : Enhanced Web and Efficient Knowledge Graph Retrieval for Citation-based Question Answering Systems. ACL (1) 2024: 14169-14187 - [c360]Ronak Pradeep, Jimmy Lin:
Towards Automated End-to-End Health Misinformation Free Search with a Large Language Model. ECIR (4) 2024: 78-86 - [c359]Crystina Zhang, Minghan Li, Jimmy Lin:
CELI: Simple yet Effective Approach to Enhance Out-of-Domain Generalization of Cross-Encoders. NAACL (Short Papers) 2024: 188-196 - [c358]Raphael Tang, Xinyu Crystina Zhang, Xueguang Ma, Jimmy Lin, Ferhan Ture:
Found in the Middle: Permutation Self-Consistency Improves Listwise Ranking in Large Language Models. NAACL-HLT 2024: 2327-2340 - [c357]Nandan Thakur, Jianmo Ni, Gustavo Hernández Ábrego, John Wieting, Jimmy Lin, Daniel Cer:
Leveraging LLMs for Synthesizing Training Data Across Many Languages in Multilingual Dense Retrieval. NAACL-HLT 2024: 7699-7724 - [c356]Mofetoluwa Adeyemi, Akintunde Oladipo, Xinyu Zhang, David Alfonso-Hermelo, Mehdi Rezagholizadeh, Boxing Chen, Abdul-Hakeem Omotayo, Idris Abdulmumin, Naome A. Etori, Toyib Babatunde Musa, Samuel Fanijo, Oluwabusayo Olufunke Awoyomi, Saheed Abdullahi Salahudeen, Labaran Adamu Mohammed, Daud Olamide Abolade, Falalu Ibrahim Lawan, Maryam Sabo Abubakar, Ruqayya Nasir Iro, Amina Abubakar Imam, Shafie Abdi Mohamed, Hanad Mohamud Mohamed, Tunde Oluwaseyi Ajayi, Jimmy Lin:
CIRAL: A Test Collection for CLIR Evaluations in African Languages. SIGIR 2024: 293-302 - [c355]Nandan Thakur, Luiz Bonifacio, Maik Fröbe, Alexander Bondarenko, Ehsan Kamalloo, Martin Potthast, Matthias Hagen, Jimmy Lin:
Systematic Evaluation of Neural Retrieval Models on the Touché 2020 Argument Retrieval Subset of BEIR. SIGIR 2024: 1420-1430 - [c354]Ehsan Kamalloo, Nandan Thakur, Carlos Lassance, Xueguang Ma, Jheng-Hong Yang, Jimmy Lin:
Resources for Brewing BEIR: Reproducible Reference Models and Statistical Analyses. SIGIR 2024: 1431-1440 - [c353]Minghan Li, Honglei Zhuang, Kai Hui, Zhen Qin, Jimmy Lin, Rolf Jagerman, Xuanhui Wang, Michael Bendersky:
Can Query Expansion Improve Generalization of Strong Cross-Encoder Rankers? SIGIR 2024: 2321-2326 - [c352]Xueguang Ma, Liang Wang, Nan Yang, Furu Wei, Jimmy Lin:
Fine-Tuning LLaMA for Multi-Stage Text Retrieval. SIGIR 2024: 2421-2425 - [c351]Akintunde Oladipo, Mofetoluwa Adeyemi, Jimmy Lin:
On Backbones and Training Regimes for Dense Retrieval in African Languages. SIGIR 2024: 2564-2568 - [c350]Ehsan Kamalloo, Shivani Upadhyay, Jimmy Lin:
Towards Robust QA Evaluation via Open LLMs. SIGIR 2024: 2811-2816 - [c349]Shi Zong, Santosh Kolagati, Amit Chaudhary, Josh Seltzer, Jimmy Lin:
Reflections on the Coding Ability of LLMs for Analyzing Market Research Surveys. SIGIR 2024: 2900-2904 - [c348]Jasper Xian, Tommaso Teofili, Ronak Pradeep, Jimmy Lin:
Vector Search with OpenAI Embeddings: Lucene Is All You Need. WSDM 2024: 1090-1093 - [i179]Jimmy Lin, Junkai Li, Jiasi Gao, Weizhi Ma, Yang Liu:
Jointly Modeling Spatio-Temporal Features of Tactile Signals for Action Classification. CoRR abs/2404.15279 (2024) - [i178]Shengyao Zhuang, Xueguang Ma, Bevan Koopman, Jimmy Lin, Guido Zuccon:
PromptReps: Prompting Large Language Models to Generate Dense and Sparse Representations for Zero-Shot Document Retrieval. CoRR abs/2404.18424 (2024) - [i177]Sheng-Chieh Lin, Luyu Gao, Barlas Oguz, Wenhan Xiong, Jimmy Lin, Wen-tau Yih, Xilun Chen:
FLAME: Factuality-Aware Alignment for Large Language Models. CoRR abs/2405.01525 (2024) - [i176]Shivani Upadhyay, Ehsan Kamalloo, Jimmy Lin:
LLMs Can Patch Up Missing Relevance Judgments in Evaluation. CoRR abs/2405.04727 (2024) - [i175]Sahel Sharifymoghaddam, Shivani Upadhyay, Wenhu Chen, Jimmy Lin:
UniRAG: Universal Retrieval Augmentation for Multi-Modal Large Language Models. CoRR abs/2405.10311 (2024) - [i174]Minghan Li, Xilun Chen, Ari Holtzman, Beidi Chen, Jimmy Lin, Wen-tau Yih, Xi Victoria Lin:
Nearest Neighbor Speculative Decoding for LLM Generation and Attribution. CoRR abs/2405.19325 (2024) - [i173]Shivani Upadhyay, Ronak Pradeep, Nandan Thakur, Nick Craswell, Jimmy Lin:
UMBRELA: UMbrela is the (Open-Source Reproduction of the) Bing RELevance Assessor. CoRR abs/2406.06519 (2024) - [i172]Raphael Tang, Xinyu Zhang, Lixinyu Xu, Yao Lu, Wenyan Li, Pontus Stenetorp, Jimmy Lin, Ferhan Ture:
Words Worth a Thousand Pictures: Measuring and Understanding Perceptual Variability in Text-to-Image Generation. CoRR abs/2406.08482 (2024) - [i171]Manveer Singh Tamber, Jasper Xian, Jimmy Lin:
Can't Hide Behind the API: Stealing Black-Box Commercial Embedding Models. CoRR abs/2406.09355 (2024) - [i170]Mohammad Dehghan, Mohammad Ali Alomrani, Sunyam Bagga, David Alfonso-Hermelo, Khalil Bibi, Abbas Ghaddar, Yingxue Zhang, Xiaoguang Li, Jianye Hao, Qun Liu, Jimmy Lin, Boxing Chen, Prasanna Parthasarathi, Mahdi Biparva, Mehdi Rezagholizadeh:
EWEK-QA: Enhanced Web and Efficient Knowledge Graph Retrieval for Citation-based Question Answering Systems. CoRR abs/2406.10393 (2024) - [i169]Xueguang Ma, Sheng-Chieh Lin, Minghan Li, Wenhu Chen, Jimmy Lin:
Unifying Multimodal Retrieval via Document Screenshot Embedding. CoRR abs/2406.11251 (2024) - [i168]Ronak Pradeep, Nandan Thakur, Sahel Sharifymoghaddam, Eric Zhang, Ryan Nguyen, Daniel Campos, Nick Craswell, Jimmy Lin:
Ragnarök: A Reusable RAG Framework and Baselines for TREC 2024 Retrieval-Augmented Generation Track. CoRR abs/2406.16828 (2024) - [i167]Shi Zong, Jimmy Lin:
Categorical Syllogisms Revisited: A Review of the Logical Reasoning Abilities of LLMs for Analyzing Categorical Syllogism. CoRR abs/2406.18762 (2024) - [i166]Nandan Thakur, Luiz Bonifacio, Maik Fröbe, Alexander Bondarenko, Ehsan Kamalloo, Martin Potthast, Matthias Hagen, Jimmy Lin:
Systematic Evaluation of Neural Retrieval Models on the Touché 2020 Argument Retrieval Subset of BEIR. CoRR abs/2407.07790 (2024) - [i165]Jheng-Hong Yang, Jimmy Lin:
Toward Automatic Relevance Judgment using Vision-Language Models for Image-Text Retrieval Evaluation. CoRR abs/2408.01363 (2024) - [i164]Ronak Pradeep, Daniel Lee, Ali Mousavi, Jeff Pound, Yisi Sang, Jimmy Lin, Ihab F. Ilyas, Saloni Potdar, Mostafa Arefiyan, Yunyao Li:
ConvKGYarn: Spinning Configurable and Scalable Conversational Knowledge Graph QA datasets with Large Language Models. CoRR abs/2408.05948 (2024) - 2023
- [j65]Sheng-Chieh Lin, Minghan Li, Jimmy Lin:
Aggretriever: A Simple Approach to Aggregate Textual Representations for Robust Dense Passage Retrieval. Trans. Assoc. Comput. Linguistics 11: 436-452 (2023) - [j64]Xinyu Zhang, Nandan Thakur, Odunayo Ogundepo, Ehsan Kamalloo, David Alfonso-Hermelo, Xiaoguang Li, Qun Liu, Mehdi Rezagholizadeh, Jimmy Lin:
MIRACL: A Multilingual Retrieval Dataset Covering 18 Diverse Languages. Trans. Assoc. Comput. Linguistics 11: 1114-1131 (2023) - [j63]Joel Mackenzie, Andrew Trotman, Jimmy Lin:
Efficient Document-at-a-time and Score-at-a-time Query Evaluation for Learned Sparse Representations. ACM Trans. Inf. Syst. 41(4): 96:1-96:28 (2023) - [j62]Sheng-Chieh Lin, Jimmy Lin:
A Dense Representation Framework for Lexical and Semantic Matching. ACM Trans. Inf. Syst. 41(4): 110:1-110:29 (2023) - [c347]Ehsan Kamalloo, Xinyu Zhang, Odunayo Ogundepo, Nandan Thakur, David Alfonso-Hermelo, Mehdi Rezagholizadeh, Jimmy Lin:
Evaluating Embedding APIs for Information Retrieval. ACL (industry) 2023: 518-526 - [c346]Aleksandra Piktus, Odunayo Ogundepo, Christopher Akiki, Akintunde Oladipo, Xinyu Zhang, Hailey Schoelkopf, Stella Biderman, Martin Potthast, Jimmy Lin:
GAIA Search: Hugging Face and Pyserini Interoperability for NLP Training Data Exploration. ACL (demo) 2023: 588-598 - [c345]Luyu Gao, Xueguang Ma, Jimmy Lin, Jamie Callan:
Precise Zero-Shot Dense Retrieval without Relevance Labels. ACL (1) 2023: 1762-1777 - [c344]Ji Xin, Raphael Tang, Zhiying Jiang, Yaoliang Yu, Jimmy Lin:
Operator Selection and Ordering in a Pipeline Approach to Efficiency Optimizations for Transformers. ACL (Findings) 2023: 2870-2882 - [c343]Raphael Tang, Linqing Liu, Akshat Pandey, Zhiying Jiang, Gefei Yang, Karun Kumar, Pontus Stenetorp, Jimmy Lin, Ferhan Ture:
What the DAAM: Interpreting Stable Diffusion Using Cross Attention. ACL (1) 2023: 5644-5659 - [c342]Zhiying Jiang, Matthew Y. R. Yang, Mikhail Tsirlin, Raphael Tang, Yiqin Dai, Jimmy Lin:
"Low-Resource" Text Classification: A Parameter-Free Classification Method with Compressors. ACL (Findings) 2023: 6810-6828 - [c341]Minghan Li, Sheng-Chieh Lin, Barlas Oguz, Asish Ghoshal, Jimmy Lin, Yashar Mehdad, Wen-tau Yih, Xilun Chen:
CITADEL: Conditional Token Interaction via Dynamic Lexical Routing for Efficient and Effective Multi-Vector Retrieval. ACL (1) 2023: 11891-11907 - [c340]Xueguang Ma, Tommaso Teofili, Jimmy Lin:
Anserini Gets Dense Retrieval: Integration of Lucene's HNSW Indexes. CIKM 2023: 5366-5370 - [c339]Wei Zhong, Yuqing Xie, Jimmy Lin:
Answer Retrieval for Math Questions Using Structural and Dense Retrieval. CLEF 2023: 209-223 - [c338]Ronak Pradeep, Haonan Chen, Lingwei Gu, Manveer Singh Tamber, Jimmy Lin:
PyGaggle: A Gaggle of Resources for Open-Domain Question Answering. ECIR (3) 2023: 148-162 - [c337]Manveer Singh Tamber, Ronak Pradeep, Jimmy Lin:
Pre-processing Matters! Improved Wikipedia Corpora for Open-Domain Question Answering. ECIR (3) 2023: 163-176 - [c336]Christopher Akiki, Odunayo Ogundepo, Aleksandra Piktus, Xinyu Zhang, Akintunde Oladipo, Jimmy Lin, Martin Potthast:
Spacerini: Plug-and-play Search Engines with Pyserini and Hugging Face. EMNLP (Demos) 2023: 140-148 - [c335]Akintunde Oladipo, Mofetoluwa Adeyemi, Orevaoghene Ahia, Abraham Toluwase Owodunni, Odunayo Ogundepo, David Ifeoluwa Adelani, Jimmy Lin:
Better Quality Pre-training Data and T5 Models for African Languages. EMNLP 2023: 158-168 - [c334]Ronak Pradeep, Kai Hui, Jai Gupta, Ádám D. Lelkes, Honglei Zhuang, Jimmy Lin, Donald Metzler, Vinh Q. Tran:
How Does Generative Retrieval Scale to Millions of Passages? EMNLP 2023: 1305-1321 - [c333]Sheng-Chieh Lin, Akari Asai, Minghan Li, Barlas Oguz, Jimmy Lin, Yashar Mehdad, Wen-tau Yih, Xilun Chen:
How to Train Your Dragon: Diverse Augmentation Towards Generalizable Dense Retrieval. EMNLP (Findings) 2023: 6385-6400 - [c332]Sheng-Chieh Lin, Amin Ahmad, Jimmy Lin:
mAggretriever: A Simple yet Effective Approach to Zero-Shot Multilingual Dense Retrieval. EMNLP 2023: 11688-11696 - [c331]Mofetoluwa Adeyemi, Akintunde Oladipo, Xinyu Zhang, David Alfonso-Hermelo, Mehdi Rezagholizadeh, Boxing Chen, Jimmy Lin:
CIRAL at FIRE 2023: Cross-Lingual Information Retrieval for African Languages. FIRE 2023: 4-6 - [c330]Mofetoluwa Adeyemi, Akintunde Oladipo, Xinyu Crystina Zhang, David Alfonso-Hermelo, Mehdi Rezagholizadeh, Boxing Chen, Jimmy Lin:
Overview of the CIRAL Track at FIRE 2023: Cross-lingual Information Retrieval for African Languages. FIRE (Working Notes) 2023: 118-136 - [c329]Wei Zhong, Sheng-Chieh Lin, Jheng-Hong Yang, Jimmy Lin:
One Blade for One Purpose: Advancing Math Information Retrieval using Hybrid Search. SIGIR 2023: 141-151 - [c328]Minghan Li, Sheng-Chieh Lin, Xueguang Ma, Jimmy Lin:
SLIM: Sparsified Late Interaction for Multi-Vector Retrieval with Inverted Indexes. SIGIR 2023: 1954-1959 - [c327]Chris Kamphuis, Aileen Lin, Siwen Yang, Jimmy Lin, Arjen P. de Vries, Faegheh Hasibi:
MMEAD: MS MARCO Entity Annotations and Disambiguations. SIGIR 2023: 2817-2825 - [c326]Nandan Thakur, Kexin Wang, Iryna Gurevych, Jimmy Lin:
SPRINT: A Unified Toolkit for Evaluating and Demystifying Zero-shot Neural Sparse Retrieval. SIGIR 2023: 2964-2974 - [c325]Jheng-Hong Yang, Carlos Lassance, Rafael Sampaio de Rezende, Krishna Srinivasan, Miriam Redi, Stéphane Clinchant, Jimmy Lin:
AToMiC: An Image/Text Retrieval Test Collection to Support Multimedia Content Creation. SIGIR 2023: 2975-2984 - [c324]Luyu Gao, Xueguang Ma, Jimmy Lin, Jamie Callan:
Tevatron: An Efficient and Flexible Toolkit for Neural Retrieval. SIGIR 2023: 3120-3124 - [c323]Xueguang Ma, Hengxin Fun, Xusen Yin, Antonio Mallia, Jimmy Lin:
Enhancing Sparse Retrieval via Unsupervised Learning. SIGIR-AP 2023: 150-157 - [i163]Shi Zong, Josh Seltzer, Jiahua Pan, Kathy Cheng, Jimmy Lin:
Which Model Shall I Choose? Cost/Quality Trade-offs for Text Classification Tasks. CoRR abs/2301.07006 (2023) - [i162]Minghan Li, Sheng-Chieh Lin, Xueguang Ma, Jimmy Lin:
SLIM: Sparsified Late Interaction for Multi-Vector Retrieval with Inverted Indexes. CoRR abs/2302.06587 (2023) - [i161]Xinyu Zhang, Minghan Li, Jimmy Lin:
Improving Out-of-Distribution Generalization of Neural Rerankers with Contextualized Late Interaction. CoRR abs/2302.06589 (2023) - [i160]Sheng-Chieh Lin, Akari Asai, Minghan Li, Barlas Oguz, Jimmy Lin, Yashar Mehdad, Wen-tau Yih, Xilun Chen:
How to Train Your DRAGON: Diverse Augmentation Towards Generalizable Dense Retrieval. CoRR abs/2302.07452 (2023) - [i159]Christopher Akiki, Odunayo Ogundepo, Aleksandra Piktus, Xinyu Zhang, Akintunde Oladipo, Jimmy Lin, Martin Potthast:
Spacerini: Plug-and-play Search Engines with Pyserini and Hugging Face. CoRR abs/2302.14534 (2023) - [i158]Jimmy Lin, David Alfonso-Hermelo, Vitor Jeronymo, Ehsan Kamalloo, Carlos Lassance, Rodrigo Frassetto Nogueira, Odunayo Ogundepo, Mehdi Rezagholizadeh, Nandan Thakur, Jheng-Hong Yang, Xinyu Zhang:
Simple Yet Effective Neural Ranking and Reranking Baselines for Cross-Lingual Information Retrieval. CoRR abs/2304.01019 (2023) - [i157]Jheng-Hong Yang, Carlos Lassance, Rafael Sampaio de Rezende, Krishna Srinivasan, Miriam Redi, Stéphane Clinchant, Jimmy Lin:
AToMiC: An Image/Text Retrieval Test Collection to Support Multimedia Content Creation. CoRR abs/2304.01961 (2023) - [i156]Xueguang Ma, Tommaso Teofili, Jimmy Lin:
Anserini Gets Dense Retrieval: Integration of Lucene's HNSW Indexes. CoRR abs/2304.12139 (2023) - [i155]Xueguang Ma, Xinyu Zhang, Ronak Pradeep, Jimmy Lin:
Zero-Shot Listwise Document Reranking with a Large Language Model. CoRR abs/2305.02156 (2023) - [i154]Ehsan Kamalloo, Xinyu Zhang, Odunayo Ogundepo, Nandan Thakur, David Alfonso-Hermelo, Mehdi Rezagholizadeh, Jimmy Lin:
Evaluating Embedding APIs for Information Retrieval. CoRR abs/2305.06300 (2023) - [i153]Josh Seltzer, Jiahua Pan, Kathy Cheng, Yuxiao Sun, Santosh Kolagati, Jimmy Lin, Shi Zong:
SmartProbe: A Virtual Moderator for Market Research Surveys. CoRR abs/2305.08271 (2023) - [i152]Ronak Pradeep, Kai Hui, Jai Gupta, Ádám Dániel Lelkes, Honglei Zhuang, Jimmy Lin, Donald Metzler, Vinh Q. Tran:
How Does Generative Retrieval Scale to Millions of Passages? CoRR abs/2305.11841 (2023) - [i151]Vanessa Liao, Syed Shariyar Murtaza, Yifan Nie, Jimmy Lin:
Regex-augmented Domain Transfer Topic Classification based on a Pre-trained Language Model: An application in Financial Domain. CoRR abs/2305.18324 (2023) - [i150]Aleksandra Piktus, Odunayo Ogundepo, Christopher Akiki, Akintunde Oladipo, Xinyu Zhang, Hailey Schoelkopf, Stella Biderman, Martin Potthast, Jimmy Lin:
GAIA Search: Hugging Face and Pyserini Interoperability for NLP Training Data Exploration. CoRR abs/2306.01481 (2023) - [i149]Ehsan Kamalloo, Nandan Thakur, Carlos Lassance, Xueguang Ma, Jheng-Hong Yang, Jimmy Lin:
Resources for Brewing BEIR: Reproducible Reference Models and an Official Leaderboard. CoRR abs/2306.07471 (2023) - [i148]Nandan Thakur, Kexin Wang, Iryna Gurevych, Jimmy Lin:
SPRINT: A Unified Toolkit for Evaluating and Demystifying Zero-shot Neural Sparse Retrieval. CoRR abs/2307.10488 (2023) - [i147]Ehsan Kamalloo, Aref Jafari, Xinyu Zhang, Nandan Thakur, Jimmy Lin:
HAGRID: A Human-LLM Collaborative Dataset for Generative Information-Seeking with Attribution. CoRR abs/2307.16883 (2023) - [i146]Cynthia Huang, Yuqing Xie, Zhiying Jiang, Jimmy Lin, Ming Li:
Approximating Human-Like Few-shot Learning with GPT-based Compression. CoRR abs/2308.06942 (2023) - [i145]Jimmy Lin, Ronak Pradeep, Tommaso Teofili, Jasper Xian:
Vector Search with OpenAI Embeddings: Lucene Is All You Need. CoRR abs/2308.14963 (2023) - [i144]Zijun Wu, Anup Anand Deshmukh, Yongkang Wu, Jimmy Lin, Lili Mou:
Unsupervised Chunking with Hierarchical RNN. CoRR abs/2309.04919 (2023) - [i143]Chris Kamphuis, Aileen Lin, Siwen Yang, Jimmy Lin, Arjen P. de Vries, Faegheh Hasibi:
MMEAD: MS MARCO Entity Annotations and Disambiguations. CoRR abs/2309.07574 (2023) - [i142]Ronak Pradeep, Sahel Sharifymoghaddam, Jimmy Lin:
RankVicuna: Zero-Shot Listwise Document Reranking with Open-Source Large Language Models. CoRR abs/2309.15088 (2023) - [i141]Raphael Tang, Xinyu Zhang, Xueguang Ma, Jimmy Lin, Ferhan Ture:
Found in the Middle: Permutation Self-Consistency Improves Listwise Ranking in Large Language Models. CoRR abs/2310.07712 (2023) - [i140]Xueguang Ma, Liang Wang, Nan Yang, Furu Wei, Jimmy Lin:
Fine-Tuning LLaMA for Multi-Stage Text Retrieval. CoRR abs/2310.08319 (2023) - [i139]Nandan Thakur, Jianmo Ni, Gustavo Hernández Ábrego, John Wieting, Jimmy Lin, Daniel Cer:
Leveraging LLMs for Synthesizing Training Data Across Many Languages in Multilingual Dense Retrieval. CoRR abs/2311.05800 (2023) - [i138]Minghan Li, Honglei Zhuang, Kai Hui, Zhen Qin, Jimmy Lin, Rolf Jagerman, Xuanhui Wang, Michael Bendersky:
Generate, Filter, and Fuse: Query Expansion via Multi-Step Keyword Generation for Zero-Shot Neural Rankers. CoRR abs/2311.09175 (2023) - [i137]Haonan Chen, Carlos Lassance, Jimmy Lin:
End-to-End Retrieval with Learned Dense and Sparse Representations Using Lucene. CoRR abs/2311.18503 (2023) - [i136]Raphael Tang, Xinyu Zhang, Jimmy Lin, Ferhan Ture:
What Do Llamas Really Think? Revealing Preference Biases in Language Model Representations. CoRR abs/2311.18812 (2023) - [i135]Jimmy Lin, Tommaso Teofili:
Searching Dense Representations with Inverted Indexes. CoRR abs/2312.01556 (2023) - [i134]Ronak Pradeep, Sahel Sharifymoghaddam, Jimmy Lin:
RankZephyr: Effective and Robust Zero-Shot Listwise Reranking is a Breeze! CoRR abs/2312.02724 (2023) - [i133]Xinyu Zhang, Sebastian Hofstätter, Patrick S. H. Lewis, Raphael Tang, Jimmy Lin:
Rank-without-GPT: Building GPT-Independent Listwise Rerankers on Open-Source Large Language Models. CoRR abs/2312.02969 (2023) - [i132]Nandan Thakur, Luiz Bonifacio, Xinyu Zhang, Odunayo Ogundepo, Ehsan Kamalloo, David Alfonso-Hermelo, Xiaoguang Li, Qun Liu, Boxing Chen, Mehdi Rezagholizadeh, Jimmy Lin:
NoMIRACL: Knowing When You Don't Know for Robust Multilingual Retrieval-Augmented Generation. CoRR abs/2312.11361 (2023) - [i131]Manveer Singh Tamber, Ronak Pradeep, Jimmy Lin:
Scaling Down, LiTting Up: Efficient Zero-Shot Listwise Reranking with Seq2seq Encoder-Decoder Models. CoRR abs/2312.16098 (2023) - [i130]Mofetoluwa Adeyemi, Akintunde Oladipo, Ronak Pradeep, Jimmy Lin:
Zero-Shot Cross-Lingual Reranking with Large Language Models for Low-Resource Languages. CoRR abs/2312.16159 (2023) - 2022
- [c322]Sankeerth Durvasula, Raymond Kiguru, Samarth Mathur, Jenny Xu, Jimmy Lin, Nandita Vijaykumar:
VoxelCache: Accelerating Online Mapping in Robotics and 3D Reconstruction Tasks. PACT 2022: 239-251 - [c321]Hang Li, Shengyao Zhuang, Xueguang Ma, Jimmy Lin, Guido Zuccon:
Pseudo-Relevance Feedback with Dense Retrievers in Pyserini. ADCS 2022: 1:1-1:6 - [c320]Wei Zhong, Yuqing Xie, Jimmy Lin:
Applying Structural and Dense Semantic Matching for the ARQMath Lab 2022, CLEF. CLEF (Working Notes) 2022: 147-170 - [c319]Chris Kamphuis, Faegheh Hasibi, Jimmy Lin, Arjen P. de Vries:
REBL: Entity Linking at Scale (prototype). DESIRES 2022: 68-75 - [c318]Hang Li, Shengyao Zhuang, Ahmed Mourad, Xueguang Ma, Jimmy Lin, Guido Zuccon:
Improving Query Representations for Dense Retrieval with Pseudo Relevance Feedback: A Reproducibility Study. ECIR (1) 2022: 599-612 - [c317]Xueguang Ma, Kai Sun, Ronak Pradeep, Minghan Li, Jimmy Lin:
Another Look at DPR: Reproduction of Training and Replication of Retrieval. ECIR (1) 2022: 613-626 - [c316]Ronak Pradeep, Yuqi Liu, Xinyu Zhang, Yilin Li, Andrew Yates, Jimmy Lin:
Squeezing Water from a Stone: A Bag of Tricks for Further Improving Cross-Encoder Effectiveness for Reranking. ECIR (1) 2022: 655-670 - [c315]Raphael Tang, Karun Kumar, Gefei Yang, Akshat Pandey, Yajie Mao, Vladislav Belyaev, Madhuri Emmadi, G. Craig Murray, Ferhan Ture, Jimmy Lin:
SpeechNet: Weakly Supervised, End-to-End Speech Recognition at Industrial Scale. EMNLP (Industry Track) 2022: 285-293 - [c314]Minghan Li, Xinyu Zhang, Ji Xin, Hongyang Zhang, Jimmy Lin:
Certified Error Control of Candidate Set Pruning for Two-Stage Relevance Ranking. EMNLP 2022: 333-345 - [c313]Yizhen Zhong, Jiajie Xiao, Thomas Vetterli, Mahan Matin, Ellen Loo, Jimmy Lin, Richard Bourgon, Ofer Shapira:
Improving Precancerous Case Characterization via Transformer-based Ensemble Learning. EMNLP (Industry Track) 2022: 379-389 - [c312]Wei Zhong, Jheng-Hong Yang, Yuqing Xie, Jimmy Lin:
Evaluating Token-Level and Passage-Level Dense Retrieval Models for Math Information Retrieval. EMNLP (Findings) 2022: 1092-1102 - [c311]Peng Shi, Rui Zhang, He Bai, Jimmy Lin:
XRICL: Cross-lingual Retrieval-Augmented In-Context Learning for Cross-lingual Text-to-SQL Semantic Parsing. EMNLP (Findings) 2022: 5248-5259 - [c310]Peng Shi, Linfeng Song, Lifeng Jin, Haitao Mi, He Bai, Jimmy Lin, Dong Yu:
Cross-lingual Text-to-SQL Semantic Parsing with Representation Mixup. EMNLP (Findings) 2022: 5296-5306 - [c309]Odunayo Ogundepo, Xinyu Zhang, Shuo Sun, Kevin Duh, Jimmy Lin:
AfriCLIRMatrix: Enabling Cross-Lingual Information Retrieval for African Languages. EMNLP 2022: 8721-8728 - [c308]Raphael Tang, Karun Kumar, Ji Xin, Piyush Vyas, Wenyan Li, Gefei Yang, Yajie Mao, G. Craig Murray, Jimmy Lin:
Temporal Early Exiting for Streaming Speech Commands Recognition. ICASSP 2022: 7567-7571 - [c307]Matthew Y. R. Yang, Siwen Yang, Jimmy Lin:
Integration of text and geospatial search for hydrographic datasets using the lucene search library. JCDL 2022: 36 - [c306]Zhiying Jiang, Yiqin Dai, Ji Xin, Ming Li, Jimmy Lin:
Few-Shot Non-Parametric Learning with Deep Latent Variable Model. NeurIPS 2022 - [c305]