


Остановите войну!
for scientists:


default search action
Jimmy Lin
Person information

- affiliation: University of Waterloo, David R. Cheriton School of Computer Science
- affiliation: Twitter Inc., San Francisco, USA
- affiliation: University of Maryland, College Park, Institute for Advanced Computer Studies (UMIACS)
- affiliation: Massachusetts Institute of Technology (MIT), Artificial Intelligence Laboratory
Refine list

refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
showing all ?? records
2020 – today
- 2023
- [j63]Joel Mackenzie
, Andrew Trotman
, Jimmy Lin
:
Efficient Document-at-a-time and Score-at-a-time Query Evaluation for Learned Sparse Representations. ACM Trans. Inf. Syst. 41(4): 96:1-96:28 (2023) - [j62]Sheng-Chieh Lin
, Jimmy Lin
:
A Dense Representation Framework for Lexical and Semantic Matching. ACM Trans. Inf. Syst. 41(4): 110:1-110:29 (2023) - [c341]Ehsan Kamalloo, Xinyu Zhang, Odunayo Ogundepo, Nandan Thakur, David Alfonso-Hermelo, Mehdi Rezagholizadeh, Jimmy Lin:
Evaluating Embedding APIs for Information Retrieval. ACL (industry) 2023: 518-526 - [c340]Aleksandra Piktus, Odunayo Ogundepo, Christopher Akiki, Akintunde Oladipo, Xinyu Zhang, Hailey Schoelkopf, Stella Biderman, Martin Potthast, Jimmy Lin:
GAIA Search: Hugging Face and Pyserini Interoperability for NLP Training Data Exploration. ACL (demo) 2023: 588-598 - [c339]Luyu Gao, Xueguang Ma, Jimmy Lin, Jamie Callan:
Precise Zero-Shot Dense Retrieval without Relevance Labels. ACL (1) 2023: 1762-1777 - [c338]Ji Xin, Raphael Tang, Zhiying Jiang, Yaoliang Yu, Jimmy Lin:
Operator Selection and Ordering in a Pipeline Approach to Efficiency Optimizations for Transformers. ACL (Findings) 2023: 2870-2882 - [c337]Raphael Tang, Linqing Liu, Akshat Pandey, Zhiying Jiang, Gefei Yang, Karun Kumar, Pontus Stenetorp, Jimmy Lin, Ferhan Ture:
What the DAAM: Interpreting Stable Diffusion Using Cross Attention. ACL (1) 2023: 5644-5659 - [c336]Zhiying Jiang, Matthew Y. R. Yang, Mikhail Tsirlin, Raphael Tang, Yiqin Dai, Jimmy Lin:
"Low-Resource" Text Classification: A Parameter-Free Classification Method with Compressors. ACL (Findings) 2023: 6810-6828 - [c335]Minghan Li
, Sheng-Chieh Lin, Barlas Oguz, Asish Ghoshal, Jimmy Lin, Yashar Mehdad, Wen-tau Yih, Xilun Chen:
CITADEL: Conditional Token Interaction via Dynamic Lexical Routing for Efficient and Effective Multi-Vector Retrieval. ACL (1) 2023: 11891-11907 - [c334]Xueguang Ma
, Tommaso Teofili
, Jimmy Lin
:
Anserini Gets Dense Retrieval: Integration of Lucene's HNSW Indexes. CIKM 2023: 5366-5370 - [c333]Wei Zhong
, Yuqing Xie
, Jimmy Lin
:
Answer Retrieval for Math Questions Using Structural and Dense Retrieval. CLEF 2023: 209-223 - [c332]Ronak Pradeep, Haonan Chen, Lingwei Gu, Manveer Singh Tamber, Jimmy Lin:
PyGaggle: A Gaggle of Resources for Open-Domain Question Answering. ECIR (3) 2023: 148-162 - [c331]Manveer Singh Tamber, Ronak Pradeep, Jimmy Lin:
Pre-processing Matters! Improved Wikipedia Corpora for Open-Domain Question Answering. ECIR (3) 2023: 163-176 - [c330]Christopher Akiki, Odunayo Ogundepo, Aleksandra Piktus, Xinyu Zhang, Akintunde Oladipo, Jimmy Lin, Martin Potthast:
Spacerini: Plug-and-play Search Engines with Pyserini and Hugging Face. EMNLP (Demos) 2023: 140-148 - [c329]Wei Zhong
, Sheng-Chieh Lin
, Jheng-Hong Yang
, Jimmy Lin
:
One Blade for One Purpose: Advancing Math Information Retrieval using Hybrid Search. SIGIR 2023: 141-151 - [c328]Minghan Li
, Sheng-Chieh Lin
, Xueguang Ma
, Jimmy Lin
:
SLIM: Sparsified Late Interaction for Multi-Vector Retrieval with Inverted Indexes. SIGIR 2023: 1954-1959 - [c327]Chris Kamphuis
, Aileen Lin
, Siwen Yang
, Jimmy Lin
, Arjen P. de Vries
, Faegheh Hasibi
:
MMEAD: MS MARCO Entity Annotations and Disambiguations. SIGIR 2023: 2817-2825 - [c326]Nandan Thakur
, Kexin Wang
, Iryna Gurevych
, Jimmy Lin
:
SPRINT: A Unified Toolkit for Evaluating and Demystifying Zero-shot Neural Sparse Retrieval. SIGIR 2023: 2964-2974 - [c325]Jheng-Hong Yang
, Carlos Lassance
, Rafael Sampaio de Rezende
, Krishna Srinivasan
, Miriam Redi
, Stéphane Clinchant
, Jimmy Lin
:
AToMiC: An Image/Text Retrieval Test Collection to Support Multimedia Content Creation. SIGIR 2023: 2975-2984 - [c324]Luyu Gao
, Xueguang Ma
, Jimmy Lin
, Jamie Callan
:
Tevatron: An Efficient and Flexible Toolkit for Neural Retrieval. SIGIR 2023: 3120-3124 - [c323]Xueguang Ma
, Hengxin Fun
, Xusen Yin
, Antonio Mallia
, Jimmy Lin
:
Enhancing Sparse Retrieval via Unsupervised Learning. SIGIR-AP 2023: 150-157 - [i157]Shi Zong, Josh Seltzer, Jiahua Pan, Kathy Cheng, Jimmy Lin:
Which Model Shall I Choose? Cost/Quality Trade-offs for Text Classification Tasks. CoRR abs/2301.07006 (2023) - [i156]Minghan Li, Sheng-Chieh Lin, Xueguang Ma, Jimmy Lin:
SLIM: Sparsified Late Interaction for Multi-Vector Retrieval with Inverted Indexes. CoRR abs/2302.06587 (2023) - [i155]Xinyu Zhang, Minghan Li, Jimmy Lin:
Improving Out-of-Distribution Generalization of Neural Rerankers with Contextualized Late Interaction. CoRR abs/2302.06589 (2023) - [i154]Sheng-Chieh Lin, Akari Asai, Minghan Li, Barlas Oguz, Jimmy Lin, Yashar Mehdad, Wen-tau Yih, Xilun Chen:
How to Train Your DRAGON: Diverse Augmentation Towards Generalizable Dense Retrieval. CoRR abs/2302.07452 (2023) - [i153]Christopher Akiki, Odunayo Ogundepo, Aleksandra Piktus, Xinyu Zhang, Akintunde Oladipo, Jimmy Lin, Martin Potthast:
Spacerini: Plug-and-play Search Engines with Pyserini and Hugging Face. CoRR abs/2302.14534 (2023) - [i152]Jimmy Lin, David Alfonso-Hermelo, Vitor Jeronymo, Ehsan Kamalloo, Carlos Lassance, Rodrigo Frassetto Nogueira, Odunayo Ogundepo, Mehdi Rezagholizadeh, Nandan Thakur, Jheng-Hong Yang, Xinyu Zhang:
Simple Yet Effective Neural Ranking and Reranking Baselines for Cross-Lingual Information Retrieval. CoRR abs/2304.01019 (2023) - [i151]Jheng-Hong Yang, Carlos Lassance, Rafael Sampaio de Rezende, Krishna Srinivasan, Miriam Redi, Stéphane Clinchant, Jimmy Lin:
AToMiC: An Image/Text Retrieval Test Collection to Support Multimedia Content Creation. CoRR abs/2304.01961 (2023) - [i150]Xueguang Ma, Tommaso Teofili, Jimmy Lin:
Anserini Gets Dense Retrieval: Integration of Lucene's HNSW Indexes. CoRR abs/2304.12139 (2023) - [i149]Xueguang Ma, Xinyu Zhang, Ronak Pradeep, Jimmy Lin:
Zero-Shot Listwise Document Reranking with a Large Language Model. CoRR abs/2305.02156 (2023) - [i148]Ehsan Kamalloo, Xinyu Zhang, Odunayo Ogundepo, Nandan Thakur, David Alfonso-Hermelo, Mehdi Rezagholizadeh, Jimmy Lin:
Evaluating Embedding APIs for Information Retrieval. CoRR abs/2305.06300 (2023) - [i147]Josh Seltzer, Jiahua Pan, Kathy Cheng, Yuxiao Sun, Santosh Kolagati, Jimmy Lin, Shi Zong:
SmartProbe: A Virtual Moderator for Market Research Surveys. CoRR abs/2305.08271 (2023) - [i146]Ronak Pradeep, Kai Hui, Jai Gupta, Ádám Dániel Lelkes, Honglei Zhuang, Jimmy Lin, Donald Metzler, Vinh Q. Tran:
How Does Generative Retrieval Scale to Millions of Passages? CoRR abs/2305.11841 (2023) - [i145]Vanessa Liao, Syed Shariyar Murtaza, Yifan Nie, Jimmy Lin:
Regex-augmented Domain Transfer Topic Classification based on a Pre-trained Language Model: An application in Financial Domain. CoRR abs/2305.18324 (2023) - [i144]Aleksandra Piktus, Odunayo Ogundepo, Christopher Akiki, Akintunde Oladipo, Xinyu Zhang, Hailey Schoelkopf, Stella Biderman, Martin Potthast, Jimmy Lin:
GAIA Search: Hugging Face and Pyserini Interoperability for NLP Training Data Exploration. CoRR abs/2306.01481 (2023) - [i143]Ehsan Kamalloo, Nandan Thakur, Carlos Lassance, Xueguang Ma, Jheng-Hong Yang, Jimmy Lin:
Resources for Brewing BEIR: Reproducible Reference Models and an Official Leaderboard. CoRR abs/2306.07471 (2023) - [i142]Nandan Thakur, Kexin Wang, Iryna Gurevych, Jimmy Lin:
SPRINT: A Unified Toolkit for Evaluating and Demystifying Zero-shot Neural Sparse Retrieval. CoRR abs/2307.10488 (2023) - [i141]Ehsan Kamalloo, Aref Jafari, Xinyu Zhang, Nandan Thakur, Jimmy Lin:
HAGRID: A Human-LLM Collaborative Dataset for Generative Information-Seeking with Attribution. CoRR abs/2307.16883 (2023) - [i140]Cynthia Huang, Yuqing Xie, Zhiying Jiang, Jimmy Lin, Ming Li:
Approximating Human-Like Few-shot Learning with GPT-based Compression. CoRR abs/2308.06942 (2023) - [i139]Jimmy Lin, Ronak Pradeep, Tommaso Teofili, Jasper Xian:
Vector Search with OpenAI Embeddings: Lucene Is All You Need. CoRR abs/2308.14963 (2023) - [i138]Zijun Wu, Anup Anand Deshmukh, Yongkang Wu, Jimmy Lin, Lili Mou:
Unsupervised Chunking with Hierarchical RNN. CoRR abs/2309.04919 (2023) - [i137]Chris Kamphuis, Aileen Lin, Siwen Yang, Jimmy Lin, Arjen P. de Vries, Faegheh Hasibi:
MMEAD: MS MARCO Entity Annotations and Disambiguations. CoRR abs/2309.07574 (2023) - [i136]Ronak Pradeep, Sahel Sharifymoghaddam, Jimmy Lin:
RankVicuna: Zero-Shot Listwise Document Reranking with Open-Source Large Language Models. CoRR abs/2309.15088 (2023) - [i135]Raphael Tang, Xinyu Zhang, Xueguang Ma, Jimmy Lin, Ferhan Ture:
Found in the Middle: Permutation Self-Consistency Improves Listwise Ranking in Large Language Models. CoRR abs/2310.07712 (2023) - [i134]Xueguang Ma, Liang Wang, Nan Yang, Furu Wei, Jimmy Lin:
Fine-Tuning LLaMA for Multi-Stage Text Retrieval. CoRR abs/2310.08319 (2023) - [i133]Nandan Thakur, Jianmo Ni, Gustavo Hernández Ábrego, John Wieting, Jimmy Lin, Daniel Cer:
Leveraging LLMs for Synthesizing Training Data Across Many Languages in Multilingual Dense Retrieval. CoRR abs/2311.05800 (2023) - [i132]Minghan Li, Honglei Zhuang, Kai Hui, Zhen Qin, Jimmy Lin, Rolf Jagerman, Xuanhui Wang, Michael Bendersky:
Generate, Filter, and Fuse: Query Expansion via Multi-Step Keyword Generation for Zero-Shot Neural Rankers. CoRR abs/2311.09175 (2023) - [i131]Haonan Chen, Carlos Lassance, Jimmy Lin:
End-to-End Retrieval with Learned Dense and Sparse Representations Using Lucene. CoRR abs/2311.18503 (2023) - [i130]Raphael Tang, Xinyu Zhang, Jimmy Lin, Ferhan Ture:
What Do Llamas Really Think? Revealing Preference Biases in Language Model Representations. CoRR abs/2311.18812 (2023) - 2022
- [c322]Sankeerth Durvasula, Raymond Kiguru, Samarth Mathur, Jenny Xu, Jimmy Lin, Nandita Vijaykumar:
VoxelCache: Accelerating Online Mapping in Robotics and 3D Reconstruction Tasks. PACT 2022: 239-251 - [c321]Hang Li
, Shengyao Zhuang
, Xueguang Ma
, Jimmy Lin
, Guido Zuccon
:
Pseudo-Relevance Feedback with Dense Retrievers in Pyserini. ADCS 2022: 1:1-1:6 - [c320]Wei Zhong, Yuqing Xie, Jimmy Lin:
Applying Structural and Dense Semantic Matching for the ARQMath Lab 2022, CLEF. CLEF (Working Notes) 2022: 147-170 - [c319]Chris Kamphuis, Faegheh Hasibi, Jimmy Lin, Arjen P. de Vries:
REBL: Entity Linking at Scale (prototype). DESIRES 2022: 68-75 - [c318]Hang Li
, Shengyao Zhuang
, Ahmed Mourad
, Xueguang Ma, Jimmy Lin
, Guido Zuccon
:
Improving Query Representations for Dense Retrieval with Pseudo Relevance Feedback: A Reproducibility Study. ECIR (1) 2022: 599-612 - [c317]Xueguang Ma, Kai Sun, Ronak Pradeep, Minghan Li
, Jimmy Lin:
Another Look at DPR: Reproduction of Training and Replication of Retrieval. ECIR (1) 2022: 613-626 - [c316]Ronak Pradeep, Yuqi Liu, Xinyu Zhang, Yilin Li, Andrew Yates, Jimmy Lin:
Squeezing Water from a Stone: A Bag of Tricks for Further Improving Cross-Encoder Effectiveness for Reranking. ECIR (1) 2022: 655-670 - [c315]Raphael Tang, Karun Kumar, Gefei Yang, Akshat Pandey, Yajie Mao, Vladislav Belyaev, Madhuri Emmadi, G. Craig Murray, Ferhan Ture, Jimmy Lin:
SpeechNet: Weakly Supervised, End-to-End Speech Recognition at Industrial Scale. EMNLP (Industry Track) 2022: 285-293 - [c314]Minghan Li
, Xinyu Zhang, Ji Xin, Hongyang Zhang, Jimmy Lin:
Certified Error Control of Candidate Set Pruning for Two-Stage Relevance Ranking. EMNLP 2022: 333-345 - [c313]Yizhen Zhong, Jiajie Xiao, Thomas Vetterli, Mahan Matin, Ellen Loo, Jimmy Lin, Richard Bourgon, Ofer Shapira:
Improving Precancerous Case Characterization via Transformer-based Ensemble Learning. EMNLP (Industry Track) 2022: 379-389 - [c312]Wei Zhong, Jheng-Hong Yang, Yuqing Xie, Jimmy Lin:
Evaluating Token-Level and Passage-Level Dense Retrieval Models for Math Information Retrieval. EMNLP (Findings) 2022: 1092-1102 - [c311]Peng Shi, Rui Zhang, He Bai, Jimmy Lin:
XRICL: Cross-lingual Retrieval-Augmented In-Context Learning for Cross-lingual Text-to-SQL Semantic Parsing. EMNLP (Findings) 2022: 5248-5259 - [c310]Peng Shi, Linfeng Song, Lifeng Jin, Haitao Mi, He Bai, Jimmy Lin, Dong Yu:
Cross-lingual Text-to-SQL Semantic Parsing with Representation Mixup. EMNLP (Findings) 2022: 5296-5306 - [c309]Odunayo Ogundepo, Xinyu Zhang, Shuo Sun, Kevin Duh, Jimmy Lin:
AfriCLIRMatrix: Enabling Cross-Lingual Information Retrieval for African Languages. EMNLP 2022: 8721-8728 - [c308]Raphael Tang, Karun Kumar, Ji Xin, Piyush Vyas, Wenyan Li, Gefei Yang, Yajie Mao, G. Craig Murray, Jimmy Lin:
Temporal Early Exiting for Streaming Speech Commands Recognition. ICASSP 2022: 7567-7571 - [c307]Matthew Y. R. Yang, Siwen Yang, Jimmy Lin:
Integration of text and geospatial search for hydrographic datasets using the lucene search library. JCDL 2022: 36 - [c306]Zhiying Jiang, Yiqin Dai, Ji Xin, Ming Li, Jimmy Lin:
Few-Shot Non-Parametric Learning with Deep Latent Variable Model. NeurIPS 2022 - [c305]Ronak Pradeep, Yilin Li, Yuetong Wang, Jimmy Lin:
Neural Query Synthesis and Domain-Specific Ranking Templates for Multi-Stage Clinical Trial Matching. SIGIR 2022: 2325-2330 - [c304]Hang Li, Shuai Wang, Shengyao Zhuang, Ahmed Mourad, Xueguang Ma, Jimmy Lin, Guido Zuccon:
To Interpolate or not to Interpolate: PRF, Dense and Sparse Retrievers. SIGIR 2022: 2495-2500 - [c303]Yuqi Liu, Chengcheng Hu, Jimmy Lin:
Another Look at Information Retrieval as Statistical Translation. SIGIR 2022: 2749-2754 - [c302]Jimmy Lin, Daniel Campos, Nick Craswell, Bhaskar Mitra, Emine Yilmaz:
Fostering Coopetition While Plugging Leaks: The Design and Implementation of the MS MARCO Leaderboards. SIGIR 2022: 2939-2948 - [c301]Ellen M. Voorhees, Nick Craswell, Jimmy Lin:
Too Many Relevants: Whither Cranfield Test Collections? SIGIR 2022: 2970-2980 - [c300]Xueguang Ma, Ronak Pradeep, Rodrigo Frassetto Nogueira, Jimmy Lin:
Document Expansion Baselines and Learned Sparse Lexical Representations for MS MARCO V1 and V2. SIGIR 2022: 3187-3197 - [c299]Andrew Trotman, Joel Mackenzie, Pradeesh Parameswaran, Jimmy Lin:
A Common Framework for Exploring Document-at-a-Time and Score-at-a-Time Retrieval Methods. SIGIR 2022: 3229-3234 - [c298]Josh Seltzer, Kathy Cheng, Shi Zong, Jimmy Lin:
Flipping the Script: Inverse Information Seeking Dialogues for Market Research. SIGIR 2022: 3380-3383 - [c297]Nick Craswell, Bhaskar Mitra, Emine Yilmaz, Daniel Campos, Jimmy Lin, Ellen M. Voorhees, Ian Soboroff:
Overview of the TREC 2022 Deep Learning Track. TREC 2022 - [c296]Jimmy Lin, David Alfonso-Hermelo, Vitor Jeronymo, Ehsan Kamalloo, Carlos Lassance, Rodrigo Frassetto Nogueira, Odunayo Ogundepo, Mehdi Rezagholizadeh, Nandan Thakur, Jheng-Hong Yang, Xinyu Zhang:
Simple Yet Effective Neural Ranking and Reranking Baselines for Cross-Lingual Information Retrieval. TREC 2022 - [c295]Josh Devins, Julie Tibshirani, Jimmy Lin:
Aligning the Research and Practice of Building Search Applications: Elasticsearch and Pyserini. WSDM 2022: 1573-1576 - [i129]Ellen M. Voorhees, Ian Soboroff, Jimmy Lin:
Can Old TREC Collections Reliably Evaluate Modern Neural Retrieval Models? CoRR abs/2201.11086 (2022) - [i128]Luyu Gao, Xueguang Ma, Jimmy Lin, Jamie Callan:
Tevatron: An Efficient and Flexible Toolkit for Dense Retrieval. CoRR abs/2203.05765 (2022) - [i127]Wei Zhong, Jheng-Hong Yang, Jimmy Lin:
Evaluating Token-Level and Passage-Level Dense Retrieval Models for Math Information Retrieval. CoRR abs/2203.11163 (2022) - [i126]Xinyu Zhang, Kelechi Ogueji, Xueguang Ma, Jimmy Lin:
Towards Best Practices for Training Multilingual Dense Retrieval Models. CoRR abs/2204.02363 (2022) - [i125]Hang Li, Shuai Wang, Shengyao Zhuang, Ahmed Mourad, Xueguang Ma, Jimmy Lin, Guido Zuccon:
To Interpolate or not to Interpolate: PRF, Dense and Sparse Retrievers. CoRR abs/2205.00235 (2022) - [i124]Minghan Li, Xinyu Zhang, Ji Xin, Hongyang Zhang, Jimmy Lin:
Certified Error Control of Candidate Set Pruning for Two-Stage Relevance Ranking. CoRR abs/2205.09638 (2022) - [i123]Nandan Thakur, Nils Reimers, Jimmy Lin:
Domain Adaptation for Memory-Efficient Dense Retrieval. CoRR abs/2205.11498 (2022) - [i122]Sheng-Chieh Lin, Jimmy Lin:
A Dense Representation Framework for Lexical and Semantic Matching. CoRR abs/2206.09912 (2022) - [i121]Zhiying Jiang, Yiqin Dai, Ji Xin, Ming Li, Jimmy Lin:
Few-Shot Non-Parametric Learning with Deep Latent Variable Model. CoRR abs/2206.11573 (2022) - [i120]Ji Xin, Raphael Tang, Zhiying Jiang, Yaoliang Yu, Jimmy Lin:
Building an Efficiency Pipeline: Commutativity and Cumulativeness of Efficiency Operators for Transformers. CoRR abs/2208.00483 (2022) - [i119]Sheng-Chieh Lin, Minghan Li, Jimmy Lin:
Aggretriever: A Simple Approach to Aggregate Textual Representation for Robust Dense Passage Retrieval. CoRR abs/2208.00511 (2022) - [i118]Raphael Tang, Akshat Pandey, Zhiying Jiang, Gefei Yang, Karun Kumar, Jimmy Lin, Ferhan Ture:
What the DAAM: Interpreting Stable Diffusion Using Cross Attention. CoRR abs/2210.04885 (2022) - [i117]Odunayo Ogundepo, Xinyu Zhang, Jimmy Lin:
Better Than Whitespace: Information Retrieval for Languages without Custom Tokenizers. CoRR abs/2210.05481 (2022) - [i116]Linqing Liu, Minghan Li, Jimmy Lin, Sebastian Riedel, Pontus Stenetorp:
Query Expansion Using Contextual Clue Sampling with Language Models. CoRR abs/2210.07093 (2022) - [i115]Sankeerth Durvasula, Raymond Kiguru, Samarth Mathur, Jenny Xu, Jimmy Lin, Nandita Vijaykumar:
VoxelCache: Accelerating Online Mapping in Robotics and 3D Reconstruction Tasks. CoRR abs/2210.08729 (2022) - [i114]Xinyu Zhang, Nandan Thakur, Odunayo Ogundepo, Ehsan Kamalloo, David Alfonso-Hermelo, Xiaoguang Li, Qun Liu, Mehdi Rezagholizadeh, Jimmy Lin:
Making a MIRACL: Multilingual Information Retrieval Across a Continuum of Languages. CoRR abs/2210.09984 (2022) - [i113]Peng Shi, Rui Zhang, He Bai
, Jimmy Lin:
XRICL: Cross-lingual Retrieval-Augmented In-Context Learning for Cross-lingual Text-to-SQL Semantic Parsing. CoRR abs/2210.13693 (2022) - [i112]Jimmy Lin:
On the Interaction Between Differential Privacy and Gradient Compression in Deep Learning. CoRR abs/2211.00734 (2022) - [i111]Minghan Li, Sheng-Chieh Lin, Barlas Oguz, Asish Ghoshal, Jimmy Lin, Yashar Mehdad, Wen-tau Yih, Xilun Chen:
CITADEL: Conditional Token Interaction via Dynamic Lexical Routing for Efficient and Effective Multi-Vector Retrieval. CoRR abs/2211.10411 (2022) - [i110]Raphael Tang, Karun Kumar, Gefei Yang, Akshat Pandey, Yajie Mao, Vladislav Belyaev, Madhuri Emmadi, G. Craig Murray, Ferhan Ture, Jimmy Lin:
SpeechNet: Weakly Supervised, End-to-End Speech Recognition at Industrial Scale. CoRR abs/2211.11740 (2022) - [i109]Yizhen Zhong, Jiajie Xiao, Thomas Vetterli, Mahan Matin, Ellen Loo, Jimmy Lin, Richard Bourgon, Ofer Shapira:
Improving Precancerous Case Characterization via Transformer-based Ensemble Learning. CoRR abs/2212.05150 (2022) - [i108]Zhiying Jiang, Matthew Y. R. Yang, Mikhail Tsirlin, Raphael Tang, Jimmy Lin:
Less is More: Parameter-Free Text Classification with Gzip. CoRR abs/2212.09410 (2022) - [i107]Luyu Gao, Xueguang Ma, Jimmy Lin, Jamie Callan:
Precise Zero-Shot Dense Retrieval without Relevance Labels. CoRR abs/2212.10496 (2022) - [i106]Jimmy Lin:
Building a Culture of Reproducibility in Academic Research. CoRR abs/2212.13534 (2022) - 2021
- [b3]Jimmy Lin, Rodrigo Frassetto Nogueira
, Andrew Yates:
Pretrained Transformers for Text Ranking: BERT and Beyond. Synthesis Lectures on Human Language Technologies, Morgan & Claypool Publishers 2021, ISBN 978-3-031-01053-8, pp. 1-325 - [j61]Samantha Fritz, Ian Milligan, Nick Ruest, Jimmy Lin:
Fostering Community Engagement through Datathon Events: The Archives Unleashed Experience. Digit. Humanit. Q. 15(1) (2021) - [j60]Martin Gauch
, Juliane Mai
, Jimmy Lin:
The proper care and feeding of CAMELS: How limited training data affects streamflow prediction. Environ. Model. Softw. 135: 104926 (2021) - [j59]Jimmy Lin:
A proposed conceptual framework for a representational approach to information retrieval. SIGIR Forum 55(2): 4:1-4:29 (2021) - [j58]Sheng-Chieh Lin, Jheng-Hong Yang, Rodrigo Frassetto Nogueira
, Ming-Feng Tsai, Chuan-Ju Wang, Jimmy Lin:
Multi-Stage Conversational Passage Retrieval: An Approach to Fusing Term Importance Estimation and Neural Query Rewriting. ACM Trans. Inf. Syst. 39(4): 48:1-48:29 (2021) - [c294]He Bai
, Peng Shi, Jimmy Lin, Yuqing Xie, Luchen Tan, Kun Xiong, Wen Gao, Ming Li:
Segatron: Segment-Aware Transformer for Language Modeling and Understanding. AAAI 2021: 12526-12534 - [c293]He Bai
, Peng Shi, Jimmy Lin, Luchen Tan, Kun Xiong, Wen Gao, Jie Liu, Ming Li:
Semantics of the Unwritten: The Effect of End of Paragraph and Sequence Tokens on Text Generation with GPT2. ACL (student) 2021: 148-162 - [c292]Kelvin Jiang, Ronak Pradeep, Jimmy Lin:
Exploring Listwise Evidence Reasoning with T5 for Fact Verification. ACL/IJCNLP (2) 2021: 402-410 - [c291]Ji Xin, Raphael Tang, Yaoliang Yu, Jimmy Lin:
The Art of Abstention: Selective Prediction and Error Regularization for Natural Language Processing. ACL/IJCNLP (1) 2021: 1040-1051 - [c290]Ronak Pradeep, Xueguang Ma, Rodrigo Frassetto Nogueira, Jimmy Lin:
Scientific Claim Verification with VerT5erini. LOUHI@EACL 2021: 94-103 - [c289]Zhiying Jiang, Raphael Tang, Ji Xin, Jimmy Lin:
How Does BERT Rerank Passages? An Attribution Analysis with Information Bottlenecks. BlackboxNLP@EMNLP 2021: 496-509 - [c288]Wei Zhong, Xinyu Zhang, Ji Xin, Richard Zanibbi, Jimmy Lin:
Approach Zero and Anserini at the CLEF-2021 ARQMath Track: Applying Substructure Search and BM25 on Operator Tree Path Tokens. CLEF (Working Notes) 2021: 133-156 - [c287]Mayank Anand, Jiarui Zhang, Shane Ding, Ji Xin, Jimmy Lin:
Serverless BM25 Search and BERT Reranking. DESIRES 2021: 3-9 - [c286]Jimmy Lin, Xueguang Ma, Joel Mackenzie, Antonio Mallia:
On the Separation of Logical and Physical Ranking Models for Text Retrieval Applications. DESIRES 2021: 176-178 - [c285]