


default search action
Transactions of the Association for Computational Linguistics, Volume 12
Volume 12, 2024
- Max Glockner, Ieva Staliunaite, James Thorne, Gisela Vallejo, Andreas Vlachos

, Iryna Gurevych:
AmbiFC: Fact-Checking Ambiguous Claims with Evidence. 1-18 - Alan Ramponi

:
Language Varieties of Italy: Technology Challenges and Opportunities. 19-38 - Tianyi Zhang, Faisal Ladhak, Esin Durmus, Percy Liang, Kathleen R. McKeown, Tatsunori B. Hashimoto:

Benchmarking Large Language Models for News Summarization. 39-57 - Oleh Shliazhko, Alena Fenogenova

, Maria Tikhonova, Anastasia Kozlova, Vladislav Mikhailov, Tatiana Shavrina:
mGPT: Few-Shot Learners Go Multilingual. 58-79 - Yong Cao, Yova Kementchedjhieva, Ruixiang Cui

, Antonia Karamolegkou
, Li Zhou, Megan Dare, Lucia Donatelli, Daniel Hershcovich:
Cultural Adaptation of Recipes. 80-99 - Shiman Zhao, Yutao Xie, Wei Chen, Tengjiao Wang, Jiahui Yao, Jiabin Zheng:

Metric-Free Learning Network with Dual Relations Propagation for Few-Shot Aspect Category Sentiment Analysis. 100-119 - Zhengping Jiang, Anqi Liu, Benjamin Van Durme:

Addressing the Binning Problem in Calibration Assessment through Scalar Annotations. 120-136 - Cheng Yang, Guoping Huang, Mo Yu, Zhirui Zhang, Siheng Li, Mingming Yang, Shuming Shi, Yujiu Yang

, Lemao Liu:
An Energy-based Model for Word-level AutoCompletion in Computer-aided Translation. 137-156 - Nelson F. Liu, Kevin Lin, John Hewitt, Ashwin Paranjape, Michele Bevilacqua, Fabio Petroni

, Percy Liang:
Lost in the Middle: How Language Models Use Long Contexts. 157-173 - Zhouxing Shi

, Yihan Wang, Fan Yin, Xiangning Chen, Kai-Wei Chang, Cho-Jui Hsieh:
Red Teaming Language Model Detectors with Language Models. 174-189 - Lei Sha

, Thomas Lukasiewicz:
Text Attribute Control via Closed-Loop Disentanglement. 190-209 - Shujie Li, Liang Li, Ruiying Geng, Min Yang, Binhua Li, Guanghu Yuan, Wanwei He, Shao Yuan, Can Ma, Fei Huang, Yongbin Li:

Unifying Structured Data as Graph for Data-to-Text Pre-Training. 210-228 - Zhiwei He, Tian Liang, Wenxiang Jiao, Zhuosheng Zhang

, Yujiu Yang
, Rui Wang
, Zhaopeng Tu, Shuming Shi, Xing Wang:
Exploring Human-Like Translation Strategy with Large Language Models. 229-246 - Dingmin Wang, Qiuyuan Huang, Matthew Jackson, Jianfeng Gao:

Retrieve What You Need: A Mutual Learning Framework for Open-domain Question Answering. 247-263 - Wolf Nuyts, Ruben Cartuyvels

, Marie-Francine Moens:
Explicitly Representing Syntax Improves Sentence-to-Layout Prediction of Unexpected Situations. 264-282 - Roi Cohen, Eden Biran, Ori Yoran, Amir Globerson, Mor Geva:

Evaluating the Ripple Effects of Knowledge Editing in Language Models. 283-298 - Aina Garí Soler

, Matthieu Labeau, Chloé Clavel:
The Impact of Word Splitting on the Semantic Content of Contextualized Word Representations. 299-320 - Vijay Viswanathan, Kiril Gashteovski, Carolin Lawrence, Tongshuang Wu, Graham Neubig:

Large Language Models Enable Few-Shot Clustering. 321-333 - Fengzhu Zeng, Wei Gao:

JustiLM: Few-shot Justification Generation for Explainable Fact-Checking of Real-world Claims. 334-354 - Jiaming Luo, Colin Cherry, George F. Foster:

To Diverge or Not to Diverge: A Morphosyntactic Perspective on Machine Translation vs Human Translation. 355-371 - Ankita Pasad, Chung-Ming Chien, Shane Settle, Karen Livescu

:
What Do Self-Supervised Speech Models Know About Words? 372-391 - Lukas Edman, Gabriele Sarti

, Antonio Toral
, Gertjan van Noord
, Arianna Bisazza:
Are Character-level Translations Worth the Wait? Comparing ByT5 and mT5 for Machine Translation. 392-410 - Valentin Hofmann, Goran Glavas, Nikola Ljubesic, Janet B. Pierrehumbert, Hinrich Schütze:

Geographic Adaptation of Pretrained Language Models. 411-431 - Sweta Agrawal, Marine Carpuat:

Do Text Simplification Systems Preserve Meaning? A Human Evaluation via Reading Comprehension. 432-448 - Pengtao Xie, Xingchen Zhao, Xuehai He:

Simultaneous Selection and Adaptation of Source Data via Four-Level Optimization. 449-466 - Sarah E. Finch

, Jinho D. Choi:
ConvoSense: Overcoming Monotonous Commonsense Inferences for Conversational AI. 467-483 - Liangming Pan, Michael Saxon, Wenda Xu, Deepak Nathani, Xinyi Wang, William Yang Wang:

Automatically Correcting Large Language Models: Surveying the Landscape of Diverse Automated Correction Strategies. 484-506 - Jiho Jin, Jiseon Kim, Nayeon Lee

, Haneul Yoo, Alice Oh, Hwaran Lee:
KoBBQ: Korean Bias Benchmark for Question Answering. 507-524 - Han Zhou

, Xingchen Wan, Ivan Vulic, Anna Korhonen:
AutoPEFT: Automatic Configuration Search for Parameter-Efficient Fine-Tuning. 525-542 - Lena Strobl, William Merrill, Gail Weiss, David Chiang, Dana Angluin:

What Formal Languages Can Transformers Express? A Survey. 543-561 - Michael Staniek, Raphael Schumann, Maike Züfle, Stefan Riezler:

Text-to-OverpassQL: A Natural Language Interface for Complex Geodata Querying of OpenStreetMap. 562-575 - Jiahuan Li, Hao Zhou, Shujian Huang, Shanbo Cheng, Jiajun Chen:

Eliciting the Translation Ability of Large Language Models via Multilingual Finetuning with Translation Instructions. 576-592 - Filip Miletic

, Sabine Schulte im Walde:
Semantics of Multiword Expressions in Transformer-Based Models: A Survey. 593-612 - Ponrawee Prasertsom, Apiwat Jaroonpol, Attapol T. Rutherford:

The Thai Discourse Treebank: Annotating and Classifying Thai Discourse Connectives. 613-629 - Nuria Rodríguez Barroso, Eugenio Martínez Cámara

, José Camacho-Collados
, María Victoria Luzón, Francisco Herrera:
Federated Learning for Exploiting Annotators' Disagreements in Natural Language Processing. 630-648 - Hajime Senuma, Akiko Aizawa:

Computational Complexity of Natural Morphology Revisited. 649-663 - Sohee Yang, Jonghyeon Kim, Joel Jang, Seonghyeon Ye, Hyunji Lee, Minjoon Seo:

Improving Probability-based Prompt Selection Through Unified Evaluation and Analysis. 664-680 - Vaibhav Adlakha, Parishad BehnamGhader, Xing Han Lù, Nicholas Meade, Siva Reddy:

Evaluating Correctness and Faithfulness of Instruction-Following Models for Question Answering. 681-699 - Josef Valvoda, Alec Thompson, Ryan Cotterell, Simone Teufel:

The Ethics of Automating Legal Actors. 700-720 - Maxwell J. Yin, Boyu Wang

, Yue Dong
, Charles Ling:
Source-Free Domain Adaptation for Question Answering with Masked Self-training. 721-737 - Gaurav Kamath, Sebastian Schuster

, Sowmya Vajjala, Siva Reddy:
Scope Ambiguities in Large Language Models. 738-754 - Leanne Nortje, Dan Oneata, Yevgen Matusevych, Herman Kamper:

Visually Grounded Speech Models Have a Mutual Exclusivity Bias. 755-770 - Jens Lehmann, Dhananjay Bhandiwad, Preetam Gattogi, Sahar Vahdati:

Beyond Boundaries: A Human-like Approach for Question Answering over Structured and Unstructured Information Sources. 786-802 - Cameron R. Jones, Sean Trott, Benjamin Bergen

:
Comparing Humans and Large Language Models on an Experimental Protocol Inventory for Theory of Mind Evaluation (EPITOME). 803-819 - Juri Opitz

:
A Closer Look at Classification Evaluation Metrics and a Critical Reflection of Common Evaluation Practice. 820-836 - Heather C. Lent, Kushal Tatariya, Raj Dabre, Yiyi Chen

, Marcell Fekete, Esther Ploeger
, Li Zhou, Ruth-Ann Armstrong, Abee Eijansantos
, Catriona Malau, Hans Erik Heje, Ernests Lavrinovics
, Diptesh Kanojia, Paul Belony, Marcel Bollmann, Loïc Grobol, Miryam de Lhoneux
, Daniel Hershcovich, Michel DeGraff, Anders Søgaard, Johannes Bjerva
:
CreoleVal: Multilingual Multitask Benchmarks for Creoles. 950-978 - Javier Iranzo-Sánchez, Jorge Iranzo-Sánchez, Adrià Giménez

, Jorge Civera
, Alfons Juan:
Segmentation-Free Streaming Machine Translation. 1104-1121 - Neha Srikanth, Marine Carpuat, Rachel Rudinger

:
How Often Are Errors in Natural Language Reasoning Due to Paraphrastic Variability? 1143-1162 - Moran Mizrahi, Guy Kaplan, Dan Malkin, Rotem Dror, Dafna Shahaf, Gabriel Stanovsky:

State of What Art? A Call for Multi-Prompt LLM Evaluation. 933-949 - Ansong Ni, Pengcheng Yin, Yilun Zhao, Martin Riddell, Troy Feng, Rui Shen, Stephen Yin, Ye Liu, Semih Yavuz, Caiming Xiong, Shafiq Joty, Yingbo Zhou, Dragomir Radev, Arman Cohan:

L2CEval: Evaluating Language-to-Code Generation Capabilities of Large Language Models. 1311-1329 - Masamune Kobayashi, Masato Mita, Mamoru Komachi:

Revisiting Meta-evaluation for Grammatical Error Correction. 837-855 - Sweta Agrawal, M. Amin Farajian, Patrick Fernandes, Ricardo Rei, André F. T. Martins:

Assessing the Role of Context in Chat Translation Evaluation: Is Context Helpful and Under What Conditions? 1250-1267 - Xuanli He, Qiongkai Xu

, Jun Wang, Benjamin I. P. Rubinstein, Trevor Cohn:
SEEP: Training Dynamics Grounds Latent Representation Search for Mitigating Backdoor Poisoning Attacks. 996-1010 - Yongxue Shan, Jie Zhou, Jie Peng, Xin Zhou, Jiaqian Yin, Xiaodong Wang:

Multi-level Shared Knowledge Guided Learning for Knowledge Graph Completion. 1027-1042 - Huy Hien Vu, Hidetaka Kamigaito, Taro Watanabe:

Context-Aware Machine Translation with Source Coreference Explanation. 856-874 - Ohad Rubin, Jonathan Berant:

Retrieval-Pretrained Transformer: Long-range Language Modeling with Self-retrieval. 1197-1213 - Jay DeYoung, Stephanie C. Martinez, Iain James Marshall, Byron C. Wallace:

Do Multi-Document Summarization Models Synthesize? 1043-1062 - Xiyan Fu, Anette Frank:

Exploring Continual Learning of Compositional Generalization in NLI. 912-932 - Huiyao Chen

, Yu Zhao, Zulong Chen, Mengjia Wang, Liangyue Li, Meishan Zhang, Min Zhang:
Retrieval-style In-context Learning for Few-shot Hierarchical Text Classification. 1214-1231 - Jiaang Li, Yova Kementchedjhieva, Constanza Fierro, Anders Søgaard:

Do Vision and Language Models Share Concepts? A Vector Space Alignment Study. 1232-1249 - Max Müller-Eberstein

, Dianna Yee, Karren D. Yang, Gautam Varma Mantena, Colin Lea:
Hypernetworks for Personalizing ASR to Atypical Speech. 1182-1196 - Ukyo Honda, Tatsushi Oka

, Peinan Zhang, Masato Mita:
Not Eliminate but Aggregate: Post-Hoc Control over Mixture-of-Experts to Address Shortcut Shifts in Natural Language Understanding. 1268-1289 - Nuno Miguel Guerreiro, Ricardo Rei, Daan van Stigt, Luísa Coheur

, Pierre Colombo, André F. T. Martins:
xcomet : Transparent Machine Translation Evaluation through Fine-grained Error Detection. 979-995 - Jessy Lin, Nicholas Tomlin, Jacob Andreas, Jason Eisner:

Decision-Oriented Dialogue for Human-AI Collaboration. 892-911 - Cristina Aggazzotti, Nicholas Andrews, Elizabeth Allyn Smith:

Can Authorship Attribution Models Distinguish Speakers in Speech Transcripts? 875-891 - Lindia Tjuatja, Valerie Chen, Tongshuang Wu, Ameet Talwalkwar, Graham Neubig:

Do LLMs Exhibit Human-like Response Biases? A Case Study in Survey Design. 1011-1026 - Zhivar Sourati, Filip Ilievski

, Pia Sommerauer
, Yifan Jiang:
ARN: Analogical Reasoning on Narratives. 1063-1086 - Cyril Chhun

, Fabian M. Suchanek, Chloé Clavel:
Do Language Models Enjoy Their Own Stories? Prompting Large Language Models for Automatic Story Evaluation. 1122-1142 - George Chrysostomou, Zhixue Zhao

, Miles Williams, Nikolaos Aletras:
Investigating Hallucinations in Pruned Large Language Models for Abstractive Summarization. 1163-1181 - Melanie Subbiah, Sean Zhang, Lydia B. Chilton, Kathleen R. McKeown:

Reading Subtext: Evaluating Large Language Models on Short Story Summarization with Writers. 1290-1310 - Jitkapat Sawatphol, Can Udomcharoenchaikit, Sarana Nutanong:

Addressing Topic Leakage in Cross-Topic Evaluation for Authorship Verification. 1363-1377 - Xunyu Zhu

, Jian Li, Yong Liu, Can Ma, Weiping Wang:
A Survey on Model Compression for Large Language Models. 1556-1577 - Abhinav Patil, Jaap Jumelet, Yu Ying Chiu, Andy Lapastora, Peter Shen, Lexie Wang, Clevis Willrich, Shane Steinert-Threlkeld:

Filtered Corpus Training (FiCT) Shows that Language Models Can Generalize from Indirect Evidence. 1597-1615 - Giulio Corallo, Paolo Papotti

:
FINCH: Prompt-guided Key-Value Cache Compression for Large Language Models. 1517-1532 - Giannis Karamanolakis, Daniel Hsu, Luis Gravaano:

Interactive Machine Teaching by Labeling Rules and Instances. 1441-1459 - Jiaxin Guo

, C. L. Philip Chen, Shuzhen Li, Tong Zhang:
Deuce: Dual-diversity Enhancement and Uncertainty-awareness for Cold-start Active Learning. 1736-1754 - Fajri Koto, Rahmad Mahendra, Nurul Aisyah, Timothy Baldwin:

IndoCulture: Exploring Geographically Influenced Cultural Commonsense Reasoning Across Eleven Indonesian Provinces. 1703-1719 - Vagrant Gautam, Eileen Bingert, Dawei Zhu, Anne Lauscher

, Dietrich Klakow:
Robust Pronoun Fidelity with English LLMs: Are they Reasoning, Repeating, or Just Biased? 1755-1779 - Andreas Waldis, Yotam Perlitz, Leshem Choshen, Yufang Hou

, Iryna Gurevych:
Holmes ⌕ A Benchmark to Assess the Linguistic Competence of Language Models. 1616-1647 - Ryo Kamoi, Yusen Zhang, Nan Zhang, Jiawei Han, Rui Zhang:

When Can LLMs Actually Correct Their Own Mistakes? A Critical Survey of Self-Correction of LLMs. 1417-1440 - Chrysoula Zerva

, André F. T. Martins:
Conformalizing Machine Translation Evaluation. 1460-1478 - Ning Pang, Xiang Zhao, Weixin Zeng, Zhen Tan, Weidong Xiao:

SCL: Selective Contrastive Learning for Data-driven Zero-shot Relation Extraction. 1720-1735 - Zheng Wei Lim, Ekaterina Vylomova, Charles Kemp

, Trevor Cohn:
Predicting Human Translation Difficulty with Neural Machine Translation. 1479-1496 - Rami Aly, Andreas Vlachos

:
TabVer: Tabular Fact Verification with Natural Logic. 1648-1671 - Tanise Ceron, Neele Falk, Ana Baric, Dmitry Nikolaev

, Sebastian Padó:
Beyond Prompt Brittleness: Evaluating the Reliability and Consistency of Political Worldviews in LLMs. 1378-1400 - Yangyang Zhao, Mehdi Dastani, Jinchuan Long, Zhenyu Wang, Shihan Wang:

Rescue Conversations from Dead-ends: Efficient Exploration for Task-oriented Dialogue Policy Optimization. 1578-1596 - Tom Hosking, Hao Tang, Mirella Lapata:

Hierarchical Indexing for Retrieval-Augmented Opinion Summarization. 1533-1555 - Asaf Achi Mordechai, Yoav Goldberg, Reut Tsarfaty:

NoviCode: Generating Programs from Natural Language Utterances by Novices. 1330-1345 - Tyler A. Chang, Zhuowen Tu, Benjamin K. Bergen:

Characterizing Learning Curves During Language Model Pre-Training: Learning, Forgetting, and Stability. 1346-1362 - Margarida M. Campos, António Farinhas, Chrysoula Zerva

, Mário A. T. Figueiredo, André F. T. Martins:
Conformal Prediction for Natural Language Processing: A Survey. 1497-1516 - Seong-Il Park, Jay-Yoon Lee:

Toward Robust RALMs: Revealing the Impact of Imperfect Retrieval on Retrieval-Augmented Language Models. 1686-1702 - Yuyin Lu, Hegang Chen, Pengbo Mao, Yanghui Rao, Haoran Xie

, Fu Lee Wang, Qing Li:
Self-supervised Topic Taxonomy Discovery in the Box Embedding Space. 1401-1416

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.


Google
Google Scholar
Semantic Scholar
Internet Archive Scholar
CiteSeerX
ORCID














