


default search action
NAACL-HLT 2025: Albuquerque, New Mexico, USA - Volume 3: Industry Track
- Weizhu Chen, Yi Yang, Mohammad Kachuee, Xue-Yong Fu:
Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies, NAACL 2025 - Volume 3: Industry Track, Albuquerque, New Mexico, USA, April 30, 2025. Association for Computational Linguistics 2025, ISBN 979-8-89176-194-0 - Chanjun Park, Hyeonwoo Kim:
Understanding LLM Development Through Longitudinal Study: Insights from the Open Ko-LLM Leaderboard. 1-8 - Sanjay Agrawal, Vivek Sembium:
RTSM: Knowledge Distillation with Diverse Signals for Efficient Real-Time Semantic Matching in E-Commerce. 9-19 - Hanchao Liu, Rongjun Li, Weimin Xiong, Ziyu Zhou, Wei Peng:
WorkTeam: Constructing Workflows from Natural Language with Multi-Agents. 20-35 - Qiang Li, Mingkun Tan, Xun Zhao, Dan Zhang, Daoan Zhang, Shengzhao Lei, Anderson S. Chu, Lujun Li, Porawit Kamnoedboon:
How LLMs React to Industrial Spatio-Temporal Data? Assessing Hallucination with a Novel Traffic Incident Benchmark Dataset. 36-53 - Gao yu Zhu, Wei Shao, Xichou Zhu, Lei Yu, Jiafeng Guo, Xueqi Cheng:
Text2Sql: Pure Fine-Tuning and Pure Knowledge Distillation. 54-61 - Vinay Kumar Verma, Shreyas Sunil Kulkarni, Happy Mittal, Deepak Gupta:
MoEMoE: Question Guided Dense and Scalable Sparse Mixture-of-Expert for Multi-source Multi-modal Answering. 62-69 - Yuki Tagawa, Yohei Momoki, Norihisa Nakano, Ryota Ozaki, Motoki Taniguchi, Masatoshi Hori, Noriyuki Tomiyama:
Finding-Centric Structuring of Japanese Radiology Reports and Analysis of Performance Gaps for Multiple Facilities. 70-85 - Xuanqing Liu, Luyang Kong, Wei Niu, Afshin Khashei, Belinda Zeng, Steve Johnson, Jon Jay, Davor Golac, Matt Pope:
Learning LLM Preference over Intra-Dialogue Pairs: A Framework for Utterance-level Understandings. 86-98 - Yi-Chang Chen, Po-Chun Hsu, Chan-Jan Hsu, Da-shan Shiu:
Enhancing Function-Calling Capabilities in LLMs: Strategies for Prompt Formats, Data Integration, and Multilingual Translation. 99-111 - George Kour, Naama Zwerdling, Marcel Zalmanovici, Ateret Anaby-Tavor, Ora Nova Fandina, Eitan Farchi:
Exploring Straightforward Methods for Automatic Conversational Red-Teaming. 112-128 - Jiaming Luo, Weiyi Luo, Guoqing Sun, Mengchen Zhu, Haifeng Tang, Kenny Q. Zhu, Mengyue Wu:
A Diverse and Effective Retrieval-Based Debt Collection System with Expert Knowledge. 129-137 - Sosuke Nishikawa, Jun Hirako, Nobuhiro Kaji, Koki Watanabe, Hiroki Asano, Souta Yamashiro, Shumpei Sano:
Search Query Embeddings via User-behavior-driven Contrastive Learning. 138-147 - Dezhi Ye, Haomei Jia, Junwei Hu, Bowen Tian, Jie Liu, Haijin Liang, Jin Ma, Wenmin Wang:
QSpell 250K: A Large-Scale, Practical Dataset for Chinese Search Query Spell Correction. 148-155 - Yifan Zhang, Xue Yang:
CONSTRUCTA: Automating Commercial Construction Schedules in Fabrication Facilities with Large Language Models. 156-172 - Bing Zhang, Guang-Jie Ren:
Challenges and Remedies of Domain-Specific Classifiers as LLM Guardrails: Self-Harm as a Case Study. 173-182 - Alonso Palomino, Andreas Fischer, David Buschhüter, Roland Roller, Niels Pinkwart, Benjamin Paassen:
Mitigating Bias in Item Retrieval for Enhancing Exam Assembly in Vocational Education Services. 183-193 - Somnath Banerjee, Avik Halder, Rajarshi Mandal, Sayan Layek, Ian Soboroff, Rima Hazra, Animesh Mukherjee:
Breaking Boundaries: Investigating the Effects of Model Editing on Cross-linguistic Performance. 194-209 - Hyowon Cho, Minjoon Seo:
Towards Reliable and Practical Phishing Detection. 210-225 - Zijian Chen, John-Michael Gamble, Jimmy Lin:
Zero-Shot ATC Coding with Large Language Models for Clinical Assessments. 226-232 - Yukyung Lee, Soonwon Ka, Bokyung Son, Pilsung Kang, Jaewook Kang:
Navigating the Path of Writing: Outline-guided Text Generation with Large Language Models. 233-250 - Jennifer Zhu, Dmitriy Bespalov, Liwen You, Ninad Kulkarni, Yanjun Qi:
TaeBench: Improving Quality of Toxic Adversarial Examples. 251-265 - Hyeonwoo Kim, Dahyun Kim, Jihoo Kim, Sukyung Lee, Yungi Kim, Chanjun Park:
Open Ko-LLM Leaderboard2: Bridging Foundational and Practical Evaluation for Korean LLMs. 266-273 - Zukang Yang, Zixuan Zhu, Jennifer Zhu:
CuriousLLM: Elevating Multi-Document Question Answering with LLM-Enhanced Knowledge Graph Reasoning. 274-286 - Jeiyoon Park, Chanjun Park, Heuiseok Lim:
CharacterGPT: A Persona Reconstruction Framework for Role-Playing Agents. 287-303 - Arijit Nag, Soumen Chakrabarti, Animesh Mukherjee, Niloy Ganguly:
Efficient Continual Pre-training of LLMs for Low-resource Languages. 304-317 - Pei Guo, Enjie Liu, Ruichao Zhong, Mochi Gao, Yunzhi Tan, Bo Hu, Zang Li:
DSRAG: A Double-Stream Retrieval-Augmented Generation Framework for Countless Intent Detection. 318-328 - Wei Chen, Zhiyuan Li, Mingyuan Ma:
Octopus: On-device language model for function calling of software APIs. 329-339 - Jean Seo, Jaeyoon Kim, Hyopil Shin:
MoFE: Mixture of Frozen Experts Architecture. 340-348 - Kang Zhang, Osamu Yoshie, Lichao Sun, Weiran Huang:
FinLLM-B: When Large Language Models Meet Financial Breakout Trading. 349-357 - Nitin Ramrakhiyani, Delton Myalil, Sachin Pawar, Manoj Apte, Rajan MA, Divyesh Saglani, Imtiyazuddin Shaik:
QueryShield: A Platform to Mitigate Enterprise Data Leakage in Queries to External LLMs. 358-369 - Lukas Fischer, Yingqiang Gao, Alexa Lintner, Annette Rios, Sarah Ebling:
SwissADT: An Audio Description Translation System for Swiss Languages. 370-379 - Jiahao Zhu, Jipeng Qiang, Ran Bai, Chenyu Liu, Xiaoye Ouyang:
Chinese Morph Resolution in E-commerce Live Streaming Scenarios. 380-389 - Sebastian Steindl, Ulrich Schäfer, Bernd Ludwig:
MonoTODia: Translating Monologue Requests to Task-Oriented Dialogues. 390-403 - Haoan Jin, Jiacheng Shi, Hanhui Xu, Kenny Q. Zhu, Mengyue Wu:
MedEthicEval: Evaluating Large Language Models Based on Chinese Medical Ethics. 404-421 - Tirthankar Dasgupta, Manjira Sinha, Sudeshna Jana:
Predicting ICU Length of Stay for Patients using Latent Categorization of Health Conditions. 422-430 - Jiban Adhikary, Mohammad Alqudah, Arun Palghat Udayashankar:
RevieWeaver: Weaving Together Review Insights by Leveraging LLMs and Semantic Similarity. 431-448 - Krishanu Das Baksi, Elijah Soba, John J. Higgins, Ravi Saini, Jaden Wood, Jane Cook, Jack Scott, Nirmala Pudota, Tim Weninger, Edward Bowen, Sanmitra Bhattacharya:
MedCodER: A Generative AI Assistant for Medical Coding. 449-459 - Jiaying Gong, Ming Cheng, Hongda Shen, Pierre-Yves Vandenbussche, Janet Jenq, Hoda Eldardiry:
Visual Zero-Shot E-Commerce Product Attribute Value Extraction. 460-469 - Grigor Nalbandyan, Rima Shahbazyan, Evelina Bakhturina:
SCORE: Systematic COnsistency and Robustness Evaluation for Large Language Models. 470-484 - Bing Zhang, Mikio Takeuchi, Ryo Kawahara, Shubhi Asthana, Md. Maruf Hossain, Guang-Jie Ren, Kate Soule, Yifan Mai, Yada Zhu:
Evaluating Large Language Models with Enterprise Benchmarks. 485-505 - Xiliang Zhu, Elena Khasanova, Cheng Chen:
Can Post-Training Quantization Benefit from an Additional QLoRA Integration? 506-514 - Victor Barres, Clifton James McFate, Aditya Kalyanpur, Kailash Karthik Saravanakumar, Lori Moon, Natnael Seifu, Abraham Bautista-Castillo:
From Generating Answers to Building Explanations: Integrating Multi-Round RAG and Causal Modeling for Scientific QA. 515-522 - Aman Goel, Xian Carrie Wu, Zhe Wang, Dmitriy Bespalov, Yanjun Qi:
TurboFuzzLLM: Turbocharging Mutation-based Fuzzing for Effectively Jailbreaking Large Language Models in Practice. 523-534 - Md Kowsher, Nusrat Jahan Prottasha, Chun-Nam Yu, Ozlem O. Garibay, Niloofar Yousefi:
Does Self-Attention Need Separate Weights in Transformers? 535-543 - Chening Yang, Duy-Khanh Vu, Minh-Tien Nguyen, Xuan-Quang Nguyen, Linh Nguyen, Hung Le:
SuperRAG: Beyond RAG with Layout-Aware Graph Modeling. 544-557 - Hitesh Laxmichand Patel, Amit Agarwal, Arion Das, Bhargava Kumar, Srikant Panda, Priyaranjan Pattnayak, Taki Hasan Rafi, Tejaswini Kumar, Dong-Kyu Chae:
SweEval: Do LLMs Really Swear? A Safety Benchmark for Testing Limits for Enterprise Use. 558-582 - Naoki Otani, Nikita Bhutani, Estevam Hruschka:
Natural Language Processing for Human Resources: A Survey. 583-597 - Syed Shariyar Murtaza, Yifan Nie, Elias Avan, Utkarsh Soni, Wanyu Liao, Adam Carnegie, Cyril John Mathias, Junlin Jiang, Eugene Wen:
Implementing Retrieval Augmented Generation Technique on Unstructured and Structured Data Sources in a Call Center of a Large Financial Institution. 598-606 - Inkit Padhi, Manish Nagireddy, Giandomenico Cornacchia, Subhajit Chaudhury, Tejaswini Pedapati, Pierre L. Dognin, Keerthiram Murugesan, Erik Miehling, Martín Santillán Cooper, Kieran Fraser, Giulio Zizzo, Muhammad Zaid Hameed, Mark Purcell, Michael Desmond, Qian Pan, Inge Vejsbjerg, Elizabeth M. Daly, Michael Hind, Werner Geyer, Ambrish Rawat, Kush R. Varshney, Prasanna Sattigeri:
Granite Guardian: Comprehensive LLM Safeguarding. 607-615 - Yang Li, Yuan Shangguan, Yuhao Wang, Liangzhen Lai, Ernie Chang, Changsheng Zhao, Yangyang Shi, Vikas Chandra:
Breaking Down Power Barriers in On-Device Streaming ASR: Insights and Solutions. 616-626 - Swapnil Gupta, Lucas Pereira Carlini, Prateek Sircar, Deepak Gupta:
Break-Ideate-Generate (BrIdGe): Moving beyond Translations for Localization using LLMs. 627-637 - Emmanuel Aboah Boateng, Cassiano O. Becker, Nabiha Asghar, Kabir Walia, Ashwin Srinivasan, Ehi Nosakhare, Soundar Srinivasan, Victor Dibia:
Concept Distillation from Strong to Weak Models via Hypotheses-to-Theories Prompting. 638-654 - Kevin Shukang Wang, Karel Joshua Harjono, Ramon Lawrence:
Towards Reliable Agents: Benchmarking Customized LLM-Based Retrieval-Augmented Generation Frameworks with Deployment Validation. 655-661 - Minji Seo, Youngwon Lee, Seung-won Hwang, Seoho Song, Hee-Cheol Seo, Young-In Song:
Query Variant Detection Using Retriever as Environment. 662-671 - Hayate Iso, Pouya Pezeshkpour, Nikita Bhutani, Estevam Hruschka:
Evaluating Bias in LLMs for Job-Resume Matching: Gender, Race, and Education. 672-683 - Aniya Aggarwal, Ankush Gupta, Shivangi Bithel, Arvind Agarwal:
Goal-Driven Data Story, Narrations and Explanations. 684-694 - Vishnu Prabhakaran, Purav Aggarwal, Vishruit Kulshreshtha, Arunita Das, Sahini Venkata Sitaram Sruti, Anoop Saladi:
VIT-Pro: Visual Instruction Tuning for Product Images. 695-707 - Rishav Sahay, Arihant Jain, Purav Aggarwal, Anoop Saladi:
AutoKB: Automated Creation of Structured Knowledge Bases for Domain-Specific Support. 708-723 - Khai Le-Duc, David Thulke, Hung-Phong Tran, Long Vo-Dang, Khai-Nguyen Nguyen, Truong-Son Hy, Ralf Schlüter:
Medical Spoken Named Entity Recognition. 724-783 - Jaeseong Lee, Hojae Han, Jongyoon Kim, Seung-won Hwang, Naun Kang, KyungJun An, Sungho Jang:
PLEX: Adaptive Parameter-Efficient Fine-Tuning for Code LLMs using Lottery-Tickets. 784-793 - Yuyang Li, Philip J. M. Kerbusch, Raimon H. R. Pruim, Tobias Käfer:
Evaluating the Performance of RAG Methods for Conversational AI in the Airport Domain. 794-808 - Prasanjit Rath, Hari Shrawgi, Parag Agrawal, Sandipan Dandapat:
LLM Safety for Children. 809-821 - Akshay Jagatap, Srujana Merugu, Prakash Mandayam Comar:
RxLens: Multi-Agent LLM-powered Scan and Order for Pharmacy. 822-832 - Cong Duy Vu Hoang, Gioacchino Tangari, Clémence Lanfranchi, Dalu Guo, Paul Cayet, Steve Siu, Don Dharmasiri, Yuan-Fang Li, Long Duong, Damien Hilloulin, Rhicheek Patra, Sungpack Hong, Hassan Chafi:
Distill-C: Enhanced NL2SQL via Distilled Customization with LLMs. 833-848 - Luis Antonio Gutiérrez Guanilo, Mir Tafseer Nayeem, Cristian Jose Lopez Del Alamo, Davood Rafiei:
eC-Tab2Text: Aspect-Based Text Generation from e-Commerce Product Tables. 849-867 - Tzu-Lin Kuo, Fengting Liao, Mu-Wei Hsieh, Fu-Chieh Chang, Po-Chun Hsu, Da-shan Shiu:
RAD-Bench: Evaluating Large Language Models' Capabilities in Retrieval Augmented Dialogues. 868-902 - Seong-Jin Park, Youn-Gyu Jin, Hyun-Young Moon, Bong-Hyuck Choi, Seung Hwan Lee, Ohjoon Kwon, Kang-Min Kim:
Conflict and Overlap Classification in Construction Standards Using a Large Language Model. 903-917 - Ala Jararweh, Oladimeji Macaulay, David Arredondo, Yue Hu, Luis E. Tafoya, Kushal Virupakshappa, Avinash Sahu:
Protein2Text: Resampling Mechanism to Translate Protein Sequences into Human-Interpretable Text. 918-937 - Fajri Koto:
Cracking the Code: Multi-domain LLM Evaluation on Real-World Professional Exams in Indonesia. 938-948 - Ashlesha Akella, Abhijit Manatkar, Krishnasuri Narayanam, Sameep Mehta:
CodeGenWrangler: Data Wrangling task automation using Code-Generating Models. 949-960 - Mengze Hong, Chen Jason Zhang, Chaotao Chen, Rongzhong Lian, Di Jiang:
Dialogue Language Model with Large-Scale Persona Data Engineering. 961-970 - Song Wang, Xun Wang, Jie Mei, Yujia Xie, Si-Qing Chen, Wayne Xiong:
Developing a Reliable, Fast, General-Purpose Hallucination Detection and Mitigation Service. 971-978 - Siddharth Tumre, Sangameshwar Patil, Alok Kumar:
Improved Near-Duplicate Detection for Aggregated and Paywalled News-Feeds. 979-987 - Ivan Bondarenko, Daniil Grebenkin, Oleg Sedukhin, Mikhail Klementev, Derunets Roman, Lyudmila Budneva:
Pisets: A Robust Speech Recognition System for Lectures and Interviews. 988-997 - Kaixin Wu, Yixin Ji, Zeyuan Chen, Qiang Wang, Cunxiang Wang, Hong Liu, Baijun Ji, Jia Xu, Zhongyi Liu, Jinjie Gu, Yuan Zhou, Linjian Mo:
CPRM: A LLM-based Continual Pre-training Framework for Relevance Modeling in Commercial Search. 998-1008 - Nitin Gupta, Manish Kesarwani, Sambit Ghosh, Sameep Mehta, Carlos Eberhardt, Dan Debrunner:
Schema and Natural Language Aware In-Context Learning for Improved GraphQL Query Generation. 1009-1015 - Lucas Spangher, Tianle Li, William F. Arnold, Nick Masiewicki, Xerxes Dotiwalla, Rama Kumar Pasumarthi, Peter Grabowski, Eugene Ie, Daniel Gruhl:
Chatbot Arena Estimate: towards a generalized performance benchmark for LLM capabilities. 1016-1025 - Arvind Krishna Sridhar, Yinyi Guo, Erik Visser:
Enhancing Temporal Understanding in Audio Question Answering for Large Audio Language Models. 1026-1035 - Rishi Kalra, Zekun Wu, Ayesha Gulley, Airlie Hilliard, Xin Guan, Adriano S. Koshiyama, Philip C. Treleaven:
HyPA-RAG: A Hybrid Parameter Adaptive Retrieval-Augmented Generation System for AI Legal and Policy Applications. 1036-1054 - Pengyu Gao, Jinming Zhao, Xinyue Chen, Yilin Long:
An Efficient Context-Dependent Memory Framework for LLM-Centric Agents. 1055-1069

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.