default search action
RANLP 2023: Varna, Bulgaria
- Ruslan Mitkov, Galia Angelova:
Proceedings of the 14th International Conference on Recent Advances in Natural Language Processing, RANLP 2023, Varna, Bulgaria, 4-6 September 2023. INCOMA Ltd., Shoumen, Bulgaria 2023, ISBN 978-954-452-092-2 - Frontmatter.
- Tosin P. Adewumi, Isabella Södergren, Lama Alkhaled, Sana Sabah Al-Azzawi, Foteini Simistira Liwicki, Marcus Liwicki:
Bipol: Multi-Axes Evaluation of Bias with Explainability in Benchmark Datasets. 1-10 - Aditya Agarwal, Radhika Mamidi:
Automatically Generating Hindi Wikipedia Pages Using Wikidata as a Knowledge Graph: A Domain-Specific Template Sentences Approach. 11-21 - Shareefa Al Amer, Mark Lee, Phillip Smith:
Cross-lingual Classification of Crisis-related Tweets Using Machine Translation. 22-31 - Vera Aleksic, Mona Brems, Anna Mathes, Theresa Bertele:
Lexicon-Driven Automatic Sentence Generation for the Skills Section in a Job Posting. 32-40 - Abinew Ali Ayele, Skadi Dinter, Seid Muhie Yimam, Chris Biemann:
Multilingual Racial Hate Speech Detection Using Transfer Learning. 41-48 - Abinew Ali Ayele, Seid Muhie Yimam, Tadesse Destaw Belay, Tesfa Tegegne Asfaw, Chris Biemann:
Exploring Amharic Hate Speech Data Collection and Classification Approaches. 49-59 - Imran Ali, Praveen Gatla:
Bhojpuri WordNet: Problems in Translating Hindi Synsets into Bhojpuri. 60-68 - Fatemah Almeman, Hadi Sheikhi, Luis Espinosa Anke:
3D-EX: A Unified Dataset of Definitions and Dictionary Examples. 69-79 - Ghadi Alnafesah, Phillip Smith, Mark Lee:
Are You Not moved? Incorporating Sensorimotor Knowledge to Improve Metaphor Detection. 80-89 - Sarah Alnefaie, Eric Atwell, Mohammad Ammar Alsalka:
HAQA and QUQA: Constructing Two Arabic Question-Answering Corpora for the Quran and Hadith. 90-97 - Sultan Alsarra, Luay Abdeljaber, Wooseong Yang, Niamat Zawad, Latifur Khan, Patrick T. Brandt, Javier Osorio, Vito D'Orazio:
ConfliBERT-Arabic: A Pre-trained Arabic Language Model for Politics, Conflicts and Violence. 98-108 - Fabio Yanez, Andrés Montoyo, Yoan Gutiérrez, Rafael Muñoz, Armando Suárez:
A Review in Knowledge Extraction from Knowledge Bases. 109-116 - Isuri Anuradha Nanomi Arachchige, Le An Ha, Ruslan Mitkov, Vinita Nahar:
Evaluating of Large Language Models in Relationship Extraction from Unstructured Data: Empirical Study from Holocaust Testimonies. 117-123 - Ratchakrit Arreerard, Scott Piao:
Impact of Emojis on Automatic Analysis of Individual Emotion Categories. 124-131 - Santiago Arróniz, Sandra Kübler:
Was That a Question? Automatic Classification of Discourse Meaning in Spanish. 132-142 - Ana-Maria Barbu, Elena Irimia, Carmen Mîrzea Vasile, Vasile Pais:
Designing the LECOR Learner Corpus for Romanian. 143-152 - Florian Baud, Alex Aussem:
Non-Parametric Memory Guidance for Multi-Document Summarization. 153-158 - Ahmed Belkhir, Fatiha Sadat:
Beyond Information: Is ChatGPT Empathetic Enough? 159-169 - Meriem Beloucif, Mihir Bansal, Chris Biemann:
Using Wikidata for Enhancing Compositionality in Pretrained Language Models. 170-178 - Jishnu Bhardwaj, Anurag Balakrishnan, Satyam Pathak, Ishan Unnarkar, Aniruddha Gawande, Benyamin Ahmadnia:
Multimodal Learning for Accurate Visual Question Answering: An Attention-Based Approach. 179-186 - Savita Bhat, Vasudeva Varma, Niranjan Pedanekar:
Generative Models For Indic Languages: Evaluating Content Generation Capabilities. 187-195 - Angana Borah, Daria Pylypenko, Cristina España-Bonet, Josef van Genabith:
Measuring Spurious Correlation in Classification: "Clever Hans" in Translationese. 196-206 - Hsuvas Borkakoty, Luis Espinosa Anke:
WIKITIDE: A Wikipedia-Based Timestamped Definition Pairs Dataset. 207-216 - Pablo Botton da Costa, Matheus Camasmie Pavan, Wesley Ramos dos Santos, Samuel Caetano da Silva, Ivandré Paraboni:
BERTabaporu: Assessing a Genre-Specific Language Model for Portuguese NLP. 217-223 - Ivelina Bozhinova, Andrey Tagarev:
Comparison of Multilingual Entity Linking Approaches. 224-233 - Ana-Maria Bucur, Andreea Dinca, Madalina Chitez, Roxana Rogobete:
Automatic Extraction of the Romanian Academic Word List: Data and Methods. 234-241 - Lais Carraro Leme Cavalheiro, Matheus Camasmie Pavan, Ivandré Paraboni:
Stance Prediction from Multimodal Social Media Data. 242-248 - Mason Choey:
From Stigma to Support: A Parallel Monolingual Corpus and NLP Approach for Neutralizing Mental Illness Bias. 249-254 - Leonardo de Andrade, Karin Becker:
BB25HLegalSum: Leveraging BM25 and BERT-Based Clustering for the Summarization of Legal Documents. 255-263 - André Mediote de Sousa, Karin Becker:
SSSD: Leveraging Pre-trained Models and Semantic Search for Semi-supervised Stance Detection. 264-273 - Daryna Dementieva, Nikolay Babakov, Alexander Panchenko:
Detecting Text Formality: A Study of Text Classification Approaches. 274-284 - Hannah Devinney, Anton Eklund, Igor Ryazanov, Jingwen Cai:
Developing a Multilingual Corpus of Wikipedia Biographies. 285-294 - Liviu P. Dinu, Ana Sabina Uban:
A Computational Analysis of the Voices of Shakespeare's Characters. 295-300 - Fahad Ebrahim, Mike Joy:
Source Code Plagiarism Detection with Pre-Trained Model Embeddings and Automated Machine Learning. 301-309 - Deniz Ekin Yavas, Laura Kallmeyer, Rainer Osswald, Elisabetta Jezek, Marta Ricchiardi, Long Chen:
Identifying Semantic Argument Types in Predication and Copredication Contexts: A Zero-Shot Cross-Lingual Approach. 310-320 - Isabel Espinosa-Zaragoza, José Ignacio Abreu Salas, Elena Lloret, Paloma Moreda, Manuel Palomar:
A Review of Research-Based Automatic Text Simplification Tools. 321-330 - Michael Färber, Nicholas Popovic:
Vocab-Expander: A System for Creating Domain-Specific Vocabularies Based on Word Embeddings. 331-335 - Elisabetta Fersini, Antonio Candelieri, Lorenzo Pastore:
On the Generalization of Projection-Based Gender Debiasing in Word Embedding. 336-343 - Nelson Filipe Costa, Nadia Sheikh, Leila Kosseim:
Mapping Explicit and Implicit Discourse Relations between the RST-DT and the PDTB 3.0. 344-352 - Matthew Fort, Zuoyu Tian, Elizabeth Gabel, Nina Georgiades, Noah Sauer, Daniel Dakota, Sandra Kübler:
Bigfoot in Big Tech: Detecting Out of Domain Conspiracy Theories. 353-363 - Emma Franklin, Tharindu Ranasinghe:
Deep Learning Approaches to Detecting Safeguarding Concerns in Schoolchildren's Online Conversations. 364-372 - Paolo Gajo, Arianna Muti, Katerina Korre, Silvia Bernardini, Alberto Barrón-Cedeño:
On the Identification and Forecasting of Hate Speech in Inceldom. 373-384 - Santiago Galiano, Rafael Muñoz, Yoan Gutiérrez, Andrés Montoyo, José Ignacio Abreu, Luis Alfonso Ureña López:
T2KG: Transforming Multimodal Document to Knowledge Graph. 385-391 - Federico Garcea, Margherita Martinelli, Maja Milicevic Petrovic, Alberto Barrón-Cedeño:
!Translate: When You Cannot Cook Up a Translation, Explain. 392-398 - Harritxu Gete, Thierry Etchegoyhen:
An Evaluation of Source Factors in Concatenation-Based Context-Aware Neural Machine Translation. 399-407 - Iacopo Ghinassi, Lin Wang, Chris Newell, Matthew Purver:
Lessons Learnt from Linear Text Segmentation: a Fair Comparison of Architectural and Sentence Encoding Strategies for Successful Segmentation. 408-418 - Serge Gladkoff, Lifeng Han, Goran Nenadic:
Student's t-Distribution: On Measuring the Inter-Rater Reliability When the Observations are Scarce. 419-428 - Anna Glazkova:
Data Augmentation for Fake News Detection by Combining Seq2seq and NLI. 429-439 - Vishwani Gupta, Astrid Viciano, Holger Wormer, Najmehsadat Mousavinezhad:
Exploring Unsupervised Semantic Similarity Methods for Claim Verification in Health Care News Articles. 440-447 - Najet Hadj Mohamed, Malak Rassem, Lifeng Han, Goran Nenadic:
AlphaMWE-Arabic: Arabic Edition of Multilingual Parallel Corpora with Multiword Expression Annotations. 448-457 - Abdelhalim Hafedh Dahou, Mohamed Amine Chéragui, Ahmed Abdelali:
Performance Analysis of Arabic Pre-trained Models on Named Entity Recognition Task. 458-467 - Blaise Hanel, Leila Kosseim:
Discourse Analysis of Argumentative Essays of English Learners Based on CEFR Level. 468-474 - Mathias Hans Erik Stenlund, Mathilde Nanni, Micaella Bruton, Meriem Beloucif:
Improving Translation Quality for Low-Resource Inuktitut with Various Preprocessing Techniques. 475-479 - Momchil Hardalov, Ivan Koychev, Preslav Nakov:
Enriched Pre-trained Transformers for Joint Slot Filling and Intent Detection. 480-493 - Muzhaffar Hazman, Susan McKeever, Josephine Griffith:
Unimodal Intermediate Training for Multimodal Meme Sentiment Classification. 494-506 - Hansi Hettiarachchi, Tharindu Ranasinghe:
Explainable Event Detection with Event Trigger Identification as Rationale Extraction. 507-518 - Anton Hristov, Petar Ivanov, Anna Aksenova, Tsvetan Asamov, Pavlin Gyurov, Todor Primov, Svetla Boytcheva:
Clinical Text Classification to SNOMED CT Codes Using Transformers Trained on Linked Open Medical Ontologies. 519-526 - Rudali Huidrom, Anya Belz:
Towards a Consensus Taxonomy for Annotating Errors in Automatically Generated Text. 527-540 - Jinha Hwang, Carol Gudumotu, Benyamin Ahmadnia:
Uncertainty Quantification of Text Classification in a Multi-Label Setting for Risk-Sensitive Systems. 541-547 - Tatsuya Ishigaki, Yui Uehara, Goran Topic, Hiroya Takamura:
Pretraining Language- and Domain-Specific BERT on Automatically Translated Text. 548-555 - Ye Jiang, Xingyi Song, Carolina Scarton, Iknoor Singh, Ahmet Aker, Kalina Bontcheva:
Categorising Fine-to-Coarse Grained Misinformation: An Empirical Study of the COVID-19 Infodemic. 556-567 - Shun Kiyono, Sho Takase, Shengzhe Li, Toshinori Sato:
Bridging the Gap between Subword and Character Segmentation in Pretrained Language Models. 568-577 - Jordan Koontz, Maite Oronoz, Alicia Pérez:
Evaluating Data Augmentation for Medication Identification in Clinical Notes. 578-585 - Andriy Kosar, Guy De Pauw, Walter Daelemans:
Advancing Topical Text Classification: A Novel Distance-Based Method with Contextual Embeddings. 586-597 - Saranya Krishnamoorthy, Ayush Singh:
Taxonomy-Based Automation of Prior Approval Using Clinical Guidelines. 598-607 - Maria Kunilovskaya, Heike Przybyl, Ekaterina Lapshinova-Koltunski, Elke Teich:
Simultaneous Interpreting as a Noisy Channel: How Much Information Gets Through. 608-618 - Fabian Lechner, Allison Lahnala, Charles Welch, Lucie Flek:
Challenges of GPT-3-Based Conversational Agents for Healthcare. 619-630 - João Augusto Leite, Carolina Scarton, Diego F. Silva:
Noisy Self-Training with Data Augmentations for Offensive and Hate Speech Detection Tasks. 631-640 - Yinheng Li:
A Practical Survey on Zero-Shot Prompt Design for In-Context Learning. 641-647 - Yue Li, Carolina Scarton, Xingyi Song, Kalina Bontcheva:
Classifying COVID-19 Vaccine Narratives. 648-657 - Jacky Li, Jaren Gerdes, James Gojit, Austin Tao, Samyak Katke, Kate Nguyen, Benyamin Ahmadnia:
Sign Language Recognition and Translation: A Multi-Modal Approach Using Computer Vision and Natural Language Processing. 658-665 - Tianyu Liang, Yida Mu, Soonho Kim, Darline Kengne Kuate, Julie Lang, Rob Vos, Xingyi Song:
Classification-Aware Neural Topic Model Combined with Interpretable Analysis - for Conflict Classification. 666-672 - Ming Liu, Massimo Poesio:
Data Augmentation for Fake Reviews Detection. 673-680 - Congda Ma, Kotaro Funakoshi, Kiyoaki Shirai, Manabu Okumura:
Coherent Story Generation with Structured Knowledge. 681-690 - Eliot Maës, Thierry Legou, Leonor Becerra, Philippe Blache:
Studying Common Ground Instantiation Using Audio, Video and Brain Behaviours: The BrainKT Corpus. 691-702 - Ole Magnus Holter, Basil Ell:
Reading between the Lines: Information Extraction from Industry Requirements. 703-711 - Iva Marinova, Kiril Simov, Petya Osenova:
Transformer-Based Language Models for Bulgarian. 712-720 - Alimuddin Melleng, Anna Jurek-Loughrey, Deepak P:
Multi-task Ensemble Learning for Fake Reviews Detection and Helpfulness Prediction: A Novel Approach. 721-729 - Alimuddin Melleng, Anna Jurek-Loughrey, Deepak P:
Data Fusion for Better Fake Reviews Detection. 730-738 - Pascale Moreira, Yuri Bizzoni:
Dimensions of Quality: Contrasting Stylistic vs. Semantic Features for Modelling Literary Quality in 9, 000 Novels. 739-747 - Md. Motahar Mahtab, Monirul Haque, Md. Mehedi Hasan Shawon, Farig Sadeque:
BanglaBait: Semi-Supervised Adversarial Approach for Clickbait Detection on Bangla Clickbait Dataset. 748-758 - Attila Nagy, Dorina Lakatos, Botond Barta, Judit Ács:
TreeSwap: Data Augmentation for Machine Translation via Dependency Subtree Swapping. 759-768 - Kamel Nebhi, György Szaszák:
Automatic Assessment Of Spoken English Proficiency Based on Multimodal and Multitask Transformers. 769-776 - Vasudevan Nedumpozhimana, Sneha Rautmare, Meegan Gower, Nishtha Jain, Maja Popovic, Patricia Buffini, John D. Kelleher:
Medical Concept Mention Identification in Social Media Posts Using a Small Number of Sample References. 777-784 - Jan Nehring, René Marcel Berk, Stefan Hillmann:
Context-Aware Module Selection in Modular Dialog Systems. 785-791 - Boyu Niu, Céline Manetta, Frédérique Segond:
Human Value Detection from Bilingual Sensory Product Reviews. 792-802 - Magali Norré, Rémi Cardon, Vincent Vandeghinste, Thomas François:
Word Sense Disambiguation for Automatic Translation of Medical Dialogues into Pictographs. 803-812 - John E. Ortega, Kenneth Church:
A Research-Based Guide for the Creation and Deployment of a Low-Resource Machine Translation System. 813-823 - Jan Pasek, Jakub Sido, Miloslav Konopík, Ondrej Prazák:
MQDD: Pre-training of Multimodal Question Duplicity Detection for Software Engineering Domain. 824-835 - Nilay Patel, Jeffrey Flanigan:
Forming Trees with Treeformers. 836-845 - Judicael Poumay, Ashwin Ittoo:
Evaluating Unsupervised Hierarchical Topic Models Using a Labeled Dataset. 846-853 - Judicael Poumay, Ashwin Ittoo:
HTMOT: Hierarchical Topic Modelling over Time. 854-863 - Karan Praharaj, Irina Matveeva:
Multilingual Continual Learning Approaches for Text Classification. 864-870 - Damith Premasiri, Tharindu Ranasinghe, Ruslan Mitkov:
Can Model Fusing Help Transformers in Long Document Classification? An Empirical Study. 871-878 - Damith Premasiri, Amal Haddad Haddad, Tharindu Ranasinghe, Ruslan Mitkov:
Deep Learning Methods for Identification of Multiword Flower and Plant Names. 879-887 - Pavel Pribán, Ondrej Prazák:
Improving Aspect-Based Sentiment with End-to-End Semantic Role Labeling Model. 888-897 - Noémi Prótár, Dávid Márk Nemeskey:
huPWKP: A Hungarian Text Simplification Corpus. 898-907 - Mahfuzur Rahman Chowdhury, Intesur Ahmed, Farig Sadeque, Muhammad Yanhaona:
Topic Modeling Using Community Detection on a Word Association Graph. 908-917 - Bharathi Raja Chakravarthi, Prasanna Kumar Kumaresan, Rahul Ponnusamy, John P. McCrae, Michaela Comerford, Jay Megaro, Deniz Keles, Last Feremenga:
Exploring Techniques to Detect and Mitigate Non-Inclusive Language Bias in Marketing Communications Using a Dictionary-Based Approach. 918-925 - Geetanjali Rakshit, Jeffrey Flanigan:
Does the "Most Sinfully Decadent Cake Ever" Taste Good? Answering Yes/No Questions from Figurative Contexts. 926-936 - Leonardo Ranaldi, Giulia Pucci, Fabio Massimo Zanzotto:
Modeling Easiness for Training Transformers with Curriculum Learning. 937-948 - Leonardo Ranaldi, Aria Nourbakhsh, Elena Sofia Ruzzetti, Arianna Patrizi, Dario Onorati, Michele Mastromattei, Francesca Fallucchi, Fabio Massimo Zanzotto:
The Dark Side of the Language: Pre-trained Transformers in the DarkNet. 949-960 - Leonardo Ranaldi, Elena Sofia Ruzzetti, Fabio Massimo Zanzotto:
PreCog: Exploring the Relation between Memorization and Performance in Pre-trained Language Models. 961-967 - Tharindu Ranasinghe, Alistair Plum, Christoph Purschke, Marcos Zampieri:
Publish or Hold? Automatic Comment Moderation in Luxembourgish News Articles. 968-978 - Amaan Rizvi, Anupam Jamatia, Dwijen Rudrapal, Kunal Chakma, Björn Gambäck:
Cross-Lingual Speaker Identification for Indian Languages. 979-987 - Pattabhi Rk Rao, Sobha Lalitha Devi:
'ChemXtract' A System for Extraction of Chemical Events from Patent Documents. 988-995 - Julia Romberg:
Mind the User! Measures to More Accurately Evaluate the Practical Value of Active Learning Strategies. 996-1006 - Sumukh S, Abhinav Appidi Reddy, Manish Shrivastava:
Event Annotation and Detection in Kannada-English Code-Mixed Social Media Data. 1007-1014 - Branislava Sandrih Todorovic, Katarina Josipovic, Jurij Kodre:
Three Approaches to Client Email Topic Classification. 1015-1022 - Parth Saxena, Mo El-Haj:
Exploring Abstractive Text Summarisation for Podcasts: A Comparative Study of BART and T5 Models. 1023-1033 - Tim Schopf, Karim Arabi, Florian Matthes:
Exploring the Landscape of Natural Language Processing Research. 1034-1045 - Tim Schopf, Dennis N. Schneider, Florian Matthes:
Efficient Domain Adaptation of Sentence Embeddings Using Adapters. 1046-1053 - Tim Schopf, Emanuel Gerber, Malte Ostendorff, Florian Matthes:
AspectCSE: Sentence Embeddings for Aspect-Based Semantic Textual Similarity Using Contrastive Learning and Structured Knowledge. 1054-1065 - Sadat Shahriar, Arjun Mukherjee:
Tackling the Myriads of Collusion Scams on YouTube Comments of Cryptocurrency Videos. 1066-1075 - Sadat Shahriar, Arjun Mukherjee, Omprakash Gnawali:
Exploring Deceptive Domain Transfer Strategies: Mitigating the Differences among Deceptive Domains. 1076-1084 - Sanjeepan Sivapiran, Charangan Vasantharajan, Uthayasanker Thayasivam:
Party Extraction from Legal Contract Using Contextualized Span Representations of Parties. 1085-1094 - Razvan-Alexandru Smadu, Sebastian-Vasile Echim, Dumitru-Clementin Cercel, Iuliana Marin, Florin Pop:
From Fake to Hyperpartisan News Detection Using Domain Adaptation. 1095-1109 - Jakub Smíd, Pavel Pribán:
Prompt-Based Approach for Czech Sentiment Analysis. 1110-1120 - Nasim Sobhani, Kinshuk Sengupta, Sarah Jane Delany:
Measuring Gender Bias in Natural Language Processing: Incorporating Gender-Neutral Linguistic Forms for Non-Binary Gender Identities in Abusive Speech Detection. 1121-1131 - Sanja Stajner, Daniel Ibanez, Horacio Saggion:
LeSS: A Computationally-Light Lexical Simplifier for Spanish. 1132-1142 - R. Vijay Sundar Ram, Sobha Lalitha Devi:
Hindi to Dravidian Language Neural Machine Translation Systems. 1143-1150 - Irina Temnikova, Iva Marinova, Silvia Gargova, Ruslana Margova, Ivan Koychev:
Looking for Traces of Textual Deepfakes in Bulgarian on Social Media. 1151-1161 - Natalia Vanetik, Marina Litvak, Egor Reviakin, Margarita Tiamanova:
Propaganda Detection in Russian Telegram Posts in the Scope of the Russian Invasion of Ukraine. 1162-1170 - Stalin Varanasi, Muhammad Umer Tariq Butt, Guenter Neumann:
Auto-Encoding Questions with Retrieval Augmented Decoding for Unsupervised Passage Retrieval and Zero-Shot Question Generation. 1171-1179 - Francielle Alves Vargas, Isabelle Carvalho, Wolfgang Schmeisser-Nieto, Fabrício Benevenuto, Thiago A. S. Pardo:
NoHateBrazil: A Brazilian Portuguese Text Offensiveness Analysis System. 1180-1186 - Francielle Vargas, Isabelle Carvalho, Ali Hürriyetoglu, Thiago A. S. Pardo, Fabrício Benevenuto:
Socially Responsible Hate Speech Detection: Can Classifiers Reflect Social Stereotypes? 1187-1196 - Francielle Vargas, Kokil Jaidka, Thiago A. S. Pardo, Fabrício Benevenuto:
Predicting Sentence-Level Factuality of News and Bias of Media Outlets. 1197-1206 - Shubham Vatsal, Adam Meyers, John E. Ortega:
Classification of US Supreme Court Cases Using BERT-Based Techniques. 1207-1215 - Devika Verma, Ramprasad S. Joshi, Aiman A Shivani, Rohan D. Gupta:
Kāraka-Based Answer Retrieval for Question Answering in Indic Languages. 1216-1224 - Gayashan Weerasundara, Nisansa de Silva:
Comparative Analysis of Named Entity Recognition in the Dungeons and Dragons Domain. 1225-1233 - Yizhou Xu, Kata Gábor, Jérôme Milleret, Frédérique Segond:
Comparative Analysis of Anomaly Detection Algorithms in Text Data. 1234-1245