


default search action
Marcos Zampieri
Person information
- affiliation: George Mason University, Fairfax, VA, USA
- affiliation (former): Rochester Institute of Technology, Rochester, NY, USA
Refine list

refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2025
- [j21]Kai North, Tharindu Ranasinghe, Matthew Shardlow, Marcos Zampieri:
Deep learning approaches to lexical simplification: A survey. J. Intell. Inf. Syst. 63(1): 111-134 (2025) - [j20]Tharindu Ranasinghe, Isuri Anuradha, Damith Premasiri, Kanishka Silva, Hansi Hettiarachchi, Lasitha Uyangodage, Marcos Zampieri:
SOLD: Sinhala offensive language dataset. Lang. Resour. Evaluation 59(1): 297-337 (2025) - [c120]Sujan Dutta, Deepak Pandita, Tharindu Cyril Weerasooriya, Marcos Zampieri, Christopher M. Homan, Ashiqur R. KhudaBukhsh:
ARTICLE: Annotator Reliability Through In-Context Learning. AAAI 2025: 14230-14237 - [c119]Sujan Dutta, Deepak Pandita, Tharindu Cyril Weerasooriya, Marcos Zampieri, Christopher M. Homan, Ashiqur R. KhudaBukhsh:
ARTICLE: Annotator Reliability Through In-Context Learning (Student Abstract). AAAI 2025: 29356-29358 - [c118]Nishat Raihan
, Mohammed Latif Siddiq
, Joanna C. S. Santos
, Marcos Zampieri
:
Large Language Models in Computer Science Education: A Systematic Literature Review. SIGCSE (1) 2025: 938-944 - [i88]Nishat Raihan, Marcos Zampieri:
TigerLLM - A Family of Bangla Large Language Models. CoRR abs/2503.10995 (2025) - [i87]Ana-Maria Bucur, Andreea-Codrina Moldovan, Krutika Parvatikar, Marcos Zampieri, Ashiqur R. KhudaBukhsh, Liviu P. Dinu:
Datasets for Depression Modeling in Social Media: An Overview. CoRR abs/2503.21513 (2025) - 2024
- [j19]Md. Mushfiqur Rahman
, Mohammad Sabik Irbaz
, Kai North, Michelle S. Williams, Marcos Zampieri, Kevin Lybarger
:
Health text simplification: An annotated corpus for digestive cancer education and novel strategies for reinforcement learning. J. Biomed. Informatics 158: 104727 (2024) - [j18]Alphaeus Dmonte, Shrey Satapara, Rehab Alsudais, Tharindu Ranasinghe, Marcos Zampieri:
On the effects of machine translation on offensive language detection. Soc. Netw. Anal. Min. 14(1): 242 (2024) - [c117]Alphaeus Dmonte, Tejas Arya, Tharindu Ranasinghe, Marcos Zampieri:
Towards Generalized Offensive Language Identification. ASONAM (1) 2024: 271-286 - [c116]Matthew Shardlow, Fernando Alva-Manchego, Riza Batista-Navarro, Stefan Bott, Saúl Calderón Ramírez, Rémi Cardon, Thomas François, Akio Hayakawa, Andrea Horbach, Anna Hülsing, Yusuke Ide, Joseph Marvin Imperial, Adam Nohejl, Kai North, Laura Occhipinti, Nelson Perez-Rojas, Nishat Raihan, Tharindu Ranasinghe, Martin Solis-Salazar, Sanja Stajner, Marcos Zampieri, Horacio Saggion:
The BEA 2024 Shared Task on the Multilingual Lexical Simplification Pipeline. BEA 2024: 571-589 - [c115]Dhiman Goswami, Kai North, Marcos Zampieri:
GMU at MLSP 2024: Multilingual Lexical Simplification with Transformer Models. BEA 2024: 627-634 - [c114]Alphaeus Dmonte, Eunmi Ko, Marcos Zampieri:
An Evaluation of Large Language Models in Financial Sentiment Analysis. IEEE Big Data 2024: 4869-4874 - [c113]Nishat Raihan, Christian D. Newman, Marcos Zampieri:
Code LLMs: A Taxonomy-based Survey. IEEE Big Data 2024: 5402-5411 - [c112]Amrita Ganguly, Al Nahian Bin Emran, Sadiya Sayara Chowdhury Puspo, Md. Nishat Raihan, Dhiman Goswami, Marcos Zampieri:
MasonPerplexity at Multimodal Hate Speech Event Detection 2024: Hate Speech and Target Detection Using Transformer Ensembles. CASE 2024: 125-131 - [c111]Marcos Zampieri, Kai North, Tommi Jauhiainen, Mariano Felice, Neha Kumari, Nishant Nair, Yash Mahesh Bangera:
Language Variety Identification with True Labels. LREC/COLING 2024: 10100-10109 - [c110]Md. Nishat Raihan, Sadiya Sayara Chowdhury Puspo, Shafkat Farabi, Ana-Maria Bucur, Tharindu Ranasinghe, Marcos Zampieri:
MentalHelp: A Multi-Task Dataset for Mental Health in Social Media. LREC/COLING 2024: 11196-11203 - [c109]Roland Oruche, Marcos Zampieri, Prasad Calyam:
Deep Contrastive Active Learning for Out-of-domain Filtering in Dialog Systems. DSAA 2024: 1-10 - [c108]Deepak Pandita, Tharindu Cyril Weerasooriya, Sujan Dutta, Sarah Luger, Tharindu Ranasinghe, Ashiqur R. KhudaBukhsh, Marcos Zampieri, Christopher Homan:
Rater Cohesion and Quality from a Vicarious Perspective. EMNLP (Findings) 2024: 5149-5162 - [c107]Shafkat Farabi, Tharindu Ranasinghe, Diptesh Kanojia, Yu Kong, Marcos Zampieri:
A Survey of Multimodal Sarcasm Detection. IJCAI 2024: 8020-8028 - [c106]Md. Nishat Raihan, Dhiman Goswami, Sadiya Sayara Chowdhury Puspo, Christian D. Newman, Tharindu Ranasinghe, Marcos Zampieri:
CSEPrompts: A Benchmark of Introductory Computer Science Prompts. ISMIS 2024: 45-54 - [c105]Dhiman Goswami, Sharanya Thilagan, Kai North, Shervin Malmasi, Marcos Zampieri:
Native Language Identification in Texts: A Survey. NAACL-HLT 2024: 3149-3160 - [c104]Elijah Bass, Massimiliano Albanese, Marcos Zampieri:
DISC: A Dataset for Information Security Classification. SECRYPT 2024: 175-185 - [c103]Alphaeus Dmonte, Marcos Zampieri, Kevin Lybarger
, Massimiliano Albanese
, Genya Coulter:
Classifying Human-Generated and AI-Generated Election Claims in Social Media. SECRYPT 2024: 237-248 - [c102]Md. Nishat Raihan, Dhiman Goswami, Al Nahian Bin Emran, Sadiya Sayara Chowdhury Puspo, Amrita Ganguly, Marcos Zampieri:
MasonTigers at SemEval-2024 Task 9: Solving Puzzles with an Ensemble of Chain-of-Thought Prompts. SemEval@NAACL 2024: 1358-1363 - [c101]Dhiman Goswami, Sadiya Sayara Chowdhury Puspo, Md. Nishat Raihan, Al Nahian Bin Emran, Amrita Ganguly, Marcos Zampieri:
MasonTigers at SemEval-2024 Task 1: An Ensemble Approach for Semantic Textual Relatedness. SemEval@NAACL 2024: 1380-1390 - [e12]Yang (Trista) Cao, Isabel Papadimitriou, Anaelia Ovalle, Marcos Zampieri, Francis Ferraro, Swabha Swayamdipta:
Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies: Student Research Workshop, NAACL 2024, Mexico City, Mexico, June 18, 2024. Association for Computational Linguistics 2024, ISBN 979-8-89176-117-9 [contents] - [i86]Md. Mushfiqur Rahman, Mohammad Sabik Irbaz, Kai North, Michelle S. Williams, Marcos Zampieri, Kevin Lybarger:
Health Text Simplification: An Annotated Corpus for Digestive Cancer Education and Novel Strategies for Reinforcement Learning. CoRR abs/2401.15043 (2024) - [i85]Amrita Ganguly, Al Nahian Bin Emran, Sadiya Sayara Chowdhury Puspo, Md. Nishat Raihan, Dhiman Goswami, Marcos Zampieri:
MasonPerplexity at Multimodal Hate Speech Event Detection 2024: Hate Speech and Target Detection Using Transformer Ensembles. CoRR abs/2402.01967 (2024) - [i84]Kai North, Tharindu Ranasinghe, Matthew Shardlow, Marcos Zampieri:
MultiLS: A Multi-task Lexical Simplification Framework. CoRR abs/2402.14972 (2024) - [i83]Md. Nishat Raihan, Dhiman Goswami, Al Nahian Bin Emran, Sadiya Sayara Chowdhury Puspo, Amrita Ganguly, Marcos Zampieri:
MasonTigers at SemEval-2024 Task 9: Solving Puzzles with an Ensemble of Chain-of-Thoughts. CoRR abs/2403.14982 (2024) - [i82]Dhiman Goswami, Sadiya Sayara Chowdhury Puspo, Md. Nishat Raihan, Al Nahian Bin Emran, Amrita Ganguly, Marcos Zampieri:
MasonTigers at SemEval-2024 Task 1: An Ensemble Approach for Semantic Textual Relatedness. CoRR abs/2403.14990 (2024) - [i81]Md. Nishat Raihan, Dhiman Goswami, Sadiya Sayara Chowdhury Puspo, Christian D. Newman, Tharindu Ranasinghe, Marcos Zampieri:
CSEPrompts: A Benchmark of Introductory Computer Science Prompts. CoRR abs/2404.02540 (2024) - [i80]Marcos Zampieri, Damith Premasiri, Tharindu Ranasinghe:
A Federated Learning Approach to Privacy Preserving Offensive Language Identification. CoRR abs/2404.11470 (2024) - [i79]Alphaeus Dmonte
, Marcos Zampieri, Kevin Lybarger, Massimiliano Albanese, Genya Coulter:
Classifying Human-Generated and AI-Generated Election Claims in Social Media. CoRR abs/2404.16116 (2024) - [i78]Sungsoo Ray Hong, Marcos Zampieri, Brittany N. Hand, Vivian Motti
, Dongjun Chung, Özlem Uzuner:
Collaborative Design for Job-Seekers with Autism: A Conceptual Framework for Future Research. CoRR abs/2405.06078 (2024) - [i77]Md. Nishat Raihan, Dhiman Goswami, Antara Mahmud, Antonios Anastasopoulos, Marcos Zampieri:
EmoMix-3L: A Code-Mixed Dataset for Bangla-English-Hindi Emotion Detection. CoRR abs/2405.06922 (2024) - [i76]Alphaeus Dmonte, Tejas Arya, Tharindu Ranasinghe, Marcos Zampieri:
Towards Generalized Offensive Language Identification. CoRR abs/2407.18738 (2024) - [i75]Deepak Pandita, Tharindu Cyril Weerasooriya, Sujan Dutta, Sarah Luger, Tharindu Ranasinghe, Ashiqur R. KhudaBukhsh, Marcos Zampieri, Christopher M. Homan:
Rater Cohesion and Quality from a Vicarious Perspective. CoRR abs/2408.08411 (2024) - [i74]Alphaeus Dmonte, Roland Oruche, Marcos Zampieri, Prasad Calyam, Isabelle Augenstein:
Claim Verification in the Age of Large Language Models: A Survey. CoRR abs/2408.14317 (2024) - [i73]Sujan Dutta, Deepak Pandita, Tharindu Cyril Weerasooriya, Marcos Zampieri, Christopher M. Homan, Ashiqur R. KhudaBukhsh:
ARTICLE: Annotator Reliability Through In-Context Learning. CoRR abs/2409.12218 (2024) - [i72]Ana-Maria Bucur, Andreea-Codrina Moldovan, Krutika Parvatikar, Marcos Zampieri, Ashiqur R. KhudaBukhsh, Liviu P. Dinu:
On the State of NLP Approaches to Modeling Depression in Social Media: A Post-COVID-19 Outlook. CoRR abs/2410.08793 (2024) - [i71]Nishat Raihan, Antonios Anastasopoulos, Marcos Zampieri:
mHumanEval - A Multilingual Benchmark to Evaluate Large Language Models for Code Generation. CoRR abs/2410.15037 (2024) - [i70]Mamadou K. Keita, Christopher Homan, Sofiane Abdoulaye Hamani, Adwoa Bremang, Marcos Zampieri, Habibatou Abdoulaye Alfari, Elysabhete Amadou Ibrahim, Dennis Owusu:
Grammatical Error Correction for Low-Resource Languages: The Case of Zarma. CoRR abs/2410.15539 (2024) - [i69]Nishat Raihan, Mohammed Latif Siddiq, Joanna C. S. Santos, Marcos Zampieri:
Large Language Models in Computer Science Education: A Systematic Literature Review. CoRR abs/2410.16349 (2024) - [i68]Nishat Raihan, Joanna C. S. Santos, Marcos Zampieri:
MojoBench: Language Modeling and Benchmarks for Mojo. CoRR abs/2410.17736 (2024) - [i67]Shafkat Farabi, Tharindu Ranasinghe, Diptesh Kanojia, Yu Kong, Marcos Zampieri:
A Survey of Multimodal Sarcasm Detection. CoRR abs/2410.18882 (2024) - [i66]Nishat Raihan, Christian D. Newman, Marcos Zampieri:
Code LLMs: A Taxonomy-based Survey. CoRR abs/2412.08291 (2024) - 2023
- [j17]Kai North
, Marcos Zampieri
, Matthew Shardlow
:
Lexical Complexity Prediction: An Overview. ACM Comput. Surv. 55(9): 179:1-179:42 (2023) - [j16]Kai North, Marcos Zampieri:
Features of lexical complexity: insights from L1 and L2 speakers. Frontiers Artif. Intell. 6 (2023) - [j15]Marcos Zampieri, Tharindu Ranasinghe
, Diptanu Sarkar, Alexander Ororbia:
Offensive language identification with multi-task learning. J. Intell. Inf. Syst. 60(3): 613-630 (2023) - [j14]Marcos Zampieri
, Isabelle Augenstein
, Siddharth Krishnan, Joshua Melton, Preslav Nakov:
Preface: Special issue on NLP approaches to offensive content online. Nat. Lang. Eng. 29(6): 1415 (2023) - [j13]Marcos Zampieri
, Sara Rosenthal
, Preslav Nakov
, Alphaeus Dmonte
, Tharindu Ranasinghe
:
OffensEval 2023: Offensive language identification in the age of Large Language Models. Nat. Lang. Eng. 29(6): 1416-1435 (2023) - [c100]Marcos Zampieri, Skye Morgan, Kai North, Tharindu Ranasinghe
, Austin Simmmons, Paridhi Khandelwal, Sara Rosenthal, Preslav Nakov:
Target-Based Offensive Language Identification. ACL (2) 2023: 762-770 - [c99]Tharindu Ranasinghe, Marcos Zampieri:
Teacher and Student Models of Offensive Language in Social Media. ACL (Findings) 2023: 3910-3922 - [c98]Kai North, Alphaeus Dmonte
, Tharindu Ranasinghe, Matthew Shardlow, Marcos Zampieri:
ALEXSIS+: Improving Substitute Generation and Selection for Lexical Simplification with Information Retrieval. BEA@ACL 2023: 404-413 - [c97]Niloofar Kalantari, Amirreza Payandeh, Marcos Zampieri, Vivian Genaro Motti
:
Understanding the Language of ADHD and Autism Communities on Social Media. IEEE Big Data 2023: 2188-2195 - [c96]Tharindu Cyril Weerasooriya, Sujan Dutta, Tharindu Ranasinghe, Marcos Zampieri, Christopher Homan, Ashiqur R. KhudaBukhsh:
Vicarious Offense and Noise Audit of Offensive Speech Classifiers: Unifying Human and Machine Disagreement on What is Offensive. EMNLP 2023: 11648-11668 - [c95]Tharindu Ranasinghe
, Koyel Ghosh
, Aditya Shankar Pal
, Apurbalal Senapati
, Alphaeus Eric Dmonte
, Marcos Zampieri
, Sandip Modha
, Shrey Satapara
:
Overview of the HASOC Subtracks at FIRE 2023: Hate Speech and Offensive Content Identification in Assamese, Bengali, Bodo, Gujarati and Sinhala. FIRE 2023: 13-15 - [c94]Shrey Satapara, Hiren Madhu, Tharindu Ranasinghe, Alphaeus Eric Dmonte, Marcos Zampieri, Pavan Pandya, Nisarg Shah, Sandip Modha, Prasenjit Majumder, Thomas Mandl:
Overview of the HASOC Subtrack at FIRE 2023: Hate-Speech Identification in Sinhala and Gujarati. FIRE (Working Notes) 2023: 344-350 - [c93]Tharindu Ranasinghe, Marcos Zampieri:
A Text-to-Text Model for Multilingual Offensive Language Identification. IJCNLP (Findings) 2023: 375-384 - [c92]Tharindu Ranasinghe, Alistair Plum, Christoph Purschke, Marcos Zampieri:
Publish or Hold? Automatic Comment Moderation in Luxembourgish News Articles. RANLP 2023: 968-978 - [c91]Noëmi Aepli, Çagri Çöltekin, Rob van der Goot, Tommi Jauhiainen, Mourhaf Kazzaz, Nikola Ljubesic, Kai North, Barbara Plank, Yves Scherrer, Marcos Zampieri:
Findings of the VarDial Evaluation Campaign 2023. VarDial@EACL 2023: 251-261 - [e11]Yves Scherrer, Tommi Jauhiainen, Nikola Ljubesic, Preslav Nakov, Jörg Tiedemann, Marcos Zampieri:
Tenth Workshop on NLP for Similar Languages, Varieties and Dialects, VarDial@EACL 2023, Dubrovnik, Croatia, May 5, 2023. Association for Computational Linguistics 2023, ISBN 978-1-959429-50-0 [contents] - [i65]Tharindu Cyril Weerasooriya
, Sujan Dutta, Tharindu Ranasinghe, Marcos Zampieri, Christopher M. Homan, Ashiqur R. KhudaBukhsh:
Vicarious Offense and Noise Audit of Offensive Speech Classifiers. CoRR abs/2301.12534 (2023) - [i64]Horacio Saggion, Sanja Stajner, Daniel Ferrés, Kim Cheng Sheang, Matthew Shardlow, Kai North, Marcos Zampieri:
Findings of the TSAR-2022 Shared Task on Multilingual Lexical Simplification. CoRR abs/2302.02888 (2023) - [i63]Marcos Zampieri, Kai North, Tommi Jauhiainen, Mariano Felice, Neha Kumari, Nishant Nair, Yash Bangera:
Language Variety Identification with True Labels. CoRR abs/2303.01490 (2023) - [i62]Kai North, Marcos Zampieri, Matthew Shardlow:
Lexical Complexity Prediction: An Overview. CoRR abs/2303.04851 (2023) - [i61]Kai North, Tharindu Ranasinghe, Matthew Shardlow, Marcos Zampieri:
Deep Learning Approaches to Lexical Simplification: A Survey. CoRR abs/2305.12000 (2023) - [i60]Noëmi Aepli, Çagri Çöltekin, Rob van der Goot, Tommi Jauhiainen, Mourhaf Kazzaz, Nikola Ljubesic, Kai North, Barbara Plank, Yves Scherrer, Marcos Zampieri:
Findings of the VarDial Evaluation Campaign 2023. CoRR abs/2305.20080 (2023) - [i59]Md. Nishat Raihan, Dhiman Goswami, Antara Mahmud, Antonios Anstasopoulos, Marcos Zampieri:
SentMix-3L: A Bangla-English-Hindi Code-Mixed Dataset for Sentiment Analysis. CoRR abs/2310.18023 (2023) - [i58]Dhiman Goswami, Md. Nishat Raihan, Antara Mahmud, Antonios Anstasopoulos, Marcos Zampieri:
OffMix-3L: A Novel Code-Mixed Dataset in Bangla-English-Hindi for Offensive Language Identification. CoRR abs/2310.18387 (2023) - [i57]Md. Nishat Raihan, Umma Hani Tanmoy, Anika Binte Islam, Kai North, Tharindu Ranasinghe, Antonios Anastasopoulos, Marcos Zampieri:
Offensive Language Identification in Transliterated and Code-Mixed Bangla. CoRR abs/2311.15023 (2023) - [i56]Md. Nishat Raihan, Dhiman Goswami, Sadiya Sayara Chowdhury Puspo, Marcos Zampieri:
nlpBDpatriots at BLP-2023 Task 1: A Two-Step Classification for Violence Inciting Text Detection in Bangla. CoRR abs/2311.15029 (2023) - [i55]Dhiman Goswami, Md. Nishat Raihan, Sadiya Sayara Chowdhury Puspo, Marcos Zampieri:
nlpBDpatriots at BLP-2023 Task 2: A Transfer Learning Approach to Bangla Sentiment Analysis. CoRR abs/2311.15032 (2023) - [i54]Tharindu Ranasinghe, Marcos Zampieri:
A Text-to-Text Model for Multilingual Offensive Language Identification. CoRR abs/2312.03379 (2023) - 2022
- [j12]Sanja Stajner, Daniel Ferrés, Matthew Shardlow, Kai North, Marcos Zampieri, Horacio Saggion:
Lexical simplification benchmarks for English, Portuguese, and Spanish. Frontiers Artif. Intell. 5 (2022) - [j11]Matthew Shardlow
, Richard Evans
, Marcos Zampieri
:
Predicting lexical complexity in English texts: the Complex 2.0 dataset. Lang. Resour. Evaluation 56(4): 1153-1194 (2022) - [j10]Marcos Zampieri, Tharindu Ranasinghe
, Mrinal Chaudhari, Saurabh Gaikwad, Prajwal Krishna, Mayuresh Nene, Shrunali Paygude:
Predicting the type and target of offensive social media posts in Marathi. Soc. Netw. Anal. Min. 12(1): 77 (2022) - [j9]Tharindu Ranasinghe
, Marcos Zampieri:
Multilingual Offensive Language Identification for Low-resource Languages. ACM Trans. Asian Low Resour. Lang. Inf. Process. 21(1): 4:1-4:13 (2022) - [j8]Christian D. Newman
, Michael John Decker
, Reem S. Alsuhaibani
, Anthony Peruma
, Mohamed Wiem Mkaouer
, Satyajit Mohapatra, Tejal Vishnoi, Marcos Zampieri, Timothy J. Sheldon, Emily Hill:
An Ensemble Approach for Annotating Source Code Identifiers With Part-of-Speech Tags. IEEE Trans. Software Eng. 48(9): 3506-3522 (2022) - [c90]Kai North, Marcos Zampieri, Tharindu Ranasinghe:
ALEXSIS-PT: A New Resource for Portuguese Lexical Simplification. COLING 2022: 6057-6062 - [c89]Shrey Satapara
, Prasenjit Majumder
, Thomas Mandl
, Sandip Modha
, Hiren Madhu
, Tharindu Ranasinghe
, Marcos Zampieri
, Kai North
, Damith Premasiri
:
Overview of the HASOC Subtrack at FIRE 2022: Hate Speech and Offensive Content Identification in English and Indo-Aryan Languages. FIRE 2022: 4-7 - [c88]Tharindu Ranasinghe, Kai North, Damith Premasiri, Marcos Zampieri:
Overview of the HASOC Subtrack at FIRE 2022: Offensive Language Identification in Marathi. FIRE (Working Notes) 2022: 489-501 - [c87]Farhad Akhbardeh, Marcos Zampieri, Cecilia Ovesdotter Alm, Travis Desell:
Transfer Learning Methods for Domain Adaptation in Technical Logbook Datasets. LREC 2022: 4235-4244 - [e10]Luciana Benotti, Naoaki Okazaki, Yves Scherrer, Marcos Zampieri:
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics, ACL 2022 - Tutorial Abstracts, Dublin, Ireland, May 22-27, 2022. Association for Computational Linguistics 2022, ISBN 978-1-955917-20-9 [contents] - [e9]Philipp Koehn, Loïc Barrault, Ondrej Bojar, Fethi Bougares, Rajen Chatterjee, Marta R. Costa-jussà, Christian Federmann, Mark Fishel, Alexander Fraser, Markus Freitag, Yvette Graham, Roman Grundkiewicz, Paco Guzman, Barry Haddow, Matthias Huck, Antonio Jimeno-Yepes, Tom Kocmi, André Martins, Makoto Morishita, Christof Monz, Masaaki Nagata, Toshiaki Nakazawa, Matteo Negri, Aurélie Névéol, Mariana Neves, Martin Popel, Marco Turchi, Marcos Zampieri:
Proceedings of the Seventh Conference on Machine Translation, WMT 2022, Abu Dhabi, United Arab Emirates (Hybrid), December 7-8, 2022. Association for Computational Linguistics 2022, ISBN 978-1-959429-29-6 [contents] - [i53]Sanja Stajner
, Daniel Ferrés, Matthew Shardlow, Kai North, Marcos Zampieri, Horacio Saggion:
Lexical Simplification Benchmarks for English, Portuguese, and Spanish. CoRR abs/2209.05301 (2022) - [i52]Kai North, Marcos Zampieri, Tharindu Ranasinghe:
ALEXSIS-PT: A New Resource for Portuguese Lexical Simplification. CoRR abs/2209.09034 (2022) - [i51]Tharindu Ranasinghe, Kai North, Damith Premasiri, Marcos Zampieri:
Overview of the HASOC Subtrack at FIRE 2022: Offensive Language Identification in Marathi. CoRR abs/2211.10163 (2022) - [i50]Marcos Zampieri, Tharindu Ranasinghe, Mrinal Chaudhari, Saurabh Gaikwad, Prajwal Krishna, Mayuresh Nene, Shrunali Paygude:
Predicting the Type and Target of Offensive Social Media Posts in Marathi. CoRR abs/2211.12570 (2022) - [i49]Tharindu Ranasinghe, Isuri Anuradha, Damith Premasiri, Kanishka Silva, Hansi Hettiarachchi, Lasitha Uyangodage, Marcos Zampieri:
SOLD: Sinhala Offensive Language Dataset. CoRR abs/2212.00851 (2022) - 2021
- [j7]Hanna Béchara, Constantin Orasan
, Carla Parra Escartín
, Marcos Zampieri, William Lowe:
The Role of Machine Translation Quality Estimation in the Post-Editing Workflow. Informatics 8(3): 61 (2021) - [j6]Tharindu Ranasinghe
, Marcos Zampieri:
An Evaluation of Multilingual Offensive Language Identification Methods for the Languages of India. Inf. 12(8): 306 (2021) - [c86]Sara Rosenthal, Pepa Atanasova
, Georgi Karadzhov, Marcos Zampieri, Preslav Nakov:
SOLID: A Large-Scale Semi-Supervised Dataset for Offensive Language Identification. ACL/IJCNLP (Findings) 2021: 915-928 - [c85]Ana-Maria Bucur, Marcos Zampieri, Liviu P. Dinu:
An Exploratory Analysis of the Relation between Offensive Language and Mental Health. ACL/IJCNLP (Findings) 2021: 3600-3606 - [c84]Farhad Akhbardeh, Cecilia Ovesdotter Alm
, Marcos Zampieri, Travis Desell:
Handling Extreme Class Imbalance in Technical Logbook Datasets. ACL/IJCNLP (1) 2021: 4034-4045 - [c83]Diptanu Sarkar, Marcos Zampieri, Tharindu Ranasinghe
, Alexander G. Ororbia II:
fBERT: A Neural Transformer for Identifying Offensive Content. EMNLP (Findings) 2021: 1792-1798 - [c82]Liviu P. Dinu, Ioan-Bogdan Iordache, Ana Sabina Uban, Marcos Zampieri:
A Computational Exploration of Pejorative Language in Social Media. EMNLP (Findings) 2021: 3493-3498 - [c81]Thomas Mandl, Sandip Modha, Gautam Kishore Shahi, Hiren Madhu, Shrey Satapara, Prasenjit Majumder, Johannes Schäfer, Tharindu Ranasinghe, Marcos Zampieri, Durgesh Nandini, Amit Kumar Jaiswal:
Overview of the HASOC Subtrack at FIRE 2021: HateSpeech and Offensive Content Identification in English and Indo-Aryan Languages. FIRE (Working Notes) 2021: 1-19 - [c80]Sandip Modha, Thomas Mandl
, Gautam Kishore Shahi
, Hiren Madhu, Shrey Satapara
, Tharindu Ranasinghe
, Marcos Zampieri:
Overview of the HASOC Subtrack at FIRE 2021: Hate Speech and Offensive Content Identification in English and Indo-Aryan Languages and Conversational Hate Speech. FIRE 2021: 1-3 - [c79]Mayuresh Nene, Kai North, Tharindu Ranasinghe, Marcos Zampieri:
Transformer Models for Offensive Language Identification in Marathi. FIRE (Working Notes) 2021: 273-282 - [c78]Skye Morgan, Tharindu Ranasinghe
, Marcos Zampieri:
WLV-RIT at GermEval 2021: Multitask Learning with Transformers to Detect Toxic, Engaging, and Fact-Claiming Comments. GermEval@KONVENS 2021: 32-38 - [c77]Tharindu Ranasinghe
, Marcos Zampieri:
MUDES: Multilingual Detection of Offensive Spans. NAACL-HLT (Demonstrations) 2021: 144-152 - [c76]Saurabh Sampatrao Gaikwad, Tharindu Ranasinghe, Marcos Zampieri, Christopher Homan:
Cross-lingual Offensive Language Identification for Low Resource Languages: The Case of Marathi. RANLP 2021: 437-443 - [c75]Matthew Shardlow, Richard Evans, Gustavo Henrique Paetzold, Marcos Zampieri:
SemEval-2021 Task 1: Lexical Complexity Prediction. SemEval@ACL/IJCNLP 2021: 1-16 - [c74]Abhinandan Tejalkumar Desai, Kai North, Marcos Zampieri, Christopher Homan:
LCP-RIT at SemEval-2021 Task 1: Exploring Linguistic Features for Lexical Complexity Prediction. SemEval@ACL/IJCNLP 2021: 548-553 - [c73]Tharindu Ranasinghe
, Diptanu Sarkar, Marcos Zampieri, Alexander G. Ororbia II:
WLV-RIT at SemEval-2021 Task 5: A Neural Transformer Framework for Detecting Toxic Spans. SemEval@ACL/IJCNLP 2021: 833-840 - [c72]Bharathi Raja Chakravarthi, Mihaela Gaman, Radu Tudor Ionescu, Heidi Jauhiainen, Tommi Jauhiainen, Krister Lindén, Nikola Ljubesic, Niko Partanen, Ruba Priyadharshini, Christoph Purschke, Eswari Rajagopal, Yves Scherrer, Marcos Zampieri:
Findings of the VarDial Evaluation Campaign 2021. VarDial@EACL 2021: 1-11 - [c71]Tommi Jauhiainen, Tharindu Ranasinghe, Marcos Zampieri:
Comparing Approaches to Dravidian Language Identification. VarDial@EACL 2021: 120-127 - [c70]Farhad Akhbardeh, Arkady Arkhangorodsky, Magdalena Biesialska
, Ondrej Bojar, Rajen Chatterjee, Vishrav Chaudhary, Marta R. Costa-jussà, Cristina España-Bonet, Angela Fan, Christian Federmann, Markus Freitag, Yvette Graham, Roman Grundkiewicz, Barry Haddow, Leonie Harter, Kenneth Heafield, Christopher Homan, Matthias Huck, Kwabena Amponsah-Kaakyire, Jungo Kasai, Daniel Khashabi, Kevin Knight, Tom Kocmi, Philipp Koehn, Nicholas Lourie, Christof Monz, Makoto Morishita, Masaaki Nagata, Ajay Nagesh, Toshiaki Nakazawa, Matteo Negri, Santanu Pal, Allahsera Auguste Tapo, Marco Turchi, Valentin Vydrin, Marcos Zampieri:
Findings of the 2021 Conference on Machine Translation (WMT21). WMT@EMNLP 2021: 1-88 - [e8]Marcos Zampieri, Preslav Nakov, Nikola Ljubesic, Jörg Tiedemann, Yves Scherrer, Tommi Jauhiainen:
Proceedings of the Eighth Workshop on NLP for Similar Languages, Varieties and Dialects, VarDial@EACL 2021, Kiyv, Ukraine, April 20, 2021. Association for Computational Linguistics 2021, ISBN 978-1-954085-12-1 [contents] - [i48]Matthew Shardlow, Richard Evans, Marcos Zampieri:
Predicting Lexical Complexity in English Texts. CoRR abs/2102.08773 (2021) - [i47]Tharindu Ranasinghe, Marcos Zampieri:
MUDES: Multilingual Detection of Offensive Spans. CoRR abs/2102.09665 (2021) - [i46]Tommi Jauhiainen, Tharindu Ranasinghe, Marcos Zampieri:
Comparing Approaches to Dravidian Language Identification. CoRR abs/2103.05552 (2021) - [i45]Allahsera Auguste Tapo, Michael Leventhal, Sarah Luger, Christopher M. Homan, Marcos Zampieri:
Domain-specific MT for Low-resource Languages: The case of Bambara-French. AfricaNLP 2021 - [i44]Tharindu Ranasinghe, Diptanu Sarkar, Marcos Zampieri, Alexander G. Ororbia II:
WLV-RIT at SemEval-2021 Task 5: A Neural Transformer Framework for Detecting Toxic Spans. CoRR abs/2104.04630 (2021) - [i43]Tharindu Ranasinghe, Marcos Zampieri:
Multilingual Offensive Language Identification for Low-resource Languages. CoRR abs/2105.05996 (2021) - [i42]Abhinandan Desai, Kai North, Marcos Zampieri, Christopher M. Homan:
LCP-RIT at SemEval-2021 Task 1: Exploring Linguistic Features for Lexical Complexity Prediction. CoRR abs/2105.08780 (2021) - [i41]Ana-Maria Bucur
, Marcos Zampieri, Liviu P. Dinu:
An Exploratory Analysis of the Relation Between Offensive Language and Mental Health. CoRR abs/2105.14888 (2021) - [i40]Matthew Shardlow, Richard Evans, Gustavo Henrique Paetzold, Marcos Zampieri:
SemEval-2021 Task 1: Lexical Complexity Prediction. CoRR abs/2106.00473 (2021) - [i39]Skye Morgan, Tharindu Ranasinghe, Marcos Zampieri:
WLV-RIT at GermEval 2021: Multitask Learning with Transformers to Detect Toxic, Engaging, and Fact-Claiming Comments. CoRR abs/2108.00057 (2021) - [i38]Christian D. Newman, Michael John Decker, Reem S. Alsuhaibani, Anthony Peruma, Satyajit Mohapatra, Tejal Vishnoi, Marcos Zampieri, Mohamed Wiem Mkaouer, Timothy J. Sheldon, Emily Hill:
An Ensemble Approach for Annotating Source Code Identifiers with Part-of-speech Tags. CoRR abs/2109.00629 (2021) - [i37]Saurabh Gaikwad, Tharindu Ranasinghe, Marcos Zampieri, Christopher M. Homan:
Cross-lingual Offensive Language Identification for Low Resource Languages: The Case of Marathi. CoRR abs/2109.03552 (2021) - [i36]Diptanu Sarkar, Marcos Zampieri, Tharindu Ranasinghe, Alexander G. Ororbia II:
FBERT: A Neural Transformer for Identifying Offensive Content. CoRR abs/2109.05074 (2021) - [i35]Thomas Mandl, Sandip Modha, Gautam Kishore Shahi, Hiren Madhu, Shrey Satapara, Prasenjit Majumder, Johannes Schäfer, Tharindu Ranasinghe, Marcos Zampieri, Durgesh Nandini, Amit Kumar Jaiswal:
Overview of the HASOC Subtrack at FIRE 2021: Hate Speech and Offensive Content Identification in English and Indo-Aryan Languages. CoRR abs/2112.09301 (2021) - 2020
- [j5]Marcos Zampieri, Preslav Nakov, Yves Scherrer
:
Natural language processing for similar languages, varieties, and dialects: A survey. Nat. Lang. Eng. 26(6): 595-612 (2020) - [c69]Sarah Luger, Martina Anto-Ocrah, Allahsera Tapo, Christopher Homan, Marcos Zampieri, Michael Leventhal:
Health Care Misinformation: an Artificial Intelligence Challenge for Low-resource languages. AI4SG@AAAI Fall Symposium 2020 - [c68]Ritesh Kumar, Atul Kr. Ojha, Shervin Malmasi, Marcos Zampieri:
Evaluating Aggression Identification in Social Media. TRAC@LREC 2020: 1-5 - [c67]Farhad Akhbardeh, Travis Desell, Marcos Zampieri:
MaintNet: A Collaborative Open-Source Library for Predictive Maintenance Language Resources. COLING (Demonstrations) 2020: 7-11 - [c66]Tharindu Ranasinghe
, Marcos Zampieri:
Multilingual Offensive Language Identification with Cross-lingual Embeddings. EMNLP (1) 2020: 5838-5844 - [c65]Tharindu Ranasinghe, Sarthak Gupte, Marcos Zampieri, Ifeoma Nwogu:
WLV-RIT at HASOC-Dravidian-CodeMix-FIRE2020: Offensive Language Identification in Code-switched YouTube Comments. FIRE (Working Notes) 2020: 417-426 - [c64]Farhad Akhbardeh, Travis Desell, Marcos Zampieri:
NLP Tools for Predictive Maintenance Records in MaintNet. AACL/IJCNLP (System Demonstrations) 2020: 26-32 - [c63]Matthew Shardlow, Marcos Zampieri, Michael Cooper:
CompLex - A New Corpus for Lexical Complexity Predicition from LikertScale Data. READI@LREC 2020: 57-62 - [c62]Zeses Pitenis, Marcos Zampieri, Tharindu Ranasinghe:
Offensive Language Identification in Greek. LREC 2020: 5113-5119 - [c61]Marcos Zampieri, Preslav Nakov, Sara Rosenthal, Pepa Atanasova
, Georgi Karadzhov, Hamdy Mubarak, Leon Derczynski, Zeses Pitenis, Çagri Çöltekin:
SemEval-2020 Task 12: Multilingual Offensive Language Identification in Social Media (OffensEval 2020). SemEval@COLING 2020: 1425-1447 - [c60]Mihaela Gaman, Dirk Hovy, Radu Tudor Ionescu, Heidi Jauhiainen, Tommi Jauhiainen, Krister Lindén, Nikola Ljubesic, Niko Partanen, Christoph Purschke, Yves Scherrer, Marcos Zampieri:
A Report on the VarDial Evaluation Campaign 2020. VarDial@COLING 2020: 1-14 - [c59]Loïc Barrault, Magdalena Biesialska, Ondrej Bojar, Marta R. Costa-jussà, Christian Federmann, Yvette Graham, Roman Grundkiewicz, Barry Haddow, Matthias Huck, Eric Joanis, Tom Kocmi, Philipp Koehn, Chi-kiu Lo, Nikola Ljubesic, Christof Monz, Makoto Morishita, Masaaki Nagata, Toshiaki Nakazawa, Santanu Pal, Matt Post, Marcos Zampieri:
Findings of the 2020 Conference on Machine Translation (WMT20). WMT@EMNLP 2020: 1-55 - [c58]Santanu Pal, Marcos Zampieri:
Neural Machine Translation for Similar Languages: The Case of Indo-Aryan Languages. WMT@EMNLP 2020: 424-429 - [c57]Michael Leventhal, Allahsera Tapo, Sarah Luger, Marcos Zampieri, Christopher M. Homan:
Assessing Human Translations from French to Bambara for Machine Learning: a Pilot Study. AfricaNLP 2020 - [e7]Ritesh Kumar, Atul Kr. Ojha, Bornini Lahiri, Marcos Zampieri, Shervin Malmasi, Vanessa Murdock, Daniel Kadar:
Proceedings of the Second Workshop on Trolling, Aggression and Cyberbullying, TRAC@LREC 2020, Marseille, France, May 2020. European Language Resources Association (ELRA) 2020, ISBN 979-10-95546-56-6 [contents] - [e6]Marcos Zampieri, Preslav Nakov, Nikola Ljubesic, Jörg Tiedemann, Yves Scherrer:
Proceedings of the 7th Workshop on NLP for Similar Languages, Varieties and Dialects, VarDial@COLING 2020, Barcelona, Spain (Online), December 13, 2020. International Committee on Computational Linguistics (ICCL) 2020, ISBN 978-1-952148-47-7 [contents] - [d1]Marcos Zampieri
, Preslav Nakov
, Sara Rosenthal
, Pepa Atanasova
, Georgi Karadzhov
, Hamdy Mubarak
, Leon Derczynski
, Zeses Pitenis, Çagri Çöltekin
:
SemEval-2020 Task 12: Multilingual Offensive Language Identification in Social Media (OffensEval 2020). Zenodo, 2020 - [i34]Matthew Shardlow, Michael Cooper, Marcos Zampieri:
CompLex - A New Corpus for Lexical Complexity Predicition from Likert Scale Data. CoRR abs/2003.07008 (2020) - [i33]Zeses Pitenis, Marcos Zampieri, Tharindu Ranasinghe:
Offensive Language Identification in Greek. CoRR abs/2003.07459 (2020) - [i32]Sara Rosenthal, Pepa Atanasova, Georgi Karadzhov, Marcos Zampieri, Preslav Nakov:
A Large-Scale Semi-Supervised Dataset for Offensive Language Identification. CoRR abs/2004.14454 (2020) - [i31]Farhad Akhbardeh, Travis Desell, Marcos Zampieri:
MaintNet: A Collaborative Open-Source Library for Predictive Maintenance Language Resources. CoRR abs/2005.12443 (2020) - [i30]Marcos Zampieri, Preslav Nakov, Sara Rosenthal, Pepa Atanasova, Georgi Karadzhov, Hamdy Mubarak, Leon Derczynski, Zeses Pitenis, Çagri Çöltekin:
SemEval-2020 Task 12: Multilingual Offensive Language Identification in Social Media (OffensEval 2020). CoRR abs/2006.07235 (2020) - [i29]Tharindu Ranasinghe, Marcos Zampieri:
Multilingual Offensive Language Identification with Cross-lingual Embeddings. CoRR abs/2010.05324 (2020) - [i28]Tharindu Ranasinghe, Sarthak Gupte, Marcos Zampieri, Ifeoma Nwogu:
WLV-RIT at HASOC-Dravidian-CodeMix-FIRE2020: Offensive Language Identification in Code-switched YouTube Comments. CoRR abs/2011.00559 (2020) - [i27]Allahsera Auguste Tapo, Bakary Coulibaly, Sébastien Diarra, Christopher Homan, Julia Kreutzer, Sarah Luger, Arthur Nagashima, Marcos Zampieri, Michael Leventhal:
Neural Machine Translation for Extremely Low-Resource African Languages: A Case Study on Bambara. CoRR abs/2011.05284 (2020)
2010 – 2019
- 2019
- [j4]Tommi Jauhiainen
, Marco Lui, Marcos Zampieri, Timothy Baldwin, Krister Lindén
:
Automatic Language Identification in Texts: A Survey. J. Artif. Intell. Res. 65: 675-782 (2019) - [j3]Marcos Zampieri, Preslav Nakov
:
Preface. Nat. Lang. Eng. 25(5): 559 (2019) - [c56]Alistair Plum, Marcos Zampieri, Constantin Orasan, Eveline Wandl-Vogt, Ruslan Mitkov:
Large-scale Data Harvesting for Biographical Data. BD 2019: 66-72 - [c55]Tharindu Ranasinghe, Marcos Zampieri, Hansi Hettiarachchi:
BRUMS at HASOC 2019: Deep Learning Models for Multilingual Hate Speech and Offensive Language Identification. FIRE (Working Notes) 2019: 199-207 - [c54]Mihaela Vela, Santanu Pal, Marcos Zampieri, Sudip Kumar Naskar, Josef van Genabith:
Improving CAT Tools in the Translation Workflow: New Approaches and Evaluation. MTSummit (2) 2019: 8-15 - [c53]Marcos Zampieri, Shervin Malmasi, Preslav Nakov, Sara Rosenthal, Noura Farra, Ritesh Kumar:
Predicting the Type and Target of Offensive Posts in Social Media. NAACL-HLT (1) 2019: 1415-1420 - [c52]Marcos Zampieri, Shervin Malmasi, Preslav Nakov, Sara Rosenthal, Noura Farra, Ritesh Kumar:
SemEval-2019 Task 6: Identifying and Categorizing Offensive Language in Social Media (OffensEval). SemEval@NAACL-HLT 2019: 75-86 - [c51]Gustavo Henrique Paetzold, Marcos Zampieri, Shervin Malmasi:
UTFPR at SemEval-2019 Task 5: Hate Speech Identification with Recurrent Neural Networks. SemEval@NAACL-HLT 2019: 519-523 - [c50]Loïc Barrault, Ondrej Bojar, Marta R. Costa-jussà, Christian Federmann, Mark Fishel, Yvette Graham, Barry Haddow, Matthias Huck, Philipp Koehn, Shervin Malmasi, Christof Monz, Mathias Müller, Santanu Pal, Matt Post, Marcos Zampieri:
Findings of the 2019 Conference on Machine Translation (WMT19). WMT (2) 2019: 1-61 - [c49]Santanu Pal, Marcos Zampieri, Josef van Genabith:
UDS-DFKI Submission to the WMT2019 Czech-Polish Similar Language Translation Shared Task. WMT (3) 2019: 219-223 - [i26]Marcos Zampieri, Shervin Malmasi, Preslav Nakov, Sara Rosenthal, Noura Farra, Ritesh Kumar:
Predicting the Type and Target of Offensive Posts in Social Media. CoRR abs/1902.09666 (2019) - [i25]Marcos Zampieri, Shervin Malmasi, Preslav Nakov, Sara Rosenthal, Noura Farra, Ritesh Kumar:
SemEval-2019 Task 6: Identifying and Categorizing Offensive Language in Social Media (OffensEval). CoRR abs/1903.08983 (2019) - [i24]Gustavo Henrique Paetzold, Shervin Malmasi, Marcos Zampieri:
UTFPR at SemEval-2019 Task 5: Hate Speech Identification with Recurrent Neural Networks. CoRR abs/1904.07839 (2019) - [i23]Gustavo Henrique Paetzold, Marcos Zampieri:
Experiments in Cuneiform Language Identification. CoRR abs/1904.12087 (2019) - [i22]Santanu Pal, Marcos Zampieri, Josef van Genabith:
UDS-DFKI Submission to the WMT2019 Similar Language Translation Shared Task. CoRR abs/1908.06138 (2019) - [i21]Mihaela Vela, Santanu Pal, Marcos Zampieri, Sudip Kumar Naskar, Josef van Genabith:
Improving CAT Tools in the Translation Workflow: New Approaches and Evaluation. CoRR abs/1908.06140 (2019) - 2018
- [j2]Shervin Malmasi, Marcos Zampieri:
Challenges in discriminating profanity from hate speech. J. Exp. Theor. Artif. Intell. 30(2): 187-202 (2018) - [c48]Fernando Benites, Shervin Malmasi, Marcos Zampieri:
Classifying Patent Applications with Ensemble Methods. ALTA 2018: 89-92 - [c47]Seid Muhie Yimam
, Chris Biemann, Shervin Malmasi, Gustavo Paetzold, Lucia Specia, Sanja Stajner, Anaïs Tack, Marcos Zampieri:
A Report on the Complex Word Identification Shared Task 2018. BEA@NAACL-HLT 2018: 66-78 - [c46]Iria del Río Gayo
, Marcos Zampieri, Shervin Malmasi:
A Portuguese Native Language Identification Dataset. BEA@NAACL-HLT 2018: 291-296 - [c45]Ritesh Kumar, Atul Kr. Ojha, Shervin Malmasi, Marcos Zampieri:
Benchmarking Aggression Identification in Social Media. TRAC@COLING 2018 2018: 1-11 - [c44]Diego Moussallem, Thiago Castro Ferreira, Marcos Zampieri, Maria Cláudia Cavalcanti, Geraldo Xexéo, Mariana L. Neves, Axel-Cyrille Ngonga Ngomo
:
RDF2PT: Generating Brazilian Portuguese Texts from RDF Data. LREC 2018 - [c43]Diego Moussallem, Mohamed Ahmed Sherif, Diego Esteves, Marcos Zampieri, Axel-Cyrille Ngonga Ngomo
:
LIdioms: A Multilingual Linked Idioms Data Set. LREC 2018 - [c42]Shervin Malmasi, Iria del Río
, Marcos Zampieri:
Portuguese Native Language Identification. PROPOR 2018: 115-124 - [c41]Marcos Zampieri, Shervin Malmasi, Preslav Nakov, Ahmed Ali, Suwon Shon, James R. Glass, Yves Scherrer, Tanja Samardzic, Nikola Ljubesic, Jörg Tiedemann, Chris van der Lee, Stefan Grondelaers, Nelleke Oostdijk, Dirk Speelman, Antal van den Bosch, Ritesh Kumar, Bornini Lahiri, Mayank Jain:
Language Identification and Morphosyntactic Tagging: The Second VarDial Evaluation Campaign. VarDial@COLING 2018 2018: 1-17 - [c40]Alina Maria Ciobanu, Marcos Zampieri, Shervin Malmasi, Santanu Pal, Liviu P. Dinu:
Discriminating between Indo-Aryan Languages Using SVM Ensembles. VarDial@COLING 2018 2018: 178-184 - [c39]Marta R. Costa-jussà, Marcos Zampieri, Santanu Pal:
A Neural Approach to Language Variety Translation. VarDial@COLING 2018 2018: 275-282 - [e5]Ritesh Kumar, Atul Kr. Ojha, Marcos Zampieri, Shervin Malmasi:
Proceedings of the First Workshop on Trolling, Aggression and Cyberbullying, TRAC@COLING 2018, Santa Fe, New Mexico, USA, August 25, 2018. Association for Computational Linguistics 2018, ISBN 978-1-948087-60-5 [contents] - [e4]Marcos Zampieri, Preslav Nakov, Nikola Ljubesic, Jörg Tiedemann, Shervin Malmasi, Ahmed Ali:
Proceedings of the Fifth Workshop on NLP for Similar Languages, Varieties and Dialects, VarDial@COLING 2018, Santa Fe, New Mexico, USA, August 20, 2018. Association for Computational Linguistics 2018, ISBN 978-1-948087-55-1 [contents] - [i20]Diego Moussallem, Mohamed Ahmed Sherif, Diego Esteves, Marcos Zampieri, Axel-Cyrille Ngonga Ngomo:
LIDIOMS: A Multilingual Linked Idioms Data Set. CoRR abs/1802.08148 (2018) - [i19]Diego Moussallem, Thiago Castro Ferreira, Marcos Zampieri, Maria Cláudia Cavalcanti, Geraldo Xexéo, Mariana L. Neves, Axel-Cyrille Ngonga Ngomo:
RDF2PT: Generating Brazilian Portuguese Texts from RDF Data. CoRR abs/1802.08150 (2018) - [i18]Shervin Malmasi, Marcos Zampieri:
Challenges in Discriminating Profanity from Hate Speech. CoRR abs/1803.05495 (2018) - [i17]Tommi Jauhiainen
, Marco Lui, Marcos Zampieri, Timothy Baldwin, Krister Lindén
:
Automatic Language Identification in Texts: A Survey. CoRR abs/1804.08186 (2018) - [i16]Seid Muhie Yimam, Chris Biemann, Shervin Malmasi, Gustavo H. Paetzold, Lucia Specia, Sanja Stajner, Anaïs Tack, Marcos Zampieri:
A Report on the Complex Word Identification Shared Task 2018. CoRR abs/1804.09132 (2018) - [i15]Iria del Río, Marcos Zampieri, Shervin Malmasi:
A Portuguese Native Language Identification Dataset. CoRR abs/1804.11346 (2018) - [i14]Marta R. Costa-jussà, Marcos Zampieri, Santanu Pal:
A Neural Approach to Language Variety Translation. CoRR abs/1807.00651 (2018) - [i13]Alina Maria Ciobanu, Marcos Zampieri, Shervin Malmasi, Santanu Pal, Liviu P. Dinu:
Discriminating between Indo-Aryan Languages Using SVM Ensembles. CoRR abs/1807.03108 (2018) - [i12]Liviu P. Dinu, Alina Maria Ciobanu, Marcos Zampieri, Shervin Malmasi:
Classifier Ensembles for Dialect and Language Variety Identification. CoRR abs/1808.04800 (2018) - [i11]Fernando Benites, Shervin Malmasi, Marcos Zampieri:
Classifying Patent Applications with Ensemble Methods. CoRR abs/1811.04695 (2018) - 2017
- [c38]Marcos Zampieri, Shervin Malmasi, Gustavo Paetzold, Lucia Specia:
Complex Word Identification: Challenges in Data Annotation and System Performance. NLP-TEA@IJCNLP 2017: 59-63 - [c37]Marcos Zampieri, Alina Maria Ciobanu, Liviu P. Dinu:
Native Language Identification on Text and Speech. BEA@EMNLP 2017: 398-404 - [c36]Alina Maria Ciobanu, Marcos Zampieri, Shervin Malmasi, Liviu P. Dinu:
Including Dialects and Language Varieties in Author Profiling. CLEF (Working Notes) 2017 - [c35]Octavia-Maria Sulea, Marcos Zampieri, Shervin Malmasi, Mihaela Vela, Liviu P. Dinu, Josef van Genabith:
Exploring the Use of Text Classification in the Legal Domain. ASAIL@ICAIL 2017 - [c34]Shervin Malmasi, Marcos Zampieri:
Detecting Hate Speech in Social Media. RANLP 2017: 467-472 - [c33]Octavia-Maria Sulea, Marcos Zampieri, Mihaela Vela, Josef van Genabith:
Predicting the Law Area and Decisions of French Supreme Court Cases. RANLP 2017: 716-722 - [c32]Marcos Zampieri, Shervin Malmasi, Nikola Ljubesic, Preslav Nakov, Ahmed Ali, Jörg Tiedemann, Yves Scherrer, Noëmi Aepli:
Findings of the VarDial Evaluation Campaign 2017. VarDial 2017: 1-15 - [c31]Shervin Malmasi, Marcos Zampieri:
German Dialect Identification in Interview Transcriptions. VarDial 2017: 164-169 - [c30]Shervin Malmasi, Marcos Zampieri:
Arabic Dialect Identification Using iVectors and ASR Transcripts. VarDial 2017: 178-183 - [e3]Preslav Nakov, Marcos Zampieri, Nikola Ljubesic, Jörg Tiedemann, Shervin Malmasi, Ahmed Ali:
Proceedings of the Fourth Workshop on NLP for Similar Languages, Varieties and Dialects, VarDial 2017, Valencia, Spain, April 3, 2017. Association for Computational Linguistics 2017, ISBN 978-1-945626-43-2 [contents] - [i10]Alina Maria Ciobanu, Marcos Zampieri, Shervin Malmasi, Liviu P. Dinu:
Including Dialects and Language Varieties in Author Profiling. CoRR abs/1707.00621 (2017) - [i9]Marcos Zampieri, Alina Maria Ciobanu, Liviu P. Dinu:
Native Language Identification on Text and Speech. CoRR abs/1707.07182 (2017) - [i8]Octavia-Maria Sulea, Marcos Zampieri, Mihaela Vela, Josef van Genabith:
Predicting the Law Area and Decisions of French Supreme Court Cases. CoRR abs/1708.01681 (2017) - [i7]Ekaterina Lapshinova-Koltunski, Marcos Zampieri:
Linguistic Features of Genre and Method Variation in Translation: A Computational Perspective. CoRR abs/1709.04359 (2017) - [i6]Marcos Zampieri:
Compiling and Processing Historical and Contemporary Portuguese Corpora. CoRR abs/1710.00803 (2017) - [i5]Marcos Zampieri, Shervin Malmasi, Gustavo Paetzold, Lucia Specia:
Complex Word Identification: Challenges in Data Annotation and System Performance. CoRR abs/1710.04989 (2017) - [i4]Octavia-Maria Sulea, Marcos Zampieri, Shervin Malmasi, Mihaela Vela, Liviu P. Dinu, Josef van Genabith:
Exploring the Use of Text Classification in the Legal Domain. CoRR abs/1710.09306 (2017) - [i3]Shervin Malmasi, Marcos Zampieri:
Detecting Hate Speech in Social Media. CoRR abs/1712.06427 (2017) - 2016
- [j1]Rohit Gupta
, Constantin Orasan
, Marcos Zampieri, Mihaela Vela, Josef van Genabith, Ruslan Mitkov:
Improving translation memory matching and retrieval using paraphrases. Mach. Transl. 30(1-2): 19-40 (2016) - [c29]Santanu Pal, Sudip Kumar Naskar, Marcos Zampieri, Tapas Nayak, Josef van Genabith:
CATaLog Online: A Web-based CAT Tool for Distributed Translation with Data Capture for APE and Translation Process Research. COLING (Demos) 2016: 98-102 - [c28]Cyril Goutte, Serge Léger, Shervin Malmasi, Marcos Zampieri:
Discriminating Similar Languages: Evaluations and Explorations. LREC 2016 - [c27]Santanu Pal, Marcos Zampieri, Sudip Kumar Naskar, Tapas Nayak, Mihaela Vela, Josef van Genabith:
CATaLog Online: Porting a Post-editing Tool to the Web. LREC 2016 - [c26]Marcos Zampieri, Shervin Malmasi, Mark Dras:
Modeling Language Change in Historical Corpora: The Case of Portuguese. LREC 2016 - [c25]Shervin Malmasi, Marcos Zampieri, Mark Dras:
Predicting Post Severity in Mental Health Forums. CLPsych@HLT-NAACL 2016: 133-137 - [c24]Shervin Malmasi, Marcos Zampieri:
MAZA at SemEval-2016 Task 11: Detecting Lexical Complexity Using a Decision Stump Meta-Classifier. SemEval@NAACL-HLT 2016: 991-995 - [c23]Shervin Malmasi, Mark Dras
, Marcos Zampieri:
LTG at SemEval-2016 Task 11: Complex Word Identification with Classifier Ensembles. SemEval@NAACL-HLT 2016: 996-1000 - [c22]Marcos Zampieri, Liling Tan, Josef van Genabith:
MacSaar at SemEval-2016 Task 11: Zipfian and Character Features for ComplexWord Identification. SemEval@NAACL-HLT 2016: 1001-1005 - [c21]Eckhard Bick
, Marcos Zampieri:
Grammatical Annotation of Historical Portuguese: Generating a Corpus-Based Diachronic Dictionary. TSD 2016: 3-11 - [c20]Shervin Malmasi, Marcos Zampieri, Nikola Ljubesic, Preslav Nakov, Ahmed Ali, Jörg Tiedemann:
Discriminating between Similar Languages and Arabic Dialect Identification: A Report on the Third DSL Shared Task. VarDial@COLING 2016: 1-14 - [c19]Shervin Malmasi, Marcos Zampieri:
Arabic Dialect Identification in Speech Transcripts. VarDial@COLING 2016: 106-113 - [c18]Ondrej Bojar
, Rajen Chatterjee, Christian Federmann, Yvette Graham, Barry Haddow, Matthias Huck, Antonio Jimeno-Yepes, Philipp Koehn, Varvara Logacheva, Christof Monz, Matteo Negri, Aurélie Névéol, Mariana L. Neves, Martin Popel, Matt Post, Raphael Rubino, Carolina Scarton, Lucia Specia, Marco Turchi, Karin Verspoor
, Marcos Zampieri:
Findings of the 2016 Conference on Machine Translation. WMT 2016: 131-198 - [c17]Santanu Pal, Marcos Zampieri, Josef van Genabith:
USAAR: An Operation Sequential Model for Automatic Statistical Post-Editing. WMT 2016: 759-763 - [e2]Preslav Nakov, Marcos Zampieri, Liling Tan, Nikola Ljubesic, Jörg Tiedemann, Shervin Malmasi:
Proceedings of the Third Workshop on NLP for Similar Languages, Varieties and Dialects, VarDial@COLING 2016, Osaka, Japan, December 12, 2016. The COLING 2016 Organizing Committee 2016, ISBN 978-4-87974-716-7 [contents] - [i2]Marcos Zampieri, Shervin Malmasi, Mark Dras:
Modeling Language Change in Historical Corpora: The Case of Portuguese. CoRR abs/1610.00030 (2016) - [i1]Cyril Goutte, Serge Léger, Shervin Malmasi, Marcos Zampieri:
Discriminating Similar Languages: Evaluations and Explorations. CoRR abs/1610.00031 (2016) - 2015
- [c16]Rohit Gupta, Constantin Orasan, Marcos Zampieri, Mihaela Vela, Josef van Genabith:
Can Translation Memories afford not to use paraphrasing? EAMT 2015 - [c15]Carolina Scarton, Marcos Zampieri, Mihaela Vela, Josef van Genabith, Lucia Specia:
Searching for Context: a Study on Document-Level Labels for Translation Quality Estimation. EAMT 2015 - [c14]Marcos Zampieri, Alina Maria Ciobanu, Vlad Niculae, Liviu P. Dinu:
AMBRA: A Ranking Approach to Temporal Text Classification. SemEval@NAACL-HLT 2015: 851-855 - [c13]Marcos Zampieri, Ekaterina Lapshinova-Koltunski
:
Investigating Genre and Method Variation in Translation Using Text Classification. TSD 2015: 41-50 - 2014
- [c12]Vlad Niculae, Marcos Zampieri, Liviu P. Dinu, Alina Maria Ciobanu:
Temporal Text Ranking and Automatic Dating of Texts. EACL 2014: 17-21 - [c11]Marcos Zampieri, Mihaela Vela:
Quantifying the Influence of MT Output in the Translators' Performance: A Case Study in Technical Translation. HaCaT@EACL 2014: 93-98 - [c10]Marcos Zampieri, Liling Tan:
Grammatical Error Detection with Limited Training Data: The Case of Chinese. ICCE 2014 - [c9]Marcos Zampieri, Binyam Gebrekidan Gebre:
VarClass: An Open-source Language Identification Tool for Language Varieties. LREC 2014: 3305-3308 - [c8]Marcos Zampieri, Renato Cordeiro de Amorim:
Between Sound and Spelling: Combining Phonetics and Clustering Algorithms to Improve Target Word Recovery. PolTAL 2014: 438-449 - [c7]Marcos Zampieri, Liling Tan, Nikola Ljubesic, Jörg Tiedemann:
A Report on the DSL Shared Task 2014. VarDial@COLING 2014: 58-67 - [e1]Marcos Zampieri, Liling Tan, Nikola Ljubesic, Jörg Tiedemann:
Proceedings of the First Workshop on Applying NLP Tools to Similar Languages, Varieties and Dialects, VarDial@COLING 2014, Dublin, Ireland, August 23, 2014. Association for Computational Linguistics and Dublin City University 2014, ISBN 978-1-873769-39-3 [contents] - 2013
- [c6]Binyam Gebrekidan Gebre, Marcos Zampieri, Peter Wittenburg, Tom Heskes:
Improving Native Language Identification with TF-IDF Weighting. BEA@NAACL-HLT 2013: 216-223 - [c5]Renato Cordeiro de Amorim, Marcos Zampieri:
Effective Spell Checking Methods Using Clustering Algorithms. RANLP 2013: 172-178 - [c4]Marcos Zampieri, Binyam Gebrekidan Gebre, Sascha Diwersy:
N-gram Language Models and POS Distribution for the Identification of Spanish Varieties (Ngrammes et Traits Morphosyntaxiques pour la Identification de Variétés de l'Espagnol) [in French]. TALN (2) 2013: 580-587 - [c3]Sanja Stajner
, Marcos Zampieri:
Stylistic Changes for Temporal Text Classification. TSD 2013: 519-526 - 2012
- [c2]Marcos Zampieri, Binyam Gebrekidan Gebre:
Automatic identification of language varieties: The case of Portuguese. KONVENS 2012: 233-237 - 2010
- [c1]Jorge Baptista
, Neuza Costa
, Joaquim Guerra
, Marcos Zampieri, Maria Cabral, Nuno J. Mamede
:
P-AWL: Academic Word List for Portuguese. PROPOR 2010: 120-123
Coauthor Index
aka: Alphaeus Eric Dmonte
aka: Christopher M. Homan

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from ,
, and
to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and
to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2025-04-20 23:46 CEST by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint