
Serge Sharoff
Refine list

refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2020
- [j14]Serge Sharoff:
Finding next of kin: Cross-lingual embedding spaces for related languages. Nat. Lang. Eng. 26(2): 163-182 (2020) - [c47]Reinhard Rapp, Pierre Zweigenbaum, Serge Sharoff:
Overview of the Fourth BUCC Shared Task: Bilingual Dictionary Induction from Comparable Corpora. BUCC@LREC 2020: 6-13 - [c46]Dmitri Roussinov, Serge Sharoff
, Nadezhda Puchnina:
Recognizing Semantic Relations: Attention-Based Transformers vs. Recurrent Models. ECIR (1) 2020: 561-574 - [c45]Yu Yuan, Serge Sharoff:
Sentence Level Human Translation Quality Estimation with Attention-based Neural Networks. LREC 2020: 1858-1865 - [c44]Serge Sharoff:
Know thy Corpus! Robust Methods for Digital Curation of Web corpora. LREC 2020: 2453-2460 - [c43]Dmitri Roussinov, Serge Sharoff, Nadezhda Puchnina:
Recognizing Semantic Relations by Combining Transformers and Fully Connected Models. LREC 2020: 5838-5845 - [e7]Reinhard Rapp, Pierre Zweigenbaum, Serge Sharoff:
Proceedings of the 13th Workshop on Building and Using Comparable Corpora, BUCC@LREC 2020, Marseille, France, May, 2020. European Language Resources Association 2020, ISBN 979-10-95546-42-9 [contents] - [i2]Yu Yuan, Serge Sharoff:
Sentence Level Human Translation Quality Estimation with Attention-based Neural Networks. CoRR abs/2003.06381 (2020) - [i1]Serge Sharoff:
Know thy corpus! Robust methods for digital curation of Web corpora. CoRR abs/2003.06389 (2020)
2010 – 2019
- 2019
- [c42]Maria Kunilovskaya, Serge Sharoff:
Towards Functionally Similar Corpus Resources for Translation. RANLP 2019: 583-592 - [p6]Reinhard Rapp, Vivian Xu, Michael Zock, Serge Sharoff, Richard S. Forsyth, Bogdan Babych, Chenhui Chu, Toshiaki Nakazawa, Sadao Kurohashi:
New Areas of Application of Comparable Corpora. Using Comparable Corpora for Under-Resourced Areas of Machine Translation 2019: 255-290 - 2018
- [c41]Serge Sharoff:
Language adaptation experiments via cross-lingual embeddings for related languages. LREC 2018 - [c40]Yu Yuan, Yuze Gao, Yue Zhang, Serge Sharoff:
Cross-lingual Terminology Extraction for Translation Quality Estimation. LREC 2018 - [c39]Yu Yuan, Serge Sharoff:
Investigating the Influence of Bilingual MWU on Trainee Translation Quality. LREC 2018 - [c38]Pierre Zweigenbaum, Serge Sharoff, Reinhard Rapp:
A Multilingual Dataset for Evaluating Parallel Sentence Extraction from Comparable Corpora. LREC 2018 - 2017
- [c37]Serge Sharoff:
Toward Pan-Slavic NLP: Some Experiments with Language Adaptation. BSNLP@EACL 2017: 1-2 - [c36]Pierre Zweigenbaum, Serge Sharoff, Reinhard Rapp:
Overview of the Second BUCC Shared Task: Spotting Parallel Sentences in Comparable Corpora. BUCC@ACL 2017: 60-67 - [e6]Serge Sharoff, Pierre Zweigenbaum, Reinhard Rapp:
Proceedings of the 10th Workshop on Building and Using Comparable Corpora, BUCC@ACL 2017, Vancouver, Canada, August 3, 2017. Association for Computational Linguistics 2017, ISBN 978-1-945626-61-6 [contents] - 2016
- [j13]Noushin Rezapour Asheghi, Serge Sharoff, Katja Markert:
Crowdsourcing for web genre annotation. Lang. Resour. Evaluation 50(3): 603-641 (2016) - [j12]Reinhard Rapp, Serge Sharoff, Pierre Zweigenbaum:
Preface. Nat. Lang. Eng. 22(4): 497-500 (2016) - [j11]Reinhard Rapp, Serge Sharoff, Pierre Zweigenbaum:
Recent advances in machine translation using comparable corpora. Nat. Lang. Eng. 22(4): 501-516 (2016) - [j10]Miguel Rios, Serge Sharoff:
Language Adaptation for Extending Post-Editing Estimates for Closely Related Languages. Prague Bull. Math. Linguistics 106: 181-192 (2016) - [c35]Erika Dalan, Serge Sharoff:
Genre classification for a corpus of academic webpages. WAC@ACL 2016: 90-98 - [c34]Roger Evans, Alexander F. Gelbukh, Gregory Grefenstette, Patrick Hanks, Milos Jakubícek, Diana McCarthy, Martha Palmer, Ted Pedersen
, Michael Rundell, Pavel Rychlý, Serge Sharoff, David Tugwell:
Adam Kilgarriff's Legacy to Computational Linguistics and Beyond. CICLing (1) 2016: 3-25 - [c33]Yu Yuan, Serge Sharoff, Bogdan Babych:
MoBiL: A Hybrid Feature Set for Automatic Human Translation Quality Assessment. LREC 2016 - 2015
- [j9]Serge Sharoff:
Web Corpus Construction Roland Schäfer and Felix Bildhauer (Freie Universität Berlin) Morgan & Claypool (Synthesis Lectures on Human Language Technologies, edited by Graeme Hirst, volume 22), 2013, 145 pages, paper-bound, ISBN 9781608459834, doi: 10.2200/S00508ED1V01Y201305HLT022. Comput. Linguistics 41(1): 161-163 (2015) - [c32]Anisya Katinskaya, Serge Sharoff:
Applying Multi-Dimensional Analysis to a Russian Webcorpus: Searching for Evidence of Genres. BSNLP@RANLP 2015: 65-74 - [c31]Miguel Rios, Serge Sharoff:
Obtaining SMT dictionaries for related languages. BUCC@ACL/IJCNLP 2015: 68-73 - [c30]Serge Sharoff, Pierre Zweigenbaum, Reinhard Rapp:
BUCC Shared Task: Cross-Language Document Similarity. BUCC@ACL/IJCNLP 2015: 74-78 - [e5]Pierre Zweigenbaum, Serge Sharoff, Reinhard Rapp:
Proceedings of the Eighth Workshop on Building and Using Comparable Corpora, BUCC@ACL/IJCNLP 2015, Beijing, China, July 30, 2015. Association for Computational Linguistics 2015, ISBN 978-1-941643-60-0 [contents] - 2014
- [j8]Richard S. Forsyth, Serge Sharoff:
Document dissimilarity within and across languages: A benchmarking study. Lit. Linguistic Comput. 29(1): 6-22 (2014) - [j7]Serge Sharoff, Stefania Spina, Sofie Johansson Kokkinakis:
Introduction to the special issue on Resources and Tools for Language Learners. Lang. Resour. Evaluation 48(1): 1-3 (2014) - [j6]Adam Kilgarriff, Frieda Charalabopoulou, Maria Gavrilidou, Janne Bondi Johannessen, Saussan Khalil, Sofie Johansson Kokkinakis, Robert Lew
, Serge Sharoff, Ravikiran Vadlapudi, Elena Volodina:
Corpus-based vocabulary lists for language learners for nine languages. Lang. Resour. Evaluation 48(1): 121-163 (2014) - [c29]Marilena Di Bari, Serge Sharoff, Martin Thomas:
Multiple views as aid to linguistic annotation error analysis. LAW@COLING 2014: 82-86 - [c28]Reinhard Rapp, Serge Sharoff:
Extracting Multiword Translations from Aligned Comparable Documents. HyTra@EACL 2014: 87-95 - [c27]Noushin Rezapour Asheghi, Serge Sharoff, Katja Markert:
Designing and Evaluating a Reliable Corpus of Web Genres via Crowd-Sourcing. LREC 2014: 1339-1346 - [c26]Noushin Rezapour Asheghi, Katja Markert, Serge Sharoff:
Semi-supervised Graph-based Genre Classification for Web Pages. TextGraphs@EMNLP 2014: 39-47 - 2013
- [c25]Pavel Braslavski, Alexander Beloborodov, Maxim Khalilov, Serge Sharoff:
English-to-Russian MT evaluation campaign. ACL (2) 2013: 262-267 - [c24]Marilena Di Bari, Serge Sharoff, Martin Thomas:
SentiML: functional annotation for multilingual sentiment analysis. DH-CASE@DocEng 2013: 15:1-15:7 - [p5]Serge Sharoff, Reinhard Rapp, Pierre Zweigenbaum:
Overviewing Important Aspects of the Last Twenty Years of Research in Comparable Corpora. Building and Using Comparable Corpora 2013: 1-17 - [p4]Serge Sharoff:
Measuring the Distance Between Comparable Corpora Between Languages. Building and Using Comparable Corpora 2013: 113-130 - [e4]Serge Sharoff, Reinhard Rapp, Pierre Zweigenbaum, Pascale Fung:
Building and Using Comparable Corpora. Springer Berlin Heidelberg 2013, ISBN 978-3-642-20127-1 [contents] - [e3]Serge Sharoff, Pierre Zweigenbaum, Reinhard Rapp:
Proceedings of the Sixth Workshop on Building and Using Comparable Corpora, BUCC@ACL 2013, Sofia, Bulgaria, August 8, 2013. Association for Computational Linguistics 2013, ISBN 978-1-937284-60-2 [contents] - 2012
- [c23]Bogdan Babych, Kurt Eberle, Johanna Geiß, Mireia Ginestí-Rosell, Anthony Hartley, Reinhard Rapp, Serge Sharoff, Martin Thomas:
Design of a hybrid high quality machine translation system. ESIRMT/HyTra@EACL 2012: 101-112 - [c22]Reinhard Rapp, Serge Sharoff, Bogdan Babych:
Identifying Word Translations from Comparable Documents Without a Seed Lexicon. LREC 2012: 460-466 - 2011
- [p3]Marina Santini, Alexander Mehler, Serge Sharoff:
Riding the Rough Waves of Genre on the Web. Genres on the Web 2011: 3-30 - [p2]Serge Sharoff:
In the Garden and in the Jungle. Genres on the Web 2011: 149-166 - [p1]Marina Santini, Serge Sharoff, Alexander Mehler:
Any Land in Sight? Genres on the Web 2011: 351-354 - [e2]Alexander Mehler, Serge Sharoff, Marina Santini:
Genres on the Web. Text, Speech and Language Technology 42, Springer 2011, ISBN 978-90-481-9177-2 [contents] - [e1]Pierre Zweigenbaum, Reinhard Rapp, Serge Sharoff:
Proceedings of the 4th Workshop on Building and Using Comparable Corpora: Comparable Corpora and the Web, BUCC@ACL 2011, Portland, OR, USA, June 24, 2011. Association for Computational Linguistics 2011, ISBN 9781937284015 [contents] - 2010
- [j5]Yun-hua Qu, Tian-jiong Tao, Serge Sharoff, Narisong Jin, Ruo-yuan Gao, Nan Zhang, Yu-ting Yang, Cheng-zhi Xu:
Using an integrated feature set to generalize and justify the Chinese-to-English transferring rule of the 'ZHE' aspect. J. Zhejiang Univ. Sci. C 11(9): 663-676 (2010) - [j4]Paul Rayson
, Scott Piao
, Serge Sharoff, Stefan Evert, Begoña Villada Moirón:
Multiword expressions: hard going or plain sailing? Lang. Resour. Evaluation 44(1-2): 1-5 (2010) - [c21]Zhili Wu, Katja Markert, Serge Sharoff:
Fine-Grained Genre Classification Using Structural Learning Algorithms. ACL 2010: 749-759 - [c20]Serge Sharoff, Zhili Wu, Katja Markert:
The Web Library of Babel: evaluating genre collections. LREC 2010 - [c19]James Wilson, Anthony Hartley, Serge Sharoff, Paul Stephenson:
Advanced Corpus Solutions for Humanities Researchers. PACLIC 2010: 769-778
2000 – 2009
- 2009
- [j3]Marina Santini, Serge Sharoff:
Web Genre Benchmark Under Construction. J. Lang. Technol. Comput. Linguistics 24(1): 129-145 (2009) - [j2]Serge Sharoff, Bogdan Babych
, Anthony Hartley:
'Irrefragable answers' using comparable corpora to retrieve translation equivalents. Lang. Resour. Evaluation 43(1): 15-25 (2009) - [c18]Bogdan Babych, Anthony Hartley, Serge Sharoff:
Evaluation-Guided Pre-Editing of Source Text: Improving MT-Tractability of Light Verb Constructions. EAMT 2009 - 2008
- [c17]Bogdan Babych, Serge Sharoff, Anthony Hartley:
Generalising Lexical Translation Strategies for MT Using Comparable Corpora. LREC 2008 - [c16]Marco Baroni, Francis Chantree, Adam Kilgarriff, Serge Sharoff:
Cleaneval: a Competition for Cleaning Web Pages. LREC 2008 - [c15]Svitlana Kurella, Serge Sharoff, Anthony Hartley:
Corpus-Based Tools for Computer-Assisted Acquisition of Reading Abilities in Cognate Languages. LREC 2008 - [c14]Serge Sharoff, Mikhail Kopotev, Tomaz Erjavec, Anna Feldman, Dagmar Divjak:
Designing and Evaluating a Russian Tagset. LREC 2008 - 2007
- [j1]J. Delin, Serge Sharoff, S. Lillford, C. Barnes:
Linguistic support for concept selection decisions. Artif. Intell. Eng. Des. Anal. Manuf. 21(2): 123-135 (2007) - [c13]Bogdan Babych, Anthony Hartley, Serge Sharoff, Olga Mudraya:
Assisting Translators in Indirect Lexical Transfer. ACL 2007 - 2006
- [c12]Serge Sharoff, Bogdan Babych, Anthony Hartley:
Using Comparable Corpora to Solve Problems Difficult for Human Translators. ACL 2006 - [c11]Serge Sharoff, Bogdan Babych, Paul Rayson, Olga Mudraya, Scott Piao:
ASSIST: Automated Semantic Assistance for Translators. EACL 2006 - [c10]Serge Sharoff, Bogdan Babych, Anthony Hartley:
Using collocations from comparable corpora to find translation equivalents. LREC 2006: 465-470 - [c9]Serge Sharoff:
A Uniform Interface to Large-Scale Linguistic Resources. LREC 2006: 539-542 - [c8]Dragoç Ciobanu, Tony Hartley, Serge Sharoff:
Using Richly Annotated Trilingual Language Resources for Acquiring Reading Skills in a Foreign Language. LREC 2006: 543-548 - 2004
- [c7]Serge Sharoff:
Towards Basic Categories for Describing Properties of Texts in a Corpus. LREC 2004 - 2002
- [c6]Serge Sharoff:
Meaning as use: exploitation of aligned corpora for the contrastive study of lexical semantics. LREC 2002 - 2001
- [c5]Dafydd Gibbon, Thorsten Trippel, Serge Sharoff:
Concordancing for parallel spoken language corpora. INTERSPEECH 2001: 2063-2066 - 2000
- [c4]Geert-Jan M. Kruijff, Elke Teich, John A. Bateman, Ivana Kruijff-Korbayová, Hana Skoumalová, Serge Sharoff, Elena G. Sokolova, Tony Hartley, Kamenka Staykova, Jiri Hana:
Multilinguality in a Text Generation System For Three Slavic Languages. COLING 2000: 474-480 - [c3]John A. Bateman, Elke Teich, Geert-Jan M. Kruijff, Ivana Kruijff-Korbayová, Serge Sharoff, Hana Skoumalová:
Resources for Multilingual Text Generation in Three Slavic Languages. LREC 2000
1990 – 1999
- 1999
- [c2]Serge Sharoff, Vlad Zhigalov:
Register-domain Separation as a Methodology for Development of Natural Language Interfaces to Databases. INTERACT 1999: 79-85 - 1996
- [c1]Irina Kononenko
, Serge Sharoff:
Understanding Short Texts with Integration of Knowledge Representation Methods. Ershov Memorial Conference 1996: 111-121
Coauthor Index

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
load content from web.archive.org
Privacy notice: By enabling the option above, your browser will contact the API of web.archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from ,
, and
to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and
to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
Tweets on dblp homepage
Show tweets from on the dblp homepage.
Privacy notice: By enabling the option above, your browser will contact twitter.com and twimg.com to load tweets curated by our Twitter account. At the same time, Twitter will persistently store several cookies with your web browser. While we did signal Twitter to not track our users by setting the "dnt" flag, we do not have any control over how Twitter uses your data. So please proceed with care and consider checking the Twitter privacy policy.
last updated on 2020-11-11 21:44 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint