default search action
Alexei Baevski
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [j1]Vineel Pratap, Andros Tjandra, Bowen Shi, Paden Tomasello, Arun Babu, Sayani Kundu, Ali Elkahky, Zhaoheng Ni, Apoorv Vyas, Maryam Fazel-Zarandi, Alexei Baevski, Yossi Adi, Xiaohui Zhang, Wei-Ning Hsu, Alexis Conneau, Michael Auli:
Scaling Speech Technology to 1, 000+ Languages. J. Mach. Learn. Res. 25: 97:1-97:52 (2024) - 2023
- [c33]Derek Xu, Shuyan Dong, Changhan Wang, Suyoun Kim, Zhaojiang Lin, Bing Liu, Akshat Shrivastava, Shang-Wen Li, Liang-Hsuan Tseng, Guan-Ting Lin, Alexei Baevski, Hung-yi Lee, Yizhou Sun, Wei Wang:
Introducing Semantics into Speech Encoders. ACL (1) 2023: 11413-11429 - [c32]Jiachen Lian, Alexei Baevski, Wei-Ning Hsu, Michael Auli:
Av-Data2Vec: Self-Supervised Learning of Audio-Visual Speech Representations with Contextualized Target Representations. ASRU 2023: 1-8 - [c31]Ju-Chieh Chou, Chung-Ming Chien, Wei-Ning Hsu, Karen Livescu, Arun Babu, Alexis Conneau, Alexei Baevski, Michael Auli:
Toward Joint Language Modeling for Speech Units and Text. EMNLP (Findings) 2023: 6582-6593 - [c30]Ramon Sanabria, Wei-Ning Hsu, Alexei Baevski, Michael Auli:
Measuring the Impact of Domain Factors in Self-Supervised Pre-Training. ICASSP Workshops 2023: 1-5 - [c29]Alexei Baevski, Arun Babu, Wei-Ning Hsu, Michael Auli:
Efficient Self-supervised Learning with Contextualized Target Representations for Vision, Speech and Language. ICML 2023: 1416-1429 - [i37]Jiachen Lian, Alexei Baevski, Wei-Ning Hsu, Michael Auli:
AV-data2vec: Self-supervised Learning of Audio-Visual Speech Representations with Contextualized Target Representations. CoRR abs/2302.06419 (2023) - [i36]Karmesh Yadav, Arjun Majumdar, Ram Ramrakhya, Naoki Yokoyama, Alexei Baevski, Zsolt Kira, Oleksandr Maksymets, Dhruv Batra:
OVRL-V2: A simple state-of-art baseline for ImageNav and ObjectNav. CoRR abs/2303.07798 (2023) - [i35]Vineel Pratap, Andros Tjandra, Bowen Shi, Paden Tomasello, Arun Babu, Sayani Kundu, Ali Elkahky, Zhaoheng Ni, Apoorv Vyas, Maryam Fazel-Zarandi, Alexei Baevski, Yossi Adi, Xiaohui Zhang, Wei-Ning Hsu, Alexis Conneau, Michael Auli:
Scaling Speech Technology to 1, 000+ Languages. CoRR abs/2305.13516 (2023) - [i34]Ju-Chieh Chou, Chung-Ming Chien, Wei-Ning Hsu, Karen Livescu, Arun Babu, Alexis Conneau, Alexei Baevski, Michael Auli:
Toward Joint Language Modeling for Speech Units and Text. CoRR abs/2310.08715 (2023) - 2022
- [c28]Yun Tang, Hongyu Gong, Ning Dong, Changhan Wang, Wei-Ning Hsu, Jiatao Gu, Alexei Baevski, Xian Li, Abdelrahman Mohamed, Michael Auli, Juan Miguel Pino:
Unified Speech-Text Pre-training for Speech Translation and Recognition. ACL (1) 2022: 1488-1499 - [c27]Andros Tjandra, Diptanu Gon Choudhury, Frank Zhang, Kritika Singh, Alexis Conneau, Alexei Baevski, Assaf Sela, Yatharth Saraf, Michael Auli:
Improved Language Identification Through Cross-Lingual Self-Supervised Learning. ICASSP 2022: 6877-6881 - [c26]Alexei Baevski, Wei-Ning Hsu, Qiantong Xu, Arun Babu, Jiatao Gu, Michael Auli:
data2vec: A General Framework for Self-supervised Learning in Speech, Vision and Language. ICML 2022: 1298-1312 - [c25]Alexander H. Liu, Cheng-I Lai, Wei-Ning Hsu, Michael Auli, Alexei Baevski, James R. Glass:
Simple and Effective Unsupervised Speech Synthesis. INTERSPEECH 2022: 843-847 - [c24]Qiantong Xu, Alexei Baevski, Michael Auli:
Simple and Effective Zero-shot Cross-lingual Phoneme Recognition. INTERSPEECH 2022: 2113-2117 - [c23]Arun Babu, Changhan Wang, Andros Tjandra, Kushal Lakhotia, Qiantong Xu, Naman Goyal, Kritika Singh, Patrick von Platen, Yatharth Saraf, Juan Pino, Alexei Baevski, Alexis Conneau, Michael Auli:
XLS-R: Self-supervised Cross-lingual Speech Representation Learning at Scale. INTERSPEECH 2022: 2278-2282 - [c22]Apoorv Vyas, Wei-Ning Hsu, Michael Auli, Alexei Baevski:
On-demand compute reduction with stochastic wav2vec 2.0. INTERSPEECH 2022: 3048-3052 - [c21]Anuroop Sriram, Michael Auli, Alexei Baevski:
Wav2Vec-Aug: Improved self-supervised training with limited data. INTERSPEECH 2022: 4950-4954 - [c20]Po-Yao Huang, Hu Xu, Juncheng Li, Alexei Baevski, Michael Auli, Wojciech Galuba, Florian Metze, Christoph Feichtenhofer:
Masked Autoencoders that Listen. NeurIPS 2022 - [c19]Alexander H. Liu, Wei-Ning Hsu, Michael Auli, Alexei Baevski:
Towards End-to-End Unsupervised Speech Recognition. SLT 2022: 221-228 - [i33]Alexei Baevski, Wei-Ning Hsu, Qiantong Xu, Arun Babu, Jiatao Gu, Michael Auli:
data2vec: A General Framework for Self-supervised Learning in Speech, Vision and Language. CoRR abs/2202.03555 (2022) - [i32]Ramon Sanabria, Wei-Ning Hsu, Alexei Baevski, Michael Auli:
Measuring the Impact of Individual Domain Factors in Self-Supervised Pre-Training. CoRR abs/2203.00648 (2022) - [i31]Alexander H. Liu, Wei-Ning Hsu, Michael Auli, Alexei Baevski:
Towards End-to-end Unsupervised Speech Recognition. CoRR abs/2204.02492 (2022) - [i30]Alexander H. Liu, Cheng-I Jeff Lai, Wei-Ning Hsu, Michael Auli, Alexei Baevski, James R. Glass:
Simple and Effective Unsupervised Speech Synthesis. CoRR abs/2204.02524 (2022) - [i29]Yun Tang, Hongyu Gong, Ning Dong, Changhan Wang, Wei-Ning Hsu, Jiatao Gu, Alexei Baevski, Xian Li, Abdelrahman Mohamed, Michael Auli, Juan Miguel Pino:
Unified Speech-Text Pre-training for Speech Translation and Recognition. CoRR abs/2204.05409 (2022) - [i28]Apoorv Vyas, Wei-Ning Hsu, Michael Auli, Alexei Baevski:
On-demand compute reduction with stochastic wav2vec 2.0. CoRR abs/2204.11934 (2022) - [i27]Karmesh Yadav, Ram Ramrakhya, Arjun Majumdar, Vincent-Pierre Berges, Sachit Kuhar, Dhruv Batra, Alexei Baevski, Oleksandr Maksymets:
Offline Visual Representation Learning for Embodied Navigation. CoRR abs/2204.13226 (2022) - [i26]Anuroop Sriram, Michael Auli, Alexei Baevski:
Wav2Vec-Aug: Improved self-supervised training with limited data. CoRR abs/2206.13654 (2022) - [i25]Po-Yao Huang, Hu Xu, Juncheng Li, Alexei Baevski, Michael Auli, Wojciech Galuba, Florian Metze, Christoph Feichtenhofer:
Masked Autoencoders that Listen. CoRR abs/2207.06405 (2022) - [i24]Derek Xu, Shuyan Dong, Changhan Wang, Suyoun Kim, Zhaojiang Lin, Akshat Shrivastava, Shang-Wen Li, Liang-Hsuan Tseng, Alexei Baevski, Guan-Ting Lin, Hung-yi Lee, Yizhou Sun, Wei Wang:
Introducing Semantics into Speech Encoders. CoRR abs/2211.08402 (2022) - [i23]Alexei Baevski, Arun Babu, Wei-Ning Hsu, Michael Auli:
Efficient Self-supervised Learning with Contextualized Target Representations for Vision, Speech and Language. CoRR abs/2212.07525 (2022) - 2021
- [c18]Xian Li, Changhan Wang, Yun Tang, Chau Tran, Yuqing Tang, Juan Miguel Pino, Alexei Baevski, Alexis Conneau, Michael Auli:
Multilingual Speech Translation from Efficient Finetuning of Pretrained Models. ACL/IJCNLP (1) 2021: 827-838 - [c17]Sheng Shen, Alexei Baevski, Ari S. Morcos, Kurt Keutzer, Michael Auli, Douwe Kiela:
Reservoir Transformers. ACL/IJCNLP (1) 2021: 4294-4309 - [c16]Qiantong Xu, Alexei Baevski, Tatiana Likhomanenko, Paden Tomasello, Alexis Conneau, Ronan Collobert, Gabriel Synnaeve, Michael Auli:
Self-Training and Pre-Training are Complementary for Speech Recognition. ICASSP 2021: 3030-3034 - [c15]Henry Zhou, Alexei Baevski, Michael Auli:
A Comparison of Discrete Latent Variable Models for Speech Representation Learning. ICASSP 2021: 3050-3054 - [c14]Wei-Ning Hsu, Anuroop Sriram, Alexei Baevski, Tatiana Likhomanenko, Qiantong Xu, Vineel Pratap, Jacob Kahn, Ann Lee, Ronan Collobert, Gabriel Synnaeve, Michael Auli:
Robust wav2vec 2.0: Analyzing Domain Shift in Self-Supervised Pre-Training. Interspeech 2021: 721-725 - [c13]Changhan Wang, Anne Wu, Juan Pino, Alexei Baevski, Michael Auli, Alexis Conneau:
Large-Scale Self- and Semi-Supervised Learning for Speech Translation. Interspeech 2021: 2242-2246 - [c12]Alexis Conneau, Alexei Baevski, Ronan Collobert, Abdelrahman Mohamed, Michael Auli:
Unsupervised Cross-Lingual Representation Learning for Speech Recognition. Interspeech 2021: 2426-2430 - [c11]Alexei Baevski, Wei-Ning Hsu, Alexis Conneau, Michael Auli:
Unsupervised Speech Recognition. NeurIPS 2021: 27826-27839 - [i22]Kushal Lakhotia, Evgeny Kharitonov, Wei-Ning Hsu, Yossi Adi, Adam Polyak, Benjamin Bolte, Tu Anh Nguyen, Jade Copet, Alexei Baevski, Adelrahman Mohamed, Emmanuel Dupoux:
Generative Spoken Language Modeling from Raw Audio. CoRR abs/2102.01192 (2021) - [i21]Wei-Ning Hsu, Anuroop Sriram, Alexei Baevski, Tatiana Likhomanenko, Qiantong Xu, Vineel Pratap, Jacob Kahn, Ann Lee, Ronan Collobert, Gabriel Synnaeve, Michael Auli:
Robust wav2vec 2.0: Analyzing Domain Shift in Self-Supervised Pre-Training. CoRR abs/2104.01027 (2021) - [i20]Changhan Wang, Anne Wu, Juan Miguel Pino, Alexei Baevski, Michael Auli, Alexis Conneau:
Large-Scale Self- and Semi-Supervised Learning for Speech Translation. CoRR abs/2104.06678 (2021) - [i19]Alexei Baevski, Wei-Ning Hsu, Alexis Conneau, Michael Auli:
Unsupervised Speech Recognition. CoRR abs/2105.11084 (2021) - [i18]Andros Tjandra, Diptanu Gon Choudhury, Frank Zhang, Kritika Singh, Alexei Baevski, Assaf Sela, Yatharth Saraf, Michael Auli:
Improved Language Identification Through Cross-Lingual Self-Supervised Learning. CoRR abs/2107.04082 (2021) - [i17]Qiantong Xu, Alexei Baevski, Michael Auli:
Simple and Effective Zero-shot Cross-lingual Phoneme Recognition. CoRR abs/2109.11680 (2021) - [i16]Arun Babu, Changhan Wang, Andros Tjandra, Kushal Lakhotia, Qiantong Xu, Naman Goyal, Kritika Singh, Patrick von Platen, Yatharth Saraf, Juan Pino, Alexei Baevski, Alexis Conneau, Michael Auli:
XLS-R: Self-supervised Cross-lingual Speech Representation Learning at Scale. CoRR abs/2111.09296 (2021) - 2020
- [c10]Alexei Baevski, Abdelrahman Mohamed:
Effectiveness of Self-Supervised Pre-Training for ASR. ICASSP 2020: 7694-7698 - [c9]Alexei Baevski, Steffen Schneider, Michael Auli:
vq-wav2vec: Self-Supervised Learning of Discrete Speech Representations. ICLR 2020 - [c8]Alexei Baevski, Yuhao Zhou, Abdelrahman Mohamed, Michael Auli:
wav2vec 2.0: A Framework for Self-Supervised Learning of Speech Representations. NeurIPS 2020 - [i15]Alexei Baevski, Henry Zhou, Abdelrahman Mohamed, Michael Auli:
wav2vec 2.0: A Framework for Self-Supervised Learning of Speech Representations. CoRR abs/2006.11477 (2020) - [i14]Alexis Conneau, Alexei Baevski, Ronan Collobert, Abdelrahman Mohamed, Michael Auli:
Unsupervised Cross-lingual Representation Learning for Speech Recognition. CoRR abs/2006.13979 (2020) - [i13]Qiantong Xu, Alexei Baevski, Tatiana Likhomanenko, Paden Tomasello, Alexis Conneau, Ronan Collobert, Gabriel Synnaeve, Michael Auli:
Self-training and Pre-training are Complementary for Speech Recognition. CoRR abs/2010.11430 (2020) - [i12]Henry Zhou, Alexei Baevski, Michael Auli:
A Comparison of Discrete Latent Variable Models for Speech Representation Learning. CoRR abs/2010.14230 (2020) - [i11]Tu Anh Nguyen, Maureen de Seyssel, Patricia Rozé, Morgane Rivière, Evgeny Kharitonov, Alexei Baevski, Ewan Dunbar, Emmanuel Dupoux:
The Zero Resource Speech Benchmark 2021: Metrics and baselines for unsupervised spoken language modeling. CoRR abs/2011.11588 (2020) - [i10]Sheng Shen, Alexei Baevski, Ari S. Morcos, Kurt Keutzer, Michael Auli, Douwe Kiela:
Reservoir Transformer. CoRR abs/2012.15045 (2020)
2010 – 2019
- 2019
- [c7]Alexei Baevski, Sergey Edunov, Yinhan Liu, Luke Zettlemoyer, Michael Auli:
Cloze-driven Pretraining of Self-attention Networks. EMNLP/IJCNLP (1) 2019: 5359-5368 - [c6]Alexei Baevski, Michael Auli:
Adaptive Input Representations for Neural Language Modeling. ICLR (Poster) 2019 - [c5]Felix Wu, Angela Fan, Alexei Baevski, Yann N. Dauphin, Michael Auli:
Pay Less Attention with Lightweight and Dynamic Convolutions. ICLR 2019 - [c4]Steffen Schneider, Alexei Baevski, Ronan Collobert, Michael Auli:
wav2vec: Unsupervised Pre-Training for Speech Recognition. INTERSPEECH 2019: 3465-3469 - [c3]Myle Ott, Sergey Edunov, Alexei Baevski, Angela Fan, Sam Gross, Nathan Ng, David Grangier, Michael Auli:
fairseq: A Fast, Extensible Toolkit for Sequence Modeling. NAACL-HLT (Demonstrations) 2019: 48-53 - [c2]Sergey Edunov, Alexei Baevski, Michael Auli:
Pre-trained language model representations for language generation. NAACL-HLT (1) 2019: 4052-4059 - [c1]Nathan Ng, Kyra Yee, Alexei Baevski, Myle Ott, Michael Auli, Sergey Edunov:
Facebook FAIR's WMT19 News Translation Task Submission. WMT (2) 2019: 314-319 - [i9]Felix Wu, Angela Fan, Alexei Baevski, Yann N. Dauphin, Michael Auli:
Pay Less Attention with Lightweight and Dynamic Convolutions. CoRR abs/1901.10430 (2019) - [i8]Alexei Baevski, Sergey Edunov, Yinhan Liu, Luke Zettlemoyer, Michael Auli:
Cloze-driven Pretraining of Self-attention Networks. CoRR abs/1903.07785 (2019) - [i7]Sergey Edunov, Alexei Baevski, Michael Auli:
Pre-trained Language Model Representations for Language Generation. CoRR abs/1903.09722 (2019) - [i6]Myle Ott, Sergey Edunov, Alexei Baevski, Angela Fan, Sam Gross, Nathan Ng, David Grangier, Michael Auli:
fairseq: A Fast, Extensible Toolkit for Sequence Modeling. CoRR abs/1904.01038 (2019) - [i5]Steffen Schneider, Alexei Baevski, Ronan Collobert, Michael Auli:
wav2vec: Unsupervised Pre-training for Speech Recognition. CoRR abs/1904.05862 (2019) - [i4]Nathan Ng, Kyra Yee, Alexei Baevski, Myle Ott, Michael Auli, Sergey Edunov:
Facebook FAIR's WMT19 News Translation Task Submission. CoRR abs/1907.06616 (2019) - [i3]Alexei Baevski, Steffen Schneider, Michael Auli:
vq-wav2vec: Self-Supervised Learning of Discrete Speech Representations. CoRR abs/1910.05453 (2019) - [i2]Alexei Baevski, Michael Auli, Abdelrahman Mohamed:
Effectiveness of self-supervised pre-training for speech recognition. CoRR abs/1911.03912 (2019) - 2018
- [i1]Alexei Baevski, Michael Auli:
Adaptive Input Representations for Neural Language Modeling. CoRR abs/1809.10853 (2018)
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-09-18 00:14 CEST by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint