default search action

combined dblp search
author search
venue search
publication search

ask others

Abdel-rahman Mohamed

Abdelrahman Mohamed

> Home > Persons

Person information

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

2020 – today

see FAQ

What is the meaning of the colors in the publication lists?

2025
[c68]
- view
  authority control:
- export record
  dblp key:
  - conf/aisi/AkramMMAA25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aisi/AkramMMAA25
Omar Akram, Abdelrahman Mohamed, Hager Magdy, Mariam M. Abdellatif, Sara Abdelghafar:
Comparative Analysis of Custom CNN Architecture and MobileNet for Deepfake Image Detection. AISI 2025: 58-68
[i67]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2503-21910
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2503-21910
Karima Kadaoui, Hanin Atwany, Hamdan Al-Ali, Abdelrahman Mohamed, Ali Mekky, Sergei Tilga, Natalia Fedorova, Ekaterina Artemova, Hanan Aldarmaki, Yova Kementchedjhieva:
JEEM: Vision-Language Understanding in Four Arabic Dialects. CoRR abs/2503.21910 (2025)
2024
[j12]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/YangCHLLWSCTHFCLCHTLLMWL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/YangCHLLWSCTHFCLCHTLLMWL24
Shu-Wen Yang, Heng-Jui Chang, Zili Huang, Andy T. Liu, Cheng-I Lai, Haibin Wu, Jiatong Shi, Xuankai Chang, Hsiang-Sheng Tsai, Wen-Chin Huang, Tzu-hsun Feng, Po-Han Chi, Yist Y. Lin, Yung-Sung Chuang, Tzu-Hsien Huang, Wei-Cheng Tseng, Kushal Lakhotia, Shang-Wen Li, Abdelrahman Mohamed, Shinji Watanabe, Hung-yi Lee:
A Large-Scale Evaluation of Speech Foundation Models. IEEE ACM Trans. Audio Speech Lang. Process. 32: 2884-2899 (2024)
[c67]
- view
  authority control:
- export record
  dblp key:
  - conf/acl/Peng00MH24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/acl/Peng00MH24
Puyuan Peng, Po-Yao Huang, Shang-Wen Li, Abdelrahman Mohamed, David Harwath:
VoiceCraft: Zero-Shot Speech Editing and Text-to-Speech in the Wild. ACL (1) 2024: 12442-12462
[c66]
- view
  authority control:
- export record
  dblp key:
  - conf/acl/AlwajihNBMA24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/acl/AlwajihNBMA24
Fakhraddin Alwajih, El Moatez Billah Nagoudi, Gagan Bhatia, Abdelrahman Mohamed, Muhammad Abdul-Mageed:
Peacock: A Family of Arabic Multimodal Large Language Models and Benchmarks. ACL (1) 2024: 12753-12776
[c65]
- view
  - electronic edition @ aclanthology.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/emnlp/TalafhaKMHCEZTA24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/emnlp/TalafhaKMHCEZTA24
Bashar Talafha, Karima Kadaoui, Samar Mohamed Magdy, Mariem Habiboullah, Chafei Mohamed Chafei, Ahmed Oumar El-Shangiti, Hiba Zayed, Mohamedou Cheikh Tourad, Rahaf Alhamouri, Rwaa Assi, Aisha Alraeesi, Hour Mohamed, Fakhraddin Alwajih, Abdelrahman Mohamed, Abdellah El Mekki, El Moatez Billah Nagoudi, Benelhadj Saadia, Hamzah A. Alsayadi, Walid Al-Dhabyani, Sara Shatnawi, Yasir Ech-Chammakhy, Amal Makouar, Yousra Berrachedi, Mustafa Jarrar, Shady Shehata, Ismail Berrada, Muhammad Abdul-Mageed:
Casablanca: Data and Models for Multidialectal Arabic Speech Recognition. EMNLP 2024: 21745-21758
[c64]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/TsengBCCLLPSWW024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/TsengBCCLLPSWW024
Yuan Tseng, Layne Berry, Yiting Chen, I-Hsiang Chiu, Hsuan-Hao Lin, Max Liu, Puyuan Peng, Yi-Jen Shih, Hung-Yu Wang, Haibin Wu, Poyao Huang, Chun-Mao Lai, Shang-Wen Li, David Harwath, Yu Tsao, Abdelrahman Mohamed, Chi-Luen Feng, Hung-Yi Lee:
AV-SUPERB: A Multi-Task Evaluation Benchmark for Audio-Visual Representation Models. ICASSP 2024: 6890-6894
[c63]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ChoMBA24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ChoMBA24
Cheol Jun Cho, Abdelrahman Mohamed, Alan W. Black, Gopala Krishna Anumanchipalli:
Self-Supervised Models of Speech Infer Universal Articulatory Kinematics. ICASSP 2024: 12061-12065
[c62]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ChoM0BA24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ChoM0BA24
Cheol Jun Cho, Abdelrahman Mohamed, Shang-Wen Li, Alan W. Black, Gopala Krishna Anumanchipalli:
SD-HuBERT: Sentence-Level Self-Distillation Induces Syllabic Organization in Hubert. ICASSP 2024: 12076-12080
[c61]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/LinLCW0MLL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/LinLCW0MLL24
Chyi-Jiunn Lin, Guan-Ting Lin, Yung-Sung Chuang, Wei-Lun Wu, Shang-Wen Li, Abdelrahman Mohamed, Hung-Yi Lee, Lin-Shan Lee:
SpeechDPR: End-To-End Spoken Passage Retrieval For Open-Domain Spoken Question Answering. ICASSP 2024: 12476-12480
[i66]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2401-13463
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2401-13463
Chyi-Jiunn Lin, Guan-Ting Lin, Yung-Sung Chuang, Wei-Lun Wu, Shang-Wen Li, Abdelrahman Mohamed, Hung-yi Lee, Lin-Shan Lee:
SpeechDPR: End-to-End Spoken Passage Retrieval for Open-Domain Spoken Question Answering. CoRR abs/2401.13463 (2024)
[i65]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2403-01031
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2403-01031
Fakhraddin Alwajih, El Moatez Billah Nagoudi, Gagan Bhatia, Abdelrahman Mohamed, Muhammad Abdul-Mageed:
Peacock: A Family of Arabic Multimodal Large Language Models and Benchmarks. CoRR abs/2403.01031 (2024)
[i64]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2403-16973
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2403-16973
Puyuan Peng, Po-Yao Huang, Daniel Li, Abdelrahman Mohamed, David Harwath:
VoiceCraft: Zero-Shot Speech Editing and Text-to-Speech in the Wild. CoRR abs/2403.16973 (2024)
[i63]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2404-09385
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2404-09385
Shu-Wen Yang, Heng-Jui Chang, Zili Huang, Andy T. Liu, Cheng-I Lai, Haibin Wu, Jiatong Shi, Xuankai Chang, Hsiang-Sheng Tsai, Wen-Chin Huang, Tzu-hsun Feng, Po-Han Chi, Yist Y. Lin, Yung-Sung Chuang, Tzu-Hsien Huang, Wei-Cheng Tseng, Kushal Lakhotia, Shang-Wen Li, Abdelrahman Mohamed, Shinji Watanabe, Hung-yi Lee:
A Large-Scale Evaluation of Speech Foundation Models. CoRR abs/2404.09385 (2024)
[i62]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2409-19641
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2409-19641
Xinyue Zhang, Jiaqi Yang, Xiangting Meng, Abdelrahman Mohamed, Laurent Kneip:
fCOP: Focal Length Estimation from Category-level Object Priors. CoRR abs/2409.19641 (2024)
[i61]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2410-04527
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2410-04527
Bashar Talafha, Karima Kadaoui, Samar Mohamed Magdy, Mariem Habiboullah, Chafei Mohamed Chafei, Ahmed Oumar El-Shangiti, Hiba Zayed, Mohamedou Cheikh Tourad, Rahaf Alhamouri, Rwaa Assi, Aisha Alraeesi, Hour Mohamed, Fakhraddin Alwajih, Abdelrahman Mohamed, Abdellah El Mekki, El Moatez Billah Nagoudi, Benelhadj Djelloul Mama Saadia, Hamzah A. Alsayadi, Walid Al-Dhabyani, Sara Shatnawi, Yasir Ech-Chammakhy, Amal Makouar, Yousra Berrachedi, Mustafa Jarrar, Shady Shehata, Ismail Berrada, Muhammad Abdul-Mageed:
Casablanca: Data and Models for Multidialectal Arabic Speech Recognition. CoRR abs/2410.04527 (2024)
2023
[j11]
- view
  authority control:
- export record
  dblp key:
  - journals/spm/YuGPRHPZSCBM23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/spm/YuGPRHPZSCBM23
Dong Yu, Yifan Gong, Michael A. Picheny, Bhuvana Ramabhadran, Dilek Hakkani-Tür, Rohit Prasad, Heiga Zen, Jan Skoglund, Jan Honza Cernocký, Lukás Burget, Abdelrahman Mohamed:
Twenty-Five Years of Evolution in Speech and Language Processing. IEEE Signal Process. Mag. 40(5): 27-39 (2023)
[j10]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/tacl/NguyenKCAHETASM23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tacl/NguyenKCAHETASM23
Tu Anh Nguyen, Eugene Kharitonov, Jade Copet, Yossi Adi, Wei-Ning Hsu, Ali Elkahky, Paden Tomasello, Robin Algayres, Benoît Sagot, Abdelrahman Mohamed, Emmanuel Dupoux:
Generative Spoken Dialogue Language Modeling. Trans. Assoc. Comput. Linguistics 11: 250-266 (2023)
[j9]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/DalmiaOLEWMZM23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/DalmiaOLEWMZM23
Siddharth Dalmia, Dmytro Okhonko, Mike Lewis, Sergey Edunov, Shinji Watanabe, Florian Metze, Luke Zettlemoyer, Abdelrahman Mohamed:
LegoNN: Building Modular Encoder-Decoder Models. IEEE ACM Trans. Audio Speech Lang. Process. 31: 3112-3126 (2023)
[c60]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/ShiCBWHHCCTLMLW23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/ShiCBWHHCCTLMLW23
Jiatong Shi, William Chen, Dan Berrebbi, Hsiu-Hsuan Wang, Wei-Ping Huang, En-Pei Hu, Ho-Lam Chuang, Xuankai Chang, Yuxun Tang, Shang-Wen Li, Abdelrahman Mohamed, Hung-Yi Lee, Shinji Watanabe:
Findings of the 2023 ML-Superb Challenge: Pre-Training And Evaluation Over More Languages And Beyond. ASRU 2023: 1-8
[c59]
- view
  authority control:
- export record
  dblp key:
  - conf/cvpr/MohamedGJKK23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/cvpr/MohamedGJKK23
Abdelrahman Mohamed, Rushali Grandhe, K. J. Joseph, Salman H. Khan, Fahad Shahbaz Khan:
D³Former: Debiased Dual Distilled Transformer for Incremental Learning. CVPR Workshops 2023: 2421-2430
[c58]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ChoWMA23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ChoWMA23
Cheol Jun Cho, Peter Wu, Abdelrahman Mohamed, Gopala Krishna Anumanchipalli:
Evidence of Vocal Tract Articulation in Self-Supervised Learning of Speech. ICASSP 2023: 1-5
[c57]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/DiwanYHTCHM23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/DiwanYHTCHM23
Anuj Diwan, Ching-Feng Yeh, Wei-Ning Hsu, Paden Tomasello, Eunsol Choi, David Harwath, Abdelrahman Mohamed:
Continual Learning for On-Device Speech Recognition Using Disentangled Conformers. ICASSP 2023: 1-5
[c56]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ElkahkyHTNAACDM23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ElkahkyHTNAACDM23
Ali Elkahky, Wei-Ning Hsu, Paden Tomasello, Tu Anh Nguyen, Robin Algayres, Yossi Adi, Jade Copet, Emmanuel Dupoux, Abdelrahman Mohamed:
Do Coarser Units Benefit Cluster Prediction-Based Speech Pre-Training? ICASSP 2023: 1-5
[c55]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/TjandraSZKMLS23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/TjandraSZKMLS23
Andros Tjandra, Nayan Singhal, David Zhang, Ozlem Kalinli, Abdelrahman Mohamed, Duc Le, Michael L. Seltzer:
Massively Multilingual ASR on 70 Languages: Tokenization, Architecture, and Generalization Capabilities. ICASSP 2023: 1-5
[c54]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Peng0RMH23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Peng0RMH23
Puyuan Peng, Shang-Wen Li, Okko Räsänen, Abdelrahman Mohamed, David Harwath:
Syllable Discovery and Cross-Lingual Generalization in a Visually Grounded, Self-Supervised Speech Model. INTERSPEECH 2023: 391-395
[c53]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ShiBCHHCC0ML023
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ShiBCHHCC0ML023
Jiatong Shi, Dan Berrebbi, William Chen, En-Pei Hu, Wei-Ping Huang, Ho-Lam Chung, Xuankai Chang, Shang-Wen Li, Abdelrahman Mohamed, Hung-yi Lee, Shinji Watanabe:
ML-SUPERB: Multilingual Speech Universal PERformance Benchmark. INTERSPEECH 2023: 884-888
[c52]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KreyssigSGSMW23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KreyssigSGSMW23
Florian L. Kreyssig, Yangyang Shi, Jinxi Guo, Leda Sari, Abdel-rahman Mohamed, Philip C. Woodland:
Biased Self-supervised Learning for ASR. INTERSPEECH 2023: 4948-4952
[c51]
- view
  authority control:
- export record
  dblp key:
  - conf/wanlp/MohamedANIA23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/wanlp/MohamedANIA23
Abdelrahman Mohamed, Fakhraddin Alwajih, El Moatez Billah Nagoudi, Alcides Alcoba Inciarte, Muhammad Abdul-Mageed:
Violet: A Vision-Language Model for Arabic Image Captioning with Gemini Decoder. ArabicNLP 2023: 1-11
[i60]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2301-00652
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2301-00652
Ching-Feng Yeh, Wei-Ning Hsu, Paden Tomasello, Abdelrahman Mohamed:
Efficient Speech Representation Learning with Low-Bit Quantization. CoRR abs/2301.00652 (2023)
[i59]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2305-10615
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2305-10615
Jiatong Shi, Dan Berrebbi, William Chen, Ho-Lam Chung, En-Pei Hu, Wei-Ping Huang, Xuankai Chang, Shang-Wen Li, Abdelrahman Mohamed, Hung-yi Lee, Shinji Watanabe:
ML-SUPERB: Multilingual Speech Universal PERformance Benchmark. CoRR abs/2305.10615 (2023)
[i58]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2305-11435
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2305-11435
Puyuan Peng, Shang-Wen Li, Okko Räsänen, Abdelrahman Mohamed, David Harwath:
Syllable Discovery and Cross-Lingual Generalization in a Visually Grounded, Self-Supervised Speech Mode. CoRR abs/2305.11435 (2023)
[i57]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2309-10787
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2309-10787
Yuan Tseng, Layne Berry, Yi-Ting Chen, I-Hsiang Chiu, Hsuan-Hao Lin, Max Liu, Puyuan Peng, Yi-Jen Shih, Hung-Yu Wang, Haibin Wu, Po-Yao Huang, Chun-Mao Lai, Shang-Wen Li, David Harwath, Yu Tsao, Shinji Watanabe, Abdelrahman Mohamed, Chi-Luen Feng, Hung-yi Lee:
AV-SUPERB: A Multi-Task Evaluation Benchmark for Audio-Visual Representation Models. CoRR abs/2309.10787 (2023)
[i56]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2309-17020
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2309-17020
Po-Chun Hsu, Ali Elkahky, Wei-Ning Hsu, Yossi Adi, Tu Anh Nguyen, Jade Copet, Emmanuel Dupoux, Hung-yi Lee, Abdelrahman Mohamed:
Low-Resource Self-Supervised Learning with SSL-Enhanced TTS. CoRR abs/2309.17020 (2023)
[i55]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2310-05513
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2310-05513
Jiatong Shi, William Chen, Dan Berrebbi, Hsiu-Hsuan Wang, Wei-Ping Huang, En-Pei Hu, Ho-Lam Chung, Xuankai Chang, Yuxun Tang, Shang-Wen Li, Abdelrahman Mohamed, Hung-yi Lee, Shinji Watanabe:
Findings of the 2023 ML-SUPERB Challenge: Pre-Training and Evaluation over More Languages and Beyond. CoRR abs/2310.05513 (2023)
[i54]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2310-10788
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2310-10788
Cheol Jun Cho, Abdelrahman Mohamed, Alan W. Black, Gopala Krishna Anumanchipalli:
Self-Supervised Models of Speech Infer Universal Articulatory Kinematics. CoRR abs/2310.10788 (2023)
[i53]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2310-10803
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2310-10803
Cheol Jun Cho, Abdelrahman Mohamed, Shang-Wen Li, Alan W. Black, Gopala Krishna Anumanchipalli:
SD-HuBERT: Self-Distillation Induces Syllabic Organization in HuBERT. CoRR abs/2310.10803 (2023)
[i52]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2311-08844
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2311-08844
Abdelrahman Mohamed, Fakhraddin Alwajih, El Moatez Billah Nagoudi, Alcides Alcoba Inciarte, Muhammad Abdul-Mageed:
Violet: A Vision-Language Model for Arabic Image Captioning with Gemini Decoder. CoRR abs/2311.08844 (2023)
2022
[j8]
- view
  authority control:
- export record
  dblp key:
  - journals/jstsp/LeeWLMS22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/jstsp/LeeWLMS22
Hung-Yi Lee, Shinji Watanabe, Karen Livescu, Abdelrahman Mohamed, Tara N. Sainath:
Editorial Editorial of Special Issue on Self-Supervised Learning for Speech and Audio Processing. IEEE J. Sel. Top. Signal Process. 16(6): 1174-1178 (2022)
[j7]
- view
  authority control:
- export record
  dblp key:
  - journals/jstsp/MohamedLBHEIKLL22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/jstsp/MohamedLBHEIKLL22
Abdelrahman Mohamed, Hung-yi Lee, Lasse Borgholt, Jakob D. Havtorn, Joakim Edin, Christian Igel, Katrin Kirchhoff, Shang-Wen Li, Karen Livescu, Lars Maaløe, Tara N. Sainath, Shinji Watanabe:
Self-Supervised Speech Representation Learning: A Review. IEEE J. Sel. Top. Signal Process. 16(6): 1179-1210 (2022)
[j6]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/tacl/AlgayresRKLZMSD22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tacl/AlgayresRKLZMSD22
Robin Algayres, Tristan Ricoul, Julien Karadayi, Hugo Laurençon, Mohamed Salah Zaïem, Abdelrahman Mohamed, Benoît Sagot, Emmanuel Dupoux:
DP-Parse: Finding Word Boundaries from Raw Speech with an Instance Lexicon. Trans. Assoc. Comput. Linguistics 10: 1051-1065 (2022)
[c50]
- view
  authority control:
- export record
  dblp key:
  - conf/acl/TangGDWHGBLMAP22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/acl/TangGDWHGBLMAP22
Yun Tang, Hongyu Gong, Ning Dong, Changhan Wang, Wei-Ning Hsu, Jiatao Gu, Alexei Baevski, Xian Li, Abdelrahman Mohamed, Michael Auli, Juan Miguel Pino:
Unified Speech-Text Pre-training for Speech Translation and Recognition. ACL (1) 2022: 1488-1499
[c49]
- view
  authority control:
- export record
  dblp key:
  - conf/acl/TsaiCHHLYDLLSCH22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/acl/TsaiCHHLYDLLSCH22
Hsiang-Sheng Tsai, Heng-Jui Chang, Wen-Chin Huang, Zili Huang, Kushal Lakhotia, Shu-Wen Yang, Shuyan Dong, Andy T. Liu, Cheng-I Lai, Jiatong Shi, Xuankai Chang, Phil Hall, Hsuan-Jui Chen, Shang-Wen Li, Shinji Watanabe, Abdelrahman Mohamed, Hung-yi Lee:
SUPERB-SG: Enhanced Speech processing Universal PERformance Benchmark for Semantic and Generative Capabilities. ACL (1) 2022: 8479-8492
[c48]
- view
  authority control:
- export record
  dblp key:
  - conf/acl/KharitonovLPACL22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/acl/KharitonovLPACL22
Eugene Kharitonov, Ann Lee, Adam Polyak, Yossi Adi, Jade Copet, Kushal Lakhotia, Tu Anh Nguyen, Morgane Rivière, Abdelrahman Mohamed, Emmanuel Dupoux, Wei-Ning Hsu:
Text-Free Prosody-Aware Generative Spoken Language Modeling. ACL (1) 2022: 8666-8681
[c47]
- view
  authority control:
- export record
  dblp key:
  - conf/emnlp/KreukPCKNRHMDA22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/emnlp/KreukPCKNRHMDA22
Felix Kreuk, Adam Polyak, Jade Copet, Eugene Kharitonov, Tu Anh Nguyen, Morgane Rivière, Wei-Ning Hsu, Abdelrahman Mohamed, Emmanuel Dupoux, Yossi Adi:
Textless Speech Emotion Conversion using Discrete & Decomposed Representations. EMNLP 2022: 11200-11214
[c46]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/ShiHLM22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/ShiHLM22
Bowen Shi, Wei-Ning Hsu, Kushal Lakhotia, Abdelrahman Mohamed:
Learning Audio-Visual Speech Representation by Masked Multimodal Cluster Prediction. ICLR 2022
[c45]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/PillutlaMMRS022
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/PillutlaMMRS022
Krishna Pillutla, Kshitiz Malik, Abdelrahman Mohamed, Michael G. Rabbat, Maziar Sanjabi, Lin Xiao:
Federated Learning with Partial Model Personalization. ICML 2022: 17716-17758
[c44]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ShiHM22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ShiHM22
Bowen Shi, Wei-Ning Hsu, Abdelrahman Mohamed:
Robust Self-Supervised Audio-Visual Speech Recognition. INTERSPEECH 2022: 2118-2122
[c43]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ShiMH22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ShiMH22
Bowen Shi, Abdelrahman Mohamed, Wei-Ning Hsu:
Learning Lip-Based Audio-Visual Speaker Embeddings with AV-HuBERT. INTERSPEECH 2022: 4785-4789
[c42]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ZhengXKL0FKSM22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ZhengXKL0FKSM22
Weiyi Zheng, Alex Xiao, Gil Keren, Duc Le, Frank Zhang, Christian Fuegen, Ozlem Kalinli, Yatharth Saraf, Abdelrahman Mohamed:
Scaling ASR Improves Zero and Few Shot Learning. INTERSPEECH 2022: 5135-5139
[c41]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LinCCYCD0MLL22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LinCCYCD0MLL22
Guan-Ting Lin, Yung-Sung Chuang, Ho-Lam Chung, Shu-Wen Yang, Hsuan-Jui Chen, Shuyan Annie Dong, Shang-Wen Li, Abdelrahman Mohamed, Hung-yi Lee, Lin-Shan Lee:
DUAL: Discrete Spoken Unit Adaptive Learning for Textless Spoken Question Answering. INTERSPEECH 2022: 5165-5169
[c40]
- view
  authority control:
- export record
  dblp key:
  - conf/slt/TomaselloSLHLSECHAANDZM22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/slt/TomaselloSLHLSECHAANDZM22
Paden Tomasello, Akshat Shrivastava, Daniel Lazar, Po-Chun Hsu, Duc Le, Adithya Sagar, Ali Elkahky, Jade Copet, Wei-Ning Hsu, Yossi Adi, Robin Algayres, Tu Anh Nguyen, Emmanuel Dupoux, Luke Zettlemoyer, Abdelrahman Mohamed:
Stop: A Dataset for Spoken Task Oriented Semantic Parsing. SLT 2022: 991-998
[c39]
- view
  authority control:
- export record
  dblp key:
  - conf/slt/FengDYYLSCHWCWMLL22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/slt/FengDYYLSCHWCWMLL22
Tzu-hsun Feng, Shuyan Annie Dong, Ching-Feng Yeh, Shu-Wen Yang, Tzu-Quan Lin, Jiatong Shi, Kai-Wei Chang, Zili Huang, Haibin Wu, Xuankai Chang, Shinji Watanabe, Abdelrahman Mohamed, Shang-Wen Li, Hung-yi Lee:
Superb @ SLT 2022: Challenge on Generalization and Efficiency of Self-Supervised Speech Representation Learning. SLT 2022: 1096-1103
[i51]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2201-01763
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2201-01763
Bowen Shi, Wei-Ning Hsu, Abdelrahman Mohamed:
Robust Self-Supervised Audio-Visual Speech Recognition. CoRR abs/2201.01763 (2022)
[i50]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2201-02184
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2201-02184
Bowen Shi, Wei-Ning Hsu, Kushal Lakhotia, Abdelrahman Mohamed:
Learning Audio-Visual Speech Representation by Masked Multimodal Cluster Prediction. CoRR abs/2201.02184 (2022)
[i49]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2201-08763
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2201-08763
Hashmat Shadab Malik, Ikboljon Sobirov, Abdelrahman Mohamed:
Object Detection in Aerial Images: What Improves the Accuracy? CoRR abs/2201.08763 (2022)
[i48]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2202-07359
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2202-07359
Eugene Kharitonov, Jade Copet, Kushal Lakhotia, Tu Anh Nguyen, Paden Tomasello, Ann Lee, Ali Elkahky, Wei-Ning Hsu, Abdelrahman Mohamed, Emmanuel Dupoux, Yossi Adi:
textless-lib: a Library for Textless Spoken Language Processing. CoRR abs/2202.07359 (2022)
[i47]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2203-04911
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2203-04911
Guan-Ting Lin, Yung-Sung Chuang, Ho-Lam Chung, Shu-Wen Yang, Hsuan-Jui Chen, Shuyan Dong, Shang-Wen Li, Abdelrahman Mohamed, Hung-yi Lee, Lin-Shan Lee:
DUAL: Discrete Spoken Unit Adaptive Learning for Textless Spoken Question Answering. CoRR abs/2203.04911 (2022)
[i46]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2203-06849
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2203-06849
Hsiang-Sheng Tsai, Heng-Jui Chang, Wen-Chin Huang, Zili Huang, Kushal Lakhotia, Shu-Wen Yang, Shuyan Dong, Andy T. Liu, Cheng-I Jeff Lai, Jiatong Shi, Xuankai Chang, Phil Hall, Hsuan-Jui Chen, Shang-Wen Li, Shinji Watanabe, Abdelrahman Mohamed, Hung-yi Lee:
SUPERB-SG: Enhanced Speech processing Universal PERformance Benchmark for Semantic and Generative Capabilities. CoRR abs/2203.06849 (2022)
[i45]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2203-16502
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2203-16502
Tu Anh Nguyen, Eugene Kharitonov, Jade Copet, Yossi Adi, Wei-Ning Hsu, Ali Elkahky, Paden Tomasello, Robin Algayres, Benoît Sagot, Abdelrahman Mohamed, Emmanuel Dupoux:
Generative Spoken Dialogue Language Modeling. CoRR abs/2203.16502 (2022)
[i44]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2204-03809
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2204-03809
Krishna Pillutla, Kshitiz Malik, Abdelrahman Mohamed, Michael G. Rabbat, Maziar Sanjabi, Lin Xiao:
Federated Learning with Partial Model Personalization. CoRR abs/2204.03809 (2022)
[i43]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2204-05409
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2204-05409
Yun Tang, Hongyu Gong, Ning Dong, Changhan Wang, Wei-Ning Hsu, Jiatao Gu, Alexei Baevski, Xian Li, Abdelrahman Mohamed, Michael Auli, Juan Miguel Pino:
Unified Speech-Text Pre-training for Speech Translation and Recognition. CoRR abs/2204.05409 (2022)
[i42]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2205-07180
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2205-07180
Bowen Shi, Abdelrahman Mohamed, Wei-Ning Hsu:
Learning Lip-Based Audio-Visual Speaker Embeddings with AV-HuBERT. CoRR abs/2205.07180 (2022)
[i41]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2205-10643
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2205-10643
Abdelrahman Mohamed, Hung-yi Lee, Lasse Borgholt, Jakob D. Havtorn, Joakim Edin, Christian Igel, Katrin Kirchhoff, Shang-Wen Li, Karen Livescu, Lars Maaløe, Tara N. Sainath, Shinji Watanabe:
Self-Supervised Speech Representation Learning: A Review. CoRR abs/2205.10643 (2022)
[i40]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2206-03318
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2206-03318
Siddharth Dalmia, Dmytro Okhonko, Mike Lewis, Sergey Edunov, Shinji Watanabe, Florian Metze, Luke Zettlemoyer, Abdelrahman Mohamed:
LegoNN: Building Modular Encoder-Decoder Models. CoRR abs/2206.03318 (2022)
[i39]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2206-11332
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2206-11332
Robin Algayres, Tristan Ricoul, Julien Karadayi, Hugo Laurençon, Mohamed Salah Zaïem, Abdelrahman Mohamed, Benoît Sagot, Emmanuel Dupoux:
DP-Parse: Finding Word Boundaries from Raw Speech with an Instance Lexicon. CoRR abs/2206.11332 (2022)
[i38]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2207-10643
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2207-10643
Paden Tomasello, Akshat Shrivastava, Daniel Lazar, Po-Chun Hsu, Duc Le, Adithya Sagar, Ali Elkahky, Jade Copet, Wei-Ning Hsu, Yossef Mordechay, Robin Algayres, Tu Anh Nguyen, Emmanuel Dupoux, Luke Zettlemoyer, Abdelrahman Mohamed:
STOP: A dataset for Spoken Task Oriented Semantic Parsing. CoRR abs/2207.10643 (2022)
[i37]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2208-00777
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2208-00777
Abdelrahman Mohamed, Rushali Grandhe, K. J. Joseph, Salman Khan, Fahad Shahbaz Khan:
D³Former: Debiased Dual Distilled Transformer for Incremental Learning. CoRR abs/2208.00777 (2022)
[i36]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2210-08634
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2210-08634
Tzu-hsun Feng, Shuyan Annie Dong, Ching-Feng Yeh, Shu-Wen Yang, Tzu-Quan Lin, Jiatong Shi, Kai-Wei Chang, Zili Huang, Haibin Wu, Xuankai Chang, Shinji Watanabe, Abdelrahman Mohamed, Shang-Wen Li, Hung-yi Lee:
SUPERB @ SLT 2022: Challenge on Generalization and Efficiency of Self-Supervised Speech Representation Learning. CoRR abs/2210.08634 (2022)
[i35]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2210-11723
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2210-11723
Cheol Jun Cho, Peter Wu, Abdelrahman Mohamed, Gopala Krishna Anumanchipalli:
Evidence of Vocal Tract Articulation in Self-Supervised Learning of Speech. CoRR abs/2210.11723 (2022)
[i34]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2211-02536
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2211-02536
Florian L. Kreyssig, Yangyang Shi, Jinxi Guo, Leda Sari, Abdelrahman Mohamed, Philip C. Woodland:
Biased Self-supervised learning for ASR. CoRR abs/2211.02536 (2022)
[i33]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2211-05756
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2211-05756
Andros Tjandra, Nayan Singhal, David Zhang, Ozlem Kalinli, Abdelrahman Mohamed, Duc Le, Michael L. Seltzer:
Massively Multilingual ASR on 70 Languages: Tokenization, Architecture, and Generalization Capabilities. CoRR abs/2211.05756 (2022)
[i32]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2212-01393
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2212-01393
Anuj Diwan, Ching-Feng Yeh, Wei-Ning Hsu, Paden Tomasello, Eunsol Choi, David Harwath, Abdelrahman Mohamed:
Continual Learning for On-Device Speech Recognition using Disentangled Conformers. CoRR abs/2212.01393 (2022)
2021
[j5]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/HsuBTLSM21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/HsuBTLSM21
Wei-Ning Hsu, Benjamin Bolte, Yao-Hung Hubert Tsai, Kushal Lakhotia, Ruslan Salakhutdinov, Abdelrahman Mohamed:
HuBERT: Self-Supervised Speech Representation Learning by Masked Prediction of Hidden Units. IEEE ACM Trans. Audio Speech Lang. Process. 29: 3451-3460 (2021)
[c38]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/ManoharLXHCSZM21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/ManoharLXHCSZM21
Vimal Manohar, Tatiana Likhomanenko, Qiantong Xu, Wei-Ning Hsu, Ronan Collobert, Yatharth Saraf, Geoffrey Zweig, Abdelrahman Mohamed:
Kaizen: Continuously Improving Teacher Using Exponential Moving Average for Semi-Supervised Speech Recognition. ASRU 2021: 518-525
[c37]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/XiaoFM21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/XiaoFM21
Alex Xiao, Christian Fuegen, Abdelrahman Mohamed:
Contrastive Semi-Supervised Learning for ASR. ICASSP 2021: 3870-3874
[c36]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/HsuTBSM21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/HsuTBSM21
Wei-Ning Hsu, Yao-Hung Hubert Tsai, Benjamin Bolte, Ruslan Salakhutdinov, Abdelrahman Mohamed:
Hubert: How Much Can a Bad Teacher Benefit ASR Pre-Training? ICASSP 2021: 6533-6537
[c35]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/YangCCLLLLSCLHT21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/YangCCLLLLSCLHT21
Shu-Wen Yang, Po-Han Chi, Yung-Sung Chuang, Cheng-I Jeff Lai, Kushal Lakhotia, Yist Y. Lin, Andy T. Liu, Jiatong Shi, Xuankai Chang, Guan-Ting Lin, Tzu-Hsien Huang, Wei-Cheng Tseng, Ko-tik Lee, Da-Rong Liu, Zili Huang, Shuyan Dong, Shang-Wen Li, Shinji Watanabe, Abdelrahman Mohamed, Hung-yi Lee:
SUPERB: Speech Processing Universal PERformance Benchmark. Interspeech 2021: 1194-1198
[c34]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ConneauBCMA21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ConneauBCMA21
Alexis Conneau, Alexei Baevski, Ronan Collobert, Abdelrahman Mohamed, Michael Auli:
Unsupervised Cross-Lingual Representation Learning for Speech Recognition. Interspeech 2021: 2426-2430
[c33]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/PolyakACKLHMD21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/PolyakACKLHMD21
Adam Polyak, Yossi Adi, Jade Copet, Eugene Kharitonov, Kushal Lakhotia, Wei-Ning Hsu, Abdelrahman Mohamed, Emmanuel Dupoux:
Speech Resynthesis from Discrete Disentangled Self-Supervised Representations. Interspeech 2021: 3615-3619
[i31]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2103-05149
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2103-05149
Alex Xiao, Christian Fuegen, Abdelrahman Mohamed:
Contrastive Semi-supervised Learning for ASR. CoRR abs/2103.05149 (2021)
[i30]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2104-00355
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2104-00355
Adam Polyak, Yossi Adi, Jade Copet, Eugene Kharitonov, Kushal Lakhotia, Wei-Ning Hsu, Abdelrahman Mohamed, Emmanuel Dupoux:
Speech Resynthesis from Discrete Disentangled Self-Supervised Representations. CoRR abs/2104.00355 (2021)
[i29]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2105-01051
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2105-01051
Shu-Wen Yang, Po-Han Chi, Yung-Sung Chuang, Cheng-I Jeff Lai, Kushal Lakhotia, Yist Y. Lin, Andy T. Liu, Jiatong Shi, Xuankai Chang, Guan-Ting Lin, Tzu-Hsien Huang, Wei-Cheng Tseng, Ko-tik Lee, Da-Rong Liu, Zili Huang, Shuyan Dong, Shang-Wen Li, Shinji Watanabe, Abdelrahman Mohamed, Hung-yi Lee:
SUPERB: Speech processing Universal PERformance Benchmark. CoRR abs/2105.01051 (2021)
[i28]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2106-07447
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2106-07447
Wei-Ning Hsu, Benjamin Bolte, Yao-Hung Hubert Tsai, Kushal Lakhotia, Ruslan Salakhutdinov, Abdelrahman Mohamed:
HuBERT: Self-Supervised Speech Representation Learning by Masked Prediction of Hidden Units. CoRR abs/2106.07447 (2021)
[i27]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2106-07759
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2106-07759
Vimal Manohar, Tatiana Likhomanenko, Qiantong Xu, Wei-Ning Hsu, Ronan Collobert, Yatharth Saraf, Geoffrey Zweig, Abdelrahman Mohamed:
Kaizen: Continuously improving teacher using Exponential Moving Average for semi-supervised speech recognition. CoRR abs/2106.07759 (2021)
[i26]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2109-03264
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2109-03264
Eugene Kharitonov, Ann Lee, Adam Polyak, Yossi Adi, Jade Copet, Kushal Lakhotia, Tu Anh Nguyen, Morgane Rivière, Abdelrahman Mohamed, Emmanuel Dupoux, Wei-Ning Hsu:
Text-Free Prosody-Aware Generative Spoken Language Modeling. CoRR abs/2109.03264 (2021)
[i25]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2111-05948
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2111-05948
Alex Xiao, Weiyi Zheng, Gil Keren, Duc Le, Frank Zhang, Christian Fuegen, Ozlem Kalinli, Yatharth Saraf, Abdelrahman Mohamed:
Scaling ASR Improves Zero and Few Shot Learning. CoRR abs/2111.05948 (2021)
[i24]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2111-07402
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2111-07402
Felix Kreuk, Adam Polyak, Jade Copet, Eugene Kharitonov, Tu Anh Nguyen, Morgane Rivière, Wei-Ning Hsu, Abdelrahman Mohamed, Emmanuel Dupoux, Yossi Adi:
Textless Speech Emotion Conversion using Decomposed and Discrete Representations. CoRR abs/2111.07402 (2021)
2020
[c32]
- view
  authority control:
- export record
  dblp key:
  - conf/acl/LewisLGGMLSZ20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/acl/LewisLGGMLSZ20
Mike Lewis, Yinhan Liu, Naman Goyal, Marjan Ghazvininejad, Abdelrahman Mohamed, Omer Levy, Veselin Stoyanov, Luke Zettlemoyer:
BART: Denoising Sequence-to-Sequence Pre-training for Natural Language Generation, Translation, and Comprehension. ACL 2020: 7871-7880
[c31]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/WangMLLXMHTZZFZ20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/WangMLLXMHTZZFZ20
Yongqiang Wang, Abdelrahman Mohamed, Duc Le, Chunxi Liu, Alex Xiao, Jay Mahadeokar, Hongzhao Huang, Andros Tjandra, Xiaohui Zhang, Frank Zhang, Christian Fuegen, Geoffrey Zweig, Michael L. Seltzer:
Transformer-Based Acoustic Modeling for Hybrid Speech Recognition. ICASSP 2020: 6874-6878
[c30]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/KahnRZKXMKLCFLS20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/KahnRZKXMKLCFLS20
Jacob Kahn, Morgane Rivière, Weiyi Zheng, Evgeny Kharitonov, Qiantong Xu, Pierre-Emmanuel Mazaré, Julien Karadayi, Vitaliy Liptchinsky, Ronan Collobert, Christian Fuegen, Tatiana Likhomanenko, Gabriel Synnaeve, Armand Joulin, Abdelrahman Mohamed, Emmanuel Dupoux:
Libri-Light: A Benchmark for ASR with Limited or No Supervision. ICASSP 2020: 7669-7673
[c29]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/BaevskiM20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/BaevskiM20
Alexei Baevski, Abdelrahman Mohamed:
Effectiveness of Self-Supervised Pre-Training for ASR. ICASSP 2020: 7694-7698
[c28]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/SinghOLWZGEPSZM20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/SinghOLWZGEPSZM20
Kritika Singh, Dmytro Okhonko, Jun Liu, Yongqiang Wang, Frank Zhang, Ross B. Girshick, Sergey Edunov, Fuchun Peng, Yatharth Saraf, Geoffrey Zweig, Abdelrahman Mohamed:
Training ASR Models By Generation of Contextual Information. ICASSP 2020: 7864-7868
[c27]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SinghMXEGLFSZM20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SinghMXEGLFSZM20
Kritika Singh, Vimal Manohar, Alex Xiao, Sergey Edunov, Ross B. Girshick, Vitaliy Liptchinsky, Christian Fuegen, Yatharth Saraf, Geoffrey Zweig, Abdelrahman Mohamed:
Large Scale Weakly and Semi-Supervised Learning for Low-Resource Video ASR. INTERSPEECH 2020: 3770-3774
[c26]
- view
  - electronic edition @ neurips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/BaevskiZMA20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/BaevskiZMA20
Alexei Baevski, Yuhao Zhou, Abdelrahman Mohamed, Michael Auli:
wav2vec 2.0: A Framework for Self-Supervised Learning of Speech Representations. NeurIPS 2020
[i23]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2005-07850
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2005-07850
Kritika Singh, Vimal Manohar, Alex Xiao, Sergey Edunov, Ross B. Girshick, Vitaliy Liptchinsky, Christian Fuegen, Yatharth Saraf, Geoffrey Zweig, Abdelrahman Mohamed:
Large scale weakly and semi-supervised learning for low-resource video ASR. CoRR abs/2005.07850 (2020)
[i22]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2006-11477
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2006-11477
Alexei Baevski, Henry Zhou, Abdelrahman Mohamed, Michael Auli:
wav2vec 2.0: A Framework for Self-Supervised Learning of Speech Representations. CoRR abs/2006.11477 (2020)
[i21]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2006-13979
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2006-13979
Alexis Conneau, Alexei Baevski, Ronan Collobert, Abdelrahman Mohamed, Michael Auli:
Unsupervised Cross-lingual Representation Learning for Speech Recognition. CoRR abs/2006.13979 (2020)

2010 – 2019

see FAQ

What is the meaning of the colors in the publication lists?

2019
[i20]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1904-11660
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1904-11660
Abdelrahman Mohamed, Dmytro Okhonko, Luke Zettlemoyer:
Transformers with convolutional context for ASR. CoRR abs/1904.11660 (2019)
[i19]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1910-09799
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1910-09799
Yongqiang Wang, Abdelrahman Mohamed, Duc Le, Chunxi Liu, Alex Xiao, Jay Mahadeokar, Hongzhao Huang, Andros Tjandra, Xiaohui Zhang, Frank Zhang, Christian Fuegen, Geoffrey Zweig, Michael L. Seltzer:
Transformer-based Acoustic Modeling for Hybrid Speech Recognition. CoRR abs/1910.09799 (2019)
[i18]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1910-12367
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1910-12367
Kritika Singh, Dmytro Okhonko, Jun Liu, Yongqiang Wang, Frank Zhang, Ross B. Girshick, Sergey Edunov, Fuchun Peng, Yatharth Saraf, Geoffrey Zweig, Abdelrahman Mohamed:
Training ASR models by Generation of Contextual Information. CoRR abs/1910.12367 (2019)
[i17]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1910-13461
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1910-13461
Mike Lewis, Yinhan Liu, Naman Goyal, Marjan Ghazvininejad, Abdelrahman Mohamed, Omer Levy, Veselin Stoyanov, Luke Zettlemoyer:
BART: Denoising Sequence-to-Sequence Pre-training for Natural Language Generation, Translation, and Comprehension. CoRR abs/1910.13461 (2019)
[i16]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1911-03782
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1911-03782
Siddharth Dalmia, Abdelrahman Mohamed, Mike Lewis, Florian Metze, Luke Zettlemoyer:
Enforcing Encoder-Decoder Modularity in Sequence-to-Sequence Models. CoRR abs/1911.03782 (2019)
[i15]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1911-03912
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1911-03912
Alexei Baevski, Michael Auli, Abdelrahman Mohamed:
Effectiveness of self-supervised pre-training for speech recognition. CoRR abs/1911.03912 (2019)
[i14]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1912-07875
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1912-07875
Jacob Kahn, Morgane Rivière, Weiyi Zheng, Evgeny Kharitonov, Qiantong Xu, Pierre-Emmanuel Mazaré, Julien Karadayi, Vitaliy Liptchinsky, Ronan Collobert, Christian Fuegen, Tatiana Likhomanenko, Gabriel Synnaeve, Armand Joulin, Abdelrahman Mohamed, Emmanuel Dupoux:
Libri-Light: A Benchmark for ASR with Limited or No Supervision. CoRR abs/1912.07875 (2019)
2018
[j4]
- view
  authority control:
- export record
  dblp key:
  - journals/ral/KhalilMSMTMHHKM18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ral/KhalilMSMTMHHKM18
Islam S. M. Khalil, Dalia Mahdy, Ahmed El Sharkawy, Ramez R. Moustafa, Ahmet Fatih Tabak, Mohamed E. Mitwally, Sarah Hesham, Nabila Hamdi, Anke Klingner, Abdelrahman Mohamed, Metin Sitti:
Mechanical Rubbing of Blood Clots Using Helical Robots Under Ultrasound Guidance. IEEE Robotics Autom. Lett. 3(2): 1112-1119 (2018)
[c25]
- view
  authority control:
- export record
  dblp key:
  - conf/slt/FakoorKSWMS18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/slt/FakoorKSWMS18
Rasool Fakoor, Amanjit Kainth, Siamak Shakeri, Christopher Winestock, Abdel-rahman Mohamed, Ruhi Sarikaya:
Direct Optimization of F-Measure for Retrieval-Based Personal Question Answering. SLT 2018: 815-822
[i13]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1810-00679
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1810-00679
Rasool Fakoor, Amanjit Kainth, Siamak Shakeri, Christopher Winestock, Abdel-rahman Mohamed, Ruhi Sarikaya:
Direct optimization of F-measure for retrieval-based personal question answering. CoRR abs/1810.00679 (2018)
[i12]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1810-12464
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1810-12464
Thomas Powers, Rasool Fakoor, Siamak Shakeri, Abhinav Sethy, Amanjit Kainth, Abdel-rahman Mohamed, Ruhi Sarikaya:
Differentiable Greedy Networks. CoRR abs/1810.12464 (2018)
2017
[c24]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/ParisottoMS0ZK17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/ParisottoMS0ZK17
Emilio Parisotto, Abdel-rahman Mohamed, Rishabh Singh, Lihong Li, Dengyong Zhou, Pushmeet Kohli:
Neuro-Symbolic Program Synthesis. ICLR (Poster) 2017
[c23]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/UrbanGKAWMPRC17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/UrbanGKAWMPRC17
Gregor Urban, Krzysztof J. Geras, Samira Ebrahimi Kahou, Özlem Aslan, Shengjie Wang, Abdelrahman Mohamed, Matthai Philipose, Matthew Richardson, Rich Caruana:
Do Deep Convolutional Nets Really Need to be Deep and Convolutional? ICLR (Poster) 2017
[c22]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/DevlinUBSMK17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/DevlinUBSMK17
Jacob Devlin, Jonathan Uesato, Surya Bhupatiraju, Rishabh Singh, Abdel-rahman Mohamed, Pushmeet Kohli:
RobustFill: Neural Program Learning under Noisy I/O. ICML 2017: 990-998
[c21]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/WangWHMZD17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/WangWHMZD17
Chong Wang, Yining Wang, Po-Sen Huang, Abdelrahman Mohamed, Dengyong Zhou, Li Deng:
Sequence Modeling via Segmentations. ICML 2017: 3674-3683
[i11]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/WangWHMZD17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/WangWHMZD17
Chong Wang, Yining Wang, Po-Sen Huang, Abdelrahman Mohamed, Dengyong Zhou, Li Deng:
Sequence Modeling via Segmentations. CoRR abs/1702.07463 (2017)
[i10]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/DevlinUBSMK17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/DevlinUBSMK17
Jacob Devlin, Jonathan Uesato, Surya Bhupatiraju, Rishabh Singh, Abdel-rahman Mohamed, Pushmeet Kohli:
RobustFill: Neural Program Learning under Noisy I/O. CoRR abs/1703.07469 (2017)
[i9]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/BhupatirajuSMK17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/BhupatirajuSMK17
Surya Bhupatiraju, Rishabh Singh, Abdel-rahman Mohamed, Pushmeet Kohli:
Deep API Programmer: Learning to Program with APIs. CoRR abs/1704.04327 (2017)
[i8]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1709-00503
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1709-00503
Kavosh Asadi, Cameron Allen, Melrose Roderick, Abdel-rahman Mohamed, George Dimitri Konidaris, Michael L. Littman:
Mean Actor Critic. CoRR abs/1709.00503 (2017)
2016
[c20]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/LiMZG16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/LiMZG16
Jinyu Li, Abdelrahman Mohamed, Geoffrey Zweig, Yifan Gong:
Exploring multidimensional lstms for large vocabulary ASR. ICASSP 2016: 4940-4944
[c19]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/WangMCBPRGUA16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/WangMCBPRGUA16
Shengjie Wang, Abdel-rahman Mohamed, Rich Caruana, Jeff A. Bilmes, Matthai Philipose, Matthew Richardson, Krzysztof J. Geras, Gregor Urban, Özlem Aslan:
Analysis of Deep Neural Networks with Extended Data Jacobian Matrix. ICML 2016: 718-726
[i7]
- view
  - electronic edition @ nist.gov (open access)
  - details & citations
- export record
  dblp key:
  - conf/tac/NogueiraKMAKM16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/tac/NogueiraKMAKM16
Rodrigo Nogueira, Taesup Kim, Abdel-rahman Mohamed, Ahmed Hassan Awadallah, Pushmeet Kohli, Ahmed Mohamed:
MSR System Description - TAC 2016 KBP Cold Start Slof Filling Track. TAC 2016
[i6]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/UrbanGKAWCMPR16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/UrbanGKAWCMPR16
Gregor Urban, Krzysztof J. Geras, Samira Ebrahimi Kahou, Özlem Aslan, Shengjie Wang, Rich Caruana, Abdelrahman Mohamed, Matthai Philipose, Matthew Richardson:
Do Deep Convolutional Nets Really Need to be Deep (Or Even Convolutional)? CoRR abs/1603.05691 (2016)
[i5]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/ParisottoMSLZK16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/ParisottoMSLZK16
Emilio Parisotto, Abdel-rahman Mohamed, Rishabh Singh, Lihong Li, Dengyong Zhou, Pushmeet Kohli:
Neuro-Symbolic Program Synthesis. CoRR abs/1611.01855 (2016)
[i4]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/FakoorMMKK16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/FakoorMMKK16
Rasool Fakoor, Abdel-rahman Mohamed, Margaret Mitchell, Sing Bing Kang, Pushmeet Kohli:
Memory-augmented Attention Modelling for Videos. CoRR abs/1611.02261 (2016)
2015
[j3]
- view
  authority control:
- export record
  dblp key:
  - journals/nn/SainathKSSMDR15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/nn/SainathKSSMDR15
Tara N. Sainath, Brian Kingsbury, George Saon, Hagen Soltau, Abdel-rahman Mohamed, George E. Dahl, Bhuvana Ramabhadran:
Deep Convolutional Neural Networks for Large-scale Speech Tasks. Neural Networks 64: 39-48 (2015)
[c18]
- view
  authority control:
- export record
  dblp key:
  - conf/acl/WangMH15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/acl/WangMH15
Tong Wang, Abdelrahman Mohamed, Graeme Hirst:
Learning Lexical Embeddings with Syntactic and Lexicographic Knowledge. ACL (2) 2015: 458-463
[c17]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/MohamedSYDSZP15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/MohamedSYDSZP15
Abdel-rahman Mohamed, Frank Seide, Dong Yu, Jasha Droppo, Andreas Stolcke, Geoffrey Zweig, Gerald Penn:
Deep bi-directional recurrent networks over spectral windows. ASRU 2015: 78-83
[c16]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/LiMZG15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/LiMZG15
Jinyu Li, Abdelrahman Mohamed, Geoffrey Zweig, Yifan Gong:
LSTM time and frequency recurrence for automatic speech recognition. ASRU 2015: 187-191
[i3]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/GerasMCUWAPRS15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/GerasMCUWAPRS15
Krzysztof J. Geras, Abdel-rahman Mohamed, Rich Caruana, Gregor Urban, Shengjie Wang, Özlem Aslan, Matthai Philipose, Matthew Richardson, Charles Sutton:
Compressing LSTMs into CNNs. CoRR abs/1511.06433 (2015)
2014
[b1]
- view
  - electronic edition via handle.net
  - details & citations
- export record
  dblp key:
  - phd/ca/Mohamed14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/phd/ca/Mohamed14
Abdel-rahman Mohamed:
Deep Neural Network Acoustic Models for ASR. University of Toronto, Canada, 2014
[j2]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/Abdel-HamidMJDPY14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/Abdel-HamidMJDPY14
Ossama Abdel-Hamid, Abdel-rahman Mohamed, Hui Jiang, Li Deng, Gerald Penn, Dong Yu:
Convolutional Neural Networks for Speech Recognition. IEEE ACM Trans. Audio Speech Lang. Process. 22(10): 1533-1545 (2014)
[c15]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/SainathKMSR14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/SainathKMSR14
Tara N. Sainath, Brian Kingsbury, Abdel-rahman Mohamed, George Saon, Bhuvana Ramabhadran:
Improvements to filterbank and delta learning within a deep neural network framework. ICASSP 2014: 6839-6843
2013
[c14]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/GravesJM13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/GravesJM13
Alex Graves, Navdeep Jaitly, Abdel-rahman Mohamed:
Hybrid speech recognition with Deep Bidirectional LSTM. ASRU 2013: 273-278
[c13]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/SainathKMR13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/SainathKMR13
Tara N. Sainath, Brian Kingsbury, Abdel-rahman Mohamed, Bhuvana Ramabhadran:
Learning filter banks within a deep neural network framework. ASRU 2013: 297-302
[c12]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/SainathKMDSSBAR13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/SainathKMDSSBAR13
Tara N. Sainath, Brian Kingsbury, Abdel-rahman Mohamed, George E. Dahl, George Saon, Hagen Soltau, Tomás Beran, Aleksandr Y. Aravkin, Bhuvana Ramabhadran:
Improvements to Deep Convolutional Neural Networks for LVCSR. ASRU 2013: 315-320
[c11]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/GravesMH13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/GravesMH13
Alex Graves, Abdel-rahman Mohamed, Geoffrey E. Hinton:
Speech recognition with deep recurrent neural networks. ICASSP 2013: 6645-6649
[c10]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/SainathMKR13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/SainathMKR13
Tara N. Sainath, Abdel-rahman Mohamed, Brian Kingsbury, Bhuvana Ramabhadran:
Deep convolutional neural networks for LVCSR. ICASSP 2013: 8614-8618
[i2]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1303-5778
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1303-5778
Alex Graves, Abdel-rahman Mohamed, Geoffrey E. Hinton:
Speech Recognition with Deep Recurrent Neural Networks. CoRR abs/1303.5778 (2013)
[i1]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/SainathKMDSSBAR13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/SainathKMDSSBAR13
Tara N. Sainath, Brian Kingsbury, Abdel-rahman Mohamed, George E. Dahl, George Saon, Hagen Soltau, Tomás Beran, Aleksandr Y. Aravkin, Bhuvana Ramabhadran:
Improvements to deep convolutional neural networks for LVCSR. CoRR abs/1309.1501 (2013)
2012
[j1]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/MohamedDH12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/MohamedDH12
Abdel-rahman Mohamed, George E. Dahl, Geoffrey E. Hinton:
Acoustic Modeling Using Deep Belief Networks. IEEE Trans. Speech Audio Process. 20(1): 14-22 (2012)
[c9]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/MohamedHP12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/MohamedHP12
Abdel-rahman Mohamed, Geoffrey E. Hinton, Gerald Penn:
Understanding how Deep Belief Networks perform acoustic modelling. ICASSP 2012: 4273-4276
[c8]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/Abdel-HamidMJP12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/Abdel-HamidMJP12
Ossama Abdel-Hamid, Abdel-rahman Mohamed, Hui Jiang, Gerald Penn:
Applying Convolutional Neural Networks concepts to hybrid NN-HMM model for speech recognition. ICASSP 2012: 4277-4280
[c7]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - journals/jmlr/TangM12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/jmlr/TangM12
Yichuan Tang, Abdel-rahman Mohamed:
Multiresolution Deep Belief Networks. AISTATS 2012: 1203-1211
2011
[c6]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/SainathKRFNM11
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/SainathKRFNM11
Tara N. Sainath, Brian Kingsbury, Bhuvana Ramabhadran, Petr Fousek, Petr Novák, Abdel-rahman Mohamed:
Making Deep Belief Networks effective for large vocabulary continuous speech recognition. ASRU 2011: 30-35
[c5]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/MohamedSDRHP11
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/MohamedSDRHP11
Abdel-rahman Mohamed, Tara N. Sainath, George E. Dahl, Bhuvana Ramabhadran, Geoffrey E. Hinton, Michael A. Picheny:
Deep Belief Networks using discriminative features for phone recognition. ICASSP 2011: 5060-5063
2010
[c4]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/MohamedH10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/MohamedH10
Abdel-rahman Mohamed, Geoffrey E. Hinton:
Phone recognition using Restricted Boltzmann Machines. ICASSP 2010: 4354-4357
[c3]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/DengSYAMH10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/DengSYAMH10
Li Deng, Michael L. Seltzer, Dong Yu, Alex Acero, Abdel-rahman Mohamed, Geoffrey E. Hinton:
Binary coding of speech spectrograms using a deep auto-encoder. INTERSPEECH 2010: 1692-1695
[c2]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MohamedYD10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MohamedYD10
Abdel-rahman Mohamed, Dong Yu, Li Deng:
Investigation of full-sequence training of deep belief networks for speech recognition. INTERSPEECH 2010: 2846-2849
[c1]
- view
- export record
  dblp key:
  - conf/nips/DahlRMH10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/DahlRMH10
George E. Dahl, Marc'Aurelio Ranzato, Abdel-rahman Mohamed, Geoffrey E. Hinton:
Phone Recognition with the Mean-Covariance Restricted Boltzmann Machine. NIPS 2010: 469-477

Coauthor Index

see FAQ

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.