


Остановите войну!
for scientists:


default search action
Abdel-rahman Mohamed
Abdelrahman Mohamed
Person information

Refine list

refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
showing all ?? records
2020 – today
- 2023
- [i54]Ching-Feng Yeh, Wei-Ning Hsu, Paden Tomasello, Abdelrahman Mohamed:
Efficient Speech Representation Learning with Low-Bit Quantization. CoRR abs/2301.00652 (2023) - [i53]Jiatong Shi, Dan Berrebbi, William Chen, Ho-Lam Chung, En-Pei Hu, Wei-Ping Huang, Xuankai Chang, Shang-Wen Li, Abdelrahman Mohamed, Hung-yi Lee, Shinji Watanabe:
ML-SUPERB: Multilingual Speech Universal PERformance Benchmark. CoRR abs/2305.10615 (2023) - [i52]Puyuan Peng, Shang-Wen Li, Okko Räsänen, Abdelrahman Mohamed, David Harwath:
Syllable Discovery and Cross-Lingual Generalization in a Visually Grounded, Self-Supervised Speech Mode. CoRR abs/2305.11435 (2023) - 2022
- [j8]Hung-Yi Lee, Shinji Watanabe
, Karen Livescu, Abdelrahman Mohamed, Tara N. Sainath
:
Editorial Editorial of Special Issue on Self-Supervised Learning for Speech and Audio Processing. IEEE J. Sel. Top. Signal Process. 16(6): 1174-1178 (2022) - [j7]Abdelrahman Mohamed, Hung-yi Lee
, Lasse Borgholt, Jakob D. Havtorn, Joakim Edin
, Christian Igel, Katrin Kirchhoff, Shang-Wen Li, Karen Livescu, Lars Maaløe, Tara N. Sainath
, Shinji Watanabe
:
Self-Supervised Speech Representation Learning: A Review. IEEE J. Sel. Top. Signal Process. 16(6): 1179-1210 (2022) - [j6]Robin Algayres, Tristan Ricoul, Julien Karadayi, Hugo Laurençon, Mohamed Salah Zaïem, Abdelrahman Mohamed, Benoît Sagot, Emmanuel Dupoux:
DP-Parse: Finding Word Boundaries from Raw Speech with an Instance Lexicon. Trans. Assoc. Comput. Linguistics 10: 1051-1065 (2022) - [c50]Yun Tang, Hongyu Gong, Ning Dong, Changhan Wang, Wei-Ning Hsu, Jiatao Gu, Alexei Baevski, Xian Li, Abdelrahman Mohamed, Michael Auli, Juan Miguel Pino:
Unified Speech-Text Pre-training for Speech Translation and Recognition. ACL (1) 2022: 1488-1499 - [c49]Hsiang-Sheng Tsai, Heng-Jui Chang, Wen-Chin Huang, Zili Huang, Kushal Lakhotia, Shu-Wen Yang, Shuyan Dong, Andy T. Liu, Cheng-I Lai, Jiatong Shi, Xuankai Chang, Phil Hall, Hsuan-Jui Chen, Shang-Wen Li, Shinji Watanabe, Abdelrahman Mohamed, Hung-yi Lee:
SUPERB-SG: Enhanced Speech processing Universal PERformance Benchmark for Semantic and Generative Capabilities. ACL (1) 2022: 8479-8492 - [c48]Eugene Kharitonov, Ann Lee, Adam Polyak, Yossi Adi, Jade Copet, Kushal Lakhotia, Tu Anh Nguyen, Morgane Rivière, Abdelrahman Mohamed, Emmanuel Dupoux, Wei-Ning Hsu:
Text-Free Prosody-Aware Generative Spoken Language Modeling. ACL (1) 2022: 8666-8681 - [c47]Felix Kreuk, Adam Polyak, Jade Copet, Eugene Kharitonov, Tu Anh Nguyen, Morgane Rivière, Wei-Ning Hsu, Abdelrahman Mohamed, Emmanuel Dupoux, Yossi Adi:
Textless Speech Emotion Conversion using Discrete & Decomposed Representations. EMNLP 2022: 11200-11214 - [c46]Bowen Shi, Wei-Ning Hsu, Kushal Lakhotia, Abdelrahman Mohamed:
Learning Audio-Visual Speech Representation by Masked Multimodal Cluster Prediction. ICLR 2022 - [c45]Krishna Pillutla, Kshitiz Malik, Abdelrahman Mohamed, Michael G. Rabbat, Maziar Sanjabi, Lin Xiao:
Federated Learning with Partial Model Personalization. ICML 2022: 17716-17758 - [c44]Bowen Shi, Wei-Ning Hsu, Abdelrahman Mohamed:
Robust Self-Supervised Audio-Visual Speech Recognition. INTERSPEECH 2022: 2118-2122 - [c43]Bowen Shi, Abdelrahman Mohamed, Wei-Ning Hsu:
Learning Lip-Based Audio-Visual Speaker Embeddings with AV-HuBERT. INTERSPEECH 2022: 4785-4789 - [c42]Weiyi Zheng, Alex Xiao, Gil Keren, Duc Le, Frank Zhang, Christian Fuegen, Ozlem Kalinli, Yatharth Saraf, Abdelrahman Mohamed:
Scaling ASR Improves Zero and Few Shot Learning. INTERSPEECH 2022: 5135-5139 - [c41]Guan-Ting Lin, Yung-Sung Chuang, Ho-Lam Chung, Shu-Wen Yang, Hsuan-Jui Chen, Shuyan Annie Dong, Shang-Wen Li, Abdelrahman Mohamed, Hung-yi Lee, Lin-Shan Lee:
DUAL: Discrete Spoken Unit Adaptive Learning for Textless Spoken Question Answering. INTERSPEECH 2022: 5165-5169 - [c40]Paden Tomasello, Akshat Shrivastava, Daniel Lazar, Po-Chun Hsu, Duc Le, Adithya Sagar, Ali Elkahky, Jade Copet, Wei-Ning Hsu, Yossi Adi, Robin Algayres, Tu Ahn Nguyen, Emmanuel Dupoux, Luke Zettlemoyer, Abdelrahman Mohamed:
Stop: A Dataset for Spoken Task Oriented Semantic Parsing. SLT 2022: 991-998 - [c39]Tzu-hsun Feng, Shuyan Annie Dong, Ching-Feng Yeh, Shu-Wen Yang, Tzu-Quan Lin, Jiatong Shi, Kai-Wei Chang, Zili Huang, Haibin Wu, Xuankai Chang, Shinji Watanabe
, Abdelrahman Mohamed, Shang-Wen Li, Hung-yi Lee:
Superb @ SLT 2022: Challenge on Generalization and Efficiency of Self-Supervised Speech Representation Learning. SLT 2022: 1096-1103 - [i51]Bowen Shi, Wei-Ning Hsu, Abdelrahman Mohamed:
Robust Self-Supervised Audio-Visual Speech Recognition. CoRR abs/2201.01763 (2022) - [i50]Bowen Shi, Wei-Ning Hsu, Kushal Lakhotia, Abdelrahman Mohamed:
Learning Audio-Visual Speech Representation by Masked Multimodal Cluster Prediction. CoRR abs/2201.02184 (2022) - [i49]Hashmat Shadab Malik, Ikboljon Sobirov, Abdelrahman Mohamed:
Object Detection in Aerial Images: What Improves the Accuracy? CoRR abs/2201.08763 (2022) - [i48]Eugene Kharitonov, Jade Copet, Kushal Lakhotia, Tu Anh Nguyen, Paden Tomasello, Ann Lee, Ali Elkahky, Wei-Ning Hsu, Abdelrahman Mohamed, Emmanuel Dupoux, Yossi Adi:
textless-lib: a Library for Textless Spoken Language Processing. CoRR abs/2202.07359 (2022) - [i47]Guan-Ting Lin, Yung-Sung Chuang, Ho-Lam Chung, Shu-Wen Yang, Hsuan-Jui Chen, Shuyan Dong, Shang-Wen Li, Abdelrahman Mohamed, Hung-yi Lee, Lin-Shan Lee:
DUAL: Discrete Spoken Unit Adaptive Learning for Textless Spoken Question Answering. CoRR abs/2203.04911 (2022) - [i46]Hsiang-Sheng Tsai, Heng-Jui Chang, Wen-Chin Huang, Zili Huang, Kushal Lakhotia, Shu-Wen Yang, Shuyan Dong, Andy T. Liu, Cheng-I Jeff Lai, Jiatong Shi, Xuankai Chang, Phil Hall, Hsuan-Jui Chen, Shang-Wen Li, Shinji Watanabe, Abdelrahman Mohamed, Hung-yi Lee:
SUPERB-SG: Enhanced Speech processing Universal PERformance Benchmark for Semantic and Generative Capabilities. CoRR abs/2203.06849 (2022) - [i45]Tu Anh Nguyen, Eugene Kharitonov, Jade Copet, Yossi Adi, Wei-Ning Hsu, Ali Elkahky, Paden Tomasello, Robin Algayres, Benoît Sagot, Abdelrahman Mohamed, Emmanuel Dupoux:
Generative Spoken Dialogue Language Modeling. CoRR abs/2203.16502 (2022) - [i44]Krishna Pillutla, Kshitiz Malik, Abdelrahman Mohamed, Michael G. Rabbat, Maziar Sanjabi, Lin Xiao:
Federated Learning with Partial Model Personalization. CoRR abs/2204.03809 (2022) - [i43]Yun Tang, Hongyu Gong, Ning Dong, Changhan Wang, Wei-Ning Hsu, Jiatao Gu, Alexei Baevski, Xian Li, Abdelrahman Mohamed, Michael Auli, Juan Miguel Pino:
Unified Speech-Text Pre-training for Speech Translation and Recognition. CoRR abs/2204.05409 (2022) - [i42]Bowen Shi, Abdelrahman Mohamed, Wei-Ning Hsu:
Learning Lip-Based Audio-Visual Speaker Embeddings with AV-HuBERT. CoRR abs/2205.07180 (2022) - [i41]Abdelrahman Mohamed, Hung-yi Lee, Lasse Borgholt, Jakob D. Havtorn, Joakim Edin, Christian Igel, Katrin Kirchhoff, Shang-Wen Li, Karen Livescu, Lars Maaløe, Tara N. Sainath, Shinji Watanabe
:
Self-Supervised Speech Representation Learning: A Review. CoRR abs/2205.10643 (2022) - [i40]Siddharth Dalmia, Dmytro Okhonko, Mike Lewis, Sergey Edunov, Shinji Watanabe
, Florian Metze, Luke Zettlemoyer, Abdelrahman Mohamed:
LegoNN: Building Modular Encoder-Decoder Models. CoRR abs/2206.03318 (2022) - [i39]Robin Algayres, Tristan Ricoul, Julien Karadayi, Hugo Laurençon, Mohamed Salah Zaïem, Abdelrahman Mohamed, Benoît Sagot, Emmanuel Dupoux:
DP-Parse: Finding Word Boundaries from Raw Speech with an Instance Lexicon. CoRR abs/2206.11332 (2022) - [i38]Paden Tomasello, Akshat Shrivastava, Daniel Lazar, Po-Chun Hsu, Duc Le, Adithya Sagar, Ali Elkahky, Jade Copet, Wei-Ning Hsu, Yossef Mordechay, Robin Algayres, Tu Anh Nguyen, Emmanuel Dupoux, Luke Zettlemoyer, Abdelrahman Mohamed:
STOP: A dataset for Spoken Task Oriented Semantic Parsing. CoRR abs/2207.10643 (2022) - [i37]Abdelrahman Mohamed, Rushali Grandhe, K. J. Joseph, Salman Khan, Fahad Shahbaz Khan:
D3Former: Debiased Dual Distilled Transformer for Incremental Learning. CoRR abs/2208.00777 (2022) - [i36]Tzu-hsun Feng, Shuyan Annie Dong, Ching-Feng Yeh, Shu-Wen Yang, Tzu-Quan Lin, Jiatong Shi, Kai-Wei Chang, Zili Huang, Haibin Wu, Xuankai Chang, Shinji Watanabe
, Abdelrahman Mohamed, Shang-Wen Li, Hung-yi Lee:
SUPERB @ SLT 2022: Challenge on Generalization and Efficiency of Self-Supervised Speech Representation Learning. CoRR abs/2210.08634 (2022) - [i35]Cheol Jun Cho, Peter Wu, Abdelrahman Mohamed, Gopala Krishna Anumanchipalli:
Evidence of Vocal Tract Articulation in Self-Supervised Learning of Speech. CoRR abs/2210.11723 (2022) - [i34]Florian L. Kreyssig, Yangyang Shi, Jinxi Guo, Leda Sari, Abdelrahman Mohamed, Philip C. Woodland:
Biased Self-supervised learning for ASR. CoRR abs/2211.02536 (2022) - [i33]Andros Tjandra, Nayan Singhal, David Zhang, Ozlem Kalinli, Abdelrahman Mohamed, Duc Le, Michael L. Seltzer:
Massively Multilingual ASR on 70 Languages: Tokenization, Architecture, and Generalization Capabilities. CoRR abs/2211.05756 (2022) - [i32]Anuj Diwan, Ching-Feng Yeh, Wei-Ning Hsu, Paden Tomasello, Eunsol Choi, David Harwath, Abdelrahman Mohamed:
Continual Learning for On-Device Speech Recognition using Disentangled Conformers. CoRR abs/2212.01393 (2022) - 2021
- [j5]Wei-Ning Hsu
, Benjamin Bolte, Yao-Hung Hubert Tsai, Kushal Lakhotia, Ruslan Salakhutdinov, Abdelrahman Mohamed:
HuBERT: Self-Supervised Speech Representation Learning by Masked Prediction of Hidden Units. IEEE ACM Trans. Audio Speech Lang. Process. 29: 3451-3460 (2021) - [c38]Vimal Manohar, Tatiana Likhomanenko, Qiantong Xu, Wei-Ning Hsu, Ronan Collobert, Yatharth Saraf, Geoffrey Zweig, Abdelrahman Mohamed:
Kaizen: Continuously Improving Teacher Using Exponential Moving Average for Semi-Supervised Speech Recognition. ASRU 2021: 518-525 - [c37]Alex Xiao, Christian Fuegen, Abdelrahman Mohamed:
Contrastive Semi-Supervised Learning for ASR. ICASSP 2021: 3870-3874 - [c36]Wei-Ning Hsu, Yao-Hung Hubert Tsai, Benjamin Bolte, Ruslan Salakhutdinov, Abdelrahman Mohamed:
Hubert: How Much Can a Bad Teacher Benefit ASR Pre-Training? ICASSP 2021: 6533-6537 - [c35]Shu-Wen Yang, Po-Han Chi, Yung-Sung Chuang, Cheng-I Jeff Lai, Kushal Lakhotia, Yist Y. Lin, Andy T. Liu, Jiatong Shi, Xuankai Chang, Guan-Ting Lin, Tzu-Hsien Huang, Wei-Cheng Tseng, Ko-tik Lee, Da-Rong Liu, Zili Huang, Shuyan Dong, Shang-Wen Li, Shinji Watanabe
, Abdelrahman Mohamed, Hung-yi Lee:
SUPERB: Speech Processing Universal PERformance Benchmark. Interspeech 2021: 1194-1198 - [c34]Alexis Conneau, Alexei Baevski, Ronan Collobert, Abdelrahman Mohamed, Michael Auli:
Unsupervised Cross-Lingual Representation Learning for Speech Recognition. Interspeech 2021: 2426-2430 - [c33]Adam Polyak, Yossi Adi, Jade Copet, Eugene Kharitonov, Kushal Lakhotia, Wei-Ning Hsu, Abdelrahman Mohamed, Emmanuel Dupoux:
Speech Resynthesis from Discrete Disentangled Self-Supervised Representations. Interspeech 2021: 3615-3619 - [i31]Alex Xiao, Christian Fuegen, Abdelrahman Mohamed:
Contrastive Semi-supervised Learning for ASR. CoRR abs/2103.05149 (2021) - [i30]Adam Polyak, Yossi Adi, Jade Copet, Eugene Kharitonov, Kushal Lakhotia, Wei-Ning Hsu, Abdelrahman Mohamed, Emmanuel Dupoux
:
Speech Resynthesis from Discrete Disentangled Self-Supervised Representations. CoRR abs/2104.00355 (2021) - [i29]Shu-Wen Yang, Po-Han Chi, Yung-Sung Chuang, Cheng-I Jeff Lai, Kushal Lakhotia, Yist Y. Lin, Andy T. Liu, Jiatong Shi, Xuankai Chang, Guan-Ting Lin, Tzu-Hsien Huang, Wei-Cheng Tseng, Ko-tik Lee, Da-Rong Liu, Zili Huang, Shuyan Dong, Shang-Wen Li, Shinji Watanabe, Abdelrahman Mohamed, Hung-yi Lee:
SUPERB: Speech processing Universal PERformance Benchmark. CoRR abs/2105.01051 (2021) - [i28]Wei-Ning Hsu, Benjamin Bolte, Yao-Hung Hubert Tsai, Kushal Lakhotia, Ruslan Salakhutdinov, Abdelrahman Mohamed:
HuBERT: Self-Supervised Speech Representation Learning by Masked Prediction of Hidden Units. CoRR abs/2106.07447 (2021) - [i27]Vimal Manohar, Tatiana Likhomanenko, Qiantong Xu, Wei-Ning Hsu, Ronan Collobert, Yatharth Saraf, Geoffrey Zweig, Abdelrahman Mohamed:
Kaizen: Continuously improving teacher using Exponential Moving Average for semi-supervised speech recognition. CoRR abs/2106.07759 (2021) - [i26]Eugene Kharitonov, Ann Lee, Adam Polyak, Yossi Adi, Jade Copet, Kushal Lakhotia, Tu Anh Nguyen, Morgane Rivière, Abdelrahman Mohamed, Emmanuel Dupoux, Wei-Ning Hsu:
Text-Free Prosody-Aware Generative Spoken Language Modeling. CoRR abs/2109.03264 (2021) - [i25]Alex Xiao, Weiyi Zheng, Gil Keren, Duc Le, Frank Zhang, Christian Fuegen, Ozlem Kalinli, Yatharth Saraf, Abdelrahman Mohamed:
Scaling ASR Improves Zero and Few Shot Learning. CoRR abs/2111.05948 (2021) - [i24]Felix Kreuk, Adam Polyak, Jade Copet, Eugene Kharitonov, Tu Anh Nguyen, Morgane Rivière, Wei-Ning Hsu, Abdelrahman Mohamed, Emmanuel Dupoux, Yossi Adi:
Textless Speech Emotion Conversion using Decomposed and Discrete Representations. CoRR abs/2111.07402 (2021) - 2020
- [c32]Mike Lewis, Yinhan Liu, Naman Goyal, Marjan Ghazvininejad, Abdelrahman Mohamed, Omer Levy, Veselin Stoyanov, Luke Zettlemoyer:
BART: Denoising Sequence-to-Sequence Pre-training for Natural Language Generation, Translation, and Comprehension. ACL 2020: 7871-7880 - [c31]Yongqiang Wang, Abdelrahman Mohamed, Duc Le, Chunxi Liu, Alex Xiao, Jay Mahadeokar, Hongzhao Huang, Andros Tjandra, Xiaohui Zhang, Frank Zhang, Christian Fuegen, Geoffrey Zweig, Michael L. Seltzer:
Transformer-Based Acoustic Modeling for Hybrid Speech Recognition. ICASSP 2020: 6874-6878 - [c30]Jacob Kahn, Morgane Rivière, Weiyi Zheng, Evgeny Kharitonov, Qiantong Xu, Pierre-Emmanuel Mazaré, Julien Karadayi, Vitaliy Liptchinsky, Ronan Collobert, Christian Fuegen, Tatiana Likhomanenko, Gabriel Synnaeve, Armand Joulin, Abdelrahman Mohamed, Emmanuel Dupoux
:
Libri-Light: A Benchmark for ASR with Limited or No Supervision. ICASSP 2020: 7669-7673 - [c29]Alexei Baevski, Abdelrahman Mohamed:
Effectiveness of Self-Supervised Pre-Training for ASR. ICASSP 2020: 7694-7698 - [c28]Kritika Singh, Dmytro Okhonko, Jun Liu, Yongqiang Wang, Frank Zhang, Ross B. Girshick, Sergey Edunov, Fuchun Peng, Yatharth Saraf, Geoffrey Zweig, Abdelrahman Mohamed:
Training ASR Models By Generation of Contextual Information. ICASSP 2020: 7864-7868 - [c27]Kritika Singh, Vimal Manohar, Alex Xiao, Sergey Edunov, Ross B. Girshick, Vitaliy Liptchinsky, Christian Fuegen, Yatharth Saraf, Geoffrey Zweig, Abdelrahman Mohamed:
Large Scale Weakly and Semi-Supervised Learning for Low-Resource Video ASR. INTERSPEECH 2020: 3770-3774 - [c26]Alexei Baevski, Yuhao Zhou, Abdelrahman Mohamed, Michael Auli:
wav2vec 2.0: A Framework for Self-Supervised Learning of Speech Representations. NeurIPS 2020 - [i23]Kritika Singh, Vimal Manohar, Alex Xiao, Sergey Edunov, Ross B. Girshick, Vitaliy Liptchinsky, Christian Fuegen, Yatharth Saraf, Geoffrey Zweig, Abdelrahman Mohamed:
Large scale weakly and semi-supervised learning for low-resource video ASR. CoRR abs/2005.07850 (2020) - [i22]Alexei Baevski, Henry Zhou, Abdelrahman Mohamed, Michael Auli:
wav2vec 2.0: A Framework for Self-Supervised Learning of Speech Representations. CoRR abs/2006.11477 (2020) - [i21]Alexis Conneau, Alexei Baevski, Ronan Collobert, Abdelrahman Mohamed, Michael Auli:
Unsupervised Cross-lingual Representation Learning for Speech Recognition. CoRR abs/2006.13979 (2020)
2010 – 2019
- 2019
- [i20]Abdelrahman Mohamed, Dmytro Okhonko, Luke Zettlemoyer:
Transformers with convolutional context for ASR. CoRR abs/1904.11660 (2019) - [i19]Yongqiang Wang, Abdelrahman Mohamed, Duc Le, Chunxi Liu, Alex Xiao, Jay Mahadeokar, Hongzhao Huang, Andros Tjandra, Xiaohui Zhang, Frank Zhang, Christian Fuegen, Geoffrey Zweig, Michael L. Seltzer:
Transformer-based Acoustic Modeling for Hybrid Speech Recognition. CoRR abs/1910.09799 (2019) - [i18]Kritika Singh, Dmytro Okhonko, Jun Liu, Yongqiang Wang, Frank Zhang, Ross B. Girshick, Sergey Edunov, Fuchun Peng, Yatharth Saraf, Geoffrey Zweig, Abdelrahman Mohamed:
Training ASR models by Generation of Contextual Information. CoRR abs/1910.12367 (2019) - [i17]Mike Lewis, Yinhan Liu, Naman Goyal, Marjan Ghazvininejad, Abdelrahman Mohamed, Omer Levy, Veselin Stoyanov, Luke Zettlemoyer:
BART: Denoising Sequence-to-Sequence Pre-training for Natural Language Generation, Translation, and Comprehension. CoRR abs/1910.13461 (2019) - [i16]Siddharth Dalmia, Abdelrahman Mohamed, Mike Lewis, Florian Metze, Luke Zettlemoyer:
Enforcing Encoder-Decoder Modularity in Sequence-to-Sequence Models. CoRR abs/1911.03782 (2019) - [i15]Alexei Baevski, Michael Auli, Abdelrahman Mohamed:
Effectiveness of self-supervised pre-training for speech recognition. CoRR abs/1911.03912 (2019) - [i14]Jacob Kahn, Morgane Rivière, Weiyi Zheng, Evgeny Kharitonov, Qiantong Xu, Pierre-Emmanuel Mazaré, Julien Karadayi, Vitaliy Liptchinsky, Ronan Collobert, Christian Fuegen, Tatiana Likhomanenko, Gabriel Synnaeve, Armand Joulin, Abdelrahman Mohamed, Emmanuel Dupoux:
Libri-Light: A Benchmark for ASR with Limited or No Supervision. CoRR abs/1912.07875 (2019) - 2018
- [j4]Islam S. M. Khalil
, Dalia Mahdy, Ahmed El Sharkawy, Ramez R. Moustafa, Ahmet Fatih Tabak
, Mohamed E. Mitwally, Sarah Hesham
, Nabila Hamdi, Anke Klingner
, Abdelrahman Mohamed, Metin Sitti
:
Mechanical Rubbing of Blood Clots Using Helical Robots Under Ultrasound Guidance. IEEE Robotics Autom. Lett. 3(2): 1112-1119 (2018) - [c25]Rasool Fakoor, Amanjit Kainth, Siamak Shakeri, Christopher Winestock, Abdel-rahman Mohamed, Ruhi Sarikaya:
Direct Optimization of F-Measure for Retrieval-Based Personal Question Answering. SLT 2018: 815-822 - [i13]Rasool Fakoor, Amanjit Kainth, Siamak Shakeri, Christopher Winestock, Abdel-rahman Mohamed, Ruhi Sarikaya:
Direct optimization of F-measure for retrieval-based personal question answering. CoRR abs/1810.00679 (2018) - [i12]Thomas Powers, Rasool Fakoor, Siamak Shakeri, Abhinav Sethy, Amanjit Kainth, Abdel-rahman Mohamed, Ruhi Sarikaya:
Differentiable Greedy Networks. CoRR abs/1810.12464 (2018) - 2017
- [c24]Emilio Parisotto, Abdel-rahman Mohamed, Rishabh Singh, Lihong Li, Dengyong Zhou, Pushmeet Kohli:
Neuro-Symbolic Program Synthesis. ICLR (Poster) 2017 - [c23]Gregor Urban, Krzysztof J. Geras, Samira Ebrahimi Kahou, Özlem Aslan, Shengjie Wang, Abdelrahman Mohamed, Matthai Philipose, Matthew Richardson, Rich Caruana:
Do Deep Convolutional Nets Really Need to be Deep and Convolutional? ICLR (Poster) 2017 - [c22]Jacob Devlin, Jonathan Uesato, Surya Bhupatiraju, Rishabh Singh, Abdel-rahman Mohamed, Pushmeet Kohli:
RobustFill: Neural Program Learning under Noisy I/O. ICML 2017: 990-998 - [c21]Chong Wang, Yining Wang, Po-Sen Huang, Abdelrahman Mohamed, Dengyong Zhou, Li Deng:
Sequence Modeling via Segmentations. ICML 2017: 3674-3683 - [i11]Chong Wang, Yining Wang, Po-Sen Huang, Abdelrahman Mohamed, Dengyong Zhou, Li Deng:
Sequence Modeling via Segmentations. CoRR abs/1702.07463 (2017) - [i10]Jacob Devlin, Jonathan Uesato, Surya Bhupatiraju, Rishabh Singh, Abdel-rahman Mohamed, Pushmeet Kohli:
RobustFill: Neural Program Learning under Noisy I/O. CoRR abs/1703.07469 (2017) - [i9]Surya Bhupatiraju, Rishabh Singh, Abdel-rahman Mohamed, Pushmeet Kohli:
Deep API Programmer: Learning to Program with APIs. CoRR abs/1704.04327 (2017) - [i8]Kavosh Asadi, Cameron Allen, Melrose Roderick, Abdel-rahman Mohamed, George Dimitri Konidaris, Michael L. Littman:
Mean Actor Critic. CoRR abs/1709.00503 (2017) - 2016
- [c20]Jinyu Li
, Abdelrahman Mohamed, Geoffrey Zweig, Yifan Gong:
Exploring multidimensional lstms for large vocabulary ASR. ICASSP 2016: 4940-4944 - [c19]Shengjie Wang, Abdel-rahman Mohamed, Rich Caruana, Jeff A. Bilmes, Matthai Philipose, Matthew Richardson, Krzysztof J. Geras, Gregor Urban, Özlem Aslan:
Analysis of Deep Neural Networks with Extended Data Jacobian Matrix. ICML 2016: 718-726 - [i7]Rodrigo Nogueira, Taesup Kim, Abdel-rahman Mohamed, Ahmed Hassan Awadallah, Pushmeet Kohli, Ahmed Mohamed:
MSR System Description - TAC 2016 KBP Cold Start Slof Filling Track. TAC 2016 - [i6]Gregor Urban, Krzysztof J. Geras, Samira Ebrahimi Kahou, Özlem Aslan, Shengjie Wang, Rich Caruana, Abdelrahman Mohamed, Matthai Philipose, Matthew Richardson:
Do Deep Convolutional Nets Really Need to be Deep (Or Even Convolutional)? CoRR abs/1603.05691 (2016) - [i5]Emilio Parisotto, Abdel-rahman Mohamed, Rishabh Singh, Lihong Li, Dengyong Zhou, Pushmeet Kohli:
Neuro-Symbolic Program Synthesis. CoRR abs/1611.01855 (2016) - [i4]Rasool Fakoor, Abdel-rahman Mohamed, Margaret Mitchell, Sing Bing Kang, Pushmeet Kohli:
Memory-augmented Attention Modelling for Videos. CoRR abs/1611.02261 (2016) - 2015
- [j3]Tara N. Sainath, Brian Kingsbury, George Saon, Hagen Soltau, Abdel-rahman Mohamed, George E. Dahl, Bhuvana Ramabhadran:
Deep Convolutional Neural Networks for Large-scale Speech Tasks. Neural Networks 64: 39-48 (2015) - [c18]Tong Wang, Abdelrahman Mohamed, Graeme Hirst:
Learning Lexical Embeddings with Syntactic and Lexicographic Knowledge. ACL (2) 2015: 458-463 - [c17]Abdel-rahman Mohamed, Frank Seide, Dong Yu, Jasha Droppo
, Andreas Stolcke, Geoffrey Zweig, Gerald Penn
:
Deep bi-directional recurrent networks over spectral windows. ASRU 2015: 78-83 - [c16]Jinyu Li
, Abdelrahman Mohamed, Geoffrey Zweig, Yifan Gong:
LSTM time and frequency recurrence for automatic speech recognition. ASRU 2015: 187-191 - [i3]Krzysztof J. Geras, Abdel-rahman Mohamed, Rich Caruana, Gregor Urban, Shengjie Wang, Özlem Aslan, Matthai Philipose, Matthew Richardson, Charles Sutton:
Compressing LSTMs into CNNs. CoRR abs/1511.06433 (2015) - 2014
- [b1]Abdel-rahman Mohamed:
Deep Neural Network Acoustic Models for ASR. University of Toronto, Canada, 2014 - [j2]Ossama Abdel-Hamid, Abdel-rahman Mohamed, Hui Jiang, Li Deng, Gerald Penn
, Dong Yu:
Convolutional Neural Networks for Speech Recognition. IEEE ACM Trans. Audio Speech Lang. Process. 22(10): 1533-1545 (2014) - [c15]Tara N. Sainath, Brian Kingsbury, Abdel-rahman Mohamed, George Saon, Bhuvana Ramabhadran:
Improvements to filterbank and delta learning within a deep neural network framework. ICASSP 2014: 6839-6843 - 2013
- [c14]Alex Graves, Navdeep Jaitly, Abdel-rahman Mohamed:
Hybrid speech recognition with Deep Bidirectional LSTM. ASRU 2013: 273-278 - [c13]Tara N. Sainath, Brian Kingsbury, Abdel-rahman Mohamed, Bhuvana Ramabhadran:
Learning filter banks within a deep neural network framework. ASRU 2013: 297-302 - [c12]Tara N. Sainath, Brian Kingsbury, Abdel-rahman Mohamed, George E. Dahl, George Saon, Hagen Soltau, Tomás Beran, Aleksandr Y. Aravkin, Bhuvana Ramabhadran:
Improvements to Deep Convolutional Neural Networks for LVCSR. ASRU 2013: 315-320 - [c11]Alex Graves, Abdel-rahman Mohamed, Geoffrey E. Hinton:
Speech recognition with deep recurrent neural networks. ICASSP 2013: 6645-6649 - [c10]Tara N. Sainath, Abdel-rahman Mohamed, Brian Kingsbury, Bhuvana Ramabhadran:
Deep convolutional neural networks for LVCSR. ICASSP 2013: 8614-8618 - [i2]Alex Graves, Abdel-rahman Mohamed, Geoffrey E. Hinton:
Speech Recognition with Deep Recurrent Neural Networks. CoRR abs/1303.5778 (2013) - [i1]Tara N. Sainath, Brian Kingsbury, Abdel-rahman Mohamed, George E. Dahl, George Saon, Hagen Soltau, Tomás Beran, Aleksandr Y. Aravkin, Bhuvana Ramabhadran:
Improvements to deep convolutional neural networks for LVCSR. CoRR abs/1309.1501 (2013) - 2012
- [j1]Abdel-rahman Mohamed, George E. Dahl, Geoffrey E. Hinton:
Acoustic Modeling Using Deep Belief Networks. IEEE Trans. Speech Audio Process. 20(1): 14-22 (2012) - [c9]Abdel-rahman Mohamed, Geoffrey E. Hinton, Gerald Penn
:
Understanding how Deep Belief Networks perform acoustic modelling. ICASSP 2012: 4273-4276 - [c8]Ossama Abdel-Hamid, Abdel-rahman Mohamed, Hui Jiang, Gerald Penn
:
Applying Convolutional Neural Networks concepts to hybrid NN-HMM model for speech recognition. ICASSP 2012: 4277-4280 - [c7]Yichuan Tang, Abdel-rahman Mohamed:
Multiresolution Deep Belief Networks. AISTATS 2012: 1203-1211 - 2011
- [c6]Tara N. Sainath, Brian Kingsbury, Bhuvana Ramabhadran, Petr Fousek, Petr Novák, Abdel-rahman Mohamed:
Making Deep Belief Networks effective for large vocabulary continuous speech recognition. ASRU 2011: 30-35 - [c5]Abdel-rahman Mohamed, Tara N. Sainath, George E. Dahl, Bhuvana Ramabhadran, Geoffrey E. Hinton, Michael A. Picheny:
Deep Belief Networks using discriminative features for phone recognition. ICASSP 2011: 5060-5063 - 2010
- [c4]Abdel-rahman Mohamed, Geoffrey E. Hinton:
Phone recognition using Restricted Boltzmann Machines. ICASSP 2010: 4354-4357 - [c3]Li Deng, Michael L. Seltzer, Dong Yu, Alex Acero, Abdel-rahman Mohamed, Geoffrey E. Hinton:
Binary coding of speech spectrograms using a deep auto-encoder. INTERSPEECH 2010: 1692-1695 - [c2]Abdel-rahman Mohamed, Dong Yu, Li Deng:
Investigation of full-sequence training of deep belief networks for speech recognition. INTERSPEECH 2010: 2846-2849 - [c1]George E. Dahl, Marc'Aurelio Ranzato, Abdel-rahman Mohamed, Geoffrey E. Hinton:
Phone Recognition with the Mean-Covariance Restricted Boltzmann Machine. NIPS 2010: 469-477