default search action
Christian Fügen
Christian Fuegen
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
showing all ?? records
2020 – today
- 2024
- [c65]Ju Lin, Niko Moritz, Yiteng Huang, Ruiming Xie, Ming Sun, Christian Fuegen, Frank Seide:
AGADIR: Towards Array-Geometry Agnostic Directional Speech Recognition. ICASSP 2024: 11951-11955 - [c64]Egor Lakomkin, Chunyang Wu, Yassir Fathullah, Ozlem Kalinli, Michael L. Seltzer, Christian Fuegen:
End-to-End Speech Recognition Contextualization with Large Language Models. ICASSP 2024: 12406-12410 - [c63]Jinxi Guo, Niko Moritz, Yingyi Ma, Frank Seide, Chunyang Wu, Jay Mahadeokar, Ozlem Kalinli, Christian Fuegen, Mike Seltzer:
Effective Internal Language Model Training and Fusion for Factorized Transducer Model. ICASSP 2024: 12687-12691 - [c62]Yassir Fathullah, Chunyang Wu, Egor Lakomkin, Junteng Jia, Yuan Shangguan, Ke Li, Jinxi Guo, Wenhan Xiong, Jay Mahadeokar, Ozlem Kalinli, Christian Fuegen, Mike Seltzer:
Prompting Large Language Models with Speech Recognition Abilities. ICASSP 2024: 13351-13355 - [c61]Yassir Fathullah, Chunyang Wu, Egor Lakomkin, Ke Li, Junteng Jia, Yuan Shangguan, Jay Mahadeokar, Ozlem Kalinli, Christian Fuegen, Mike Seltzer:
AudioChatLlama: Towards General-Purpose Speech Abilities for LLMs. NAACL-HLT 2024: 5522-5532 - [i35]Ju Lin, Niko Moritz, Yiteng Huang, Ruiming Xie, Ming Sun, Christian Fuegen, Frank Seide:
AGADIR: Towards Array-Geometry Agnostic Directional Speech Recognition. CoRR abs/2401.10411 (2024) - [i34]Jinxi Guo, Niko Moritz, Yingyi Ma, Frank Seide, Chunyang Wu, Jay Mahadeokar, Ozlem Kalinli, Christian Fuegen, Mike Seltzer:
Effective internal language model training and fusion for factorized transducer model. CoRR abs/2404.01716 (2024) - 2023
- [c60]Xubo Liu, Egor Lakomkin, Konstantinos Vougioukas, Pingchuan Ma, Honglie Chen, Ruiming Xie, Morrie Doulaty, Niko Moritz, Jáchym Kolár, Stavros Petridis, Maja Pantic, Christian Fuegen:
SynthVSR: Scaling Up Visual Speech RecognitionWith Synthetic Supervision. CVPR 2023: 18806-18815 - [c59]Pingchuan Ma, Niko Moritz, Stavros Petridis, Christian Fuegen, Maja Pantic:
Streaming Audio-Visual Speech Recognition with Alignment Regularization. INTERSPEECH 2023: 1598-1602 - [c58]Ju Lin, Niko Moritz, Ruiming Xie, Kaustubh Kalgaonkar, Christian Fuegen, Frank Seide:
Directional Speech Recognition for Speaker Disambiguation and Cross-talk Suppression. INTERSPEECH 2023: 3522-3526 - [i33]Xubo Liu, Egor Lakomkin, Konstantinos Vougioukas, Pingchuan Ma, Honglie Chen, Ruiming Xie, Morrie Doulaty, Niko Moritz, Jáchym Kolár, Stavros Petridis, Maja Pantic, Christian Fuegen:
SynthVSR: Scaling Up Visual Speech Recognition With Synthetic Supervision. CoRR abs/2303.17200 (2023) - [i32]Yassir Fathullah, Chunyang Wu, Egor Lakomkin, Junteng Jia, Yuan Shangguan, Ke Li, Jinxi Guo, Wenhan Xiong, Jay Mahadeokar, Ozlem Kalinli, Christian Fuegen, Mike Seltzer:
Prompting Large Language Models with Speech Recognition Abilities. CoRR abs/2307.11795 (2023) - [i31]Egor Lakomkin, Chunyang Wu, Yassir Fathullah, Ozlem Kalinli, Michael L. Seltzer, Christian Fuegen:
End-to-End Speech Recognition Contextualization with Large Language Models. CoRR abs/2309.10917 (2023) - [i30]Yassir Fathullah, Chunyang Wu, Egor Lakomkin, Junteng Jia, Yuan Shangguan, Jay Mahadeokar, Ozlem Kalinli, Christian Fuegen, Mike Seltzer:
Towards General-Purpose Speech Abilities for Large Language Models Using Unpaired Data. CoRR abs/2311.06753 (2023) - 2022
- [c57]Kristen Grauman, Andrew Westbury, Eugene Byrne, Zachary Chavis, Antonino Furnari, Rohit Girdhar, Jackson Hamburger, Hao Jiang, Miao Liu, Xingyu Liu, Miguel Martin, Tushar Nagarajan, Ilija Radosavovic, Santhosh Kumar Ramakrishnan, Fiona Ryan, Jayant Sharma, Michael Wray, Mengmeng Xu, Eric Zhongcong Xu, Chen Zhao, Siddhant Bansal, Dhruv Batra, Vincent Cartillier, Sean Crane, Tien Do, Morrie Doulaty, Akshay Erapalli, Christoph Feichtenhofer, Adriano Fragomeni, Qichen Fu, Abrham Gebreselasie, Cristina González, James Hillis, Xuhua Huang, Yifei Huang, Wenqi Jia, Weslie Khoo, Jáchym Kolár, Satwik Kottur, Anurag Kumar, Federico Landini, Chao Li, Yanghao Li, Zhenqiang Li, Karttikeya Mangalam, Raghava Modhugu, Jonathan Munro, Tullie Murrell, Takumi Nishiyasu, Will Price, Paola Ruiz Puentes, Merey Ramazanova, Leda Sari, Kiran Somasundaram, Audrey Southerland, Yusuke Sugano, Ruijie Tao, Minh Vo, Yuchen Wang, Xindi Wu, Takuma Yagi, Ziwei Zhao, Yunyi Zhu, Pablo Arbeláez, David Crandall, Dima Damen, Giovanni Maria Farinella, Christian Fuegen, Bernard Ghanem, Vamsi Krishna Ithapu, C. V. Jawahar, Hanbyul Joo, Kris Kitani, Haizhou Li, Richard A. Newcombe, Aude Oliva, Hyun Soo Park, James M. Rehg, Yoichi Sato, Jianbo Shi, Mike Zheng Shou, Antonio Torralba, Lorenzo Torresani, Mingfei Yan, Jitendra Malik:
Ego4D: Around the World in 3, 000 Hours of Egocentric Video. CVPR 2022: 18973-18990 - [c56]Suyoun Kim, Duc Le, Weiyi Zheng, Tarun Singh, Abhinav Arora, Xiaoyu Zhai, Christian Fuegen, Ozlem Kalinli, Michael L. Seltzer:
Evaluating User Perception of Speech Recognition System Quality with Semantic Distance Metric. INTERSPEECH 2022: 3978-3982 - [c55]Weiyi Zheng, Alex Xiao, Gil Keren, Duc Le, Frank Zhang, Christian Fuegen, Ozlem Kalinli, Yatharth Saraf, Abdelrahman Mohamed:
Scaling ASR Improves Zero and Few Shot Learning. INTERSPEECH 2022: 5135-5139 - [c54]Niko Moritz, Frank Seide, Duc Le, Jay Mahadeokar, Christian Fuegen:
An Investigation of Monotonic Transducers for Large-Scale Automatic Speech Recognition. SLT 2022: 324-330 - [i29]Niko Moritz, Frank Seide, Duc Le, Jay Mahadeokar, Christian Fuegen:
An Investigation of Monotonic Transducers for Large-Scale Automatic Speech Recognition. CoRR abs/2204.08858 (2022) - [i28]Pingchuan Ma, Niko Moritz, Stavros Petridis, Christian Fuegen, Maja Pantic:
Streaming Audio-Visual Speech Recognition with Alignment Regularization. CoRR abs/2211.02133 (2022) - 2021
- [c53]Alex Xiao, Christian Fuegen, Abdelrahman Mohamed:
Contrastive Semi-Supervised Learning for ASR. ICASSP 2021: 3870-3874 - [c52]Ju Lin, Yun Wang, Kaustubh Kalgaonkar, Gil Keren, Didi Zhang, Christian Fuegen:
A Time-Domain Convolutional Recurrent Network for Packet Loss Concealment. ICASSP 2021: 7148-7152 - [c51]Suyoun Kim, Yuan Shangguan, Jay Mahadeokar, Antoine Bruguier, Christian Fuegen, Michael L. Seltzer, Duc Le:
Improved Neural Language Model Fusion for Streaming Recurrent Neural Network Transducer. ICASSP 2021: 7333-7337 - [c50]Ganesh Venkatesh, Alagappan Valliappan, Jay Mahadeokar, Yuan Shangguan, Christian Fuegen, Michael L. Seltzer, Vikas Chandra:
Memory-Efficient Speech Recognition on Smart Devices. ICASSP 2021: 8368-8372 - [c49]Chunyang Wu, Zhiping Xiu, Yangyang Shi, Ozlem Kalinli, Christian Fuegen, Thilo Köhler, Qing He:
Transformer-Based Acoustic Modeling for Streaming Speech Synthesis. Interspeech 2021: 146-150 - [c48]Anurag Kumar, Yun Wang, Vamsi Krishna Ithapu, Christian Fuegen:
Do Sound Event Representations Generalize to Other Audio Tasks? A Case Study in Audio Transfer Learning. Interspeech 2021: 1214-1218 - [c47]Ju Lin, Yun Wang, Kaustubh Kalgaonkar, Gil Keren, Didi Zhang, Christian Fuegen:
A Two-Stage Approach to Speech Bandwidth Extension. Interspeech 2021: 1689-1693 - [c46]Duc Le, Mahaveer Jain, Gil Keren, Suyoun Kim, Yangyang Shi, Jay Mahadeokar, Julian Chan, Yuan Shangguan, Christian Fuegen, Ozlem Kalinli, Yatharth Saraf, Michael L. Seltzer:
Contextualized Streaming End-to-End Speech Recognition with Trie-Based Deep Biasing and Shallow Fusion. Interspeech 2021: 1772-1776 - [c45]Suyoun Kim, Abhinav Arora, Duc Le, Ching-Feng Yeh, Christian Fuegen, Ozlem Kalinli, Michael L. Seltzer:
Semantic Distance: A New Metric for ASR Performance Analysis Towards Spoken Language Understanding. Interspeech 2021: 1977-1981 - [c44]Yangyang Shi, Varun Nagaraja, Chunyang Wu, Jay Mahadeokar, Duc Le, Rohit Prabhavalkar, Alex Xiao, Ching-Feng Yeh, Julian Chan, Christian Fuegen, Ozlem Kalinli, Michael L. Seltzer:
Dynamic Encoder Transducer: A Flexible Solution for Trading Off Accuracy for Latency. Interspeech 2021: 2042-2046 - [c43]Jay Mahadeokar, Yangyang Shi, Yuan Shangguan, Chunyang Wu, Alex Xiao, Hang Su, Duc Le, Ozlem Kalinli, Christian Fuegen, Michael L. Seltzer:
Flexi-Transducer: Optimizing Latency, Accuracy and Compute for Multi-Domain On-Device Scenarios. Interspeech 2021: 2107-2111 - [c42]Yuan Shangguan, Rohit Prabhavalkar, Hang Su, Jay Mahadeokar, Yangyang Shi, Jiatong Zhou, Chunyang Wu, Duc Le, Ozlem Kalinli, Christian Fuegen, Michael L. Seltzer:
Dissecting User-Perceived Latency of On-Device E2E Speech Recognition. Interspeech 2021: 4553-4557 - [c41]Jay Mahadeokar, Yuan Shangguan, Duc Le, Gil Keren, Hang Su, Thong Le, Ching-Feng Yeh, Christian Fuegen, Michael L. Seltzer:
Alignment Restricted Streaming Recurrent Neural Network Transducer. SLT 2021: 52-59 - [c40]Duc Le, Gil Keren, Julian Chan, Jay Mahadeokar, Christian Fuegen, Michael L. Seltzer:
Deep Shallow Fusion for RNN-T Personalization. SLT 2021: 251-257 - [i27]Ganesh Venkatesh, Alagappan Valliappan, Jay Mahadeokar, Yuan Shangguan, Christian Fuegen, Michael L. Seltzer, Vikas Chandra:
Memory-efficient Speech Recognition on Smart Devices. CoRR abs/2102.11531 (2021) - [i26]Alex Xiao, Christian Fuegen, Abdelrahman Mohamed:
Contrastive Semi-supervised Learning for ASR. CoRR abs/2103.05149 (2021) - [i25]Suyoun Kim, Abhinav Arora, Duc Le, Ching-Feng Yeh, Christian Fuegen, Ozlem Kalinli, Michael L. Seltzer:
Semantic Distance: A New Metric for ASR Performance Analysis Towards Spoken Language Understanding. CoRR abs/2104.02138 (2021) - [i24]Yangyang Shi, Varun Nagaraja, Chunyang Wu, Jay Mahadeokar, Duc Le, Rohit Prabhavalkar, Alex Xiao, Ching-Feng Yeh, Julian Chan, Christian Fuegen, Ozlem Kalinli, Michael L. Seltzer:
Dynamic Encoder Transducer: A Flexible Solution For Trading Off Accuracy For Latency. CoRR abs/2104.02176 (2021) - [i23]Duc Le, Mahaveer Jain, Gil Keren, Suyoun Kim, Yangyang Shi, Jay Mahadeokar, Julian Chan, Yuan Shangguan, Christian Fuegen, Ozlem Kalinli, Yatharth Saraf, Michael L. Seltzer:
Contextualized Streaming End-to-End Speech Recognition with Trie-Based Deep Biasing and Shallow Fusion. CoRR abs/2104.02194 (2021) - [i22]Yuan Shangguan, Rohit Prabhavalkar, Hang Su, Jay Mahadeokar, Yangyang Shi, Jiatong Zhou, Chunyang Wu, Duc Le, Ozlem Kalinli, Christian Fuegen, Michael L. Seltzer:
Dissecting User-Perceived Latency of On-Device E2E Speech Recognition. CoRR abs/2104.02207 (2021) - [i21]Jay Mahadeokar, Yangyang Shi, Yuan Shangguan, Chunyang Wu, Alex Xiao, Hang Su, Duc Le, Ozlem Kalinli, Christian Fuegen, Michael L. Seltzer:
Flexi-Transducer: Optimizing Latency, Accuracy and Compute forMulti-Domain On-Device Scenarios. CoRR abs/2104.02232 (2021) - [i20]Anurag Kumar, Yun Wang, Vamsi Krishna Ithapu, Christian Fuegen:
Do sound event representations generalize to other audio tasks? A case study in audio transfer learning. CoRR abs/2106.11335 (2021) - [i19]Suyoun Kim, Duc Le, Weiyi Zheng, Tarun Singh, Abhinav Arora, Xiaoyu Zhai, Christian Fuegen, Ozlem Kalinli, Michael L. Seltzer:
Evaluating User Perception of Speech Recognition System Quality with Semantic Distance Metric. CoRR abs/2110.05376 (2021) - [i18]Kristen Grauman, Andrew Westbury, Eugene Byrne, Zachary Chavis, Antonino Furnari, Rohit Girdhar, Jackson Hamburger, Hao Jiang, Miao Liu, Xingyu Liu, Miguel Martin, Tushar Nagarajan, Ilija Radosavovic, Santhosh Kumar Ramakrishnan, Fiona Ryan, Jayant Sharma, Michael Wray, Mengmeng Xu, Eric Zhongcong Xu, Chen Zhao, Siddhant Bansal, Dhruv Batra, Vincent Cartillier, Sean Crane, Tien Do, Morrie Doulaty, Akshay Erapalli, Christoph Feichtenhofer, Adriano Fragomeni, Qichen Fu, Christian Fuegen, Abrham Gebreselasie, Cristina González, James Hillis, Xuhua Huang, Yifei Huang, Wenqi Jia, Weslie Khoo, Jáchym Kolár, Satwik Kottur, Anurag Kumar, Federico Landini, Chao Li, Yanghao Li, Zhenqiang Li, Karttikeya Mangalam, Raghava Modhugu, Jonathan Munro, Tullie Murrell, Takumi Nishiyasu, Will Price, Paola Ruiz Puentes, Merey Ramazanova, Leda Sari, Kiran Somasundaram, Audrey Southerland, Yusuke Sugano, Ruijie Tao, Minh Vo, Yuchen Wang, Xindi Wu, Takuma Yagi, Yunyi Zhu, Pablo Arbeláez, David Crandall, Dima Damen, Giovanni Maria Farinella, Bernard Ghanem, Vamsi Krishna Ithapu, C. V. Jawahar, Hanbyul Joo, Kris Kitani, Haizhou Li, Richard A. Newcombe, Aude Oliva, Hyun Soo Park, James M. Rehg, Yoichi Sato, Jianbo Shi, Mike Zheng Shou, Antonio Torralba, Lorenzo Torresani, Mingfei Yan, Jitendra Malik:
Ego4D: Around the World in 3, 000 Hours of Egocentric Video. CoRR abs/2110.07058 (2021) - [i17]Alex Xiao, Weiyi Zheng, Gil Keren, Duc Le, Frank Zhang, Christian Fuegen, Ozlem Kalinli, Yatharth Saraf, Abdelrahman Mohamed:
Scaling ASR Improves Zero and Few Shot Learning. CoRR abs/2111.05948 (2021) - 2020
- [c39]Duc Le, Thilo Köhler, Christian Fuegen, Michael L. Seltzer:
G2G: TTS-Driven Pronunciation Learning for Graphemic Hybrid ASR. ICASSP 2020: 6869-6873 - [c38]Yongqiang Wang, Abdelrahman Mohamed, Duc Le, Chunxi Liu, Alex Xiao, Jay Mahadeokar, Hongzhao Huang, Andros Tjandra, Xiaohui Zhang, Frank Zhang, Christian Fuegen, Geoffrey Zweig, Michael L. Seltzer:
Transformer-Based Acoustic Modeling for Hybrid Speech Recognition. ICASSP 2020: 6874-6878 - [c37]Weipeng He, Lu Lu, Biqiao Zhang, Jay Mahadeokar, Kaustubh Kalgaonkar, Christian Fuegen:
Spatial Attention for Far-Field Speech Recognition with Deep Beamforming Neural Networks. ICASSP 2020: 7499-7503 - [c36]Jacob Kahn, Morgane Rivière, Weiyi Zheng, Evgeny Kharitonov, Qiantong Xu, Pierre-Emmanuel Mazaré, Julien Karadayi, Vitaliy Liptchinsky, Ronan Collobert, Christian Fuegen, Tatiana Likhomanenko, Gabriel Synnaeve, Armand Joulin, Abdelrahman Mohamed, Emmanuel Dupoux:
Libri-Light: A Benchmark for ASR with Limited or No Supervision. ICASSP 2020: 7669-7673 - [c35]Kritika Singh, Vimal Manohar, Alex Xiao, Sergey Edunov, Ross B. Girshick, Vitaliy Liptchinsky, Christian Fuegen, Yatharth Saraf, Geoffrey Zweig, Abdelrahman Mohamed:
Large Scale Weakly and Semi-Supervised Learning for Low-Resource Video ASR. INTERSPEECH 2020: 3770-3774 - [c34]Yang Gao, Weiyi Zheng, Zhaojun Yang, Thilo Köhler, Christian Fuegen, Qing He:
Interactive Text-to-Speech System via Joint Style Analysis. INTERSPEECH 2020: 4447-4451 - [c33]Yangyang Shi, Yongqiang Wang, Chunyang Wu, Christian Fuegen, Frank Zhang, Duc Le, Ching-Feng Yeh, Michael L. Seltzer:
Weak-Attention Suppression for Transformer Based Speech Recognition. INTERSPEECH 2020: 4996-5000 - [i16]Yang Gao, Weiyi Zheng, Zhaojun Yang, Thilo Köhler, Christian Fuegen, Qing He:
Interactive Text-to-Speech via Semi-supervised Style Transfer Learning. CoRR abs/2002.06758 (2020) - [i15]Kritika Singh, Vimal Manohar, Alex Xiao, Sergey Edunov, Ross B. Girshick, Vitaliy Liptchinsky, Christian Fuegen, Yatharth Saraf, Geoffrey Zweig, Abdelrahman Mohamed:
Large scale weakly and semi-supervised learning for low-resource video ASR. CoRR abs/2005.07850 (2020) - [i14]Yangyang Shi, Yongqiang Wang, Chunyang Wu, Christian Fuegen, Frank Zhang, Duc Le, Ching-Feng Yeh, Michael L. Seltzer:
Weak-Attention Suppression For Transformer Based Speech Recognition. CoRR abs/2005.09137 (2020) - [i13]Suyoun Kim, Yuan Shangguan, Jay Mahadeokar, Antoine Bruguier, Christian Fuegen, Michael L. Seltzer, Duc Le:
Improved Neural Language Model Fusion for Streaming Recurrent Neural Network Transducer. CoRR abs/2010.13878 (2020) - [i12]Jay Mahadeokar, Yuan Shangguan, Duc Le, Gil Keren, Hang Su, Thong Le, Ching-Feng Yeh, Christian Fuegen, Michael L. Seltzer:
Alignment Restricted Streaming Recurrent Neural Network Transducer. CoRR abs/2011.03072 (2020) - [i11]Duc Le, Gil Keren, Julian Chan, Jay Mahadeokar, Christian Fuegen, Michael L. Seltzer:
Deep Shallow Fusion for RNN-T Personalization. CoRR abs/2011.07754 (2020)
2010 – 2019
- 2019
- [c32]Duc Le, Xiaohui Zhang, Weiyi Zheng, Christian Fügen, Geoffrey Zweig, Michael L. Seltzer:
From Senones to Chenones: Tied Context-Dependent Graphemes for Hybrid Speech Recognition. ASRU 2019: 457-464 - [c31]Zhehuai Chen, Mahaveer Jain, Yongqiang Wang, Michael L. Seltzer, Christian Fuegen:
End-to-end Contextual Speech Recognition Using Class Language Models and a Token Passing Decoder. ICASSP 2019: 6186-6190 - [c30]Zhehuai Chen, Mahaveer Jain, Yongqiang Wang, Michael L. Seltzer, Christian Fuegen:
Joint Grapheme and Phoneme Embeddings for Contextual End-to-End ASR. INTERSPEECH 2019: 3490-3494 - [i10]Duc Le, Xiaohui Zhang, Weiyi Zheng, Christian Fügen, Geoffrey Zweig, Michael L. Seltzer:
From Senones to Chenones: Tied Context-Dependent Graphemes for Hybrid Speech Recognition. CoRR abs/1910.01493 (2019) - [i9]Yongqiang Wang, Abdelrahman Mohamed, Duc Le, Chunxi Liu, Alex Xiao, Jay Mahadeokar, Hongzhao Huang, Andros Tjandra, Xiaohui Zhang, Frank Zhang, Christian Fuegen, Geoffrey Zweig, Michael L. Seltzer:
Transformer-based Acoustic Modeling for Hybrid Speech Recognition. CoRR abs/1910.09799 (2019) - [i8]Duc Le, Thilo Köhler, Christian Fuegen, Michael L. Seltzer:
G2G: TTS-Driven Pronunciation Learning for Graphemic Hybrid ASR. CoRR abs/1910.12612 (2019) - [i7]Ching-Feng Yeh, Jay Mahadeokar, Kaustubh Kalgaonkar, Yongqiang Wang, Duc Le, Mahaveer Jain, Kjell Schubert, Christian Fuegen, Michael L. Seltzer:
Transformer-Transducer: End-to-End Speech Recognition with Self-Attention. CoRR abs/1910.12977 (2019) - [i6]Mahaveer Jain, Kjell Schubert, Jay Mahadeokar, Ching-Feng Yeh, Kaustubh Kalgaonkar, Anuroop Sriram, Christian Fuegen, Michael L. Seltzer:
RNN-T For Latency Controlled ASR With Improved Beam Search. CoRR abs/1911.01629 (2019) - [i5]Weipeng He, Lu Lu, Biqiao Zhang, Jay Mahadeokar, Kaustubh Kalgaonkar, Christian Fuegen:
Spatial Attention for Far-field Speech Recognition with Deep Beamforming Neural Networks. CoRR abs/1911.02115 (2019) - [i4]Jacob Kahn, Morgane Rivière, Weiyi Zheng, Evgeny Kharitonov, Qiantong Xu, Pierre-Emmanuel Mazaré, Julien Karadayi, Vitaliy Liptchinsky, Ronan Collobert, Christian Fuegen, Tatiana Likhomanenko, Gabriel Synnaeve, Armand Joulin, Abdelrahman Mohamed, Emmanuel Dupoux:
Libri-Light: A Benchmark for ASR with Limited or No Supervision. CoRR abs/1912.07875 (2019) - 2018
- [c29]Anurag Kumar, Maksim Khadkevich, Christian Fügen:
Knowledge Transfer from Weakly Labeled Audio Using Convolutional Neural Network for Sound Events and Scenes. ICASSP 2018: 326-330 - [c28]Dmitriy Serdyuk, Yongqiang Wang, Christian Fuegen, Anuj Kumar, Baiyang Liu, Yoshua Bengio:
Towards End-to-end Spoken Language Understanding. ICASSP 2018: 5754-5758 - [i3]Dmitriy Serdyuk, Yongqiang Wang, Christian Fuegen, Anuj Kumar, Baiyang Liu, Yoshua Bengio:
Towards end-to-end spoken language understanding. CoRR abs/1802.08395 (2018) - [i2]Zhehuai Chen, Mahaveer Jain, Yongqiang Wang, Michael L. Seltzer, Christian Fuegen:
End-to-end contextual speech recognition using class language models and a token passing decoder. CoRR abs/1812.02142 (2018) - 2017
- [i1]Anurag Kumar, Maksim Khadkevich, Christian Fügen:
Knowledge Transfer from Weakly Labeled Audio using Convolutional Neural Network for Sound Events and Scenes. CoRR abs/1711.01369 (2017) - 2013
- [c27]Matthias Sperber, Graham Neubig, Christian Fügen, Satoshi Nakamura, Alex Waibel:
Efficient speech transcription through respeaking. INTERSPEECH 2013: 1087-1091 - [c26]Eunah Cho, Christian Fügen, Teresa Herrmann, Kevin Kilgour, Mohammed Mediani, Christian Mohr, Jan Niehues, Kay Rottmann, Christian Saam, Sebastian Stüker, Alex Waibel:
A real-world system for simultaneous translation of German lectures. INTERSPEECH 2013: 3473-3477 - [c25]Evgeniy Shin, Sebastian Stüker, Kevin Kilgour, Christian Fügen, Alex Waibel:
Maximum entropy language modeling for Russian ASR. IWSLT 2013
2000 – 2009
- 2009
- [b1]Christian Fügen:
A System for Simultaneous Translation of Lectures and Speeches. Karlsruhe Institute of Technology, 2009 - [c24]Olivier Hamon, Christian Fügen, Djamel Mostefa, Victoria Arranz, Muntsin Kolss, Alex Waibel, Khalid Choukri:
End-to-End Evaluation in Simultaneous Translation. EACL 2009: 345-353 - 2008
- [j3]Alex Waibel, Christian Fügen:
Spoken language translation. IEEE Signal Process. Mag. 25(3): 70-79 (2008) - 2007
- [j2]Christian Fügen, Alex Waibel, Muntsin Kolss:
Simultaneous translation of lectures and speeches. Mach. Transl. 21(4): 209-252 (2007) - [j1]Rainer Stiefelhagen, Hazim Kemal Ekenel, Christian Fügen, Petra Gieselmann, Hartwig Holzapfel, Florian Kraft, Kai Nickel, Michael Voit, Alex Waibel:
Enabling Multimodal Human-Robot Interaction for the Karlsruhe Humanoid Robot. IEEE Trans. Robotics 23(5): 840-851 (2007) - [c23]Kornel Laskowski, Christian Fügen, Tanja Schultz:
Simultaneous multispeaker segmentation for automatic meeting recognition. EUSIPCO 2007: 1294-1298 - [c22]Sebastian Stüker, Matthias Paulik, Muntsin Kolss, Christian Fügen, Alex Waibel:
Speech Translation Enhanced ASR for European Parliament Speeches - On the Influence of ASR Performance on Speech Translation. ICASSP (4) 2007: 1293-1296 - [c21]Sebastian Stüker, Christian Fügen, Florian Kraft, Matthias Wölfel:
The ISL 2007 English speech transcription system for european parliament speeches. INTERSPEECH 2007: 2609-2612 - [c20]Christian Fügen, Muntsin Kolss:
The influence of utterance chunking on machine translation performance. INTERSPEECH 2007: 2837-2840 - 2006
- [c19]Christian Fügen, Muntsin Kolss, Dietmar Bernreuther, Matthias Paulik, Sebastian Stüker, Stephan Vogel, Alex Waibel:
Open Domain Speech Recognition & Translation: Lectures and Speeches. ICASSP (1) 2006: 569-572 - [c18]Christian Fügen, Matthias Wölfel, John W. McDonough, Shajith Ikbal, Florian Kraft, Kornel Laskowski, Mari Ostendorf, Sebastian Stüker, Ken'ichi Kumatani:
Advances in lecture recognition: the ISL RT-06s evaluation system. INTERSPEECH 2006 - [c17]Tobias Gehrig, Ulrich Klee, John W. McDonough, Shajith Ikbal, Matthias Wölfel, Christian Fügen:
Tracking and beamforming for multiple simultaneous speakers with probabilistic data association filters. INTERSPEECH 2006 - [c16]Sebastian Stüker, Christian Fügen, Susanne Burger, Matthias Wölfel:
Cross-system adaptation and combination for continuous speech recognition: the influence of phoneme set and acoustic front-end. INTERSPEECH 2006 - [c15]Matthias Wölfel, Christian Fügen, Shajith Ikbal, John W. McDonough:
Multi-source far-distance microphone selection and combination for automatic transcription of lectures. INTERSPEECH 2006 - [c14]Christian Fügen, Shajith Ikbal, Florian Kraft, Ken'ichi Kumatani, Kornel Laskowski, John W. McDonough, Mari Ostendorf, Sebastian Stüker, Matthias Wölfel:
The ISL RT-06S Speech-to-Text System. MLMI 2006: 407-418 - 2005
- [c13]Florian Metze, Christian Fügen, Yue Pan, Alex Waibel:
Automatically Transcribing Meetings using Distant Microphones. ICASSP (1) 2005: 989-992 - [c12]Thilo Köhler, Christian Fügen, Sebastian Stüker, Alex Waibel:
Rapid porting of ASR-systems to mobile devices. INTERSPEECH 2005: 233-236 - [c11]Matthias Paulik, Christian Fügen, Sebastian Stüker, Tanja Schultz, Thomas Schaaf, Alex Waibel:
Document driven machine translation enhanced ASR. INTERSPEECH 2005: 2261-2264 - 2004
- [c10]Alex Waibel, Tanja Schultz, Stephan Vogel, Christian Fügen, Matthias Honal, Muntsin Kolss, Jürgen Reichert, Sebastian Stüker:
Towards language portability in statistical speech translation. ICASSP (3) 2004: 765-768 - [c9]Hagen Soltau, Hua Yu, Florian Metze, Christian Fügen, Qin Jin, Szu-Chen Stan Jou:
The 2003 ISL rich transcription system for conversational telephony speech. ICASSP (1) 2004: 773-776 - [c8]Christian Fügen, Hartwig Holzapfel, Alex Waibel:
Tight coupling of speech recognition and dialog management - dialog-context dependent grammar weighting for speech recognition. INTERSPEECH 2004: 169-172 - [c7]Tanja Schultz, Qin Jin, Kornel Laskowski, Yue Pan, Florian Metze, Christian Fügen:
Issues in meeting transcription - the ISL meeting transcription system. INTERSPEECH 2004: 1709-1712 - [c6]Rainer Stiefelhagen, Christian Fügen, Petra Gieselmann, Hartwig Holzapfel, Kai Nickel, Alex Waibel:
Natural human-robot interaction using speech, head pose and gestures. IROS 2004: 2422-2427 - 2002
- [c5]Hagen Soltau, Florian Metze, Christian Fügen, Alex Waibel:
Efficient language model lookahead through polymorphic linguistic context assignment. ICASSP 2002: 709-712 - [c4]Hartwig Holzapfel, Christian Fügen, Matthias Denecke, Alex Waibel:
Integrating Emotional Cues into a Framework for Dialogue Management. ICMI 2002: 141-148 - [c3]Manuel Kauers, Stephan Vogel, Christian Fügen, Alex Waibel:
Interlingua based statistical machine translation. INTERSPEECH 2002: 1909-1912 - 2001
- [c2]Christian Fügen, Martin Westphal, Mike Schneider, Tanja Schultz, Alex Waibel:
LingWear: A Mobile Tourist Information System. HLT 2001 - 2000
- [c1]Christian Fügen, Ivica Rogina:
Integrating dynamic speech modalities into context decision trees. ICASSP 2000: 1277-1280