default search action

combined dblp search
author search
venue search
publication search

ask others

Christian Fügen

Christian Fuegen

> Home > Persons

Person information

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

2020 – today

see FAQ

What is the meaning of the colors in the publication lists?

2025
[j4]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/pami/GraumanWBCCFGHJKLLMNRRR25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/pami/GraumanWBCCFGHJKLLMNRRR25
Kristen Grauman, Andrew Westbury, Eugene Byrne, Vincent Cartillier, Zachary Chavis, Antonino Furnari, Rohit Girdhar, Jackson Hamburger, Hao Jiang, Devansh Kukreja, Miao Liu, Xingyu Liu, Miguel Martin, Tushar Nagarajan, Ilija Radosavovic, Santhosh Kumar Ramakrishnan, Fiona Ryan, Jayant Sharma, Michael Wray, Mengmeng Xu, Eric Zhongcong Xu, Chen Zhao, Siddhant Bansal, Dhruv Batra, Sean Crane, Tien Do, Morrie Doulaty, Akshay Erapalli, Christoph Feichtenhofer, Adriano Fragomeni, Qichen Fu, Abrham Gebreselasie, Cristina González, James Hillis, Xuhua Huang, Yifei Huang, Wenqi Jia, Weslie Khoo, Jáchym Kolár, Satwik Kottur, Anurag Kumar, Federico Landini, Chao Li, Yanghao Li, Zhenqiang Li, Karttikeya Mangalam, Raghava Modhugu, Jonathan Munro, Tullie Murrell, Takumi Nishiyasu, Will Price, Paola Ruiz Puentes, Merey Ramazanova, Leda Sari, Kiran K. Somasundaram, Audrey Southerland, Yusuke Sugano, Ruijie Tao, Minh Vo, Yuchen Wang, Xindi Wu, Takuma Yagi, Ziwei Zhao, Yunyi Zhu, Pablo Arbeláez, David Crandall, Dima Damen, Giovanni Maria Farinella, Christian Fuegen, Bernard Ghanem, Vamsi Krishna Ithapu, C. V. Jawahar, Hanbyul Joo, Kris Kitani, Haizhou Li, Richard A. Newcombe, Aude Oliva, Hyun Soo Park, James M. Rehg, Yoichi Sato, Jianbo Shi, Mike Zheng Shou, Antonio Torralba, Lorenzo Torresani, Mingfei Yan, Jitendra Malik:
Ego4D: Around the World in 3,600 Hours of Egocentric Video. IEEE Trans. Pattern Anal. Mach. Intell. 47(11): 9468-9509 (2025)
[c67]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/MoritzXG0MASF25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/MoritzXG0MASF25
Niko Moritz, Ruiming Xie, Yashesh Gaur, Ke Li, Simone Merello, Zeeshan Ahmed, Frank Seide, Christian Fuegen:
Transcribing and Translating, Fast and Slow: Joint Speech Translation and Recognition. ICASSP 2025: 1-5
[c66]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ZhaoMLXXZAGLF25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ZhaoMLXXZAGLF25
Jinzheng Zhao, Niko Moritz, Egor Lakomkin, Ruiming Xie, Zhiping Xiu, Katerina Zmolíková, Zeeshan Ahmed, Yashesh Gaur, Duc Le, Christian Fuegen:
Textless Streaming Speech-to-Speech Translation using Semantic Speech Tokens. ICASSP 2025: 1-5
[i39]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2503-22051
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2503-22051
Zeeshan Ahmed, Frank Seide, Zhe Liu, Rastislav Rabatin, Jáchym Kolár, Niko Moritz, Ruiming Xie, Simone Merello, Christian Fuegen:
Non-Monotonic Attention-based Read/Write Policy Learning for Simultaneous Translation. CoRR abs/2503.22051 (2025)
[i38]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2508-13358
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2508-13358
Zeeshan Ahmed, Frank Seide, Niko Moritz, Ju Lin, Ruiming Xie, Simone Merello, Zhe Liu, Christian Fuegen:
Overcoming Latency Bottlenecks in On-Device Speech Translation: A Cascaded Approach with Alignment-Based Streaming MT. CoRR abs/2508.13358 (2025)
[i37]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2510-23276
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2510-23276
Thai-Binh Nguyen, Katerina Zmolíková, Pingchuan Ma, Ngoc Quan Pham, Christian Fuegen, Alexander Waibel:
A Cocktail-Party Benchmark: Multi-Modal dataset and Comparative Evaluation Results. CoRR abs/2510.23276 (2025)
2024
[c65]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/LinMHXSFS24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/LinMHXSFS24
Ju Lin, Niko Moritz, Yiteng Huang, Ruiming Xie, Ming Sun, Christian Fuegen, Frank Seide:
AGADIR: Towards Array-Geometry Agnostic Directional Speech Recognition. ICASSP 2024: 11951-11955
[c64]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/LakomkinWFKSF24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/LakomkinWFKSF24
Egor Lakomkin, Chunyang Wu, Yassir Fathullah, Ozlem Kalinli, Michael L. Seltzer, Christian Fuegen:
End-to-End Speech Recognition Contextualization with Large Language Models. ICASSP 2024: 12406-12410
[c63]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/GuoMMSWMKFS24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/GuoMMSWMKFS24
Jinxi Guo, Niko Moritz, Yingyi Ma, Frank Seide, Chunyang Wu, Jay Mahadeokar, Ozlem Kalinli, Christian Fuegen, Mike Seltzer:
Effective Internal Language Model Training and Fusion for Factorized Transducer Model. ICASSP 2024: 12687-12691
[c62]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/FathullahWLJSLG24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/FathullahWLJSLG24
Yassir Fathullah, Chunyang Wu, Egor Lakomkin, Junteng Jia, Yuan Shangguan, Ke Li, Jinxi Guo, Wenhan Xiong, Jay Mahadeokar, Ozlem Kalinli, Christian Fuegen, Mike Seltzer:
Prompting Large Language Models with Speech Recognition Abilities. ICASSP 2024: 13351-13355
[c61]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/naacl/FathullahWLLJSM24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/naacl/FathullahWLLJSM24
Yassir Fathullah, Chunyang Wu, Egor Lakomkin, Ke Li, Junteng Jia, Yuan Shangguan, Jay Mahadeokar, Ozlem Kalinli, Christian Fuegen, Mike Seltzer:
AudioChatLlama: Towards General-Purpose Speech Abilities for LLMs. NAACL-HLT 2024: 5522-5532
[i36]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2401-10411
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2401-10411
Ju Lin, Niko Moritz, Yiteng Huang, Ruiming Xie, Ming Sun, Christian Fuegen, Frank Seide:
AGADIR: Towards Array-Geometry Agnostic Directional Speech Recognition. CoRR abs/2401.10411 (2024)
[i35]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2404-01716
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2404-01716
Jinxi Guo, Niko Moritz, Yingyi Ma, Frank Seide, Chunyang Wu, Jay Mahadeokar, Ozlem Kalinli, Christian Fuegen, Mike Seltzer:
Effective internal language model training and fusion for factorized transducer model. CoRR abs/2404.01716 (2024)
[i34]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2412-15415
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2412-15415
Niko Moritz, Ruiming Xie, Yashesh Gaur, Ke Li, Simone Merello, Zeeshan Ahmed, Frank Seide, Christian Fuegen:
Transcribing and Translating, Fast and Slow: Joint Speech Translation and Recognition. CoRR abs/2412.15415 (2024)
2023
[c60]
- view
  authority control:
- export record
  dblp key:
  - conf/cvpr/LiuLV0CXDMKPPF23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/cvpr/LiuLV0CXDMKPPF23
Xubo Liu, Egor Lakomkin, Konstantinos Vougioukas, Pingchuan Ma, Honglie Chen, Ruiming Xie, Morrie Doulaty, Niko Moritz, Jáchym Kolár, Stavros Petridis, Maja Pantic, Christian Fuegen:
SynthVSR: Scaling Up Visual Speech RecognitionWith Synthetic Supervision. CVPR 2023: 18806-18815
[c59]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/0001MPFP23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/0001MPFP23
Pingchuan Ma, Niko Moritz, Stavros Petridis, Christian Fuegen, Maja Pantic:
Streaming Audio-Visual Speech Recognition with Alignment Regularization. INTERSPEECH 2023: 1598-1602
[c58]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LinMXKFS23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LinMXKFS23
Ju Lin, Niko Moritz, Ruiming Xie, Kaustubh Kalgaonkar, Christian Fuegen, Frank Seide:
Directional Speech Recognition for Speaker Disambiguation and Cross-talk Suppression. INTERSPEECH 2023: 3522-3526
[i33]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2303-17200
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2303-17200
Xubo Liu, Egor Lakomkin, Konstantinos Vougioukas, Pingchuan Ma, Honglie Chen, Ruiming Xie, Morrie Doulaty, Niko Moritz, Jáchym Kolár, Stavros Petridis, Maja Pantic, Christian Fuegen:
SynthVSR: Scaling Up Visual Speech Recognition With Synthetic Supervision. CoRR abs/2303.17200 (2023)
[i32]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2307-11795
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2307-11795
Yassir Fathullah, Chunyang Wu, Egor Lakomkin, Junteng Jia, Yuan Shangguan, Ke Li, Jinxi Guo, Wenhan Xiong, Jay Mahadeokar, Ozlem Kalinli, Christian Fuegen, Mike Seltzer:
Prompting Large Language Models with Speech Recognition Abilities. CoRR abs/2307.11795 (2023)
[i31]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2309-10917
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2309-10917
Egor Lakomkin, Chunyang Wu, Yassir Fathullah, Ozlem Kalinli, Michael L. Seltzer, Christian Fuegen:
End-to-End Speech Recognition Contextualization with Large Language Models. CoRR abs/2309.10917 (2023)
[i30]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2311-06753
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2311-06753
Yassir Fathullah, Chunyang Wu, Egor Lakomkin, Junteng Jia, Yuan Shangguan, Jay Mahadeokar, Ozlem Kalinli, Christian Fuegen, Mike Seltzer:
Towards General-Purpose Speech Abilities for Large Language Models Using Unpaired Data. CoRR abs/2311.06753 (2023)
2022
[c57]
- view
  authority control:
- export record
  dblp key:
  - conf/cvpr/GraumanWBCFGH0L22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/cvpr/GraumanWBCFGH0L22
Kristen Grauman, Andrew Westbury, Eugene Byrne, Zachary Chavis, Antonino Furnari, Rohit Girdhar, Jackson Hamburger, Hao Jiang, Miao Liu, Xingyu Liu, Miguel Martin, Tushar Nagarajan, Ilija Radosavovic, Santhosh Kumar Ramakrishnan, Fiona Ryan, Jayant Sharma, Michael Wray, Mengmeng Xu, Eric Zhongcong Xu, Chen Zhao, Siddhant Bansal, Dhruv Batra, Vincent Cartillier, Sean Crane, Tien Do, Morrie Doulaty, Akshay Erapalli, Christoph Feichtenhofer, Adriano Fragomeni, Qichen Fu, Abrham Gebreselasie, Cristina González, James Hillis, Xuhua Huang, Yifei Huang, Wenqi Jia, Weslie Khoo, Jáchym Kolár, Satwik Kottur, Anurag Kumar, Federico Landini, Chao Li, Yanghao Li, Zhenqiang Li, Karttikeya Mangalam, Raghava Modhugu, Jonathan Munro, Tullie Murrell, Takumi Nishiyasu, Will Price, Paola Ruiz Puentes, Merey Ramazanova, Leda Sari, Kiran K. Somasundaram, Audrey Southerland, Yusuke Sugano, Ruijie Tao, Minh Vo, Yuchen Wang, Xindi Wu, Takuma Yagi, Ziwei Zhao, Yunyi Zhu, Pablo Arbeláez, David Crandall, Dima Damen, Giovanni Maria Farinella, Christian Fuegen, Bernard Ghanem, Vamsi Krishna Ithapu, C. V. Jawahar, Hanbyul Joo, Kris Kitani, Haizhou Li, Richard A. Newcombe, Aude Oliva, Hyun Soo Park, James M. Rehg, Yoichi Sato, Jianbo Shi, Mike Zheng Shou, Antonio Torralba, Lorenzo Torresani, Mingfei Yan, Jitendra Malik:
Ego4D: Around the World in 3, 000 Hours of Egocentric Video. CVPR 2022: 18973-18990
[c56]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KimLZSAZFKS22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KimLZSAZFKS22
Suyoun Kim, Duc Le, Weiyi Zheng, Tarun Singh, Abhinav Arora, Xiaoyu Zhai, Christian Fuegen, Ozlem Kalinli, Michael L. Seltzer:
Evaluating User Perception of Speech Recognition System Quality with Semantic Distance Metric. INTERSPEECH 2022: 3978-3982
[c55]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ZhengXKL0FKSM22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ZhengXKL0FKSM22
Weiyi Zheng, Alex Xiao, Gil Keren, Duc Le, Frank Zhang, Christian Fuegen, Ozlem Kalinli, Yatharth Saraf, Abdelrahman Mohamed:
Scaling ASR Improves Zero and Few Shot Learning. INTERSPEECH 2022: 5135-5139
[c54]
- view
  authority control:
- export record
  dblp key:
  - conf/slt/MoritzSLMF22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/slt/MoritzSLMF22
Niko Moritz, Frank Seide, Duc Le, Jay Mahadeokar, Christian Fuegen:
An Investigation of Monotonic Transducers for Large-Scale Automatic Speech Recognition. SLT 2022: 324-330
[i29]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2204-08858
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2204-08858
Niko Moritz, Frank Seide, Duc Le, Jay Mahadeokar, Christian Fuegen:
An Investigation of Monotonic Transducers for Large-Scale Automatic Speech Recognition. CoRR abs/2204.08858 (2022)
[i28]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2211-02133
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2211-02133
Pingchuan Ma, Niko Moritz, Stavros Petridis, Christian Fuegen, Maja Pantic:
Streaming Audio-Visual Speech Recognition with Alignment Regularization. CoRR abs/2211.02133 (2022)
2021
[c53]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/XiaoFM21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/XiaoFM21
Alex Xiao, Christian Fuegen, Abdelrahman Mohamed:
Contrastive Semi-Supervised Learning for ASR. ICASSP 2021: 3870-3874
[c52]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/LinWKKZF21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/LinWKKZF21
Ju Lin, Yun Wang, Kaustubh Kalgaonkar, Gil Keren, Didi Zhang, Christian Fuegen:
A Time-Domain Convolutional Recurrent Network for Packet Loss Concealment. ICASSP 2021: 7148-7152
[c51]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/KimSMBFSL21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/KimSMBFSL21
Suyoun Kim, Yuan Shangguan, Jay Mahadeokar, Antoine Bruguier, Christian Fuegen, Michael L. Seltzer, Duc Le:
Improved Neural Language Model Fusion for Streaming Recurrent Neural Network Transducer. ICASSP 2021: 7333-7337
[c50]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/VenkateshVMSFSC21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/VenkateshVMSFSC21
Ganesh Venkatesh, Alagappan Valliappan, Jay Mahadeokar, Yuan Shangguan, Christian Fuegen, Michael L. Seltzer, Vikas Chandra:
Memory-Efficient Speech Recognition on Smart Devices. ICASSP 2021: 8368-8372
[c49]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WuXSKFKH21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WuXSKFKH21
Chunyang Wu, Zhiping Xiu, Yangyang Shi, Ozlem Kalinli, Christian Fuegen, Thilo Köhler, Qing He:
Transformer-Based Acoustic Modeling for Streaming Speech Synthesis. Interspeech 2021: 146-150
[c48]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/0003WIF21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/0003WIF21
Anurag Kumar, Yun Wang, Vamsi Krishna Ithapu, Christian Fuegen:
Do Sound Event Representations Generalize to Other Audio Tasks? A Case Study in Audio Transfer Learning. Interspeech 2021: 1214-1218
[c47]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LinWKKZF21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LinWKKZF21
Ju Lin, Yun Wang, Kaustubh Kalgaonkar, Gil Keren, Didi Zhang, Christian Fuegen:
A Two-Stage Approach to Speech Bandwidth Extension. Interspeech 2021: 1689-1693
[c46]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LeJKKSMCSFKSS21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LeJKKSMCSFKSS21
Duc Le, Mahaveer Jain, Gil Keren, Suyoun Kim, Yangyang Shi, Jay Mahadeokar, Julian Chan, Yuan Shangguan, Christian Fuegen, Ozlem Kalinli, Yatharth Saraf, Michael L. Seltzer:
Contextualized Streaming End-to-End Speech Recognition with Trie-Based Deep Biasing and Shallow Fusion. Interspeech 2021: 1772-1776
[c45]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KimALYFKS21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KimALYFKS21
Suyoun Kim, Abhinav Arora, Duc Le, Ching-Feng Yeh, Christian Fuegen, Ozlem Kalinli, Michael L. Seltzer:
Semantic Distance: A New Metric for ASR Performance Analysis Towards Spoken Language Understanding. Interspeech 2021: 1977-1981
[c44]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ShiNWMLPXYCFKS21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ShiNWMLPXYCFKS21
Yangyang Shi, Varun Nagaraja, Chunyang Wu, Jay Mahadeokar, Duc Le, Rohit Prabhavalkar, Alex Xiao, Ching-Feng Yeh, Julian Chan, Christian Fuegen, Ozlem Kalinli, Michael L. Seltzer:
Dynamic Encoder Transducer: A Flexible Solution for Trading Off Accuracy for Latency. Interspeech 2021: 2042-2046
[c43]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MahadeokarSSWXS21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MahadeokarSSWXS21
Jay Mahadeokar, Yangyang Shi, Yuan Shangguan, Chunyang Wu, Alex Xiao, Hang Su, Duc Le, Ozlem Kalinli, Christian Fuegen, Michael L. Seltzer:
Flexi-Transducer: Optimizing Latency, Accuracy and Compute for Multi-Domain On-Device Scenarios. Interspeech 2021: 2107-2111
[c42]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ShangguanPSMSZW21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ShangguanPSMSZW21
Yuan Shangguan, Rohit Prabhavalkar, Hang Su, Jay Mahadeokar, Yangyang Shi, Jiatong Zhou, Chunyang Wu, Duc Le, Ozlem Kalinli, Christian Fuegen, Michael L. Seltzer:
Dissecting User-Perceived Latency of On-Device E2E Speech Recognition. Interspeech 2021: 4553-4557
[c41]
- view
  authority control:
- export record
  dblp key:
  - conf/slt/MahadeokarSLKSL21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/slt/MahadeokarSLKSL21
Jay Mahadeokar, Yuan Shangguan, Duc Le, Gil Keren, Hang Su, Thong Le, Ching-Feng Yeh, Christian Fuegen, Michael L. Seltzer:
Alignment Restricted Streaming Recurrent Neural Network Transducer. SLT 2021: 52-59
[c40]
- view
  authority control:
- export record
  dblp key:
  - conf/slt/LeKCMFS21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/slt/LeKCMFS21
Duc Le, Gil Keren, Julian Chan, Jay Mahadeokar, Christian Fuegen, Michael L. Seltzer:
Deep Shallow Fusion for RNN-T Personalization. SLT 2021: 251-257
[i27]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2102-11531
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2102-11531
Ganesh Venkatesh, Alagappan Valliappan, Jay Mahadeokar, Yuan Shangguan, Christian Fuegen, Michael L. Seltzer, Vikas Chandra:
Memory-efficient Speech Recognition on Smart Devices. CoRR abs/2102.11531 (2021)
[i26]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2103-05149
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2103-05149
Alex Xiao, Christian Fuegen, Abdelrahman Mohamed:
Contrastive Semi-supervised Learning for ASR. CoRR abs/2103.05149 (2021)
[i25]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2104-02138
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2104-02138
Suyoun Kim, Abhinav Arora, Duc Le, Ching-Feng Yeh, Christian Fuegen, Ozlem Kalinli, Michael L. Seltzer:
Semantic Distance: A New Metric for ASR Performance Analysis Towards Spoken Language Understanding. CoRR abs/2104.02138 (2021)
[i24]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2104-02176
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2104-02176
Yangyang Shi, Varun Nagaraja, Chunyang Wu, Jay Mahadeokar, Duc Le, Rohit Prabhavalkar, Alex Xiao, Ching-Feng Yeh, Julian Chan, Christian Fuegen, Ozlem Kalinli, Michael L. Seltzer:
Dynamic Encoder Transducer: A Flexible Solution For Trading Off Accuracy For Latency. CoRR abs/2104.02176 (2021)
[i23]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2104-02194
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2104-02194
Duc Le, Mahaveer Jain, Gil Keren, Suyoun Kim, Yangyang Shi, Jay Mahadeokar, Julian Chan, Yuan Shangguan, Christian Fuegen, Ozlem Kalinli, Yatharth Saraf, Michael L. Seltzer:
Contextualized Streaming End-to-End Speech Recognition with Trie-Based Deep Biasing and Shallow Fusion. CoRR abs/2104.02194 (2021)
[i22]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2104-02207
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2104-02207
Yuan Shangguan, Rohit Prabhavalkar, Hang Su, Jay Mahadeokar, Yangyang Shi, Jiatong Zhou, Chunyang Wu, Duc Le, Ozlem Kalinli, Christian Fuegen, Michael L. Seltzer:
Dissecting User-Perceived Latency of On-Device E2E Speech Recognition. CoRR abs/2104.02207 (2021)
[i21]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2104-02232
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2104-02232
Jay Mahadeokar, Yangyang Shi, Yuan Shangguan, Chunyang Wu, Alex Xiao, Hang Su, Duc Le, Ozlem Kalinli, Christian Fuegen, Michael L. Seltzer:
Flexi-Transducer: Optimizing Latency, Accuracy and Compute forMulti-Domain On-Device Scenarios. CoRR abs/2104.02232 (2021)
[i20]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2106-11335
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2106-11335
Anurag Kumar, Yun Wang, Vamsi Krishna Ithapu, Christian Fuegen:
Do sound event representations generalize to other audio tasks? A case study in audio transfer learning. CoRR abs/2106.11335 (2021)
[i19]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2110-05376
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2110-05376
Suyoun Kim, Duc Le, Weiyi Zheng, Tarun Singh, Abhinav Arora, Xiaoyu Zhai, Christian Fuegen, Ozlem Kalinli, Michael L. Seltzer:
Evaluating User Perception of Speech Recognition System Quality with Semantic Distance Metric. CoRR abs/2110.05376 (2021)
[i18]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2110-07058
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2110-07058
Kristen Grauman, Andrew Westbury, Eugene Byrne, Zachary Chavis, Antonino Furnari, Rohit Girdhar, Jackson Hamburger, Hao Jiang, Miao Liu, Xingyu Liu, Miguel Martin, Tushar Nagarajan, Ilija Radosavovic, Santhosh Kumar Ramakrishnan, Fiona Ryan, Jayant Sharma, Michael Wray, Mengmeng Xu, Eric Zhongcong Xu, Chen Zhao, Siddhant Bansal, Dhruv Batra, Vincent Cartillier, Sean Crane, Tien Do, Morrie Doulaty, Akshay Erapalli, Christoph Feichtenhofer, Adriano Fragomeni, Qichen Fu, Christian Fuegen, Abrham Gebreselasie, Cristina González, James Hillis, Xuhua Huang, Yifei Huang, Wenqi Jia, Weslie Khoo, Jáchym Kolár, Satwik Kottur, Anurag Kumar, Federico Landini, Chao Li, Yanghao Li, Zhenqiang Li, Karttikeya Mangalam, Raghava Modhugu, Jonathan Munro, Tullie Murrell, Takumi Nishiyasu, Will Price, Paola Ruiz Puentes, Merey Ramazanova, Leda Sari, Kiran K. Somasundaram, Audrey Southerland, Yusuke Sugano, Ruijie Tao, Minh Vo, Yuchen Wang, Xindi Wu, Takuma Yagi, Yunyi Zhu, Pablo Arbeláez, David Crandall, Dima Damen, Giovanni Maria Farinella, Bernard Ghanem, Vamsi Krishna Ithapu, C. V. Jawahar, Hanbyul Joo, Kris Kitani, Haizhou Li, Richard A. Newcombe, Aude Oliva, Hyun Soo Park, James M. Rehg, Yoichi Sato, Jianbo Shi, Mike Zheng Shou, Antonio Torralba, Lorenzo Torresani, Mingfei Yan, Jitendra Malik:
Ego4D: Around the World in 3, 000 Hours of Egocentric Video. CoRR abs/2110.07058 (2021)
[i17]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2111-05948
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2111-05948
Alex Xiao, Weiyi Zheng, Gil Keren, Duc Le, Frank Zhang, Christian Fuegen, Ozlem Kalinli, Yatharth Saraf, Abdelrahman Mohamed:
Scaling ASR Improves Zero and Few Shot Learning. CoRR abs/2111.05948 (2021)
2020
[c39]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/LeKFS20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/LeKFS20
Duc Le, Thilo Köhler, Christian Fuegen, Michael L. Seltzer:
G2G: TTS-Driven Pronunciation Learning for Graphemic Hybrid ASR. ICASSP 2020: 6869-6873
[c38]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/WangMLLXMHTZZFZ20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/WangMLLXMHTZZFZ20
Yongqiang Wang, Abdelrahman Mohamed, Duc Le, Chunxi Liu, Alex Xiao, Jay Mahadeokar, Hongzhao Huang, Andros Tjandra, Xiaohui Zhang, Frank Zhang, Christian Fuegen, Geoffrey Zweig, Michael L. Seltzer:
Transformer-Based Acoustic Modeling for Hybrid Speech Recognition. ICASSP 2020: 6874-6878
[c37]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/HeLZMKF20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/HeLZMKF20
Weipeng He, Lu Lu, Biqiao Zhang, Jay Mahadeokar, Kaustubh Kalgaonkar, Christian Fuegen:
Spatial Attention for Far-Field Speech Recognition with Deep Beamforming Neural Networks. ICASSP 2020: 7499-7503
[c36]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/KahnRZKXMKLCFLS20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/KahnRZKXMKLCFLS20
Jacob Kahn, Morgane Rivière, Weiyi Zheng, Evgeny Kharitonov, Qiantong Xu, Pierre-Emmanuel Mazaré, Julien Karadayi, Vitaliy Liptchinsky, Ronan Collobert, Christian Fuegen, Tatiana Likhomanenko, Gabriel Synnaeve, Armand Joulin, Abdelrahman Mohamed, Emmanuel Dupoux:
Libri-Light: A Benchmark for ASR with Limited or No Supervision. ICASSP 2020: 7669-7673
[c35]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SinghMXEGLFSZM20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SinghMXEGLFSZM20
Kritika Singh, Vimal Manohar, Alex Xiao, Sergey Edunov, Ross B. Girshick, Vitaliy Liptchinsky, Christian Fuegen, Yatharth Saraf, Geoffrey Zweig, Abdelrahman Mohamed:
Large Scale Weakly and Semi-Supervised Learning for Low-Resource Video ASR. INTERSPEECH 2020: 3770-3774
[c34]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/GaoZYKFH20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/GaoZYKFH20
Yang Gao, Weiyi Zheng, Zhaojun Yang, Thilo Köhler, Christian Fuegen, Qing He:
Interactive Text-to-Speech System via Joint Style Analysis. INTERSPEECH 2020: 4447-4451
[c33]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ShiWWFZLYS20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ShiWWFZLYS20
Yangyang Shi, Yongqiang Wang, Chunyang Wu, Christian Fuegen, Frank Zhang, Duc Le, Ching-Feng Yeh, Michael L. Seltzer:
Weak-Attention Suppression for Transformer Based Speech Recognition. INTERSPEECH 2020: 4996-5000
[i16]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2002-06758
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2002-06758
Yang Gao, Weiyi Zheng, Zhaojun Yang, Thilo Köhler, Christian Fuegen, Qing He:
Interactive Text-to-Speech via Semi-supervised Style Transfer Learning. CoRR abs/2002.06758 (2020)
[i15]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2005-07850
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2005-07850
Kritika Singh, Vimal Manohar, Alex Xiao, Sergey Edunov, Ross B. Girshick, Vitaliy Liptchinsky, Christian Fuegen, Yatharth Saraf, Geoffrey Zweig, Abdelrahman Mohamed:
Large scale weakly and semi-supervised learning for low-resource video ASR. CoRR abs/2005.07850 (2020)
[i14]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2005-09137
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2005-09137
Yangyang Shi, Yongqiang Wang, Chunyang Wu, Christian Fuegen, Frank Zhang, Duc Le, Ching-Feng Yeh, Michael L. Seltzer:
Weak-Attention Suppression For Transformer Based Speech Recognition. CoRR abs/2005.09137 (2020)
[i13]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2010-13878
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2010-13878
Suyoun Kim, Yuan Shangguan, Jay Mahadeokar, Antoine Bruguier, Christian Fuegen, Michael L. Seltzer, Duc Le:
Improved Neural Language Model Fusion for Streaming Recurrent Neural Network Transducer. CoRR abs/2010.13878 (2020)
[i12]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2011-03072
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2011-03072
Jay Mahadeokar, Yuan Shangguan, Duc Le, Gil Keren, Hang Su, Thong Le, Ching-Feng Yeh, Christian Fuegen, Michael L. Seltzer:
Alignment Restricted Streaming Recurrent Neural Network Transducer. CoRR abs/2011.03072 (2020)
[i11]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2011-07754
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2011-07754
Duc Le, Gil Keren, Julian Chan, Jay Mahadeokar, Christian Fuegen, Michael L. Seltzer:
Deep Shallow Fusion for RNN-T Personalization. CoRR abs/2011.07754 (2020)

2010 – 2019

see FAQ

What is the meaning of the colors in the publication lists?

2019
[c32]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/LeZZFZS19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/LeZZFZS19
Duc Le, Xiaohui Zhang, Weiyi Zheng, Christian Fügen, Geoffrey Zweig, Michael L. Seltzer:
From Senones to Chenones: Tied Context-Dependent Graphemes for Hybrid Speech Recognition. ASRU 2019: 457-464
[c31]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ChenJWSF19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ChenJWSF19
Zhehuai Chen, Mahaveer Jain, Yongqiang Wang, Michael L. Seltzer, Christian Fuegen:
End-to-end Contextual Speech Recognition Using Class Language Models and a Token Passing Decoder. ICASSP 2019: 6186-6190
[c30]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ChenJWSF19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ChenJWSF19
Zhehuai Chen, Mahaveer Jain, Yongqiang Wang, Michael L. Seltzer, Christian Fuegen:
Joint Grapheme and Phoneme Embeddings for Contextual End-to-End ASR. INTERSPEECH 2019: 3490-3494
[i10]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1910-01493
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1910-01493
Duc Le, Xiaohui Zhang, Weiyi Zheng, Christian Fügen, Geoffrey Zweig, Michael L. Seltzer:
From Senones to Chenones: Tied Context-Dependent Graphemes for Hybrid Speech Recognition. CoRR abs/1910.01493 (2019)
[i9]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1910-09799
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1910-09799
Yongqiang Wang, Abdelrahman Mohamed, Duc Le, Chunxi Liu, Alex Xiao, Jay Mahadeokar, Hongzhao Huang, Andros Tjandra, Xiaohui Zhang, Frank Zhang, Christian Fuegen, Geoffrey Zweig, Michael L. Seltzer:
Transformer-based Acoustic Modeling for Hybrid Speech Recognition. CoRR abs/1910.09799 (2019)
[i8]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1910-12612
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1910-12612
Duc Le, Thilo Köhler, Christian Fuegen, Michael L. Seltzer:
G2G: TTS-Driven Pronunciation Learning for Graphemic Hybrid ASR. CoRR abs/1910.12612 (2019)
[i7]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1910-12977
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1910-12977
Ching-Feng Yeh, Jay Mahadeokar, Kaustubh Kalgaonkar, Yongqiang Wang, Duc Le, Mahaveer Jain, Kjell Schubert, Christian Fuegen, Michael L. Seltzer:
Transformer-Transducer: End-to-End Speech Recognition with Self-Attention. CoRR abs/1910.12977 (2019)
[i6]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1911-01629
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1911-01629
Mahaveer Jain, Kjell Schubert, Jay Mahadeokar, Ching-Feng Yeh, Kaustubh Kalgaonkar, Anuroop Sriram, Christian Fuegen, Michael L. Seltzer:
RNN-T For Latency Controlled ASR With Improved Beam Search. CoRR abs/1911.01629 (2019)
[i5]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1911-02115
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1911-02115
Weipeng He, Lu Lu, Biqiao Zhang, Jay Mahadeokar, Kaustubh Kalgaonkar, Christian Fuegen:
Spatial Attention for Far-field Speech Recognition with Deep Beamforming Neural Networks. CoRR abs/1911.02115 (2019)
[i4]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1912-07875
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1912-07875
Jacob Kahn, Morgane Rivière, Weiyi Zheng, Evgeny Kharitonov, Qiantong Xu, Pierre-Emmanuel Mazaré, Julien Karadayi, Vitaliy Liptchinsky, Ronan Collobert, Christian Fuegen, Tatiana Likhomanenko, Gabriel Synnaeve, Armand Joulin, Abdelrahman Mohamed, Emmanuel Dupoux:
Libri-Light: A Benchmark for ASR with Limited or No Supervision. CoRR abs/1912.07875 (2019)
2018
[c29]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/KumarKF18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/KumarKF18
Anurag Kumar, Maksim Khadkevich, Christian Fügen:
Knowledge Transfer from Weakly Labeled Audio Using Convolutional Neural Network for Sound Events and Scenes. ICASSP 2018: 326-330
[c28]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/SerdyukWFKLB18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/SerdyukWFKLB18
Dmitriy Serdyuk, Yongqiang Wang, Christian Fuegen, Anuj Kumar, Baiyang Liu, Yoshua Bengio:
Towards End-to-end Spoken Language Understanding. ICASSP 2018: 5754-5758
[i3]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1802-08395
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1802-08395
Dmitriy Serdyuk, Yongqiang Wang, Christian Fuegen, Anuj Kumar, Baiyang Liu, Yoshua Bengio:
Towards end-to-end spoken language understanding. CoRR abs/1802.08395 (2018)
[i2]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1812-02142
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1812-02142
Zhehuai Chen, Mahaveer Jain, Yongqiang Wang, Michael L. Seltzer, Christian Fuegen:
End-to-end contextual speech recognition using class language models and a token passing decoder. CoRR abs/1812.02142 (2018)
2017
[i1]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1711-01369
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1711-01369
Anurag Kumar, Maksim Khadkevich, Christian Fügen:
Knowledge Transfer from Weakly Labeled Audio using Convolutional Neural Network for Sound Events and Scenes. CoRR abs/1711.01369 (2017)
2013
[c27]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SperberNFNW13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SperberNFNW13
Matthias Sperber, Graham Neubig, Christian Fügen, Satoshi Nakamura, Alex Waibel:
Efficient speech transcription through respeaking. INTERSPEECH 2013: 1087-1091
[c26]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ChoFHKMMNRSSW13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ChoFHKMMNRSSW13
Eunah Cho, Christian Fügen, Teresa Herrmann, Kevin Kilgour, Mohammed Mediani, Christian Mohr, Jan Niehues, Kay Rottmann, Christian Saam, Sebastian Stüker, Alex Waibel:
A real-world system for simultaneous translation of German lectures. INTERSPEECH 2013: 3473-3477
[c25]
- view
  - electronic edition @ aclanthology.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/iwslt/ShinSKFW13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iwslt/ShinSKFW13
Evgeniy Shin, Sebastian Stüker, Kevin Kilgour, Christian Fügen, Alex Waibel:
Maximum entropy language modeling for Russian ASR. IWSLT 2013

2000 – 2009

see FAQ

What is the meaning of the colors in the publication lists?

2009
[b1]
- view
  authority control:
- export record
  dblp key:
  - phd/de/Fugen2009
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/phd/de/Fugen2009
Christian Fügen:
A System for Simultaneous Translation of Lectures and Speeches. Karlsruhe Institute of Technology, 2009
[c24]
- view
  - electronic edition @ aclanthology.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/eacl/HamonFMAKWC09
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/eacl/HamonFMAKWC09
Olivier Hamon, Christian Fügen, Djamel Mostefa, Victoria Arranz, Muntsin Kolss, Alex Waibel, Khalid Choukri:
End-to-End Evaluation in Simultaneous Translation. EACL 2009: 345-353
2008
[j3]
- view
  authority control:
- export record
  dblp key:
  - journals/spm/WaibelF08
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/spm/WaibelF08
Alex Waibel, Christian Fügen:
Spoken language translation. IEEE Signal Process. Mag. 25(3): 70-79 (2008)
2007
[j2]
- view
  authority control:
- export record
  dblp key:
  - journals/mt/FugenWK07
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/mt/FugenWK07
Christian Fügen, Alex Waibel, Muntsin Kolss:
Simultaneous translation of lectures and speeches. Mach. Transl. 21(4): 209-252 (2007)
[j1]
- view
  authority control:
- export record
  dblp key:
  - journals/trob/StiefelhagenEFGHKNVW07
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/trob/StiefelhagenEFGHKNVW07
Rainer Stiefelhagen, Hazim Kemal Ekenel, Christian Fügen, Petra Gieselmann, Hartwig Holzapfel, Florian Kraft, Kai Nickel, Michael Voit, Alex Waibel:
Enabling Multimodal Human-Robot Interaction for the Karlsruhe Humanoid Robot. IEEE Trans. Robotics 23(5): 840-851 (2007)
[c23]
- view
  - electronic edition @ ieee.org
  - details & citations
- export record
  dblp key:
  - conf/eusipco/LaskowskiFS07
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/eusipco/LaskowskiFS07
Kornel Laskowski, Christian Fügen, Tanja Schultz:
Simultaneous multispeaker segmentation for automatic meeting recognition. EUSIPCO 2007: 1294-1298
[c22]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/StukerPKFW07
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/StukerPKFW07
Sebastian Stüker, Matthias Paulik, Muntsin Kolss, Christian Fügen, Alex Waibel:
Speech Translation Enhanced ASR for European Parliament Speeches - On the Influence of ASR Performance on Speech Translation. ICASSP (4) 2007: 1293-1296
[c21]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/StukerFKW07
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/StukerFKW07
Sebastian Stüker, Christian Fügen, Florian Kraft, Matthias Wölfel:
The ISL 2007 English speech transcription system for european parliament speeches. INTERSPEECH 2007: 2609-2612
[c20]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/FugenK07
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/FugenK07
Christian Fügen, Muntsin Kolss:
The influence of utterance chunking on machine translation performance. INTERSPEECH 2007: 2837-2840
2006
[c19]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/FugenKBPSVW06
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/FugenKBPSVW06
Christian Fügen, Muntsin Kolss, Dietmar Bernreuther, Matthias Paulik, Sebastian Stüker, Stephan Vogel, Alex Waibel:
Open Domain Speech Recognition & Translation: Lectures and Speeches. ICASSP (1) 2006: 569-572
[c18]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/FugenWMIKLOSK06
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/FugenWMIKLOSK06
Christian Fügen, Matthias Wölfel, John W. McDonough, Shajith Ikbal, Florian Kraft, Kornel Laskowski, Mari Ostendorf, Sebastian Stüker, Ken'ichi Kumatani:
Advances in lecture recognition: the ISL RT-06s evaluation system. INTERSPEECH 2006
[c17]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/GehrigKMIWF06
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/GehrigKMIWF06
Tobias Gehrig, Ulrich Klee, John W. McDonough, Shajith Ikbal, Matthias Wölfel, Christian Fügen:
Tracking and beamforming for multiple simultaneous speakers with probabilistic data association filters. INTERSPEECH 2006
[c16]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/StukerFBW06
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/StukerFBW06
Sebastian Stüker, Christian Fügen, Susanne Burger, Matthias Wölfel:
Cross-system adaptation and combination for continuous speech recognition: the influence of phoneme set and acoustic front-end. INTERSPEECH 2006
[c15]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/WolfelFIM06
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/WolfelFIM06
Matthias Wölfel, Christian Fügen, Shajith Ikbal, John W. McDonough:
Multi-source far-distance microphone selection and combination for automatic transcription of lectures. INTERSPEECH 2006
[c14]
- view
  authority control:
- export record
  dblp key:
  - conf/mlmi/FugenIKKLMOSW06
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mlmi/FugenIKKLMOSW06
Christian Fügen, Shajith Ikbal, Florian Kraft, Ken'ichi Kumatani, Kornel Laskowski, John W. McDonough, Mari Ostendorf, Sebastian Stüker, Matthias Wölfel:
The ISL RT-06S Speech-to-Text System. MLMI 2006: 407-418
2005
[c13]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/MetzeFPW05
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/MetzeFPW05
Florian Metze, Christian Fügen, Yue Pan, Alex Waibel:
Automatically Transcribing Meetings using Distant Microphones. ICASSP (1) 2005: 989-992
[c12]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KohlerFSW05
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KohlerFSW05
Thilo Köhler, Christian Fügen, Sebastian Stüker, Alex Waibel:
Rapid porting of ASR-systems to mobile devices. INTERSPEECH 2005: 233-236
[c11]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/PaulikFSSSW05
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/PaulikFSSSW05
Matthias Paulik, Christian Fügen, Sebastian Stüker, Tanja Schultz, Thomas Schaaf, Alex Waibel:
Document driven machine translation enhanced ASR. INTERSPEECH 2005: 2261-2264
2004
[c10]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/WaibelSVFHKRS04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/WaibelSVFHKRS04
Alex Waibel, Tanja Schultz, Stephan Vogel, Christian Fügen, Matthias Honal, Muntsin Kolss, Jürgen Reichert, Sebastian Stüker:
Towards language portability in statistical speech translation. ICASSP (3) 2004: 765-768
[c9]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/SoltauYMFJJ04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/SoltauYMFJJ04
Hagen Soltau, Hua Yu, Florian Metze, Christian Fügen, Qin Jin, Szu-Chen Stan Jou:
The 2003 ISL rich transcription system for conversational telephony speech. ICASSP (1) 2004: 773-776
[c8]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/FugenHW04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/FugenHW04
Christian Fügen, Hartwig Holzapfel, Alex Waibel:
Tight coupling of speech recognition and dialog management - dialog-context dependent grammar weighting for speech recognition. INTERSPEECH 2004: 169-172
[c7]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SchultzJLPMF04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SchultzJLPMF04
Tanja Schultz, Qin Jin, Kornel Laskowski, Yue Pan, Florian Metze, Christian Fügen:
Issues in meeting transcription - the ISL meeting transcription system. INTERSPEECH 2004: 1709-1712
[c6]
- view
  authority control:
- export record
  dblp key:
  - conf/iros/StiefelhagenFGHNW04
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iros/StiefelhagenFGHNW04
Rainer Stiefelhagen, Christian Fügen, Petra Gieselmann, Hartwig Holzapfel, Kai Nickel, Alex Waibel:
Natural human-robot interaction using speech, head pose and gestures. IROS 2004: 2422-2427
2002
[c5]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/SoltauMFW02
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/SoltauMFW02
Hagen Soltau, Florian Metze, Christian Fügen, Alex Waibel:
Efficient language model lookahead through polymorphic linguistic context assignment. ICASSP 2002: 709-712
[c4]
- view
  authority control:
- export record
  dblp key:
  - conf/icmi/HolzapfelFDW02
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icmi/HolzapfelFDW02
Hartwig Holzapfel, Christian Fügen, Matthias Denecke, Alex Waibel:
Integrating Emotional Cues into a Framework for Dialogue Management. ICMI 2002: 141-148
[c3]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KauersVFW02
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KauersVFW02
Manuel Kauers, Stephan Vogel, Christian Fügen, Alex Waibel:
Interlingua based statistical machine translation. INTERSPEECH 2002: 1909-1912
2001
[c2]
- view
  - electronic edition @ aclanthology.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/naacl/FugenWSSW01
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/naacl/FugenWSSW01
Christian Fügen, Martin Westphal, Mike Schneider, Tanja Schultz, Alex Waibel:
LingWear: A Mobile Tourist Information System. HLT 2001
2000
[c1]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/FugenR00
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/FugenR00
Christian Fügen, Ivica Rogina:
Integrating dynamic speech modalities into context decision trees. ICASSP 2000: 1277-1280

Coauthor Index

see FAQ

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.