default search action
Khe Chai Sim
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [c114]Khe Chai Sim, Zhouyuan Huo, Tsendsuren Munkhdalai, Nikhil Siddhartha, Adam Stooke, Zhong Meng, Bo Li, Tara N. Sainath:
A Comparison of Parameter-Efficient ASR Domain Adaptation Methods for Universal Speech and Language Models. ICASSP 2024: 6900-6904 - [c113]Shefali Garg, Zhouyuan Huo, Khe Chai Sim, Suzan Schwartz, Mason Chua, Alëna Aksënova, Tsendsuren Munkhdalai, Levi King, Darryl Wright, Zion Mengesha, Dongseong Hwang, Tara N. Sainath, Françoise Beaufays, Pedro Moreno Mengibar:
Improving Speech Recognition for African American English with Audio Classification. ICASSP 2024: 12356-12360 - [c112]Weiran Wang, Rohit Prabhavalkar, Haozhe Shan, Zhong Meng, Dongseong Hwang, Qiujia Li, Khe Chai Sim, Bo Li, James Qin, Xingyu Cai, Adam Stooke, Chengjian Zheng, Yanzhang He, Tara N. Sainath, Pedro Moreno Mengibar:
Massive End-to-end Speech Recognition Models with Time Reduction. NAACL-HLT 2024: 6206-6217 - [i26]Tsendsuren Munkhdalai, Youzheng Chen, Khe Chai Sim, Fadi Biadsy, Tara N. Sainath, Pedro Moreno Mengibar:
Hierarchical Recurrent Adapters for Efficient Multi-Task Adaptation of Large Speech Models. CoRR abs/2403.19709 (2024) - [i25]Dongseong Hwang, Weiran Wang, Zhuoyuan Huo, Khe Chai Sim, Pedro Moreno Mengibar:
TransformerFAM: Feedback attention is working memory. CoRR abs/2404.09173 (2024) - 2023
- [c111]Gan Song, Zelin Wu, Golan Pundak, Angad Chandorkar, Kandarp Joshi, Xavier Velez, Diamantino Caseiro, Ben Haynor, Weiran Wang, Nikhil Siddhartha, Pat Rondon, Khe Chai Sim:
Contextual Spelling Correction with Large Language Models. ASRU 2023: 1-8 - [c110]Zhouyuan Huo, Khe Chai Sim, Bo Li, Dongseong Hwang, Tara N. Sainath, Trevor Strohman:
Resource-Efficient Transfer Learning from Speech Foundation Model Using Hierarchical Feature Fusion. ICASSP 2023: 1-5 - [c109]Dongseong Hwang, Khe Chai Sim, Yu Zhang, Trevor Strohman:
Comparison of Soft and Hard Target RNN-T Distillation for Large-Scale ASR. ICASSP 2023: 1-5 - [c108]Bo Li, Dongseong Hwang, Zhouyuan Huo, Junwen Bai, Guru Prakash, Tara N. Sainath, Khe Chai Sim, Yu Zhang, Wei Han, Trevor Strohman, Françoise Beaufays:
Efficient Domain Adaptation for Speech Foundation Models. ICASSP 2023: 1-5 - [c107]Zelin Wu, Tsendsuren Munkhdalai, Pat Rondon, Golan Pundak, Khe Chai Sim, Christopher Li:
Dual-Mode NAM: Effective Top-K Context Injection for End-to-End ASR. INTERSPEECH 2023: 221-225 - [c106]Zhouyuan Huo, Khe Chai Sim, Dongseong Hwang, Tsendsuren Munkhdalai, Tara N. Sainath, Pedro Moreno Mengibar:
Re-investigating the Efficient Transfer Learning of Speech Foundation Model using Feature Fusion Methods. INTERSPEECH 2023: 556-560 - [i24]Bo Li, Dongseong Hwang, Zhouyuan Huo, Junwen Bai, Guru Prakash, Tara N. Sainath, Khe Chai Sim, Yu Zhang, Wei Han, Trevor Strohman, Françoise Beaufays:
Efficient Domain Adaptation for Speech Foundation Models. CoRR abs/2302.01496 (2023) - [i23]Dongseong Hwang, Changwan Ryu, Khe Chai Sim:
Edit Distance based RL for RNNT decoding. CoRR abs/2306.01789 (2023) - [i22]Shefali Garg, Zhouyuan Huo, Khe Chai Sim, Suzan Schwartz, Mason Chua, Alëna Aksënova, Tsendsuren Munkhdalai, Levi King, Darryl Wright, Zion Mengesha, Dongseong Hwang, Tara N. Sainath, Françoise Beaufays, Pedro Moreno Mengibar:
Improving Speech Recognition for African American English With Audio Classification. CoRR abs/2309.09996 (2023) - [i21]Weiran Wang, Rohit Prabhavalkar, Dongseong Hwang, Qiujia Li, Khe Chai Sim, Bo Li, James Qin, Xingyu Cai, Adam Stooke, Zhong Meng, CJ Zheng, Yanzhang He, Tara N. Sainath, Pedro Moreno Mengibar:
Massive End-to-end Models for Short Search Queries. CoRR abs/2309.12963 (2023) - [i20]Weiran Wang, Zelin Wu, Diamantino Caseiro, Tsendsuren Munkhdalai, Khe Chai Sim, Pat Rondon, Golan Pundak, Gan Song, Rohit Prabhavalkar, Zhong Meng, Ding Zhao, Tara N. Sainath, Pedro Moreno Mengibar:
Contextual Biasing with the Knuth-Morris-Pratt Matching Algorithm. CoRR abs/2310.00178 (2023) - [i19]Liam Collins, Shanshan Wu, Sewoong Oh, Khe Chai Sim:
Profit: Benchmarking Personalization and Robustness Trade-off in Federated Prompt Tuning. CoRR abs/2310.04627 (2023) - 2022
- [j13]Yu Zhang, Daniel S. Park, Wei Han, James Qin, Anmol Gulati, Joel Shor, Aren Jansen, Yuanzhong Xu, Yanping Huang, Shibo Wang, Zongwei Zhou, Bo Li, Min Ma, William Chan, Jiahui Yu, Yongqiang Wang, Liangliang Cao, Khe Chai Sim, Bhuvana Ramabhadran, Tara N. Sainath, Françoise Beaufays, Zhifeng Chen, Quoc V. Le, Chung-Cheng Chiu, Ruoming Pang, Yonghui Wu:
BigSSL: Exploring the Frontier of Large-Scale Semi-Supervised Learning for Automatic Speech Recognition. IEEE J. Sel. Top. Signal Process. 16(6): 1519-1532 (2022) - [c105]Junwen Bai, Bo Li, Yu Zhang, Ankur Bapna, Nikhil Siddhartha, Khe Chai Sim, Tara N. Sainath:
Joint Unsupervised and Supervised Training for Multilingual ASR. ICASSP 2022: 6402-6406 - [c104]Dongseong Hwang, Ananya Misra, Zhouyuan Huo, Nikhil Siddhartha, Shefali Garg, David Qiu, Khe Chai Sim, Trevor Strohman, Françoise Beaufays, Yanzhang He:
Large-Scale ASR Domain Adaptation Using Self- and Semi-Supervised Learning. ICASSP 2022: 6627-6631 - [c103]Tsendsuren Munkhdalai, Khe Chai Sim, Angad Chandorkar, Fan Gao, Mason Chua, Trevor Strohman, Françoise Beaufays:
Fast Contextual Adaptation with Neural Associative Memory for On-Device Personalized Speech Recognition. ICASSP 2022: 6632-6636 - [c102]Theresa Breiner, Swaroop Ramaswamy, Ehsan Variani, Shefali Garg, Rajiv Mathews, Khe Chai Sim, Kilol Gupta, Mingqing Chen, Lara McConnaughey:
UserLibri: A Dataset for ASR Personalization Using Only Text. INTERSPEECH 2022: 694-698 - [c101]Dongseong Hwang, Khe Chai Sim, Zhouyuan Huo, Trevor Strohman:
Pseudo Label Is Better Than Human Label. INTERSPEECH 2022: 1421-1425 - [c100]Golan Pundak, Tsendsuren Munkhdalai, Khe Chai Sim:
On-the-fly ASR Corrections with Audio Exemplars. INTERSPEECH 2022: 3148-3152 - [c99]Zhouyuan Huo, Dongseong Hwang, Khe Chai Sim, Shefali Garg, Ananya Misra, Nikhil Siddhartha, Trevor Strohman, Françoise Beaufays:
Incremental Layer-Wise Self-Supervised Learning for Efficient Unsupervised Speech Domain Adaptation On Device. INTERSPEECH 2022: 4845-4849 - [c98]David Qiu, Tsendsuren Munkhdalai, Yanzhang He, Khe Chai Sim:
Context-Aware Neural Confidence Estimation for Rare Word Speech Recognition. SLT 2022: 31-37 - [c97]Tsendsuren Munkhdalai, Zelin Wu, Golan Pundak, Khe Chai Sim, Jiayang Li, Pat Rondon, Tara N. Sainath:
NAM+: Towards Scalable End-to-End Contextual Biasing for Adaptive ASR. SLT 2022: 190-196 - [c96]Adam Stooke, Khe Chai Sim, Mason Chua, Tsendsuren Munkhdalai, Trevor Strohman:
Internal Language Model Personalization of E2E Automatic Speech Recognition Using Random Encoder Features. SLT 2022: 213-220 - [i18]Dongseong Hwang, Khe Chai Sim, Zhouyuan Huo, Trevor Strohman:
Pseudo Label Is Better Than Human Label. CoRR abs/2203.12668 (2022) - [i17]Theresa Breiner, Swaroop Ramaswamy, Ehsan Variani, Shefali Garg, Rajiv Mathews, Khe Chai Sim, Kilol Gupta, Mingqing Chen, Lara McConnaughey:
UserLibri: A Dataset for ASR Personalization Using Only Text. CoRR abs/2207.00706 (2022) - [i16]Sandy Ritchie, You-Chi Cheng, Mingqing Chen, Rajiv Mathews, Daan van Esch, Bo Li, Khe Chai Sim:
Large vocabulary speech recognition for languages of Africa: multilingual modeling and self-supervised learning. CoRR abs/2208.03067 (2022) - [i15]Dongseong Hwang, Khe Chai Sim, Yu Zhang, Trevor Strohman:
Comparison of Soft and Hard Target RNN-T Distillation for Large-scale ASR. CoRR abs/2210.05793 (2022) - [i14]Zhouyuan Huo, Khe Chai Sim, Bo Li, Dongseong Hwang, Tara N. Sainath, Trevor Strohman:
Resource-Efficient Transfer Learning From Speech Foundation Model Using Hierarchical Feature Fusion. CoRR abs/2211.02712 (2022) - 2021
- [c95]Ananya Misra, Dongseong Hwang, Zhouyuan Huo, Shefali Garg, Nikhil Siddhartha, Arun Narayanan, Khe Chai Sim:
A Comparison of Supervised and Unsupervised Pre-Training of End-to-End Models. Interspeech 2021: 731-735 - [c94]Khe Chai Sim, Angad Chandorkar, Fan Gao, Mason Chua, Tsendsuren Munkhdalai, Françoise Beaufays:
Robust Continuous On-Device Personalization for Automatic Speech Recognition. Interspeech 2021: 1284-1288 - [i13]Katrin Tomanek, Françoise Beaufays, Julie Cattiau, Angad Chandorkar, Khe Chai Sim:
On-Device Personalization of Automatic Speech Recognition Models for Disordered Speech. CoRR abs/2106.10259 (2021) - [i12]Yu Zhang, Daniel S. Park, Wei Han, James Qin, Anmol Gulati, Joel Shor, Aren Jansen, Yuanzhong Xu, Yanping Huang, Shibo Wang, Zongwei Zhou, Bo Li, Min Ma, William Chan, Jiahui Yu, Yongqiang Wang, Liangliang Cao, Khe Chai Sim, Bhuvana Ramabhadran, Tara N. Sainath, Françoise Beaufays, Zhifeng Chen, Quoc V. Le, Chung-Cheng Chiu, Ruoming Pang, Yonghui Wu:
BigSSL: Exploring the Frontier of Large-Scale Semi-Supervised Learning for Automatic Speech Recognition. CoRR abs/2109.13226 (2021) - [i11]Zhouyuan Huo, Dongseong Hwang, Khe Chai Sim, Shefali Garg, Ananya Misra, Nikhil Siddhartha, Trevor Strohman, Françoise Beaufays:
Incremental Layer-wise Self-Supervised Learning for Efficient Speech Domain Adaptation On Device. CoRR abs/2110.00155 (2021) - [i10]Dongseong Hwang, Ananya Misra, Zhouyuan Huo, Nikhil Siddhartha, Shefali Garg, David Qiu, Khe Chai Sim, Trevor Strohman, Françoise Beaufays, Yanzhang He:
Large-scale ASR Domain Adaptation using Self- and Semi-supervised Learning. CoRR abs/2110.00165 (2021) - [i9]Tsendsuren Munkhdalai, Khe Chai Sim, Angad Chandorkar, Fan Gao, Mason Chua, Trevor Strohman, Françoise Beaufays:
Fast Contextual Adaptation with Neural Associative Memory for On-Device Personalized Speech Recognition. CoRR abs/2110.02220 (2021) - [i8]Junwen Bai, Bo Li, Yu Zhang, Ankur Bapna, Nikhil Siddhartha, Khe Chai Sim, Tara N. Sainath:
Joint Unsupervised and Supervised Training for Multilingual ASR. CoRR abs/2111.08137 (2021) - 2020
- [c93]Mary Gooneratne, Khe Chai Sim, Petr Zadrazil, Andreas Kabel, Françoise Beaufays, Giovanni Motta:
Low-Rank Gradient Approximation for Memory-Efficient on-Device Training of Deep Neural Network. ICASSP 2020: 3017-3021 - [i7]Mary Gooneratne, Khe Chai Sim, Petr Zadrazil, Andreas Kabel, Françoise Beaufays, Giovanni Motta:
Low-rank Gradient Approximation For Memory-Efficient On-device Training of Deep Neural Network. CoRR abs/2001.08885 (2020)
2010 – 2019
- 2019
- [c92]Khe Chai Sim, Leif Johnson, Giovanni Motta, Lillian Zhou, Françoise Beaufays, Arnaud Benard, Dhruv Guliani, Andreas Kabel, Nikhil Khare, Tamar Lucassen, Petr Zadrazil, Harry Zhang:
Personalization of End-to-End Speech Recognition on Mobile Devices for Named Entities. ASRU 2019: 23-30 - [c91]Jahn Heymann, Khe Chai Sim, Bo Li:
Improving CTC Using Stimulated Learning for Sequence Modeling. ICASSP 2019: 5701-5705 - [c90]Yanzhang He, Tara N. Sainath, Rohit Prabhavalkar, Ian McGraw, Raziel Alvarez, Ding Zhao, David Rybach, Anjuli Kannan, Yonghui Wu, Ruoming Pang, Qiao Liang, Deepti Bhatia, Yuan Shangguan, Bo Li, Golan Pundak, Khe Chai Sim, Tom Bagby, Shuo-Yiin Chang, Kanishka Rao, Alexander Gruenstein:
Streaming End-to-end Speech Recognition for Mobile Devices. ICASSP 2019: 6381-6385 - [c89]Khe Chai Sim, Petr Zadrazil, Françoise Beaufays:
An Investigation into On-Device Personalization of End-to-End Automatic Speech Recognition Models. INTERSPEECH 2019: 774-778 - [i6]Khe Chai Sim, Petr Zadrazil, Françoise Beaufays:
An Investigation Into On-device Personalization of End-to-end Automatic Speech Recognition Models. CoRR abs/1909.06678 (2019) - [i5]Khe Chai Sim, Françoise Beaufays, Arnaud Benard, Dhruv Guliani, Andreas Kabel, Nikhil Khare, Tamar Lucassen, Petr Zadrazil, Harry Zhang, Leif Johnson, Giovanni Motta, Lillian Zhou:
Personalization of End-to-end Speech Recognition On Mobile Devices For Named Entities. CoRR abs/1912.09251 (2019) - 2018
- [j12]Chunyang Wu, Mark J. F. Gales, Anton Ragni, Penny Karanasou, Khe Chai Sim:
Improving Interpretability and Regularization in Deep Learning. IEEE ACM Trans. Audio Speech Lang. Process. 26(2): 256-265 (2018) - [c88]Skanda Koppula, Khe Chai Sim, Kean K. Chin:
Understanding Recurrent Neural State Using Memory Signatures. ICASSP 2018: 2396-2400 - [c87]Bo Li, Tara N. Sainath, Khe Chai Sim, Michiel Bacchiani, Eugene Weinstein, Patrick Nguyen, Zhifeng Chen, Yanghui Wu, Kanishka Rao:
Multi-Dialect Speech Recognition with a Single Sequence-to-Sequence Model. ICASSP 2018: 4749-4753 - [c86]Lahiru Samarakoon, Brian Mak, Khe Chai Sim:
learning Effective Factorized Hidden Layer Bases Using Student-Teacher Training for LSTM Acoustic Model Adaptation. ICASSP 2018: 5954-5958 - [c85]Khe Chai Sim, Arun Narayanan, Ananya Misra, Anshuman Tripathi, Golan Pundak, Tara N. Sainath, Parisa Haghani, Bo Li, Michiel Bacchiani:
Domain Adaptation Using Factorized Hidden Layer for Robust Automatic Speech Recognition. INTERSPEECH 2018: 892-896 - [c84]Arun Narayanan, Ananya Misra, Khe Chai Sim, Golan Pundak, Anshuman Tripathi, Mohamed Elfeky, Parisa Haghani, Trevor Strohman, Michiel Bacchiani:
Toward Domain-Invariant Speech Recognition via Large Scale Training. SLT 2018: 441-447 - [c83]Tom Bagby, Kanishka Rao, Khe Chai Sim:
Efficient Implementation of Recurrent Neural Network Transducer in Tensorflow. SLT 2018: 506-512 - [i4]Skanda Koppula, Khe Chai Sim, Kean K. Chin:
Understanding Recurrent Neural State Using Memory Signatures. CoRR abs/1802.03816 (2018) - [i3]Arun Narayanan, Ananya Misra, Khe Chai Sim, Golan Pundak, Anshuman Tripathi, Mohamed Elfeky, Parisa Haghani, Trevor Strohman, Michiel Bacchiani:
Toward domain-invariant speech recognition via large scale training. CoRR abs/1808.05312 (2018) - [i2]Yanzhang He, Tara N. Sainath, Rohit Prabhavalkar, Ian McGraw, Raziel Alvarez, Ding Zhao, David Rybach, Anjuli Kannan, Yonghui Wu, Ruoming Pang, Qiao Liang, Deepti Bhatia, Yuan Shangguan, Bo Li, Golan Pundak, Khe Chai Sim, Tom Bagby, Shuo-Yiin Chang, Kanishka Rao, Alexander Gruenstein:
Streaming End-to-end Speech Recognition For Mobile Devices. CoRR abs/1811.06621 (2018) - 2017
- [c82]Khe Chai Sim, Arun Narayanan, Tom Bagby, Tara N. Sainath, Michiel Bacchiani:
Improving the efficiency of forward-backward algorithm using batched computation in TensorFlow. ASRU 2017: 258-264 - [c81]Lahiru Samarakoon, Khe Chai Sim, Brian Mak:
An investigation into learning effective speaker subspaces for robust unsupervised DNN adaptation. ICASSP 2017: 5035-5039 - [c80]Bo Li, Tara N. Sainath, Arun Narayanan, Joe Caroselli, Michiel Bacchiani, Ananya Misra, Izhak Shafran, Hasim Sak, Golan Pundak, Kean K. Chin, Khe Chai Sim, Ron J. Weiss, Kevin W. Wilson, Ehsan Variani, Chanwoo Kim, Olivier Siohan, Mitchel Weintraub, Erik McDermott, Richard Rose, Matt Shannon:
Acoustic Modeling for Google Home. INTERSPEECH 2017: 399-403 - [c79]Lahiru Samarakoon, Brian Mak, Khe Chai Sim:
Learning Factorized Transforms for Unsupervised Adaptation of LSTM-RNN Acoustic Models. INTERSPEECH 2017: 744-748 - [c78]Khe Chai Sim, Arun Narayanan:
An Efficient Phone N-Gram Forward-Backward Computation Using Dense Matrix Multiplication. INTERSPEECH 2017: 1646-1650 - [p1]Khe Chai Sim, Yanmin Qian, Gautam Mantena, Lahiru Samarakoon, Souvik Kundu, Tian Tan:
Adaptation of Deep Neural Network Acoustic Models for Robust Automatic Speech Recognition. New Era for Robust Speech Recognition, Exploiting Deep Learning 2017: 219-243 - [i1]Bo Li, Tara N. Sainath, Khe Chai Sim, Michiel Bacchiani, Eugene Weinstein, Patrick Nguyen, Zhifeng Chen, Yonghui Wu, Kanishka Rao:
Multi-Dialect Speech Recognition With A Single Sequence-To-Sequence Model. CoRR abs/1712.01541 (2017) - 2016
- [j11]Khe Chai Sim:
Sensitivity-Characterised Activity Neurogram (SCAN) for Visualising and Understanding the Inner Workings of Deep Neural Network. IEICE Trans. Inf. Syst. 99-D(10): 2423-2430 (2016) - [j10]Lahiru Samarakoon, Khe Chai Sim:
Factorized Hidden Layer Adaptation for Deep Neural Network Based Acoustic Modeling. IEEE ACM Trans. Audio Speech Lang. Process. 24(12): 2241-2250 (2016) - [c77]Souvik Kundu, Gautam Mantena, Yanmin Qian, Tian Tan, Marc Delcroix, Khe Chai Sim:
Joint acoustic factor learning for robust deep neural network based automatic speech recognition. ICASSP 2016: 5025-5029 - [c76]Lahiru Samarakoon, Khe Chai Sim:
On combining i-vectors and discriminative adaptation methods for unsupervised speaker normalization in DNN acoustic models. ICASSP 2016: 5275-5279 - [c75]Tian Tan, Yanmin Qian, Dong Yu, Souvik Kundu, Liang Lu, Khe Chai Sim, Xiong Xiao, Yu Zhang:
Speaker-aware training of LSTM-RNNS for acoustic modelling. ICASSP 2016: 5280-5284 - [c74]Shawn Tan, Khe Chai Sim:
Towards implicit complexity control using variable-depth deep neural networks for automatic speech recognition. ICASSP 2016: 5965-5969 - [c73]Chunyang Wu, Penny Karanasou, Mark J. F. Gales, Khe Chai Sim:
Stimulated Deep Neural Network for Speech Recognition. INTERSPEECH 2016: 400-404 - [c72]Lahiru Samarakoon, Khe Chai Sim:
Subspace LHUC for Fast Adaptation of Deep Neural Network Acoustic Models. INTERSPEECH 2016: 1593-1597 - [c71]Souvik Kundu, Khe Chai Sim, Mark J. F. Gales:
Incorporating a Generative Front-End Layer to Deep Neural Network for Noise Robust Automatic Speech Recognition. INTERSPEECH 2016: 2359-2363 - [c70]Lahiru Samarakoon, Khe Chai Sim:
Multi-Attribute Factorized Hidden Layer Adaptation for DNN Acoustic Models. INTERSPEECH 2016: 3484-3488 - [c69]Animesh Prasad, Khe Chai Sim:
Microphone Distance Adaptation Using Cluster Adaptive Training for Robust Far Field Speech Recognition. INTERSPEECH 2016: 3823-3827 - [c68]Shawn Tan, Khe Chai Sim:
Learning utterance-level normalisation using Variational Autoencoders for robust automatic speech recognition. SLT 2016: 43-49 - [c67]Lahiru Samarakoon, Khe Chai Sim:
Low-rank bases for factorized hidden layer adaptation of DNN acoustic models. SLT 2016: 652-658 - [c66]Gautam Mantena, Khe Chai Sim:
Entropy-based pruning of hidden units to reduce DNN parameters. SLT 2016: 672-679 - 2015
- [c65]Khe Chai Sim:
On constructing and analysing an interpretable brain model for the DNN based on hidden activity patterns. ASRU 2015: 22-29 - [c64]Lahiru Samarakoon, Khe Chai Sim:
Learning factorized feature transforms for speaker normalization. ASRU 2015: 145-152 - [c63]Shawn Tan, Khe Chai Sim, Mark J. F. Gales:
Improving the interpretability of deep neural networks with stimulated learning. ASRU 2015: 617-623 - [c62]Hengguan Huang, Khe Chai Sim:
An investigation of augmenting speaker representations to improve speaker normalisation for DNN-based speech recognition. ICASSP 2015: 4610-4613 - 2014
- [j9]Shilin Liu, Khe Chai Sim:
Temporally Varying Weight Regression: A Semi-Parametric Trajectory Model for Automatic Speech Recognition. IEEE ACM Trans. Audio Speech Lang. Process. 22(1): 151-160 (2014) - [j8]Bo Li, Khe Chai Sim:
A Spectral Masking Approach to Noise-Robust Speech Recognition Using Deep Neural Networks. IEEE ACM Trans. Audio Speech Lang. Process. 22(8): 1296-1305 (2014) - [j7]Guangsen Wang, Khe Chai Sim:
Regression-Based Context-Dependent Modeling of Deep Neural Networks for Speech Recognition. IEEE ACM Trans. Audio Speech Lang. Process. 22(11): 1660-1669 (2014) - [c61]Xuancong Wang, Hwee Tou Ng, Khe Chai Sim:
A Beam-Search Decoder for Disfluency Detection. COLING 2014: 1457-1467 - [c60]Xuancong Wang, Khe Chai Sim, Hwee Tou Ng:
Combining Punctuation and Disfluency Prediction: An Empirical Study. EMNLP 2014: 121-130 - [c59]Shilin Liu, Khe Chai Sim:
On combining DNN and GMM with unsupervised speaker adaptation for robust automatic speech recognition. ICASSP 2014: 195-199 - [c58]Bo Li, Khe Chai Sim:
An ideal hidden-activation mask for deep neural networks based noise-robust speech recognition. ICASSP 2014: 200-204 - [c57]Suliang Bu, Yanmin Qian, Khe Chai Sim, Yongbin You, Kai Yu:
Second order vector taylor series based robust speech recognition. ICASSP 2014: 1769-1773 - [c56]Guangsen Wang, Khe Chai Sim:
Refinements of regression-based context-dependent modelling of deep neural networks for automatic speech recognition. ICASSP 2014: 3022-3026 - [c55]Bo Li, Khe Chai Sim:
Modeling long temporal contexts for robust DNN-based speech recognition. INTERSPEECH 2014: 353-357 - [c54]Shilin Liu, Khe Chai Sim:
Joint adaptation and adaptive training of TVWR for robust automatic speech recognition. INTERSPEECH 2014: 636-640 - [c53]Khe Chai Sim:
A multimodal stroke-based predictive input for efficient Chinese text entry on mobile devices. SLT 2014: 448-453 - 2013
- [c52]Zhiyan Duan, Haotian Fang, Bo Li, Khe Chai Sim, Ye Wang:
The NUS sung and spoken lyrics corpus: A quantitative comparison of singing and speech. APSIPA 2013: 1-9 - [c51]Guangsen Wang, Khe Chai Sim:
Context dependent acoustic keyword spotting using deep neural network. APSIPA 2013: 1-10 - [c50]Bo Li, Khe Chai Sim:
Improving robustness of deep neural networks via spectral masking for automatic speech recognition. ASRU 2013: 279-284 - [c49]Guangsen Wang, Khe Chai Sim:
Context-dependent modelling of deep neural network using logistic regression. ASRU 2013: 338-343 - [c48]Shilin Liu, Khe Chai Sim:
Multi-stream temporally varying weight regression for cross-lingual speech recognition. ASRU 2013: 434-439 - [c47]Khe Chai Sim:
Approximated Parallel Model Combination for efficient noise-robust speech recognition. ICASSP 2013: 7383-7387 - [c46]Bo Li, Khe Chai Sim:
Noise adaptive front-end normalization based on Vector Taylor Series for Deep Neural Networks in robust speech recognition. ICASSP 2013: 7408-7412 - [c45]Shilin Liu, Khe Chai Sim:
Parameter clustering for temporally varying weight regression for automatic speech recognition. INTERSPEECH 2013: 1796-1800 - [c44]Xiaoxuan Wang, Khe Chai Sim:
Integrating conditional random fields and joint multi-gram model with syllabic features for grapheme-to-phone conversion. INTERSPEECH 2013: 2321-2325 - [c43]Shilin Liu, Khe Chai Sim:
An investigation of temporally varying weight regression for noise robust speech recognition. INTERSPEECH 2013: 2963-2967 - [c42]Bo Li, Yu Tsao, Khe Chai Sim:
An investigation of spectral restoration algorithms for deep neural networks based noise robust speech recognition. INTERSPEECH 2013: 3002-3006 - 2012
- [c41]Khe Chai Sim:
Probabilistic Integration of Partial Lexical Information for Noise Robust Haptic Voice Recognition. ACL (1) 2012: 31-39 - [c40]Guangsen Wang, Khe Chai Sim:
An investigation of tied-mixture GMM based triphone state clustering. ICASSP 2012: 4717-4720 - [c39]Shilin Liu, Khe Chai Sim:
Implicit trajectory modelling using temporally varying weight regression for automatic speech recognition. ICASSP 2012: 4761-4764 - [c38]Khe Chai Sim, Shengdong Zhao, Kai Yu, Hank Liao:
ICMI'12 grand challenge: haptic voice recognition. ICMI 2012: 363-370 - [c37]Seungwhan Moon, Khe Chai Sim:
Design and implementation of the note-taking style haptic voice recognition for mobile devices. ICMI 2012: 533-538 - [c36]Guangsen Wang, Bo Li, Shilin Liu, Xuancong Wang, Xiaoxuan Wang, Khe Chai Sim:
Improving mandarin predictive text input by augmenting pinyin initials with speech and tonal information. ICMI 2012: 545-550 - [c35]Khe Chai Sim:
Speak-as-you-swipe (SAYS): a multimodal interface combining speech and gesture keyboard synchronously for continuous mobile text entry. ICMI 2012: 555-560 - [c34]Xuancong Wang, Hwee Tou Ng, Khe Chai Sim:
Dynamic Conditional Random Fields for Joint Sentence Boundary and Punctuation Prediction. INTERSPEECH 2012: 1384-1387 - [c33]Bo Li, Khe Chai Sim:
A Two-stage Speaker Adaptation Approach for Subspace Gaussian Mixture Model based Nonnative Speech Recognition. INTERSPEECH 2012: 1772-1775 - [c32]Aisha S. Azim, Xiaoxuan Wang, Khe Chai Sim:
A Weighted Combination of Speech with Text-based Models for Arabic Diacritization. INTERSPEECH 2012: 2334-2337 - [c31]Yinsheng Zhou, Khe Chai Sim, Patsy Tan, Ye Wang:
MOGAT: mobile games with auditory training for children with cochlear implants. ACM Multimedia 2012: 429-438 - 2011
- [j6]Kong Aik Lee, Chang Huai You, Haizhou Li, Tomi Kinnunen, Khe Chai Sim:
Using Discrete Probabilities With Bhattacharyya Measure for SVM-Based Speaker Verification. IEEE ACM Trans. Audio Speech Lang. Process. 19(4): 861-870 (2011) - [c30]Khe Chai Sim, Minh-Thang Luong:
A Trajectory-based Parallel Model Combination with a unified static and dynamic parameter compensation for noisy speech recognition. ASRU 2011: 107-112 - [c29]Guangsen Wang, Khe Chai Sim:
Sequential Classification Criteria for NNs in Automatic Speech Recognition. INTERSPEECH 2011: 441-444 - [c28]Guangsen Wang, Khe Chai Sim:
Comparison of Smoothing Techniques for Robust Context Dependent Acoustic Modelling in Hybrid NN/HMM Systems. INTERSPEECH 2011: 457-460 - 2010
- [j5]Tee Kiah Chia, Khe Chai Sim, Haizhou Li, Hwee Tou Ng:
Statistical lattice-based spoken document retrieval. ACM Trans. Inf. Syst. 28(1): 2:1-2:30 (2010) - [j4]Namunu Chinthaka Maddage, Khe Chai Sim, Haizhou Li:
Word level automatic alignment of music and lyrics using vocal synthesis. ACM Trans. Multim. Comput. Commun. Appl. 6(3): 19:1-19:16 (2010) - [c27]Kong-Aik Lee, Haizhou Li, Chang Huai You, Tomi Kinnunen, Khe Chai Sim:
Discrete expected likelihood kernel for SVM-based speaker verification. EUSIPCO 2010: 591-595 - [c26]Khe Chai Sim:
A minimum variance asynchronous Detection Error Trade-off performance analysis for multi-class detection problems. ICASSP 2010: 4458-4461 - [c25]Khe Chai Sim, Kong-Aik Lee:
Adaptive score fusion using Weighted Logistic Linear Regression for spoken language recognition. ICASSP 2010: 5018-5021 - [c24]Khe Chai Sim:
Probabilistic state clustering using conditional random field for context-dependent acoustic modelling. INTERSPEECH 2010: 70-73 - [c23]Bo Li, Khe Chai Sim:
Comparison of discriminative input and output transformations for speaker adaptation in the hybrid NN/HMM systems. INTERSPEECH 2010: 526-529 - [c22]Bo Li, Khe Chai Sim:
Hidden logistic linear regression for support vector machine based phone verification. INTERSPEECH 2010: 2614-2617 - [c21]Khe Chai Sim, Shilin Liu:
Semi-parametric trajectory modelling using temporally varying feature mapping for speech recognition. INTERSPEECH 2010: 2982-2985 - [c20]Khe Chai Sim:
Haptic Voice Recognition: Augmenting speech modality with touch events for efficient speech recognition. SLT 2010: 73-78
2000 – 2009
- 2009
- [c19]Khe Chai Sim:
Discriminative Product-of-Expert acoustic mapping for cross-lingual phone recognition. ASRU 2009: 546-551 - [c18]Haizhou Li, Bin Ma, Kong-Aik Lee, Hanwu Sun, Donglai Zhu, Khe Chai Sim, Changhuai You, Rong Tong, Ismo Kärkkäinen, Chien-Lin Huang, Vladimir Pervouchine, Wu Guo, Yijie Li, Li-Rong Dai, Mohaddeseh Nosratighods, Tharmarajah Thiruvaran, Julien Epps, Eliathamby Ambikairajah, Chng Eng Siong, Tanja Schultz, Qin Jin:
The I4U system in NIST 2008 speaker recognition evaluation. ICASSP 2009: 4201-4204 - [c17]Khe Chai Sim, Haizhou Li:
Stream-based context-sensitive phone mapping for cross-lingual speech recognition. INTERSPEECH 2009: 3019-3022 - [c16]Khe Chai Sim:
Improving phone verification using state-level posterior features and support vector machine for automatic mispronunciation detection. SLaTE 2009: 133-136 - 2008
- [j3]Khe Chai Sim, Haizhou Li:
On Acoustic Diversification Front-End for Spoken Language Identification. IEEE Trans. Speech Audio Process. 16(5): 1029-1037 (2008) - [c15]Khe Chai Sim, Haizhou Li:
Robust phone set mapping using decision tree clustering for cross-lingual phone recognition. ICASSP 2008: 4309-4312 - [c14]Khe Chai Sim, Haizhou Li:
Context-sensitive probabilistic phone mapping model for cross-lingual speech recognition. INTERSPEECH 2008: 2715-2718 - [c13]Haizhou Li, Bin Ma, Kong-Aik Lee, Khe Chai Sim, Hanwu Sun, Rong Tong, Donglai Zhu, Changhuai You:
NIST 2007 Language Recognition Evaluation: From the Perspective of IIR. PACLIC 2008: 46-57 - [c12]Tee Kiah Chia, Khe Chai Sim, Haizhou Li, Hwee Tou Ng:
A lattice-based approach to query-by-example spoken document retrieval. SIGIR 2008: 363-370 - 2007
- [j2]Khe Chai Sim, Mark J. F. Gales:
Discriminative semi-parametric trajectory model for speech recognition. Comput. Speech Lang. 21(4): 669-687 (2007) - [c11]Haizhou Li, Khe Chai Sim, Jin-Shea Kuo, Minghui Dong:
Semantic Transliteration of Personal Names. ACL 2007 - [c10]Marcus Tomalin, Mark J. F. Gales, X. Andrew Liu, Khe Chai Sim, Rohit Sinha, Lan Wang, Philip C. Woodland, Kai Yu:
Improving Speech Transcription for Mandarin-English Translation. ICASSP (4) 2007: 97-100 - [c9]Khe Chai Sim, William J. Byrne, Mark J. F. Gales, Hichem Sahbi, Philip C. Woodland:
Consensus Network Decoding for Statistical Machine Translation System Combination. ICASSP (4) 2007: 105-108 - [c8]Khe Chai Sim, Haizhou Li:
Fusion of contrastive acoustic models for parallel phonotactic spoken language identification. INTERSPEECH 2007: 170-173 - 2006
- [j1]Khe Chai Sim, Mark J. F. Gales:
Minimum phone error training of precision matrix models. IEEE Trans. Speech Audio Process. 14(3): 882-889 (2006) - [c7]Rohit Sinha, Mark J. F. Gales, Do Yeong Kim, X. Andrew Liu, Khe Chai Sim, Philip C. Woodland:
The Cu-Htk Mandarin Broadcast News Transcription System. ICASSP (1) 2006: 1077-1080 - 2005
- [c6]Khe Chai Sim, Mark J. F. Gales:
Adaptation of Precision Matrix Models on Large Vocabulary Continuous Speech Recognition. ICASSP (1) 2005: 97-100 - [c5]Mark J. F. Gales, Bin Jia, X. Andrew Liu, Khe Chai Sim, Philip C. Woodland, Kai Yu:
Development of the CUHTK 2004 Mandarin Conversational Telephone Speech Transcription System. ICASSP (1) 2005: 841-844 - [c4]Xunying Liu, Mark J. F. Gales, Khe Chai Sim, Kai Yu:
Investigation of Acoustic Modeling Techniques for LVCSR Systems. ICASSP (1) 2005: 849-852 - [c3]Do Yeong Kim, Ho Yin Chan, Gunnar Evermann, Mark J. F. Gales, David Mrva, Khe Chai Sim, Philip C. Woodland:
Development of the CU-HTK 2004 Broadcast News Transcription Systems. ICASSP (1) 2005: 861-864 - [c2]Khe Chai Sim, Mark J. F. Gales:
Temporally varying model parameters for large vocabulary continuous speech recognition. INTERSPEECH 2005: 2137-2140 - 2004
- [c1]Khe Chai Sim, Mark J. F. Gales:
Basis superposition precision matrix modelling for large vocabulary continuous speech recognition. ICASSP (1) 2004: 801-804
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-08-30 20:43 CEST by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint