


Остановите войну!
for scientists:


default search action
Chng Eng Siong
Engsiong Chng – Eng Siong Chng
Person information

- affiliation: Nanyang Technological University, Singapore
Refine list

refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
showing all ?? records
2020 – today
- 2023
- [c232]Chen Chen, Yuchen Hu, Qiang Zhang, Heqing Zou, Beier Zhu, Eng Siong Chng:
Leveraging Modality-Specific Representations for Audio-Visual Speech Recognition via Reinforcement Learning. AAAI 2023: 12607-12615 - [c231]Changsong Liu, Thi-Nga Ho, Eng Siong Chng:
An Empirical Study on Punctuation Restoration for English, Mandarin, and Code-Switching Speech. ACIIDS (2) 2023: 286-296 - [c230]Chaiyasait Prachaseree, Kshitij Gupta, Thi-Nga Ho, Yizhou Peng, Kyaw Zin Tun, Eng Siong Chng, G. S. S. Chalapthi:
Adapting Code-Switching Language Models with Statistical-Based Text Augmentation. ACIIDS (2) 2023: 310-322 - [c229]Heqing Zou, Meng Shen, Chen Chen, Yuchen Hu, Deepu Rajan, Eng Siong Chng:
UniS-MMC: Multimodal Classification via Unimodality-supervised Multimodal Contrastive Learning. ACL (Findings) 2023: 659-672 - [c228]Yuchen Hu, Chen Chen, Ruizhe Li, Heqing Zou, Eng Siong Chng:
MIR-GAN: Refining Frame-Level Modality-Invariant Representations with Adversarial Network for Audio-Visual Speech Recognition. ACL (1) 2023: 11610-11625 - [c227]Yuchen Hu, Ruizhe Li, Chen Chen, Chengwei Qin, Qiu-Shi Zhu, Eng Siong Chng:
Hearing Lips in Noise: Universal Viseme-Phoneme Mapping and Transfer for Robust Audio-Visual Speech Recognition. ACL (1) 2023: 15213-15232 - [c226]Yuchen Hu, Ruizhe Li, Chen Chen, Heqing Zou, Qiushi Zhu, Eng Siong Chng:
Cross-Modal Global Interaction and Local Alignment for Audio-Visual Speech Recognition. IJCAI 2023: 5076-5084 - [c225]Yachao Guo, Zhibin Qiu, Hao Huang, Chng Eng Siong:
Improved Keyword Recognition Based on Aho-Corasick Automaton. IJCNN 2023: 1-7 - [c224]Yuke Si, Yan Zhang, Yuhang Li, Xiaobao Wang, Longbiao Wang, Jianwu Dang, Eng Siong Chng, Haizhou Li:
Local and Global Context Modeling with Relation Matching Task for Dialog Act Recognition. IJCNN 2023: 1-8 - [c223]Tanmay Khandelwal, Rohan Kumar Das, Andrew Koh, Eng Siong Chng:
Leveraging Audio-Tagging Assisted Sound Event Detection using Weakified Strong Labels and Frequency Dynamic Convolutions. SSP 2023: 329-333 - [i72]Shangeth Rajaa, Kriti Anandan, Swaraj Dalmia, Tarun Gupta, Eng Siong Chng:
Improving Spoken Language Identification with Map-Mix. CoRR abs/2302.08229 (2023) - [i71]Alexey Sholokhov, Nikita Kuzmin, Kong Aik Lee, Eng Siong Chng:
Probabilistic Back-ends for Online Speaker Recognition and Clustering. CoRR abs/2302.09523 (2023) - [i70]Yuchen Hu, Chen Chen, Heqing Zou, Xionghu Zhong, Eng Siong Chng:
Unifying Speech Enhancement and Separation with Gradient Modulation for End-to-End Noise-Robust Speech Separation. CoRR abs/2302.11131 (2023) - [i69]Yuchen Hu, Chen Chen, Ruizhe Li, Qiushi Zhu, Eng Siong Chng:
Gradient Remedy for Multi-Task Learning in End-to-End Noise-Robust Speech Recognition. CoRR abs/2302.11362 (2023) - [i68]Chen Chen, Yuchen Hu, Heqing Zou, Linhui Sun, Eng Siong Chng:
Unsupervised Noise adaptation using Data Simulation. CoRR abs/2302.11981 (2023) - [i67]Chen Chen, Yuchen Hu, Weiwei Weng, Eng Siong Chng:
Metric-oriented Speech Enhancement using Diffusion Probabilistic Model. CoRR abs/2302.11989 (2023) - [i66]Dianwen Ng, Ruixi Zhang, Jia Qi Yip, Zhao Yang, Jinjie Ni, Chong Zhang, Yukun Ma, Chongjia Ni, Eng Siong Chng, Bin Ma:
deHuBERT: Disentangling Noise in a Self-supervised Model for Robust Speech Recognition. CoRR abs/2302.14597 (2023) - [i65]Yuchen Hu, Chen Chen, Qiushi Zhu, Eng Siong Chng:
Wav2code: Restore Clean Speech Representations via Codebook Lookup for Noise-Robust ASR. CoRR abs/2304.04974 (2023) - [i64]Dianwen Ng, Ruixi Zhang, Jia Qi Yip, Chong Zhang, Yukun Ma, Trung Hieu Nguyen, Chongjia Ni, Eng Siong Chng, Bin Ma:
Contrastive Speech Mixup for Low-resource Keyword Spotting. CoRR abs/2305.01170 (2023) - [i63]Yuchen Hu, Ruizhe Li, Chen Chen, Heqing Zou, Qiushi Zhu, Eng Siong Chng:
Cross-Modal Global Interaction and Local Alignment for Audio-Visual Speech Recognition. CoRR abs/2305.09212 (2023) - [i62]Heqing Zou, Meng Shen, Chen Chen, Yuchen Hu, Deepu Rajan, Eng Siong Chng:
UniS-MMC: Multimodal Classification via Unimodality-supervised Multimodal Contrastive Learning. CoRR abs/2305.09299 (2023) - [i61]Zizheng Zhang, Chen Chen, Xiang Liu, Yuchen Hu, Eng Siong Chng:
Noise-aware Speech Separation with Contrastive Learning. CoRR abs/2305.10761 (2023) - [i60]Jia Qi Yip, Tuan Truong, Dianwen Ng, Chong Zhang
, Yukun Ma, Trung Hieu Nguyen, Chongjia Ni, Shengkui Zhao, Eng Siong Chng, Bin Ma:
ACA-Net: Towards Lightweight Speaker Verification using Asymmetric Cross Attention. CoRR abs/2305.12121 (2023) - [i59]Leander Melroy Maben, Zixun Guo, Chen Chen, Utkarsh Chudiwal, Chng Eng Siong:
Study of GANs for Noisy Speech Simulation from Clean Speech. CoRR abs/2305.12460 (2023) - [i58]Chen Chen, Chao-Han Huck Yang, Kai Li, Yuchen Hu, Pin-Jui Ku, Eng Siong Chng:
A Neural State-Space Model Approach to Efficient Speech Separation. CoRR abs/2305.16932 (2023) - [i57]Yuchen Hu, Ruizhe Li, Chen Chen, Chengwei Qin, Qiushi Zhu, Eng Siong Chng:
Hearing Lips in Noise: Universal Viseme-Phoneme Mapping and Transfer for Robust Audio-Visual Speech Recognition. CoRR abs/2306.10563 (2023) - [i56]Yuchen Hu, Chen Chen, Ruizhe Li, Heqing Zou, Eng Siong Chng:
MIR-GAN: Refining Frame-Level Modality-Invariant Representations with Adversarial Network for Audio-Visual Speech Recognition. CoRR abs/2306.10567 (2023) - [i55]Yuchen Hu, Chen Chen, Ruizhe Li, Qiushi Zhu, Eng Siong Chng:
Noise-aware Speech Enhancement using Diffusion Probabilistic Model. CoRR abs/2307.08029 (2023) - [i54]Jia Qi Yip, Dianwen Ng, Bin Ma, Chng Eng Siong:
Analysis of Speech Separation Performance Degradation on Emotional Speech Mixtures. CoRR abs/2309.07458 (2023) - [i53]Ansh Mishra, Jia Qi Yip, Eng Siong Chng:
Codec Data Augmentation for Time-domain Heart Sound Classification. CoRR abs/2309.07466 (2023) - [i52]Dianwen Ng, Chong Zhang, Ruixi Zhang, Yukun Ma, Fabian Ritter Gutierrez, Trung Hieu Nguyen, Chongjia Ni, Shengkui Zhao, Eng Siong Chng, Bin Ma:
Are Soft Prompts Good Zero-shot Learners for Speech Recognition? CoRR abs/2309.09413 (2023) - 2022
- [j36]Hexin Liu
, Leibny Paola García-Perera
, Andy W. H. Khong
, Eng Siong Chng, Suzy J. Styles, Sanjeev Khudanpur
:
Efficient Self-Supervised Learning Representations for Spoken Language Identification. IEEE J. Sel. Top. Signal Process. 16(6): 1296-1307 (2022) - [j35]Lili Guo
, Longbiao Wang
, Jianwu Dang, Eng Siong Chng, Seiichi Nakagawa:
Learning affective representations based on magnitude and dynamic relative phase information for speech emotion recognition. Speech Commun. 136: 118-127 (2022) - [c222]Yang Xiao, Xubo Liu, James A. King, Arshdeep Singh, Eng Siong Chng, Mark D. Plumbley, Wenwu Wang:
Continual Learning for On-Ddevice Environmental Sound Classification. DCASE 2022 - [c221]Dianwen Ng, Yunqi Chen, Biao Tian, Qiang Fu, Eng Siong Chng:
Convmixer: Feature Interactive Convolution with Curriculum Learning for Small Footprint and Noisy Far-Field Keyword Spotting. ICASSP 2022: 3603-3607 - [c220]Chen Chen, Yuchen Hu, Nana Hou, Xiaofeng Qi, Heqing Zou, Eng Siong Chng:
Self-Critical Sequence Training for Automatic Speech Recognition. ICASSP 2022: 3688-3692 - [c219]Chen Chen, Nana Hou, Yuchen Hu, Shashank Shirol, Eng Siong Chng:
Noise-Robust Speech Recognition With 10 Minutes Unparalleled In-Domain Data. ICASSP 2022: 4298-4302 - [c218]Yuchen Hu, Nana Hou, Chen Chen, Eng Siong Chng:
Interactive Feature Fusion for End-to-End Noise-Robust Speech Recognition. ICASSP 2022: 6292-6296 - [c217]Fuzhao Xue, Aixin Sun, Hao Zhang
, Jinjie Ni, Eng Siong Chng:
An Embarrassingly Simple Model for Dialogue Relation Extraction. ICASSP 2022: 6707-6711 - [c216]Meng Ge, Chenglin Xu, Longbiao Wang, Eng Siong Chng, Jianwu Dang, Haizhou Li:
L-SpEx: Localized Target Speaker Extraction. ICASSP 2022: 7287-7291 - [c215]Heqing Zou, Yuke Si, Chen Chen, Deepu Rajan, Eng Siong Chng:
Speech Emotion Recognition with Co-Attention Based Multi-Level Acoustic Information. ICASSP 2022: 7367-7371 - [c214]Andrew Koh, Fuzhao Xue, Chng Eng Siong:
Automated Audio Captioning Using Transfer Learning and Reconstruction Latent Space Similarity Regularization. ICASSP 2022: 7722-7726 - [c213]Yizhou Peng, Jicheng Zhang, Haihua Xu, Hao Huang, Eng Siong Chng:
Minimum Word Error Training For Non-Autoregressive Transformer-Based Code-Switching ASR. ICASSP 2022: 7807-7811 - [c212]Tarun Gupta, Duc-Tuan Truong, Tran The Anh, Eng Siong Chng:
Estimation of speaker age and height from speech signal using bi-encoder transformer mixture model. INTERSPEECH 2022: 1978-1982 - [c211]Chen Chen, Nana Hou, Yuchen Hu, Heqing Zou, Xiaofeng Qi, Eng Siong Chng:
Interactive Auido-text Representation for Automated Audio Captioning with Contrastive Learning. INTERSPEECH 2022: 2773-2777 - [c210]Yang Xiao, Nana Hou, Eng Siong Chng:
Rainbow Keywords: Efficient Incremental Learning for Online Spoken Keyword Spotting. INTERSPEECH 2022: 3764-3768 - [c209]Zixun Guo, Chen Chen, Eng Siong Chng:
DENT-DDSP: Data-efficient noisy speech generator using differentiable digital signal processors for explicit distortion modelling and noise-robust speech recognition. INTERSPEECH 2022: 3799-3803 - [c208]Ao Zhang, Fan Yu, Kaixun Huang, Lei Xie, Longbiao Wang, Eng Siong Chng, Hui Bu, Binbin Zhang, Wei Chen, Xin Xu:
The ISCSLP 2022 Intelligent Cockpit Speech Recognition Challenge (ICSRC): Dataset, Tracks, Baseline and Results. ISCSLP 2022: 507-511 - [i51]Dianwen Ng, Yunqi Chen, Biao Tian, Qiang Fu, Eng Siong Chng:
ConvMixer: Feature Interactive Convolution with Curriculum Learning for Small Footprint and Noisy Far-field Keyword Spotting. CoRR abs/2201.05863 (2022) - [i50]Meng Ge, Chenglin Xu, Longbiao Wang, Eng Siong Chng, Jianwu Dang, Haizhou Li:
L-SpEx: Localized Target Speaker Extraction. CoRR abs/2202.09995 (2022) - [i49]Tarun Gupta, Duc-Tuan Truong, Tran The Anh, Chng Eng Siong:
Estimation of speaker age and height from speech signal using bi-encoder transformer mixture model. CoRR abs/2203.11774 (2022) - [i48]Yuchen Hu, Nana Hou, Chen Chen, Eng Siong Chng:
Dual-Path Style Learning for End-to-End Noise-Robust Speech Recognition. CoRR abs/2203.14838 (2022) - [i47]Chen Chen, Nana Hou, Yuchen Hu, Shashank Shirol, Eng Siong Chng:
Noise-robust Speech Recognition with 10 Minutes Unparalleled In-domain Data. CoRR abs/2203.15321 (2022) - [i46]Heqing Zou, Yuke Si, Chen Chen, Deepu Rajan, Eng Siong Chng:
Speech Emotion Recognition with Co-Attention based Multi-level Acoustic Information. CoRR abs/2203.15326 (2022) - [i45]Chen Chen, Nana Hou, Yuchen Hu, Heqing Zou, Xiaofeng Qi, Eng Siong Chng:
Interactive Audio-text Representation for Automated Audio Captioning with Contrastive Learning. CoRR abs/2203.15526 (2022) - [i44]Yang Xiao, Nana Hou, Eng Siong Chng:
Rainbow Keywords: Efficient Incremental Learning for Online Spoken Keyword Spotting. CoRR abs/2203.16361 (2022) - [i43]Dianwen Ng, Jin Hui Pang, Yang Xiao, Biao Tian, Qiang Fu, Eng Siong Chng:
Small Footprint Multi-channel ConvMixer for Keyword Spotting with Centroid Based Awareness. CoRR abs/2204.05445 (2022) - [i42]Chen Chen, Yuchen Hu, Nana Hou, Xiaofeng Qi, Heqing Zou, Eng Siong Chng:
Self-critical Sequence Training for Automatic Speech Recognition. CoRR abs/2204.06260 (2022) - [i41]Andrew Koh, Soham Tiwari, Chng Eng Siong:
Automated Audio Captioning with Epochal Difficult Captions for Curriculum Learning. CoRR abs/2206.01918 (2022) - [i40]Andrew Koh, Eng Siong Chng:
Language-Based Audio Retrieval with Converging Tied Layers and Contrastive Loss. CoRR abs/2206.14659 (2022) - [i39]Yizhou Peng, Yufei Liu, Jicheng Zhang, Haihua Xu, Yi He, Hao Huang, Eng Siong Chng:
Internal Language Model Estimation based Language Model Fusion for Cross-Domain Code-Switching Speech Recognition. CoRR abs/2207.04176 (2022) - [i38]Jicheng Zhang, Yizhou Peng, Haihua Xu, Yi He, Eng Siong Chng, Hao Huang:
Intermediate-layer output Regularization for Attention-based Speech Recognition with Shared Decoder. CoRR abs/2207.04177 (2022) - [i37]Yang Xiao, Xubo Liu, James A. King, Arshdeep Singh, Eng Siong Chng, Mark D. Plumbley, Wenwu Wang:
Continual Learning For On-Device Environmental Sound Classification. CoRR abs/2207.07429 (2022) - [i36]Zixun Guo, Chen Chen, Eng Siong Chng:
DENT-DDSP: Data-efficient noisy speech generator using differentiable digital signal processors for explicit distortion modelling and noise-robust speech recognition. CoRR abs/2208.00987 (2022) - [i35]Dianwen Ng, Jia Qi Yip, Tanmay Surana, Zhao Yang, Chong Zhang, Yukun Ma, Chongjia Ni, Eng Siong Chng, Bin Ma:
I2CR: Improving Noise Robustness on Keyword Spotting Using Inter-Intra Contrastive Regularization. CoRR abs/2209.06360 (2022) - [i34]Yuhang Yang, Haihua Xu, Hao Huang, Eng Siong Chng, Sheng Li
:
Speech-text based multi-modal training with bidirectional attention for improved speech recognition. CoRR abs/2211.00325 (2022) - [i33]Ao Zhang, Fan Yu, Kaixun Huang, Lei Xie, Longbiao Wang, Eng Siong Chng, Hui Bu, Binbin Zhang, Wei Chen, Xin Xu:
The ISCSLP 2022 Intelligent Cockpit Speech Recognition Challenge (ICSRC): Dataset, Tracks, Baseline and Results. CoRR abs/2211.01585 (2022) - [i32]Chen Chen, Yuchen Hu, Qiang Zhang, Heqing Zou, Beier Zhu, Eng Siong Chng:
Leveraging Modality-specific Representations for Audio-visual Speech Recognition via Reinforcement Learning. CoRR abs/2212.05301 (2022) - [i31]Abhinav Rao, Thi-Nga Ho, Eng Siong Chng:
Punctuation Restoration for Singaporean Spoken Languages: English, Malay, and Mandarin. CoRR abs/2212.05356 (2022) - 2021
- [c207]Fuzhao Xue, Aixin Sun, Hao Zhang
, Eng Siong Chng:
GDPNet: Refining Latent Multi-View Graph for Relation Extraction. AAAI 2021: 14194-14202 - [c206]Manav Kaushik, Van Tung Pham, Tran The Anh, Eng Siong Chng:
End-to-End Speaker Age and Height Estimation using Attention Mechanism and Triplet Loss. APSIPA ASC 2021: 1-8 - [c205]Duo Ma, Nana Hou, Van Tung Pham, Haihua Xu, Eng Siong Chng:
Multitask-based joint learning approach to robust ASR for radio communication speech. APSIPA ASC 2021: 497-502 - [c204]Chen Chen, Nana Hou, Duo Ma, Eng Siong Chng:
Time Domain Speech Enhancement With Attentive Multi-scale Approach. APSIPA ASC 2021: 679-683 - [c203]Tingzhi Mao, Yerbolat Khassanov, Van Tung Pham, Haihua Xu, Hao Huang, Aishan Wumaier, Eng Siong Chng:
Enriching Under-Represented Named Entities for Improved Speech Recognition. APSIPA ASC 2021: 1021-1025 - [c202]Yizhou Peng, Jicheng Zhang, Haobo Zhang, Haihua Xu, Hao Huang, Sheng Li, Eng Siong Chng:
Multilingual Approach to Joint Speech and Accent Recognition with DNN-HMM Framework. APSIPA ASC 2021: 1043-1048 - [c201]Yingzhu Zhao, Chongjia Ni, Cheung-Chi Leung, Shafiq R. Joty, Eng Siong Chng, Bin Ma:
A Unified Speaker Adaptation Approach for ASR. EMNLP (1) 2021: 9339-9349 - [c200]Nana Hou, Chenglin Xu, Eng Siong Chng, Haizhou Li:
Learning Disentangled Feature Representations for Speech Enhancement Via Adversarial Training. ICASSP 2021: 666-670 - [c199]Meng Ge, Chenglin Xu, Longbiao Wang, Eng Siong Chng, Jianwu Dang, Haizhou Li:
Multi-Stage Speaker Extraction with Utterance and Frame-Level Reference Signals. ICASSP 2021: 6109-6113 - [c198]Lili Guo, Longbiao Wang, Chenglin Xu, Jianwu Dang, Eng Siong Chng, Haizhou Li:
Representation Learning with Spectro-Temporal-Channel Attention for Speech Emotion Recognition. ICASSP 2021: 6304-6308 - [c197]Yingzhu Zhao, Chongjia Ni, Cheung-Chi Leung, Shafiq R. Joty, Eng Siong Chng, Bin Ma:
Preventing Early Endpointing for Online Automatic Speech Recognition. ICASSP 2021: 6813-6817 - [c196]Jicheng Zhang, Yizhou Peng, Van Tung Pham, Haihua Xu, Hao Huang, Eng Siong Chng:
E2E-Based Multi-Task Learning Approach to Joint Speech and Accent Recognition. Interspeech 2021: 1519-1523 - [c195]Weiguang Chen, Van Tung Pham, Eng Siong Chng, Xionghu Zhong:
Overlapped Speech Detection Based on Spectral and Spatial Feature Fusion. Interspeech 2021: 4189-4193 - [c194]Tingzhi Mao, Yerbolat Khassanov, Van Tung Pham, Haihua Xu, Hao Huang, Eng Siong Chng:
Approaches to Improving Recognition of Underrepresented Named Entities in Hybrid ASR Systems. ISCSLP 2021: 1-5 - [c193]Zhiping Zeng, Van Tung Pham, Haihua Xu, Yerbolat Khassanov, Eng Siong Chng, Chongjia Ni, Bin Ma:
Leveraging Text Data Using Hybrid Transformer-LSTM Based End-to-End ASR in Transfer Learning. ISCSLP 2021: 1-5 - [i30]Manav Kaushik, Van Tung Pham, Eng Siong Chng:
End-to-End Speaker Height and age estimation using Attention Mechanism with LSTM-RNN. CoRR abs/2101.05056 (2021) - [i29]Duo Ma, Nana Hou, Van Tung Pham, Haihua Xu, Eng Siong Chng:
Multitask-Based Joint Learning Approach To Robust ASR For Radio Communication Speech. CoRR abs/2107.10701 (2021) - [i28]Andrew Koh, Fuzhao Xue, Eng Siong Chng:
Automated Audio Captioning using Transfer Learning and Reconstruction Latent Space Similarity Regularization. CoRR abs/2108.04692 (2021) - [i27]Yuchen Hu, Nana Hou, Chen Chen, Eng Siong Chng:
Interactive Feature Fusion for End-to-End Noise-Robust Speech Recognition. CoRR abs/2110.05267 (2021) - [i26]Yingzhu Zhao, Chongjia Ni, Cheung-Chi Leung, Shafiq R. Joty, Eng Siong Chng, Bin Ma:
A Unified Speaker Adaptation Approach for ASR. CoRR abs/2110.08545 (2021) - [i25]Shangeth Rajaa, Van Tung Pham, Chng Eng Siong:
Learning Speaker Representation with Semi-supervised Learning approach for Speaker Profiling. CoRR abs/2110.13653 (2021) - 2020
- [j34]Chenglin Xu
, Wei Rao
, Eng Siong Chng
, Haizhou Li
:
SpEx: Multi-Scale Time Domain Speaker Extraction Network. IEEE ACM Trans. Audio Speech Lang. Process. 28: 1370-1384 (2020) - [c192]Boon Peng Yap, Andrew Koh, Eng Siong Chng:
Adapting BERT for Word Sense Disambiguation with Gloss Selection Objective and Example Sentences. EMNLP (Findings) 2020: 41-46 - [c191]Xiang Hao, Chenglin Xu, Nana Hou, Lei Xie, Eng Siong Chng, Haizhou Li:
Time-Domain Neural Network Approach for Speech Bandwidth Extension. ICASSP 2020: 866-870 - [c190]Van Tung Pham, Haihua Xu, Yerbolat Khassanov, Zhiping Zeng, Eng Siong Chng, Chongjia Ni, Bin Ma, Haizhou Li:
Independent Language Modeling Architecture for End-To-End ASR. ICASSP 2020: 7059-7063 - [c189]Yingzhu Zhao, Chongjia Ni, Cheung-Chi Leung, Shafiq R. Joty, Eng Siong Chng, Bin Ma:
Speech Transformer with Speaker Aware Persistent Memory. INTERSPEECH 2020: 1261-1265 - [c188]Meng Ge, Chenglin Xu, Longbiao Wang, Eng Siong Chng, Jianwu Dang, Haizhou Li:
SpEx+: A Complete Time Domain Speaker Extraction Network. INTERSPEECH 2020: 1406-1410 - [c187]Haobo Zhang, Haihua Xu, Van Tung Pham, Hao Huang, Eng Siong Chng:
Monolingual Data Selection Analysis for English-Mandarin Hybrid Code-Switching Speech Recognition. INTERSPEECH 2020: 2392-2396 - [c186]Nana Hou, Chenglin Xu, Van Tung Pham, Joey Tianyi Zhou, Eng Siong Chng, Haizhou Li:
Speaker and Phoneme-Aware Speech Bandwidth Extension with Residual Dual-Path Network. INTERSPEECH 2020: 4064-4068 - [c185]Nana Hou, Chenglin Xu, Joey Tianyi Zhou, Eng Siong Chng, Haizhou Li:
Multi-Task Learning for End-to-End Noise-Robust Bandwidth Extension. INTERSPEECH 2020: 4069-4073 - [c184]Yingzhu Zhao, Chongjia Ni, Cheung-Chi Leung, Shafiq R. Joty, Eng Siong Chng, Bin Ma:
Universal Speech Transformer. INTERSPEECH 2020: 5021-5025 - [c183]Yingzhu Zhao, Chongjia Ni, Cheung-Chi Leung, Shafiq R. Joty, Eng Siong Chng, Bin Ma:
Cross Attention with Monotonic Alignment for Speech Transformer. INTERSPEECH 2020: 5031-5035 - [i24]Chenglin Xu, Wei Rao, Eng Siong Chng, Haizhou Li:
SpEx: Multi-Scale Time Domain Speaker Extraction Network. CoRR abs/2004.08326 (2020) - [i23]Chenglin Xu, Wei Rao, Eng Siong Chng, Haizhou Li:
Time-domain speaker extraction network. CoRR abs/2004.14762 (2020) - [i22]Meng Ge, Chenglin Xu, Longbiao Wang, Eng Siong Chng, Jianwu Dang, Haizhou Li:
SpEx+: A Complete Time Domain Speaker Extraction Network. CoRR abs/2005.04686 (2020) - [i21]Tingzhi Mao, Yerbolat Khassanov, Van Tung Pham, Haihua Xu, Hao Huang, Eng Siong Chng:
Approaches to Improving Recognition of Underrepresented Named Entities in Hybrid ASR Systems. CoRR abs/2005.08742 (2020) - [i20]Zhiping Zeng, Van Tung Pham, Haihua Xu, Yerbolat Khassanov, Eng Siong Chng, Chongjia Ni, Bin Ma:
Leveraging Text Data Using Hybrid Transformer-LSTM Based End-to-End ASR in Transfer Learning. CoRR abs/2005.10407 (2020) - [i19]Boon Peng Yap, Andrew Koh, Eng Siong Chng:
Adapting BERT for Word Sense Disambiguation with Gloss Selection Objective and Example Sentences. CoRR abs/2009.11795 (2020) - [i18]Yizhou Peng, Jicheng Zhang, Haobo Zhang, Haihua Xu, Hao Huang, Eng Siong Chng:
A multilingual approach to joint Speech and Accent Recognition with DNN-HMM framework. CoRR abs/2010.11483 (2020) - [i17]Tingzhi Mao, Yerbolat Khassanov, Van Tung Pham, Haihua Xu, Hao Huang, Aishan Wumaier, Eng Siong Chng:
Enriching Under-Represented Named-Entities To Improve Speech Recognition Performance. CoRR abs/2010.12143 (2020) - [i16]Meng Ge, Chenglin Xu, Longbiao Wang, Eng Siong Chng, Jianwu Dang, Haizhou Li:
Multi-stage Speaker Extraction with Utterance and Frame-Level Reference Signals. CoRR abs/2011.09624 (2020) - [i15]Fuzhao Xue, Aixin Sun, Hao Zhang, Eng Siong Chng:
GDPNet: Refining Latent Multi-View Graph for Relation Extraction. CoRR abs/2012.06780 (2020) - [i14]Fuzhao Xue, Aixin Sun, Hao Zhang, Eng Siong Chng:
An Embarrassingly Simple Model for Dialogue Relation Extraction. CoRR abs/2012.13873 (2020)
2010 – 2019
- 2019
- [c182]Thi-Ly Vu, Zhiping Zeng, Haihua Xu, Eng Siong Chng:
Audio Codec Simulation based Data Augmentation for Telephony Speech Recognition. APSIPA 2019: 198-203 - [c181]Karan Makhija, Thi-Nga Ho, Eng Siong Chng:
Transfer Learning for Punctuation Prediction. APSIPA 2019: 268-273 - [c180]Nana Hou, Chenglin Xu, Eng Siong Chng, Haizhou Li:
Domain Adversarial Training for Speech Enhancement. APSIPA 2019: 667-672 - [c179]Duo Ma, Guanyu Li, Haihua Xu, Eng Siong Chng:
Improving code-switching speech recognition with data augmentation and system combination. APSIPA 2019: 1308-1312 - [c178]Chenglin Xu, Wei Rao, Eng Siong Chng, Haizhou Li:
Time-Domain Speaker Extraction Network. ASRU 2019: 327-334 - [c177]Chenglin Xu, Wei Rao, Eng Siong Chng
, Haizhou Li
:
Optimization of Speaker Extraction Neural Network with Magnitude and Temporal Spectrum Approximation Loss. ICASSP 2019: 6990-6994 - [c176]