default search action
Cong-Thanh Do
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [i6]Cong-Thanh Do, Shuhei Imai, Rama Doddipatla, Thomas Hain:
Improving Accented Speech Recognition using Data Augmentation based on Unsupervised Text-to-Speech Synthesis. CoRR abs/2407.04047 (2024) - [i5]Mohan Li, Cong-Thanh Do, Simon Keizer, Youmna Farag, Svetlana Stoyanchev, Rama Doddipatla:
WHISMA: A Speech-LLM to Perform Zero-shot Spoken Language Understanding. CoRR abs/2408.16423 (2024) - 2023
- [c22]Mohan Li, Catalin Zorila, Cong-Thanh Do, Rama Doddipatla:
Towards a Unified End-to-End Language Understanding System for Speech and Text Inputs. ASRU 2023: 1-8 - [c21]Mohan Li, Cong-Thanh Do, Rama Doddipatla:
Cumulative Attention Based Streaming Transformer ASR with Internal Language Model Joint Training and Rescoring. ICASSP 2023: 1-5 - [c20]Cong-Thanh Do, Rama Doddipatla, Mohan Li, Thomas Hain:
Domain Adaptive Self-supervised Training of Automatic Speech Recognition. INTERSPEECH 2023: 4389-4393 - 2022
- [j8]Cong-Thanh Do, Tran Thien Dat Nguyen, Hoa Van Nguyen:
Robust multi-sensor generalized labeled multi-Bernoulli filter. Signal Process. 192: 108368 (2022) - [j7]Cong-Thanh Do, Tran Thien Dat Nguyen, Diluka Moratuwage, Changbeom Shim, Yon Dohn Chung:
Multi-object tracking with an adaptive generalized labeled multi-Bernoulli filter. Signal Process. 196: 108532 (2022) - [c19]Tran Thien Dat Nguyen, Cong-Thanh Do, Hoa Van Nguyen:
An Adaptive Multi-Sensor Generalised Labelled Multi-Bernoulli Filter for Linear Gaussian Models. ICCAIS 2022: 84-89 - [c18]Cong-Thanh Do, Mohan Li, Rama Doddipatla:
Multiple-hypothesis RNN-T Loss for Unsupervised Fine-tuning and Self-training of Neural Transducer. INTERSPEECH 2022: 4446-4450 - [i4]Cong-Thanh Do, Mohan Li, Rama Doddipatla:
Multiple-hypothesis RNN-T Loss for Unsupervised Fine-tuning and Self-training of Neural Transducer. CoRR abs/2207.14736 (2022) - 2021
- [c17]Shucong Zhang, Cong-Thanh Do, Rama Doddipatla, Erfan Loweimi, Peter Bell, Steve Renals:
Train Your Classifier First: Cascade Neural Networks Training from Upper Layers to Lower Layers. ICASSP 2021: 2750-2754 - [c16]Cong-Thanh Do, Rama Doddipatla, Thomas Hain:
Multiple-Hypothesis CTC-Based Semi-Supervised Adaptation of End-to-End Speech Recognition. ICASSP 2021: 6978-6982 - [c15]Jonah Ong, Du Yong Kim, Cong-Thanh Do:
A Tractable Multi-target Detection Model for Line-of-Sight Measurements. ICCAIS 2021: 147-152 - [i3]Shucong Zhang, Cong-Thanh Do, Rama Doddipatla, Erfan Loweimi, Peter Bell, Steve Renals:
Train your classifier first: Cascade Neural Networks Training from upper layers to lower layers. CoRR abs/2102.04697 (2021) - [i2]Cong-Thanh Do, Rama Doddipatla, Thomas Hain:
Multiple-hypothesis CTC-based semi-supervised adaptation of end-to-end speech recognition. CoRR abs/2103.15515 (2021) - 2020
- [c14]Cong-Thanh Do, Shucong Zhang, Thomas Hain:
Selective Adaptation of End-to-End Speech Recognition using Hybrid CTC/Attention Architecture for Noise Robustness. EUSIPCO 2020: 321-325 - [c13]Shucong Zhang, Cong-Thanh Do, Rama Doddipatla, Steve Renals:
Learning Noise Invariant Features Through Transfer Learning For Robust End-to-End Speech Recognition. ICASSP 2020: 7024-7028
2010 – 2019
- 2019
- [j6]Cong-Thanh Do, Hoa Van Nguyen:
Tracking Multiple Targets from Multistatic Doppler Radar with Unknown Probability of Detection. Sensors 19(7): 1672 (2019) - [j5]Cong-Thanh Do, Tran Thien Dat Nguyen, Weifeng Liu:
Tracking Multiple Marine Ships via Multiple Sensors with Unknown Backgrounds. Sensors 19(22): 5025 (2019) - [c12]Cong-Thanh Do:
Subband Temporal Envelope Features and Data Augmentation for End-to-end Recognition of Distant Conversational Speech. ICASSP 2019: 6251-6255 - [c11]Cong-Thanh Do, Tran Thien Dat Nguyen:
Multiple marine ships tracking from multistatic Doppler data with unknown clutter rate. ICCAIS 2019: 1-6 - [i1]Cong-Thanh Do:
End-to-End Speech Recognition with High-Frame-Rate Features Extraction. CoRR abs/1907.01957 (2019) - 2018
- [c10]Cong-Thanh Do, Hoa Van Nguyen:
Multistatic Doppler-Based Marine Ships Tracking. ICCAIS 2018: 151-156 - [c9]Cong-Thanh Do, Yannis Stylianou:
Weighting Time-Frequency Representation of Speech Using Auditory Saliency for Automatic Speech Recognition. INTERSPEECH 2018: 1591-1595 - 2017
- [c8]Cong-Thanh Do, Yannis Stylianou:
Improved Automatic Speech Recognition Using Subband Temporal Envelope Features and Time-Delay Neural Network Denoising Autoencoder. INTERSPEECH 2017: 3832-3836 - 2014
- [j4]Achintya Kumar Sarkar, Cong-Thanh Do, Viet Bac Le, Claude Barras:
Combination of Cepstral and Phonetically Discriminative Features for Speaker Verification. IEEE Signal Process. Lett. 21(9): 1040-1044 (2014) - [c7]Cong-Thanh Do, Marc Evrard, A. Leman, Christophe d'Alessandro, Albert Rilliard, J.-L. Crebouw:
Objective evaluation of HMM-based speech synthesis system using kullback-leibler divergence. INTERSPEECH 2014: 2952-2956 - [c6]Cong-Thanh Do, Lori Lamel, Jean-Luc Gauvain:
Speech-to-text development for Slovak, a low-resourced language. SLTU 2014: 176-182 - 2013
- [c5]Cong-Thanh Do, Claude Barras, Viet Bac Le, Achintya Kumar Sarkar:
Augmenting short-term cepstral features with long-term discriminative features for speaker verification of telephone data. INTERSPEECH 2013: 2484-2488 - 2012
- [j3]Cong-Thanh Do, Dominique Pastor, André Goalic:
A novel framework for noise robust ASR using cochlear implant-like spectrally reduced speech. Speech Commun. 54(1): 119-133 (2012) - [c4]Cong-Thanh Do, Claude Barras:
Cochlear implant-like processing of speech signal for speaker verification. SAPA@INTERSPEECH 2012: 17-21 - [c3]Cong-Thanh Do, Mohammad Javad Taghizadeh, Philip N. Garner:
Combining cepstral normalization and cochlear implant-like speech processing for microphone array-based speech recognition. SLT 2012: 137-142 - 2010
- [j2]Cong-Thanh Do, Dominique Pastor, André Goalic:
On the Recognition of Cochlear Implant-Like Spectrally Reduced Speech With MFCC and HMM-Based ASR. IEEE Trans. Speech Audio Process. 18(5): 1065-1068 (2010) - [j1]Cong-Thanh Do, Dominique Pastor, André Goalic:
On Normalized MSE Analysis of Speech Fundamental Frequency in the Cochlear Implant-Like Spectrally Reduced Speech. IEEE Trans. Biomed. Eng. 57(3): 572-577 (2010) - [c2]Cong-Thanh Do, Dominique Pastor, Gaël Le Lan, André Goalic:
Recognizing cochlear implant-like spectrally reduced speech with HMM-based ASR: experiments with MFCCs and PLP coefficients. INTERSPEECH 2010: 2634-2637
2000 – 2009
- 2009
- [c1]Cong-Thanh Do, Abdeldjalil Aïssa-El-Bey, Dominique Pastor, André Goalic:
Area of mouth opening estimation from speech acoustics using blind deconvolution technique. AVSP 2009: 80-85
Coauthor Index
aka: Rama Doddipatla
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-09-30 00:09 CEST by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint