default search action
Tamás Gábor Csapó
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [i22]Milán András Fodor, Tamás Gábor Csapó, Frigyes Viktor Arthur:
Towards Decoding Brain Activity During Passive Listening of Speech. CoRR abs/2402.16996 (2024) - 2023
- [j9]Ali Raheem Mandeel, Mohammed Salah Al-Radhi, Tamás Gábor Csapó:
Investigations on speaker adaptation using a continuous vocoder within recurrent neural network based text-to-speech synthesis. Multim. Tools Appl. 82(10): 15635-15649 (2023) - [j8]Bruce Denby, Tamás Gábor Csapó, Michael Wand:
Future Speech Interfaces with Sensors and Machine Intelligence. Sensors 23(4): 1971 (2023) - [c53]Peter Mayer, Katharina Werner, Mohammed Salah Al-Radhi, Tamás Gábor Csapó, Bálint Czeba, Géza Németh, Ana Patrícia Rocha, Ilídio Castro Oliveira, Samuel S. Silva, Melinda Szeker, António J. S. Teixeira, Paul Panek:
Concept and Pictogram-Based User-Interface Design of a Helper Tool for People with Aphasia. dHealth 2023: 77-82 - [c52]Kang You, Bo Liu, Kele Xu, Yunsheng Xiong, Qisheng Xu, Ming Feng, Tamás Gábor Csapó, Boqing Zhu:
Raw Ultrasound-Based Phonetic Segments Classification Via Mask Modeling. ICASSP 2023: 1-5 - [c51]Tamás Gábor Csapó, Frigyes Viktor Arthur, Péter Nagy, Ádám Boncz:
Towards Ultrasound Tongue Image prediction from EEG during speech production. INTERSPEECH 2023: 1164-1168 - [c50]László Tóth, Amin Honarmandi Shandiz, Gábor Gosztolya, Tamás Gábor Csapó:
Adaptation of Tongue Ultrasound-Based Silent Speech Interfaces Using Spatial Transformer Networks. INTERSPEECH 2023: 1169-1173 - [c49]Tamás Gábor Csapó, Frigyes Viktor Arthur, Péter Nagy, Ádám Boncz:
Comparison of acoustic-to-articulatory and brain-to-articulatory mapping during speech production using ultrasound tongue imaging and EEG. SMM 2023 - [c48]Mohammed Salah Al-Radhi, Omnia Ibrahim, Ali Raheem Mandeel, Tamás Gábor Csapó, Géza Németh:
Advancing Limited Data Text-to-Speech Synthesis: Non-Autoregressive Transformer for High-Quality Parallel Synthesis. SpeD 2023: 152-157 - [c47]Ali Raheem Mandeel, Mohammed Salah Al-Radhi, Tamás Gábor Csapó:
Enhancing End-to-End Speech Synthesis by Modeling Interrogative Sentences with Speaker Adaptation. SpeD 2023: 158-163 - [c46]Ali Raheem Mandeel, Mohammed Salah Al-Radhi, Tamás Gábor Csapó:
Modeling Irregular Voice in End-to-End Speech Synthesis via Speaker Adaptation. SpeD 2023: 170-175 - [c45]Mohammed Salah Al-Radhi, Tamás Gábor Csapó, Géza Németh:
Nonparallel Expressive TTS for Unseen Target Speaker using Style-Controlled Adaptive Layer and Optimized Pitch Embedding. SpeD 2023: 176-181 - [c44]Ibrahim Ibrahimov, Gábor Gosztolya, Tamás Gábor Csapó:
Data Augmentation Methods on Ultrasound Tongue Images for Articulation-to-Speech Synthesis. SSW 2023: 230-235 - [i21]László Tóth, Amin Honarmandi Shandiz, Gábor Gosztolya, Tamás Gábor Csapó:
Adaptation of Tongue Ultrasound-Based Silent Speech Interfaces Using Spatial Transformer Networks. CoRR abs/2305.19130 (2023) - [i20]Tamás Gábor Csapó, Frigyes Viktor Arthur, Péter Nagy, Ádám Boncz:
Towards Ultrasound Tongue Image prediction from EEG during speech production. CoRR abs/2306.05374 (2023) - 2022
- [j7]Tamás Gábor Csapó, Gábor Gosztolya, László Tóth, Amin Honarmandi Shandiz, Alexandra Markó:
Optimizing the Ultrasound Tongue Image Representation for Residual Network-Based Articulatory-to-Acoustic Mapping. Sensors 22(22): 8601 (2022) - [c43]Mohammed Salah Al-Radhi, Tamás Gábor Csapó, Csaba Zainkó, Géza Németh:
Towards Parametric Speech Synthesis Using Gaussian-Markov Model of Spectral Envelope and Wavelet-Based Decomposition of F0. EUSIPCO 2022: 1150-1154 - [i19]Mohammed Salah Al-Radhi, Tamás Gábor Csapó, Csaba Zainkó, Géza Németh:
Towards Parametric Speech Synthesis Using Gaussian-Markov Model of Spectral Envelope and Wavelet-Based Decomposition of F0. CoRR abs/2208.07122 (2022) - 2021
- [j6]Mohammed Salah Al-Radhi, Tamás Gábor Csapó, Géza Németh:
Noise and acoustic modeling with waveform generator in text-to-speech and neutral speech conversion. Multim. Tools Appl. 80(2): 1969-1994 (2021) - [c42]Amin Honarmandi Shandiz, László Tóth, Gábor Gosztolya, Alexandra Markó, Tamás Gábor Csapó:
Improving Neural Silent Speech Interface Models by Adversarial Training. AICV 2021: 430-440 - [c41]Frigyes Viktor Arthur, Tamás Gábor Csapó:
Towards a Practical Lip-to-Speech Conversion System Using Deep Neural Networks and Mobile Application Frontend. AICV 2021: 441-450 - [c40]Amin Honarmandi Shandiz, László Tóth, Gábor Gosztolya, Alexandra Markó, Tamás Gábor Csapó:
Neural Speaker Embeddings for Ultrasound-Based Silent Speech Interfaces. Interspeech 2021: 1932-1936 - [c39]Mohammed Salah Al-Radhi, Tamás Gábor Csapó, Csaba Zainkó, Géza Németh:
Continuous Wavelet Vocoder-Based Decomposition of Parametric Speech Waveform Synthesis. Interspeech 2021: 2212-2216 - [c38]Ali Raheem Mandeel, Mohammed Salah Al-Radhi, Tamás Gábor Csapó:
Speaker Adaptation with Continuous Vocoder-Based DNN-TTS. SPECOM 2021: 407-416 - [c37]Pengyu Dai, Mohammed Salah Al-Radhi, Tamás Gábor Csapó:
Effects of F0 Estimation Algorithms on Ultrasound-Based Silent Speech Interfaces. SpeD 2021: 47-51 - [c36]Tamás Gábor Csapó:
Extending Text-to-Speech Synthesis with Articulatory Movement Prediction using Ultrasound Tongue Imaging. SSW 2021: 7-12 - [c35]Tamás Gábor Csapó, László Tóth, Gábor Gosztolya, Alexandra Markó:
Speech Synthesis from Text and Ultrasound Tongue Image-based Articulatory Input. SSW 2021: 31-36 - [c34]Csaba Zainkó, László Tóth, Amin Honarmandi Shandiz, Gábor Gosztolya, Alexandra Markó, Géza Németh, Tamás Gábor Csapó:
Adaptation of Tacotron2-based Text-To-Speech for Articulatory-to-Acoustic Mapping using Ultrasound Tongue Imaging. SSW 2021: 54-59 - [i18]Kele Xu, Tamás Gábor Csapó, Ming Feng:
Convolutional Neural Network-Based Age Estimation Using B-Mode Ultrasound Tongue Image. CoRR abs/2101.11245 (2021) - [i17]Amin Honarmandi Shandiz, László Tóth, Gábor Gosztolya, Alexandra Markó, Tamás Gábor Csapó:
Improving Neural Silent Speech Interface Models by Adversarial Training. CoRR abs/2104.11601 (2021) - [i16]Frigyes Viktor Arthur, Tamás Gábor Csapó:
Towards a practical lip-to-speech conversion system using deep neural networks and mobile application frontend. CoRR abs/2104.14467 (2021) - [i15]Amin Honarmandi Shandiz, László Tóth, Gábor Gosztolya, Alexandra Markó, Tamás Gábor Csapó:
Neural Speaker Embeddings for Ultrasound-based Silent Speech Interfaces. CoRR abs/2106.04552 (2021) - [i14]Mohammed Salah Al-Radhi, Tamás Gábor Csapó, Csaba Zainkó, Géza Németh:
Continuous Wavelet Vocoder-based Decomposition of Parametric Speech Waveform Synthesis. CoRR abs/2106.06863 (2021) - [i13]Mohammed Salah Al-Radhi, Tamás Gábor Csapó, Géza Németh:
Advances in Speech Vocoding for Text-to-Speech with Continuous Parameters. CoRR abs/2106.10481 (2021) - [i12]Tamás Gábor Csapó, László Tóth, Gábor Gosztolya, Alexandra Markó:
Speech Synthesis from Text and Ultrasound Tongue Image-based Articulatory Input. CoRR abs/2107.02003 (2021) - [i11]Tamás Gábor Csapó:
Extending Text-to-Speech Synthesis with Articulatory Movement Prediction using Ultrasound Tongue Imaging. CoRR abs/2107.05550 (2021) - [i10]Csaba Zainkó, László Tóth, Amin Honarmandi Shandiz, Gábor Gosztolya, Alexandra Markó, Géza Németh, Tamás Gábor Csapó:
Adaptation of Tacotron2-based Text-To-Speech for Articulatory-to-Acoustic Mapping using Ultrasound Tongue Imaging. CoRR abs/2107.12051 (2021) - [i9]Ali Raheem Mandeel, Mohammed Salah Al-Radhi, Tamás Gábor Csapó:
Speaker Adaptation with Continuous Vocoder-based DNN-TTS. CoRR abs/2108.01154 (2021) - 2020
- [j5]Mohammed Salah Al-Radhi, Omnia Abdo, Tamás Gábor Csapó, Sherif M. Abdou, Géza Németh, Mervat Fashal:
A continuous vocoder for statistical parametric speech synthesis and its evaluation using an audio-visual phonetically annotated Arabic corpus. Comput. Speech Lang. 60 (2020) - [j4]Mohammed Salah Al-Radhi, Tamás Gábor Csapó, Géza Németh:
Continuous Noise Masking Based Vocoder for Statistical Parametric Speech Synthesis. IEICE Trans. Inf. Syst. 103-D(5): 1099-1107 (2020) - [c33]Tamás Gábor Csapó:
Speaker Dependent Articulatory-to-Acoustic Mapping Using Real-Time MRI of the Vocal Tract. INTERSPEECH 2020: 2722-2726 - [c32]Tamás Gábor Csapó, Csaba Zainkó, László Tóth, Gábor Gosztolya, Alexandra Markó:
Ultrasound-Based Articulatory-to-Acoustic Mapping with WaveGlow Speech Synthesis. INTERSPEECH 2020: 2727-2731 - [c31]Tamás Gábor Csapó:
Speaker Dependent Acoustic-to-Articulatory Inversion Using Real-Time MRI of the Vocal Tract. INTERSPEECH 2020: 3720-3724 - [c30]Tamás Gábor Csapó, Kele Xu:
Quantification of Transducer Misalignment in Ultrasound Tongue Imaging. INTERSPEECH 2020: 3735-3739 - [i8]Tamás Gábor Csapó:
Speaker dependent articulatory-to-acoustic mapping using real-time MRI of the vocal tract. CoRR abs/2008.00889 (2020) - [i7]Tamás Gábor Csapó:
Speaker dependent acoustic-to-articulatory inversion using real-time MRI of the vocal tract. CoRR abs/2008.02098 (2020) - [i6]Tamás Gábor Csapó, Kele Xu:
Quantification of Transducer Misalignment in Ultrasound Tongue Imaging. CoRR abs/2008.02470 (2020) - [i5]Tamás Gábor Csapó, Csaba Zainkó, László Tóth, Gábor Gosztolya, Alexandra Markó:
Ultrasound-based Articulatory-to-Acoustic Mapping with WaveGlow Speech Synthesis. CoRR abs/2008.03152 (2020)
2010 – 2019
- 2019
- [j3]Mohammed Salah Al-Radhi, Tamás Gábor Csapó, Géza Németh:
Continuous vocoder applied in deep neural network based voice conversion. Multim. Tools Appl. 78(23): 33549-33572 (2019) - [c29]Mohammed Salah Al-Radhi, Tamás Gábor Csapó, Géza Németh:
RNN-based speech synthesis using a continuous sinusoidal model. IJCNN 2019: 1-8 - [c28]Gábor Gosztolya, Ádám Pintér, László Tóth, Tamás Grósz, Alexandra Markó, Tamás Gábor Csapó:
Autoencoder-Based Articulatory-to-Acoustic Mapping for Ultrasound Silent Speech Interfaces. IJCNN 2019: 1-8 - [c27]Dagoberto Porras, Alexander Sepúlveda-Sepúlveda, Tamás Gábor Csapó:
DNN-based Acoustic-to-Articulatory Inversion using Ultrasound Tongue Imaging. IJCNN 2019: 1-8 - [c26]Tamás Gábor Csapó, Mohammed Salah Al-Radhi, Géza Németh, Gábor Gosztolya, Tamás Grósz, László Tóth, Alexandra Markó:
Ultrasound-Based Silent Speech Interface Built on a Continuous Vocoder. INTERSPEECH 2019: 894-898 - [c25]Andrea Deme, Márton Bartók, Tekla Etelka Gráczi, Tamás Gábor Csapó, Alexandra Markó:
V-to-V Coarticulation Induced Acoustic and Articulatory Variability of Vowels: The Effect of Pitch-Accent. INTERSPEECH 2019: 3317-3321 - [c24]Alexandra Markó, Márton Bartók, Tamás Gábor Csapó, Tekla Etelka Gráczi, Andrea Deme:
Articulatory Analysis of Transparent Vowel /iː/ in Harmonic and Antiharmonic Hungarian Stems: Is There a Difference? INTERSPEECH 2019: 3327-3331 - [c23]Mohammed Salah Al-Radhi, Tamás Gábor Csapó, Géza Németh:
Parallel Voice Conversion Based on a Continuous Sinusoidal Model. SpeD 2019: 1-6 - [i4]Gábor Gosztolya, Ádám Pintér, László Tóth, Tamás Grósz, Alexandra Markó, Tamás Gábor Csapó:
Autoencoder-Based Articulatory-to-Acoustic Mapping for Ultrasound Silent Speech Interfaces. CoRR abs/1904.05259 (2019) - [i3]Mohammed Salah Al-Radhi, Tamás Gábor Csapó, Géza Németh:
RNN-based speech synthesis using a continuous sinusoidal model. CoRR abs/1904.06075 (2019) - [i2]Dagoberto Porras, Alexander Sepúlveda-Sepúlveda, Tamás Gábor Csapó:
DNN-based Acoustic-to-Articulatory Inversion using Ultrasound Tongue Imaging. CoRR abs/1904.06083 (2019) - [i1]Tamás Gábor Csapó, Mohammed Salah Al-Radhi, Géza Németh, Gábor Gosztolya, Tamás Grósz, László Tóth, Alexandra Markó:
Ultrasound-based Silent Speech Interface Built on a Continuous Vocoder. CoRR abs/1906.09885 (2019) - 2018
- [c22]Tamás Grósz, Gábor Gosztolya, László Tóth, Tamás Gábor Csapó, Alexandra Markó:
F0 Estimation for DNN-Based Ultrasound Silent Speech Interfaces. ICASSP 2018: 291-295 - [c21]László Tóth, Gábor Gosztolya, Tamás Grósz, Alexandra Markó, Tamás Gábor Csapó:
Multi-Task Learning of Speech Recognition and Speech Synthesis Parameters for Ultrasound-based Silent Speech Interfaces. INTERSPEECH 2018: 3172-3176 - [c20]Mohammed Salah Al-Radhi, Tamás Gábor Csapó, Géza Németh:
A Continuous Vocoder Using Sinusoidal Model for Statistical Parametric Speech Synthesis. SPECOM 2018: 11-20 - 2017
- [c19]Mohammed Salah Al-Radhi, Tamás Gábor Csapó, Géza Németh:
Time-Domain Envelope Modulating the Noise Component of Excitation in a Continuous Residual-Based Vocoder for Statistical Parametric Speech Synthesis. INTERSPEECH 2017: 434-438 - [c18]Tamás Gábor Csapó, Tamás Grósz, Gábor Gosztolya, László Tóth, Alexandra Markó:
DNN-Based Ultrasound-to-Speech Conversion for a Silent Speech Interface. INTERSPEECH 2017: 3672-3676 - [c17]Alexandra Markó, Andrea Deme, Márton Bartók, Tekla Etelka Gráczi, Tamás Gábor Csapó:
Word-Initial Irregular Phonation as a Function of Speech Rate and Vowel Quality in Hungarian. ISSP 2017: 134-145 - [c16]Mohammed Salah Al-Radhi, Tamás Gábor Csapó, Géza Németh:
Deep Recurrent Neural Networks in Speech Synthesis Using a Continuous Vocoder. SPECOM 2017: 282-291 - 2016
- [c15]Tamás Gábor Csapó, Géza Németh, Milos Cernak, Philip N. Garner:
Modeling unvoiced sounds in statistical parametric speech synthesis with a continuous vocoder. EUSIPCO 2016: 1338-1342 - [c14]Bálint Pál Tóth, Tamás Gábor Csapó:
Continuous fundamental frequency prediction with deep neural networks. EUSIPCO 2016: 1348-1352 - [c13]Milan Secujski, Branislav Gerazov, Tamás Gábor Csapó, Vlado Delic, Philip N. Garner, Aleksandar Gjoreski, David Guennec, Zoran A. Ivanovski, Aleksandar Melov, Géza Németh, Ana Stojkovic, György Szaszák:
Design of a Speech Corpus for Research on Cross-Lingual Prosody Transfer. SPECOM 2016: 199-206 - 2015
- [c12]Tamás Gábor Csapó, Géza Németh:
Automatic transformation of irregular to regular voice by residual analysis and synthesis. INTERSPEECH 2015: 613-617 - [c11]Kálmán Abari, Tamás Gábor Csapó, Bálint Pál Tóth, Gábor Olaszy:
From text to formants - indirect model for trajectory prediction based on a multi-speaker parallel speech database. INTERSPEECH 2015: 623-627 - [c10]Tamás Gábor Csapó, Steven M. Lulich:
Error analysis of extracted tongue contours from 2d ultrasound images. INTERSPEECH 2015: 2157-2161 - [c9]Tamás Gábor Csapó, Géza Németh, Milos Cernak:
Residual-Based Excitation with Continuous F0 Modeling in HMM-Based Speech Synthesis. SLSP 2015: 27-38 - 2014
- [b1]Tamás Gábor Csapó:
A gépi beszéd-előállítás természetességének növelése rejtett Markov-modell alapú szövegfelolvasó rendszerben. Budapest University of Technology and Economics, Hungary, 2014 - [j2]Tamás Gábor Csapó, Géza Németh:
Statistical parametric speech synthesis with a novel codebook-based excitation model. Intell. Decis. Technol. 8(4): 289-299 (2014) - [j1]Tamás Gábor Csapó, Géza Németh:
Modeling Irregular Voice in Statistical Parametric Speech Synthesis With Residual Codebook Based Excitation. IEEE J. Sel. Top. Signal Process. 8(2): 209-220 (2014) - 2013
- [c8]António J. S. Teixeira, Annika Hämäläinen, Jairo Avelar, Nuno Almeida, Géza Németh, Tibor Fegyó, Csaba Zainkó, Tamás Gábor Csapó, Bálint Tóth, André Oliveira, Miguel Sales Dias:
Speech-centric Multimodal Interaction for Easy-to-access Online Services - A Personal Life Assistant for the Elderly. DSAI 2013: 389-397 - [c7]Tamás Gábor Csapó, Géza Németh:
A novel irregular voice model for HMM-based speech synthesis. SSW 2013: 229-234 - 2012
- [c6]Tamás Gábor Csapó, Géza Németh:
A novel codebook-based excitation model for use in speech synthesis. CogInfoCom 2012: 661-665 - [c5]Éva Székely, Tamás Gábor Csapó, Bálint Tóth, Péter Mihajlik, Julie Carson-Berndsen:
Synthesizing expressive speech from amateur audiobook recordings. SLT 2012: 297-302 - 2011
- [c4]Tekla Etelka Gráczi, Steven M. Lulich, Tamás Gábor Csapó, András Beke:
Context and Speaker Dependency in the Relation of Vowel Formants and Subglottal Resonances - Evidence from Hungarian. INTERSPEECH 2011: 1901-1904 - 2010
- [c3]Csaba Zainkó, Tamás Gábor Csapó, Géza Németh:
Special Speech Synthesis for Social Network Websites. TSD 2010: 455-463
2000 – 2009
- 2009
- [c2]Tamás Gábor Csapó, Zsuzsanna Bárkányi, Tekla Etelka Gráczi, Tamás Bohm, Steven M. Lulich:
Relation of formants and subglottal resonances in Hungarian vowels. INTERSPEECH 2009: 484-487 - 2007
- [c1]Géza Németh, Márk Fék, Tamás Gábor Csapó:
Increasing prosodic variability of text-to-speech synthesizers. INTERSPEECH 2007: 474-477
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-10-07 21:21 CEST by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint