default search action

combined dblp search
author search
venue search
publication search

ask others

Zoltán Tüske

> Home > Persons

Person information

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

2020 – today

see FAQ

What is the meaning of the colors in the publication lists?

2024
[c60]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/AlharbiATDAATIA24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/AlharbiATDAATIA24
Sadeen Alharbi, Areeb Alowisheq, Zoltán Tüske, Kareem Darwish, Abdullah Alrajeh, Abdulmajeed Alrowithi, Aljawharah Bin Tamran, Asma Ibrahim, Raghad Aloraini, Raneem Alnajim, Ranya Alkahtani, Renad Almuasaad, Sara Alrasheed, Shaykhah Alsubaie, Yaser Alonaizan:
SADA: Saudi Audio Dataset for Arabic. ICASSP 2024: 10286-10290
[i15]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2402-15594
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2402-15594
Jintao Jiang, Yingbo Gao, Mohammad Zeineldeen, Zoltán Tüske:
Alternating Weak Triphone/BPE Alignment Supervision from Hybrid Model Improves End-to-End ASR. CoRR abs/2402.15594 (2024)
2023
[c59]
- view
  authority control:
- export record
  dblp key:
  - conf/iwslt/BaharWIGMT23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iwslt/BaharWIGMT23
Parnia Bahar, Patrick Wilken, Javier Iranzo-Sánchez, Mattia Di Gangi, Evgeny Matusov, Zoltán Tüske:
Speech Translation with Style: AppTek's Submissions to the IWSLT Subtitling and Formality Tracks in 2023. IWSLT@ACL 2023: 251-260
[i14]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2311-14835
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2311-14835
Jintao Jiang, Yingbo Gao, Zoltán Tüske:
Weak Alignment Supervision from Hybrid Model Improves End-to-end ASR. CoRR abs/2311.14835 (2023)
2022
[c58]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/KuoTTKS22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/KuoTTKS22
Hong-Kwang Jeff Kuo, Zoltán Tüske, Samuel Thomas, Brian Kingsbury, George Saon:
Improving End-to-end Models for Set Prediction in Spoken Language Understanding. ICASSP 2022: 7162-7166
[i13]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2201-12105
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2201-12105
Hong-Kwang Jeff Kuo, Zoltán Tüske, Samuel Thomas, Brian Kingsbury, George Saon:
Improving End-to-End Models for Set Prediction in Spoken Language Understanding. CoRR abs/2201.12105 (2022)
2021
[c57]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/SaonTBK21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/SaonTBK21
George Saon, Zoltán Tüske, Daniel Bolaños, Brian Kingsbury:
Advancing RNN Transducer Technology for Speech Recognition. ICASSP 2021: 5654-5658
[c56]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/MoraisK0TK21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/MoraisK0TK21
Edmilson da Silva Morais, Hong-Kwang Jeff Kuo, Samuel Thomas, Zoltán Tüske, Brian Kingsbury:
End-to-End Spoken Language Understanding Using Transformer Networks and Self-Supervised Pre-Trained Features. ICASSP 2021: 7483-7487
[c55]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/0001KSTKKKH21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/0001KSTKKKH21
Samuel Thomas, Hong-Kwang Jeff Kuo, George Saon, Zoltán Tüske, Brian Kingsbury, Gakuto Kurata, Zvi Kons, Ron Hoory:
RNN Transducer Models for Spoken Language Understanding. ICASSP 2021: 7493-7497
[c54]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Ganhotra0KJSTK21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Ganhotra0KJSTK21
Jatin Ganhotra, Samuel Thomas, Hong-Kwang Jeff Kuo, Sachindra Joshi, George Saon, Zoltán Tüske, Brian Kingsbury:
Integrating Dialog History into End-to-End Spoken Language Understanding Systems. Interspeech 2021: 1254-1258
[c53]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/CuiKSHT21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/CuiKSHT21
Xiaodong Cui, Brian Kingsbury, George Saon, David Haws, Zoltán Tüske:
Reducing Exposure Bias in Training Recurrent Neural Network Transducers. Interspeech 2021: 1802-1806
[c52]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KurataSKHT21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KurataSKHT21
Gakuto Kurata, George Saon, Brian Kingsbury, David Haws, Zoltán Tüske:
Improving Customization of Neural Transducers by Mitigating Acoustic Mismatch of Synthesized Audio. Interspeech 2021: 2027-2031
[c51]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/TuskeSK21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/TuskeSK21
Zoltán Tüske, George Saon, Brian Kingsbury:
On the Limit of English Conversational Speech Recognition. Interspeech 2021: 2062-2066
[c50]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/FasoliCSSWVSCK021
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/FasoliCSSWVSCK021
Andrea Fasoli, Chia-Yu Chen, Mauricio J. Serrano, Xiao Sun, Naigang Wang, Swagath Venkataramani, George Saon, Xiaodong Cui, Brian Kingsbury, Wei Zhang, Zoltán Tüske, Kailash Gopalakrishnan:
4-Bit Quantization of LSTM-Based Speech Recognition Models. Interspeech 2021: 2586-2590
[i12]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2103-09935
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2103-09935
George Saon, Zoltán Tüske, Daniel Bolaños, Brian Kingsbury:
Advancing RNN Transducer Technology for Speech Recognition. CoRR abs/2103.09935 (2021)
[i11]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2104-03842
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2104-03842
Samuel Thomas, Hong-Kwang Jeff Kuo, George Saon, Zoltán Tüske, Brian Kingsbury, Gakuto Kurata, Zvi Kons, Ron Hoory:
RNN Transducer Models For Spoken Language Understanding. CoRR abs/2104.03842 (2021)
[i10]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2105-00982
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2105-00982
Zoltán Tüske, George Saon, Brian Kingsbury:
On the limit of English conversational speech recognition. CoRR abs/2105.00982 (2021)
[i9]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2108-08405
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2108-08405
Jatin Ganhotra, Samuel Thomas, Hong-Kwang Jeff Kuo, Sachindra Joshi, George Saon, Zoltán Tüske, Brian Kingsbury:
Integrating Dialog History into End-to-End Spoken Language Understanding Systems. CoRR abs/2108.08405 (2021)
[i8]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2108-10803
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2108-10803
Xiaodong Cui, Brian Kingsbury, George Saon, David Haws, Zoltán Tüske:
Reducing Exposure Bias in Training Recurrent Neural Network Transducers. CoRR abs/2108.10803 (2021)
[i7]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2108-12074
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2108-12074
Andrea Fasoli, Chia-Yu Chen, Mauricio J. Serrano, Xiao Sun, Naigang Wang, Swagath Venkataramani, George Saon, Xiaodong Cui, Brian Kingsbury, Wei Zhang, Zoltán Tüske, Kailash Gopalakrishnan:
4-bit Quantization of LSTM-based Speech Recognition Models. CoRR abs/2108.12074 (2021)
2020
[b1]
- view
  authority control:
- export record
  dblp key:
  - phd/dnb/Tuske20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/phd/dnb/Tuske20
Zoltán Tüske:
Discriminative feature modeling for statistical speech recognition. RWTH Aachen University, Germany, 2020
[c49]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/SaonTA20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/SaonTA20
George Saon, Zoltán Tüske, Kartik Audhkhasi:
Alignment-Length Synchronous Decoding for RNN Transducer. ICASSP 2020: 7804-7808
[c48]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/TuskeSAK20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/TuskeSAK20
Zoltán Tüske, George Saon, Kartik Audhkhasi, Brian Kingsbury:
Single Headed Attention Based Sequence-to-Sequence Model for State-of-the-Art Results on Switchboard. INTERSPEECH 2020: 551-555
[c47]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/KuoT0HAKKKHL20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/KuoT0HAKKKHL20
Hong-Kwang Jeff Kuo, Zoltán Tüske, Samuel Thomas, Yinghui Huang, Kartik Audhkhasi, Brian Kingsbury, Gakuto Kurata, Zvi Kons, Ron Hoory, Luis A. Lastras:
End-to-End Spoken Language Understanding Without Full Transcripts. INTERSPEECH 2020: 906-910
[i6]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2001-07263
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2001-07263
Zoltán Tüske, George Saon, Kartik Audhkhasi, Brian Kingsbury:
Single headed attention based sequence-to-sequence model for state-of-the-art results on Switchboard-300. CoRR abs/2001.07263 (2020)
[i5]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2009-14386
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2009-14386
Hong-Kwang Jeff Kuo, Zoltán Tüske, Samuel Thomas, Yinghui Huang, Kartik Audhkhasi, Brian Kingsbury, Gakuto Kurata, Zvi Kons, Ron Hoory, Luis A. Lastras:
End-to-End Spoken Language Understanding Without Full Transcripts. CoRR abs/2009.14386 (2020)
[i4]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-2011-08238
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2011-08238
Edmilson da Silva Morais, Hong-Kwang Jeff Kuo, Samuel Thomas, Zoltán Tüske, Brian Kingsbury:
End-to-end spoken language understanding using transformer networks and self-supervised pre-trained features. CoRR abs/2011.08238 (2020)

2010 – 2019

see FAQ

What is the meaning of the colors in the publication lists?

2019
[c46]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/SaonTAKPT19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/SaonTAKPT19
George Saon, Zoltán Tüske, Kartik Audhkhasi, Brian Kingsbury, Michael Picheny, Samuel Thomas:
Simplified LSTMS for Speech Recognition. ASRU 2019: 547-553
[c45]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/HuangTSTSP19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/HuangTSTSP19
Yinghui Huang, Samuel Thomas, Masayuki Suzuki, Zoltán Tüske, Larry Sansone, Michael Picheny:
Semi-Supervised Training and Data Augmentation for Adaptation of Automatic Broadcast News Captioning Systems. ASRU 2019: 867-874
[c44]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/SaonTAK19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/SaonTAK19
George Saon, Zoltán Tüske, Kartik Audhkhasi, Brian Kingsbury:
Sequence Noise Injected Training for End-to-end Speech Recognition. ICASSP 2019: 6261-6265
[c43]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ThomasSHKTSKPDK19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ThomasSHKTSKPDK19
Samuel Thomas, Masayuki Suzuki, Yinghui Huang, Gakuto Kurata, Zoltán Tüske, George Saon, Brian Kingsbury, Michael Picheny, Tom Dibert, Alice Kaiser-Schatzlein, Bern Samko:
English Broadcast News Speech Recognition by Humans and Machines. ICASSP 2019: 6455-6459
[c42]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/PichenyTKACS19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/PichenyTKACS19
Michael Picheny, Zoltán Tüske, Brian Kingsbury, Kartik Audhkhasi, Xiaodong Cui, George Saon:
Challenging the Boundaries of Speech Recognition: The MALACH Corpus. INTERSPEECH 2019: 326-330
[c41]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/AudhkhasiSTKP19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/AudhkhasiSTKP19
Kartik Audhkhasi, George Saon, Zoltán Tüske, Brian Kingsbury, Michael Picheny:
Forget a Bit to Learn Better: Soft Forgetting for CTC-Based Automatic Speech Recognition. INTERSPEECH 2019: 2618-2622
[c40]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ThomasATHP19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ThomasATHP19
Samuel Thomas, Kartik Audhkhasi, Zoltán Tüske, Yinghui Huang, Michael Picheny:
Detection and Recovery of OOVs for Improved English Broadcast News Captioning. INTERSPEECH 2019: 2973-2977
[c39]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/TuskeAS19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/TuskeAS19
Zoltán Tüske, Kartik Audhkhasi, George Saon:
Advancing Sequence-to-Sequence Based Speech Recognition. INTERSPEECH 2019: 3780-3784
[i3]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1904-13258
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1904-13258
Samuel Thomas, Masayuki Suzuki, Yinghui Huang, Gakuto Kurata, Zoltán Tüske, George Saon, Brian Kingsbury, Michael Picheny, Tom Dibert, Alice Kaiser-Schatzlein, Bern Samko:
English Broadcast News Speech Recognition by Humans and Machines. CoRR abs/1904.13258 (2019)
[i2]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1908-03455
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1908-03455
Michael Picheny, Zoltán Tüske, Brian Kingsbury, Kartik Audhkhasi, Xiaodong Cui, George Saon:
Challenging the Boundaries of Speech Recognition: The MALACH Corpus. CoRR abs/1908.03455 (2019)
2018
[c38]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/TuskeSN18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/TuskeSN18
Zoltán Tüske, Ralf Schlüter, Hermann Ney:
Acoustic Modeling of Speech Waveform Based on Multi-Resolution, Neural Network Signal Processing. ICASSP 2018: 4859-4863
[c37]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/TuskeSN18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/TuskeSN18
Zoltán Tüske, Ralf Schlüter, Hermann Ney:
Investigation on LSTM Recurrent N-gram Language Models for Speech Recognition. INTERSPEECH 2018: 3358-3362
[c36]
- view
- export record
  dblp key:
  - conf/nips/CuiZTP18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/CuiZTP18
Xiaodong Cui, Wei Zhang, Zoltán Tüske, Michael Picheny:
Evolutionary Stochastic Gradient Descent for Optimization of Deep Neural Networks. NeurIPS 2018: 6051-6061
[i1]
- view
  - electronic edition @ arxiv.org (open access)
  - references & citations
- export record
  dblp key:
  - journals/corr/abs-1810-06773
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1810-06773
Xiaodong Cui, Wei Zhang, Zoltán Tüske, Michael Picheny:
Evolutionary Stochastic Gradient Descent for Optimization of Deep Neural Networks. CoRR abs/1810.06773 (2018)
2017
[c35]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/TuskeMSN17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/TuskeMSN17
Zoltán Tüske, Wilfried Michel, Ralf Schlüter, Hermann Ney:
Parallel Neural Network Features for Improved Tandem Acoustic Modeling. INTERSPEECH 2017: 1651-1655
[c34]
- view
  authority control:
- export record
  dblp key:
  - conf/specom/GolikTIBSN17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/specom/GolikTIBSN17
Pavel Golik, Zoltán Tüske, Kazuki Irie, Eugen Beck, Ralf Schlüter, Hermann Ney:
The 2016 RWTH Keyword Search System for Low-Resource Languages. SPECOM 2017: 719-730
2016
[c33]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/TuskeISN16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/TuskeISN16
Zoltán Tüske, Kazuki Irie, Ralf Schlüter, Hermann Ney:
Investigation on log-linear interpolation of multi-domain neural network language model. ICASSP 2016: 6005-6009
[c32]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/IrieTASN16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/IrieTASN16
Kazuki Irie, Zoltán Tüske, Tamer Alkhouli, Ralf Schlüter, Hermann Ney:
LSTM, GRU, Highway and a Bit of Attention: An Empirical Overview for Language Modeling in Speech Recognition. INTERSPEECH 2016: 3519-3523
[c31]
- view
  - electronic edition @ aclanthology.org (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/iwslt/MichelTSSN16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iwslt/MichelTSSN16
Wilfried Michel, Zoltán Tüske, M. Ali Basha Shaik, Ralf Schlüter, Hermann Ney:
The RWTH Aachen LVCSR system for IWSLT-2016 German Skype conversation recognition task. IWSLT 2016
[c30]
- view
  authority control:
- export record
  dblp key:
  - conf/specom/SchluterDGKMITZ16
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/specom/SchluterDGKMITZ16
Ralf Schlüter, Patrick Doetsch, Pavel Golik, Markus Kitza, Tobias Menne, Kazuki Irie, Zoltán Tüske, Albert Zeyer:
Automatic Speech Recognition Based on Neural Networks. SPECOM 2016: 3-17
2015
[c29]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/CuiKRSACKMNPTGS15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/CuiKRSACKMNPTGS15
Jia Cui, Brian Kingsbury, Bhuvana Ramabhadran, Abhinav Sethy, Kartik Audhkhasi, Xiaodong Cui, Ellen Kislal, Lidia Mangu, Markus Nußbaum-Thom, Michael Picheny, Zoltán Tüske, Pavel Golik, Ralf Schlüter, Hermann Ney, Mark J. F. Gales, Kate M. Knill, Anton Ragni, Haipeng Wang, Philip C. Woodland:
Multilingual representations for low resource speech recognition and keyword search. ASRU 2015: 259-266
[c28]
- view
  authority control:
- export record
  dblp key:
  - conf/asru/TuskeGSN15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asru/TuskeGSN15
Zoltán Tüske, Pavel Golik, Ralf Schlüter, Hermann Ney:
Speaker adaptive joint training of Gaussian mixture models and bottleneck features. ASRU 2015: 596-603
[c27]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/TuskeTSN15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/TuskeTSN15
Zoltán Tüske, Muhammad Ali Tahir, Ralf Schlüter, Hermann Ney:
Integrating Gaussian mixtures into deep neural networks: Softmax layer with hidden variables. ICASSP 2015: 4285-4289
[c26]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/GolikTSN15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/GolikTSN15
Pavel Golik, Zoltán Tüske, Ralf Schlüter, Hermann Ney:
Convolutional neural networks for acoustic modeling of raw time signal in LVCSR. INTERSPEECH 2015: 26-30
[c25]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/GolikTSN15a
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/GolikTSN15a
Pavel Golik, Zoltán Tüske, Ralf Schlüter, Hermann Ney:
Multilingual features based keyword search for very low-resource languages. INTERSPEECH 2015: 1260-1264
[c24]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ShaikTTNSN15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ShaikTTNSN15
M. Ali Basha Shaik, Zoltán Tüske, Muhammad Ali Tahir, Markus Nußbaum-Thom, Ralf Schlüter, Hermann Ney:
Improvements in RWTH LVCSR evaluation systems for Polish, Portuguese, English, urdu, and Arabic. INTERSPEECH 2015: 3154-3158
2014
[c23]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/WieslerITSN14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/WieslerITSN14
Simon Wiesler, Kazuki Irie, Zoltán Tüske, Ralf Schlüter, Hermann Ney:
The RWTH English lecture recognition system. ICASSP 2014: 3286-3290
[c22]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/TuskeNSN14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/TuskeNSN14
Zoltán Tüske, David Nolden, Ralf Schlüter, Hermann Ney:
Multilingual MRASTA features for low-resource keyword search and speech recognition systems. ICASSP 2014: 7854-7858
[c21]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/SundermeyerTSN14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/SundermeyerTSN14
Martin Sundermeyer, Zoltán Tüske, Ralf Schlüter, Hermann Ney:
Lattice decoding and rescoring with long-Span neural network language models. INTERSPEECH 2014: 661-665
[c20]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/TuskeGSN14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/TuskeGSN14
Zoltán Tüske, Pavel Golik, Ralf Schlüter, Hermann Ney:
Acoustic modeling with deep neural networks using raw time signal for LVCSR. INTERSPEECH 2014: 890-894
[c19]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/ShaikTTNSN14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/ShaikTTNSN14
M. Ali Basha Shaik, Zoltán Tüske, Muhammad Ali Tahir, Markus Nußbaum-Thom, Ralf Schlüter, Hermann Ney:
RWTH LVCSR systems for quaero and EU-bridge: German, Polish, Spanish and Portuguese. INTERSPEECH 2014: 973-977
[c18]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/TuskeGNSN14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/TuskeGNSN14
Zoltán Tüske, Pavel Golik, David Nolden, Ralf Schlüter, Hermann Ney:
Data augmentation, feature combination, and multilingual neural networks to improve ASR and KWS performance for low-resource languages. INTERSPEECH 2014: 1420-1424
2013
[c17]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/TuskeSN13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/TuskeSN13
Zoltán Tüske, Ralf Schlüter, Hermann Ney:
Deep hierarchical bottleneck MRASTA features for LVCSR. ICASSP 2013: 6970-6974
[c16]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/TuskePWS13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/TuskePWS13
Zoltán Tüske, Joel Pinto, Daniel Willett, Ralf Schlüter:
Investigation on cross- and multilingual MLP features under matched and mismatched acoustical conditions. ICASSP 2013: 7349-7353
[c15]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/TuskeSN13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/TuskeSN13
Zoltán Tüske, Ralf Schlüter, Hermann Ney:
Multilingual hierarchical MRASTA features for ASR. INTERSPEECH 2013: 2222-2226
[c14]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/GolikTSN13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/GolikTSN13
Pavel Golik, Zoltán Tüske, Ralf Schlüter, Hermann Ney:
Development of the RWTH transcription system for slovenian. INTERSPEECH 2013: 3107-3111
[c13]
- view
  - electronic edition @ aclanthology.org (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/iwslt/ShaikTWNPSN13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iwslt/ShaikTWNPSN13
M. Ali Basha Shaik, Zoltán Tüske, Simon Wiesler, Markus Nußbaum-Thom, Stephan Peitz, Ralf Schlüter, Hermann Ney:
The RWTH Aachen German and English LVCSR systems for IWSLT-2013. IWSLT (Evaluation Campaign) 2013
2012
[c12]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/TuskeSN12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/TuskeSN12
Zoltán Tüske, Ralf Schlüter, Hermann Ney:
Comparison and combination of different CRBE based MLP features for LVCSR. ICASSP 2012: 4081-4084
[c11]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/TuskeSNS12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/TuskeSNS12
Zoltán Tüske, Ralf Schlüter, Hermann Ney, Martin Sundermeyer:
Context-Dependent MLPs for LVCSR: TANDEM, Hybrid or Both? INTERSPEECH 2012: 18-21
[c10]
- view
  - electronic edition @ isca-archive.org (open access)
  - no references & citations available
- export record
  dblp key:
  - conf/interspeech/TuskeDS12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/TuskeDS12
Zoltán Tüske, Friedhelm R. Drepper, Ralf Schlüter:
Non-stationary signal processing and its application in speech recognition. SAPA@INTERSPEECH 2012: 34-39
[c9]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/Nussbaum-ThomTHSN12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/Nussbaum-ThomTHSN12
Markus Nußbaum-Thom, Zoltán Tüske, Georg Heigold, Ralf Schlüter, Hermann Ney:
Posterior-Scaled MPE: Novel Discriminative Training Criteria. INTERSPEECH 2012: 2614-2617
[c8]
- view
  authority control:
- export record
  dblp key:
  - conf/ssp/TuskeDS12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ssp/TuskeDS12
Zoltán Tüske, Friedhelm R. Drepper, Ralf Schlüter:
Phase difference of filter-stable part-tones as acoustic feature. SSP 2012: 365-368
2011
[c7]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/TuskeGSD11
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/TuskeGSD11
Zoltán Tüske, Pavel Golik, Ralf Schlüter, Friedhelm R. Drepper:
Non-stationary feature extraction for automatic speech recognition. ICASSP 2011: 5204-5207
[c6]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/TuskePS11
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/TuskePS11
Zoltán Tüske, Christian Plahl, Ralf Schlüter:
A Study on Speaker Normalized MLP Features in LVCSR. INTERSPEECH 2011: 1089-1092
2010
[j1]
- view
  authority control:
- export record
  dblp key:
  - journals/taslp/MihajlikTTNF10
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/taslp/MihajlikTTNF10
Péter Mihajlik, Zoltán Tüske, Balázs Tarján, Bottyán Németh, Tibor Fegyó:
Improved Recognition of Spontaneous Hungarian Speech - Morphological and Acoustic Modeling Techniques for a Less Resourced Task. IEEE Trans. Speech Audio Process. 18(6): 1588-1600 (2010)

2000 – 2009

see FAQ

What is the meaning of the colors in the publication lists?

2009
[c5]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MihajlikTTF09
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MihajlikTTF09
Péter Mihajlik, Balázs Tarján, Zoltán Tüske, Tibor Fegyó:
Investigation of morph-based speech recognition improvements across speech genres. INTERSPEECH 2009: 2687-2690
2007
[c4]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MihajlikFTI07
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MihajlikFTI07
Péter Mihajlik, Tibor Fegyó, Zoltán Tüske, Pavel Ircing:
A morpho-graphemic approach for the recognition of spontaneous speech in agglutinative languages - like Hungarian. INTERSPEECH 2007: 1497-1500
[c3]
- view
  authority control:
- export record
  dblp key:
  - conf/tsd/MihajlikFNTT07
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/tsd/MihajlikFNTT07
Péter Mihajlik, Tibor Fegyó, Bottyán Németh, Zoltán Tüske, Viktor Trón:
Towards Automatic Transcription of Large Spoken Archives in Agglutinating Languages - Hungarian ASR for the MALACH Project. TSD 2007: 342-349
2005
[c2]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/TuskeMTF05
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/TuskeMTF05
Zoltán Tüske, Péter Mihajlik, Zoltán Tobler, Tibor Fegyó:
Robust voice activity detection based on the entropy of noise-suppressed spectrum. INTERSPEECH 2005: 245-248
[c1]
- view
  - electronic edition via DOI (open access)
  - references & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/MihajlikTTG05
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/MihajlikTTG05
Péter Mihajlik, Zoltán Tobler, Zoltán Tüske, Géza Gordos:
Evaluation and optimization of noise robust front-end technologies for the automatic recognition of Hungarian telephone speech. INTERSPEECH 2005: 2677-2680

Coauthor Index

see FAQ

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.