


default search action
27th SPECOM 2025: Szeged, Hungary - Part II
- Alexey Karpov

, Gábor Gosztolya
:
Speech and Computer - 27th International Conference, SPECOM 2025, Szeged, Hungary, October 13-15, 2025, Proceedings, Part II. Lecture Notes in Computer Science 16188, Springer 2026, ISBN 978-3-032-07958-9
Automatic Speech Recognition
- Jarod Duret, Salima Mdhaffar, Gaëlle Laperrière, Ryan Whetten, Audrey Galametz, Catherine Kobus, Marion-Cécile Martin, Jo Oleiwan, Yannick Estève:

In-Domain SSL Pre-training and Streaming ASR: Application to Air Traffic Control Communications. 3-12 - Sara M. Pearsell

, Oliver Niebuhr
, Samuel Schmück
:
Evaluating the Performance of Several ASR Systems in Environmental and Industrial Noise. 13-28 - Anton Polevoi, Alexander Kragin, Natalia V. Loukachevitch

:
Ground Truth-Free WER Prediction for ASR via Audio Quality and Model Confidence Features. 29-44 - Yunus Emre Ozkose, Ali Haznedaroglu:

Enhancing Speech Recognition Through Text-to-Speech and Voice Conversion Augmentation. 45-59 - Gergely Dobsinszki, Péter Mihajlik, Mate S. Kadar, Tibor Fegyó, Katalin Mády:

Best Data is more Supervised Data - Even for Hungarian ASR. 60-69 - Branislav Gerazov

, Marcello Politi
, Sébastien Bratières
:
Arabic ASR on the SADA Large-Scale Arabic Speech Corpus with Transformer-Based Models. 70-84
Speech Processing for Under-Resourced Languages
- Vishwa Gupta, Gilles Boulianne:

Effect of Increased Temporal Resolution on Speech Recognition for French Quebec Using Features from Speech Self-supervised Learning Models. 87-103 - Irina S. Kipyatkova

, Kseniia Kiseleva, Mikhail Dolgushin
, Ildar Kagirov
:
Modeling Intra-word Code-Switching for Karelian ASR. 104-117 - Vuk Stanojev

, Tijana V. Nosek
, Sinisa Suzic
, Darko Pekar
, Vlado Delic
, Milan Secujski
:
Improving Whisper-Based Serbian ASR Using Synthetic Speech. 118-129 - Anton Legchenko, Ivan Bondarenko:

Domain Knowledge and Language Embeddings for Low-Resource Multilingual Phoneme ASR. 130-143 - Alejandro López-García

, María Alfaro-Contreras
, Julien Meyer
, Jose J. Valero-Mas
:
Whistler Identification in Whistled Spanish (Silbo): A Case Study. 144-158
Digital Speech Processing
- Zhiyuan Xu, Joshua D. Reiss:

PinkVocalTransformer: Neural Acoustic-to-Articulatory Inversion Based on the Pink Trombone. 161-173 - Alexander Zaburdaev, Denis Ivanko

, Dmitry Ryumin
:
CrossMP-SENet: Transformer-Based Cross-Attention for Joint Magnitude-Phase Speech Enhancement. 174-188 - Jia-Lien Hsu

, Pei-Wen Chien:
Adaptive Singing Voice Enhancement for Live Stages. 189-202 - Anastasia Ananeva, Anton Tomilov

, Marina Volkova:
Revealing the Hidden Temporal Structure of HubertSoft Embeddings Based on the Russian Phonetic Corpus. 203-215
Natural Language Processing
- Philine Kowol, Stefan Hillmann

:
Analyzing Web-Scraped and Generated Inputs for Automatic and Scalable Intent Classification. 219-230 - Danil Tirskikh, Olesia Koroteeva

, Yuri Matveev
, Ekaterina Brovkina, Larisa Gonchar
:
Enhancing Retrieval Performance via LLM Hard-Negative Filtering. 231-241 - José Luis Vázquez Noguera

, Carlos U. Valdez, Marvin M. Agüero
, Julio César Mello Román
, José D. Colbes, Sebastián A. Grillo
:
Sector-Wise Backpropagation for Low-Resource Text Classification in Deep Models. 242-256 - Natalia Bogdanova-Beglarian

, Olga Blinova
, Mariya Khokhlova
, Tatiana Y. Sherstinova, Tatiana I. Popova
:
High-Frequency Multiword Units and the Typological Distribution of Multiword Units in Spoken Russian. 257-270 - Vladimir V. Bochkarev

, Andrey Achkeev
, Anna V. Shevlyakova
:
Estimation of the Genre Composition of the English Subcorpus of the Google Books Ngram. 271-285
Multimodal Systems
- Jason Clarke, Yoshihiko Gotoh, Stefan Goetze:

Ensembling Synchronisation-Based and Face-Voice Association Paradigms for Robust Active Speaker Detection in Egocentric Recordings. 289-301 - Vera Evdokimova

, Maria Maksimova
:
Phonetic and Visual Characteristics of Cognitive Load. 302-317 - Rodmonga Potapova

, Vsevolod Potapov
, Ekaterina Karimova
, Diana Smolskaya, Nikolay Bobrov
, Leonid Motovskikh
, Iurii Pozhilov
:
Cognitive Humor Processing in the Russian and English Internet Meme Chatting: EEG Study. 318-330 - Ali Alhejab

, Tomas Zelezný, Lamya Alkanhal, Ivan Gruber
, Yazeed Alharbi
, Jakub Straka
, Vaclav Javorek, Marek Hrúz
, Badriah Alkalifah, Ahmed Ali
:
Saudi Sign Language Translation Using T5. 331-343

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.


Google
Google Scholar
Semantic Scholar
Internet Archive Scholar
CiteSeerX
ORCID














