


default search action
27th SPECOM 2025: Szeged, Hungary - Part I
- Alexey Karpov

, Gábor Gosztolya
:
Speech and Computer - 27th International Conference, SPECOM 2025, Szeged, Hungary, October 13-15, 2025, Proceedings, Part I. Lecture Notes in Computer Science 16187, Springer 2026, ISBN 978-3-032-07955-8
Invited Paper
- Heysem Kaya

, Gizem Sogancioglu
:
Towards Responsible Multimodal Modeling for Mental Healthcare. 3-22
Speech Perception and Synthesis
- Shree Harsha Bokkahalli Satish, Gustav Eje Henter

, Éva Székely:
When Voice Matters: Evidence of Gender Disparity in Positional Bias of SpeechLLMs. 25-38 - George Close, Kris Y. Hong, Thomas Hain

, Stefan Goetze:
WhiSQA: Non-intrusive Speech Quality Prediction Using Whisper Encoder Features. 39-51 - Mohammed Salah Al-Radhi

, Sadi Mahmud Shurid, Géza Németh
:
Prompting the Mind: EEG-to-Text Translation with Multimodal LLMs and Semantic Control. 52-66 - Anastasiia Sherban, Uliana E. Kochetkova

:
Effectiveness of Tacotron2 for Intonation Model Synthesis in Russian. 67-82 - Sasangi Nayanathara, Inuri Harischandra, Thamira Weerakoon, Randil Pushpananda:

Enhancing Sinhala Text-to-Speech with End-to-End VITS Architecture. 83-98
Computational Paralinguistics
- Dániel Halmai, Gábor Gosztolya:

Spoken Emotion Recognition Using Soft Labels. 101-112 - Kunjan Gajre, Rajnidhi Gupta, Ravindrakumar M. Purohit, Hemant A. Patil

:
NAMTalk: From Muscle Vibrations to Emotional Speech. 113-128 - Olga Mitrofanova

, Polina Iurevtseva, Maxim Bakaev
:
What Do LLMs Know About Human Emotions? The Russian Case Study. 129-144 - Egor Kleshnev

, Elena E. Lyakso
:
Emotions Manifestation by Adolescents with Intellectual Disabilities. 145-156 - Abdelkader Seif El Islem Rahmani

, Yasser Yahiaoui, Abdelghani Bouziane
:
Retention-Augmented Voice Assistant: A Lightweight Architecture for Stateful Interaction with Comprehensive Evaluation and Privacy-Preserving Design. 157-169
Speech Processing for Healthcare
- Mikhail Dolgushin

, Daria Guseva
, Alexey Karpov
:
Investigation of Explainable Multimodal Methods for Detecting Mental Disorders. 173-187 - Elena E. Lyakso

, Olga V. Frolova
, Anton Matveev
, Petr Shabanov
, Andrei Lebedev
, Aleksandr Nikolaev
, Egor Kleshnev
, Severin Grechanyi
, Ruban Nersisson
:
Attention Deficit Hyperactivity Disorder: Identifying Approaches for Early Diagnosis, a Pilot Study. 188-202 - Wing-Zin Leung

, Heidi Christensen
, Stefan Goetze
:
Text-to-Dysarthric-Speech Generation for Dysarthric Automatic Speech Recognition: Is Purely Synthetic Data Enough? 203-216 - Anna V. Shevlyakova

, Vladimir V. Bochkarev
, Stanislav Khristoforov
:
Colour Preferences in Schizophrenic Speech. 217-227 - Evgeny Kostyuchenko:

Automated Assessment of Phrase Intelligibility for Russian Speech Based on Esophageal Voice. 228-237
Speech and Language Resources
- Marie Fongaro

, Barbara Gili Fivela
, Maud Pélissier, Gabriel Hévr
:
Subtle Changes in L1 Stops of Late Salento Italian-French Bilinguals: An Acoustic Study Using AutoVOT Adapted for Italian and French. 241-255 - Rodmonga Potapova

, Vsevolod Potapov
, Tsend-Ayush Ganbaatar
, Leonid Motovskikh
, Nikolay Bobrov
:
Sound and Colour in Phonosemantics: Perceptual and Acoustic Correlates of Mongolian Vowels. 256-266 - Anna Borzykh

, Tatiana Shevchenko
:
Rhythmic Diglossia Based on Discourse Types and Dialects of English: Australian and New Zealand Corpora. 267-277 - Aleksandra S. Maslenikova, Tatiana I. Popova

:
Automatic Annotation of Discourse and Speech Formulas in Internet Communication: A Telegram Comment Corpus. 278-292
Speaker Recognition
- Mohammed Hamzah Alsalihi

, Dávid Sztahó
:
Effect of Spoof Speech on Forensic Voice Comparison Using Deep Speaker Embeddings. 295-306 - Marina Volkova, Artem Chirkovskiy, Egor Ausev, Ekaterina Shangina:

Source Vendor Tracing of Audio Deepfakes. 307-321 - Anton Yakovenko

, Evgeny Bessonnitsyn, Valeria Efimova
, Mark Zaslavskiy
:
Language-Specific Adaptation Strategies for Speaker Recognition Using MobileNet. 322-332 - Sule Bekiryazici

, Cemal Hanilçi
, Neyir Ozcan
:
Enhancing Audio Replay Attack Detection with Silence-Based Blind Channel Impulse Response Estimation. 333-344

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.


Google
Google Scholar
Semantic Scholar
Internet Archive Scholar
CiteSeerX
ORCID














