


default search action
23rd SPECOM 2021: St. Petersburg, Russia
- Alexey Karpov

, Rodmonga Potapova
:
Speech and Computer - 23rd International Conference, SPECOM 2021, St. Petersburg, Russia, September 27-30, 2021, Proceedings. Lecture Notes in Computer Science 12997, Springer 2021, ISBN 978-3-030-87801-6 - Jahangir Alam, Abderrahim Fathan, Woo Hyun Kang:

Text-Independent Speaker Verification Employing CNN-LSTM-TDNN Hybrid Networks. 1-13 - Jahangir Alam, Abderrahim Fathan, Woo Hyun Kang:

End-to-End Voice Spoofing Detection Employing Time Delay Neural Networks and Higher Order Statistics. 14-25 - Nuno Almeida, Conceição Cunha, Samuel S. Silva, António Teixeira

:
Assessing Velar Gestures Timing in European Portuguese Nasal Vowels with RT-MRI Data. 26-35 - Nuno Almeida

, Diogo Cunha, Samuel S. Silva, António Teixeira:
Designing and Deploying an Interaction Modality for Articulatory-Based Audiovisual Speech Synthesis. 36-49 - Arash Amani, Mohammad MohammadAmini, Hadi Veisi:

Kurdish Spoken Dialect Recognition Using X-Vector Speaker Embedding. 50-57 - Yu Bai, Cristian Tejedor García

, Ferdy Hubers
, Catia Cucchiarini, Helmer Strik:
An ASR-Based Tutor for Learning to Read: How to Optimize Feedback to First Graders. 58-69 - Peter Birkholz

, Christian Kleiner:
Velocity Differences Between Velum Raising and Lowering Movements. 70-80 - Natalia Bogdanova-Beglarian, Olga Blinova

, Tatiana Y. Sherstinova, Tatiana Sulimova:
Pragmatic Markers of Russian Everyday Speech: Invariants in Dialogue and Monologue. 81-90 - Vincent Brignatz, Jarod Duret, Driss Matrouf, Mickael Rouvier:

Language Adaptation for Speaker Recognition Systems Using Contrastive Learning. 91-99 - Pierre Champion, Denis Jouvet, Anthony Larcher:

Evaluating X-Vector-Based Speaker Anonymization Under White-Box Assessment. 100-111 - Myrsini Christidou, Alexandra Vioni, Nikolaos Ellinas, Georgios Vamvoukakis, Konstantinos Markopoulos, Panos Kakoulidis

, June Sig Sung, Hyoungmin Park, Aimilios Chalamandaris, Pirros Tsiakoulis:
Improved Prosodic Clustering for Multispeaker and Speaker-Independent Phoneme-Level Prosody Control. 112-123 - Adam Chýlek

, Jan Svec
, Lubos Smídl
:
Initial Experiments on Question Answering from the Intrinsic Structure of Oral History Archives. 124-133 - Debadatta Dash

, Paul Ferrari
, Karinne Berstis, Jun Wang
:
Imagined, Intended, and Spoken Speech Envelope Synthesis from Neuromagnetic Signals. 134-145 - Maria Dayter

, Elena I. Riekhakaynen
:
What Causes Phonetic Reduction in Russian Speech: New Evidence from Machine Learning Algorithms. 146-156 - Mikhail Dolgushin

, Dayana Ismakova, Yuliya Bidulya, Igor Krupkin
, Galina Barskaya, Anastasiya Lesiv
:
Toxic Comment Classification Service in Social Network. 157-165 - Denis Dresvyanskiy

, Wolfgang Minker, Alexey Karpov
:
Deep Learning Based Engagement Recognition in Highly Imbalanced Data. 166-178 - Anna Dunashova

:
Intraspeaker Variability of a Professional Lecturer: Ageing, Genre, Pragmatics vs. Voice Acting (Case Study). 179-189 - Abderrahim Fathan, Jahangir Alam, Woo Hyun Kang:

An Ensemble Approach for the Diagnosis of COVID-19 from Speech and Cough Sounds. 190-201 - Sahar Ghannay, Antoine Caubrière, Salima Mdhaffar

, Gaëlle Laperrière, Bassam Jabaian, Yannick Estève:
Where Are We in Semantic Concept Extraction for Spoken Language Understanding? 202-213 - Parismita Gogoi, Sishir Kalita, Wendy Lalhminghlui

, Priyankoo Sarmah, S. R. M. Prasanna:
Learning Mizo Tones from F0 Contours Using 1D-CNN. 214-225 - Ivan Gruber

, Marek Hrúz
, Pavel Ircing
, Petr Neduchal
, Tomás Zítka
, Miroslav Hlavác
, Zbynek Zajíc
, Jan Svec
, Martin Bulín
:
OCR Improvements for Images of Multi-page Historical Documents. 226-237 - Ivan Gruber

, Marek Hrúz
, Milos Zelezný
, Alexey Karpov
:
X-Bridge: Image-to-Image Translation with Reconstruction Capabilities. 238-249 - Hien Thi Ha

, Ales Horák:
Who is Selling to Whom - Feature Evaluation for Multi-block Classification in Invoice Information Extraction. 250-261 - Abner Hernandez, Seung Hee Yang:

Multimodal Corpus Analysis of Autoblog 2020: Lecture Videos in Machine Learning. 262-270 - Juan Hussain, Christian Huber, Sebastian Stüker, Alexander Waibel:

Text and Synthetic Data for Domain Adaptation in End-to-End Speech Recognition. 271-278 - Anosha Ignatius, Uthayasanker Thayasivam

:
Speaker-Invariant Speech-to-Intent Classification for Low-Resource Languages. 279-290 - Denis Ivanko

, Dmitry Ryumin
, Alexandr Axyonov
, Alexey M. Kashevnik
:
Speaker-Dependent Visual Command Recognition in Vehicle Cabin: Methodology and Evaluation. 291-302 - Joshua Jansen van Vueren, Thomas Niesler:

Optimised Code-Switched Language Model Data Augmentation in Four Under-Resourced South African Languages. 303-316 - Virender Kadyan, Hemant Kumar Kathania, Prajjval Govil, Mikko Kurimo:

Synthesis Speech Based Data Augmentation for Low Resource Children ASR. 317-326 - Irina S. Kipyatkova:

End-to-End Russian Speech Recognition Models with Multi-head Attention. 327-335 - Konstantinos Klapsas, Nikolaos Ellinas, June Sig Sung, Hyoungmin Park, Spyros Raptis:

Word-Level Style Control for Expressive, Non-attentive Speech Synthesis. 336-347 - Liliya Komalova

, Diana Kulagina:
Perceiving Speech Aggression with and without Textual Context on Twitter Social Network Site. 348-359 - Roman Korostik, Javier Latorre, Sivanand Achanta, Yannis Stylianou:

Assessing Speaker Interpolation in Neural Text-to-Speech. 360-371 - Denis Likhachov, Maxim Vashkevich

, Elias Azarov
, Katsiaryna Malhina, Yuliya Rushkevich
:
A Mobile Application for Detection of Amyotrophic Lateral Sclerosis via Voice Analysis. 372-383 - Elena E. Lyakso

, Olga V. Frolova
, Nersisson Ruban
, A. Mary Mekala
:
Child's Emotional Speech Classification by Human Across Two Languages: Russian & Tamil. 384-396 - Olesia Makhnytkina

, Aleksey Grigorev
, Aleksander Nikolaev
:
Analysis of Dialogues of Typically Developing Children, Children with Down Syndrome and ASD Using Machine Learning Methods. 397-406 - Ali Raheem Mandeel

, Mohammed Salah Al-Radhi, Tamás Gábor Csapó:
Speaker Adaptation with Continuous Vocoder-Based DNN-TTS. 407-416 - Yuri Matveev

, Anton Matveev
, Olga V. Frolova
, Elena E. Lyakso
:
Automatic Recognition of the Psychoneurological State of Children: Autism Spectrum Disorders, Down Syndrome, Typical Development. 417-425 - Salima Mdhaffar

, Marc Tommasi
, Yannick Estève
:
Study on Acoustic Model Personalization in a Context of Collaborative Learning Constrained by Privacy Preservation. 426-436 - Muhammadjon Musaev, Saida Mussakhojayeva, Ilyos Khujayorov

, Yerbolat Khassanov, Mannon Ochilov
, Huseyin Atakan Varol:
USC: An Open-Source Uzbek Speech Corpus and Initial Speech Recognition Experiments. 437-447 - Saida Mussakhojayeva, Yerbolat Khassanov, Huseyin Atakan Varol:

A Study of Multilingual End-to-End Speech Recognition for Kazakh, Russian, and English. 448-459 - Sergis Nicolaou, Lambros Mavrides, Georgina Tryfou, Kyriakos Tolias, Konstantinos P. Panousis, Sotirios Chatzis, Sergios Theodoridis:

Dialog Speech Sentiment Classification for Imbalanced Datasets. 460-471 - Tijana V. Nosek, Sinisa Suzic, Mia Vujovic, Darko Pekar, Milan Secujski, Vlado Delic:

Explicit Control of the Level of Expressiveness in DNN-Based Speech Synthesis by Embedding Interpolation. 472-482 - Dariya Novokhrestova

, Evgeny Kostuchenko
, Ilya A. Hodashinsky
, Lidiya N. Balatskaya
:
Experimental Analysis of Expert and Quantitative Estimates of Syllable Recordings in the Process of Speech Rehabilitation. 483-491 - Edvin Pakoci

, Branislav M. Popovic
:
Methods for Using Class Based N-gram Language Models in the Kaldi Toolkit. 492-503 - Ankur T. Patil, Harsh Kotta, Rajul Acharya, Hemant A. Patil:

Spectral Root Features for Replay Spoof Detection in Voice Assistants. 504-515 - Rodmonga Potapova

, Tatyana Agibalova
, Vsevolod Potapov
, Olga Tuchina
:
Influence of the Aggressive Internet Environment on Cognitive Personality Disorders (in Relation to the Russian Young Generation of Users). 516-527 - Rodmonga Potapova

, Vsevolod Potapov
, Nataliya Lebedeva, Ekaterina Karimova
, Nikolay Bobrov
:
Media Content vs Nature Stimuli Influence on Human Brain Activity. 528-539 - Valeriya Prokaeva

, Elena I. Riekhakaynen
, Vladislav I. Zubov
:
Can Your Eyes Tell Us Why You Hesitate? Comparing Reading Aloud in Russian as L1 and Japanese as L2. 540-552 - Josef V. Psutka

, Ales Prazák
, Jan Vanek
:
Recognition of Heavily Accented and Emotional Speech of English and Czech Holocaust Survivors Using Various DNN Architectures. 553-564 - Mathias Quillot

, Richard Dufour
, Jean-François Bonastre
:
Assessing Speaker-Independent Character Information for Acted Voices. 565-576 - Mathias Quillot

, Jarod Duret
, Richard Dufour
, Mickael Rouvier
, Jean-François Bonastre
:
Influence of Speaker Pre-training on Character Voice Representation. 577-588 - Ilyos Rabbimov

, Sami Kobilov, Iosif Mporas:
Opinion Classification via Word and Emoji Embedding Models with LSTM. 589-601 - Aku Rouhe, Astrid Van Camp

, Mittul Singh
, Hugo Van hamme
, Mikko Kurimo:
An Equal Data Setting for Attention-Based Encoder-Decoder and HMM/DNN Models: A Case Study in Finnish ASR. 602-613 - Lyudmila V. Savchenko, Andrey V. Savchenko:

Speaker-Aware Training of Speech Emotion Classifier with Speaker Recognition. 614-625 - Andrey V. Savinkov

, Vladimir V. Bochkarev
, Anna V. Shevlyakova
, Stanislav Khristoforov
:
Neural Network Recognition of Russian Noun and Adjective Cases in the Google Books Ngram Corpus. 626-637 - Vered Silber-Varod

, Mária Gósy
, Anat Lerner
:
Is It a Filler or a Pause? A Quantitative Analysis of Filled Pauses in Hebrew. 638-648 - Shrishti Singh, Kuldeep Khoria, Hemant A. Patil:

Modified Group Delay Function Using Different Spectral Smoothing Techniques for Voice Liveness Detection. 649-659 - Tatiana Sokoreva

, Tatiana Shevchenko
, Mariya Chyrvonaya:
Complex Rhythm Adjustments in Multilingual Code-Switching Across Mandarin, English and Russian. 660-669 - Mohammad Soleymanpour, Michael T. Johnson, Jeffrey Berry:

Increasing the Precision of Dysarthric Speech Intelligibility and Severity Level Estimate. 670-679 - Lauri Tavi, Tomi Kinnunen, Einar Meister

, Rosa González Hautamäki, Anton Malmi
:
Articulation During Voice Disguise: A Pilot Study. 680-691 - Elena Timofeeva, Elena Evseeva, Valeriia Zaluskaia, Vlada Kapranova

, Sergei Astapov, Vladimir Kabarov:
Improvement of Speaker Number Estimation by Applying an Overlapped Speech Detector. 692-703 - Paras Tiwari

, Sawan Rai
:
Mind Your Tweet: Abusive Tweet Detection. 704-715 - Marián Trnka, Sakhia Darjaa, Milan Rusko, Meilin Schaper, Tim H. Stelkens-Kobsch:

Speaker Authorization for Air Traffic Control Security. 716-725 - Ana Rita Valente

, Catarina Oliveira
, Luciana Albuquerque
, António Teixeira
, Plínio A. Barbosa
:
Prosodic Changes with Age: A Longitudinal Study on a Famous European Portuguese Native Speaker. 726-736 - Loes van Bemmel

, Wieke Harmsen, Catia Cucchiarini, Helmer Strik:
Automatic Selection of the Most Characterizing Features for Detecting COPD in Speech. 737-748 - Ewald van der Westhuizen

, Trideba Padhi
, Thomas Niesler
:
Multilingual Training Set Selection for ASR in Under-Resourced Malian Languages. 749-760 - Jan Volín

, Markéta Rezácková
, Jindrich Matousek
:
Human and Transformer-Based Prosodic Phrasing in Two Speech Genres. 761-772 - Roman Vygon

, Nikolay Mikhaylovskiy
:
Learning Efficient Representations for Keyword Spotting with Triplet Loss. 773-785 - Tobias Watzel

, Ludwig Kürzinger
, Lujun Li
, Gerhard Rigoll
:
Regularized Forward-Backward Decoder for Attention Models. 786-794 - Tobias Watzel

, Ludwig Kürzinger
, Lujun Li
, Gerhard Rigoll
:
Induced Local Attention for Transformer Models in Speech Recognition. 795-806 - Zbynek Zajíc

, Marie Kunesová
, Ludek Müller
:
Applying EEND Diarization to Telephone Recordings from a Call Center. 807-817 - Svetlana Zimina, Vera Evdokimova:

Acoustic Characteristics of Speech Entrainment in Dialogues in Similar Phonetic Sequences. 818-825 - Ismail Rasim Ülgen

, Mustafa Erden, Levent M. Arslan:
Predicting Biometric Error Behaviour from Speaker Embeddings and a Fast Score Normalization Scheme. 826-836

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.


Google
Google Scholar
Semantic Scholar
Internet Archive Scholar
CiteSeerX
ORCID














