default search action
Paavo Alku
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [j94]Manila Kodali, Sudarsana Reddy Kadiri, Paavo Alku:
Automatic classification of the severity level of Parkinson's disease: A comparison of speaking tasks, features, and classifiers. Comput. Speech Lang. 83: 101548 (2024) - [j93]Sudarsana Reddy Kadiri, Farhad Javanmardi, Paavo Alku:
Investigation of self-supervised pre-trained models for classification of voice quality from speech and neck surface accelerometer signals. Comput. Speech Lang. 83: 101550 (2024) - [j92]Farhad Javanmardi, Sudarsana Reddy Kadiri, Paavo Alku:
A comparison of data augmentation methods in voice pathology detection. Comput. Speech Lang. 83: 101552 (2024) - [j91]Paavo Alku, Manila Kodali, Laura Laaksonen, Sudarsana Reddy Kadiri:
AVID: A speech database for machine learning studies on vocal intensity. Speech Commun. 157: 103039 (2024) - [j90]Yagnavajjula Madhu Keerthana, Mittapalle Kiran Reddy, Paavo Alku, K. Sreenivasa Rao, Pabitra Mitra:
Automatic classification of neurological voice disorders using wavelet scattering features. Speech Commun. 157: 103040 (2024) - [j89]Farhad Javanmardi, Sudarsana Reddy Kadiri, Paavo Alku:
Pre-trained models for detection and severity level classification of dysarthria from speech. Speech Commun. 158: 103047 (2024) - [j88]Farhad Javanmardi, Sudarsana Reddy Kadiri, Paavo Alku:
Exploring the Impact of Fine-Tuning the Wav2vec2 Model in Database-Independent Detection of Dysarthric Speech. IEEE J. Biomed. Health Informatics 28(8): 4951-4962 (2024) - 2023
- [j87]Sudarsana Reddy Kadiri, Paavo Alku, B. Yegnanarayana:
Analysis of Instantaneous Frequency Components of Speech Signals for Epoch Extraction. Comput. Speech Lang. 78: 101443 (2023) - [j86]Paavo Alku, Sudarsana Reddy Kadiri, Dhananjaya Gowda:
Refining a deep learning-based formant tracker using linear prediction methods. Comput. Speech Lang. 81: 101515 (2023) - [j85]Mittapalle Kiran Reddy, Yagnavajjula Madhu Keerthana, Paavo Alku:
Classification of functional dysphonia using the tunable Q wavelet transform. Speech Commun. 155: 102989 (2023) - [j84]Yuanyuan Liu, Mittapalle Kiran Reddy, Nelly Penttilä, Tiina Ihalainen, Paavo Alku, Okko Räsänen:
Automatic Assessment of Parkinson's Disease Using Speech Representations of Phonation and Articulation. IEEE ACM Trans. Audio Speech Lang. Process. 31: 242-255 (2023) - [j83]Mittapalle Kiran Reddy, Paavo Alku:
Exemplar-Based Sparse Representations for Detection of Parkinson's Disease From Speech. IEEE ACM Trans. Audio Speech Lang. Process. 31: 1386-1396 (2023) - [c161]Farhad Javanmardi, Saska Tirronen, Manila Kodali, Sudarsana Reddy Kadiri, Paavo Alku:
Wav2vec-Based Detection and Severity Level Classification of Dysarthria From Speech. ICASSP 2023: 1-5 - [c160]Manila Kodali, Sudarsana Reddy Kadiri, Laura Laaksonen, Paavo Alku:
Automatic Classification of Vocal Intensity Category from Speech. ICASSP 2023: 1-5 - [c159]Saska Tirronen, Farhad Javanmardi, Manila Kodali, Sudarsana Reddy Kadiri, Paavo Alku:
Utilizing Wav2Vec In Database-Independent Voice Disorder Detection. ICASSP 2023: 1-5 - [c158]Sudarsana Reddy Kadiri, Manila Kodali, Paavo Alku:
Severity Classification of Parkinson's Disease from Speech using Single Frequency Filtering-based Features. INTERSPEECH 2023: 2393-2397 - [c157]Manila Kodali, Sudarsana Reddy Kadiri, Paavo Alku:
Classification of Vocal Intensity Category from Speech using the Wav2vec2 and Whisper Embeddings. INTERSPEECH 2023: 4134-4138 - [i15]Sudarsana Reddy Kadiri, Farhad Javanmardi, Paavo Alku:
Investigation of Self-supervised Pre-trained Models for Classification of Voice Quality from Speech and Neck Surface Accelerometer Signals. CoRR abs/2308.03226 (2023) - [i14]Sudarsana Reddy Kadiri, Manila Kodali, Paavo Alku:
Severity Classification of Parkinson's Disease from Speech using Single Frequency Filtering-based Features. CoRR abs/2308.09042 (2023) - [i13]Paavo Alku, Sudarsana Reddy Kadiri, Dhananjaya Gowda:
Refining a Deep Learning-based Formant Tracker using Linear Prediction Methods. CoRR abs/2308.09051 (2023) - [i12]Dhananjaya Gowda, Sudarsana Reddy Kadiri, Brad H. Story, Paavo Alku:
Time-Varying Quasi-Closed-Phase Analysis for Accurate Formant Tracking in Speech Signals. CoRR abs/2308.16540 (2023) - [i11]Sudarsana Reddy Kadiri, Paavo Alku:
Analysis and Detection of Pathological Voice using Glottal Source Features. CoRR abs/2309.14080 (2023) - [i10]Farhad Javanmardi, Saska Tirronen, Manila Kodali, Sudarsana Reddy Kadiri, Paavo Alku:
Wav2vec-based Detection and Severity Level Classification of Dysarthria from Speech. CoRR abs/2309.14107 (2023) - 2022
- [j82]Sudarsana Reddy Kadiri, Paavo Alku:
Subjective Evaluation of Basic Emotions from Audio-Visual Data. Sensors 22(13): 4931 (2022) - [j81]Hemant Kumar Kathania, Sudarsana Reddy Kadiri, Paavo Alku, Mikko Kurimo:
A formant modification method for improved ASR of children's speech. Speech Commun. 136: 98-106 (2022) - [j80]Mittapalle Kiran Reddy, Hilla Pohjalainen, Pyry Helkkula, Kasimir Kaitue, Mikko Minkkinen, Heli Tolppanen, Tuomo Nieminen, Paavo Alku:
Glottal flow characteristics in vowels produced by speakers with heart failure. Speech Commun. 137: 35-43 (2022) - [j79]Mittapalle Kiran Reddy, Yagnavajjula Madhu Keerthana, Paavo Alku:
End-to-End Pathological Speech Detection Using Wavelet Scattering Network. IEEE Signal Process. Lett. 29: 1863-1867 (2022) - [c156]Farhad Javanmardi, Sudarsana Reddy Kadiri, Manila Kodali, Paavo Alku:
Comparing 1-dimensional and 2-dimensional spectral feature representations in voice pathology detection using machine learning and deep learning classifiers. INTERSPEECH 2022: 2173-2177 - [c155]Sudarsana Reddy Kadiri, Farhad Javanmardi, Paavo Alku:
Convolutional Neural Networks for Classification of Voice Qualities from Speech and Neck Surface Accelerometer Signals. INTERSPEECH 2022: 5253-5257 - [i9]Dhananjaya Gowda, Bajibabu Bollepalli, Sudarsana Reddy Kadiri, Paavo Alku:
Formant Tracking Using Quasi-Closed Phase Forward-Backward Linear Prediction Analysis and Deep Neural Networks. CoRR abs/2201.01525 (2022) - 2021
- [j78]Mittapalle Kiran Reddy, Paavo Alku:
A Comparison of Cepstral Features in the Detection of Pathological Voices by Varying the Input and Filterbank of the Cepstrum Computation. IEEE Access 9: 135953-135963 (2021) - [j77]Dhananjaya N. Gowda, Bajibabu Bollepalli, Sudarsana Reddy Kadiri, Paavo Alku:
Formant Tracking Using Quasi-Closed Phase Forward-Backward Linear Prediction Analysis and Deep Neural Networks. IEEE Access 9: 151631-151640 (2021) - [j76]N. P. Narendra, Paavo Alku:
Automatic assessment of intelligibility in speakers with dysarthria from coded telephone speech using glottal features. Comput. Speech Lang. 65: 101117 (2021) - [j75]Mittapalle Kiran Reddy, Pyry Helkkula, Yagnavajjula Madhu Keerthana, Kasimir Kaitue, Mikko Minkkinen, Heli Tolppanen, Tuomo Nieminen, Paavo Alku:
The automatic detection of heart failure using speech signals. Comput. Speech Lang. 69: 101205 (2021) - [j74]Sudarsana Reddy Kadiri, Paavo Alku:
Glottal features for classification of phonation type from speech and neck surface accelerometer signals. Comput. Speech Lang. 70: 101232 (2021) - [j73]Sudarsana Reddy Kadiri, Paavo Alku, Bayya Yegnanarayana:
Extraction and Utilization of Excitation Information of Speech: A Review. Proc. IEEE 109(12): 1920-1941 (2021) - [j72]N. P. Narendra, Björn W. Schuller, Paavo Alku:
The Detection of Parkinson's Disease From Speech Using Voice Source Information. IEEE ACM Trans. Audio Speech Lang. Process. 29: 1925-1936 (2021) - [c154]Hemant Kumar Kathania, Sudarsana Reddy Kadiri, Paavo Alku, Mikko Kurimo:
Spectral modification for recognition of children's speech undermismatched conditions. NoDaLiDa 2021: 94-100 - 2020
- [j71]Mittapalle Kiran Reddy, Paavo Alku, Krothapalli Sreenivasa Rao:
Detection of Specific Language Impairment in Children Using Glottal Source Features. IEEE Access 8: 15273-15279 (2020) - [j70]Sudarsana Reddy Kadiri, Paavo Alku:
Excitation Features of Speech for Speaker-Specific Emotion Detection. IEEE Access 8: 60382-60391 (2020) - [j69]N. P. Narendra, Paavo Alku:
Glottal Source Information for Pathological Voice Detection. IEEE Access 8: 67745-67755 (2020) - [j68]Rashmi Kethireddy, Sudarsana Reddy Kadiri, Paavo Alku, Suryakanth V. Gangashetty:
Mel-Weighted Single Frequency Filtering Spectrogram for Dialect Identification. IEEE Access 8: 174871-174879 (2020) - [j67]Xin Wang, Junichi Yamagishi, Massimiliano Todisco, Héctor Delgado, Andreas Nautsch, Nicholas W. D. Evans, Md. Sahidullah, Ville Vestman, Tomi Kinnunen, Kong Aik Lee, Lauri Juvela, Paavo Alku, Yu-Huai Peng, Hsin-Te Hwang, Yu Tsao, Hsin-Min Wang, Sébastien Le Maguer, Markus Becker, Zhen-Hua Ling:
ASVspoof 2019: A large-scale public database of synthesized, converted and replayed speech. Comput. Speech Lang. 64: 101114 (2020) - [j66]Sudarsana Reddy Kadiri, P. Gangamohan, Suryakanth V. Gangashetty, Paavo Alku, B. Yegnanarayana:
Excitation Features of Speech for Emotion Recognition Using Neutral Speech as Reference. Circuits Syst. Signal Process. 39(9): 4459-4481 (2020) - [j65]Sudarsana Reddy Kadiri, Paavo Alku:
Analysis and Detection of Pathological Voice Using Glottal Source Features. IEEE J. Sel. Top. Signal Process. 14(2): 367-379 (2020) - [j64]Sudarsana Reddy Kadiri, Paavo Alku, B. Yegnanarayana:
Analysis and classification of phonation types in speech and singing voice. Speech Commun. 118: 33-47 (2020) - [j63]N. P. Narendra, Paavo Alku:
Automatic intelligibility assessment of dysarthric speech using glottal parameters. Speech Commun. 123: 1-9 (2020) - [j62]Krishna Gurugubelli, Anil Kumar Vuppala, N. P. Narendra, Paavo Alku:
Duration of the rhotic approximant /ɹ/ in spastic dysarthria of different severity levels. Speech Commun. 125: 61-68 (2020) - [j61]Dhananjaya N. Gowda, Sudarsana Reddy Kadiri, Brad H. Story, Paavo Alku:
Time-Varying Quasi-Closed-Phase Analysis for Accurate Formant Tracking in Speech Signals. IEEE ACM Trans. Audio Speech Lang. Process. 28: 1901-1914 (2020) - [c153]Sudarsana Reddy Kadiri, Paavo Alku, B. Yegnanarayana:
Comparison of Glottal Closure Instants Detection Algorithms for Emotional Speech. ICASSP 2020: 7379-7383 - [c152]Hemant Kumar Kathania, Sudarsana Reddy Kadiri, Paavo Alku, Mikko Kurimo:
Study of Formant Modification for Children ASR. ICASSP 2020: 7429-7433 - [c151]Sudarsana Reddy Kadiri, Rashmi Kethireddy, Paavo Alku:
Parkinson's Disease Detection from Speech Using Single Frequency Filtering Cepstral Coefficients. INTERSPEECH 2020: 4971-4975
2010 – 2019
- 2019
- [j60]Shreyas Seshadri, Lauri Juvela, Okko Räsänen, Paavo Alku:
Vocal Effort Based Speaking Style Conversion Using Vocoder Features and Parallel Learning. IEEE Access 7: 17230-17246 (2019) - [j59]Emma Jokinen, Rahim Saeidi, Tomi Kinnunen, Paavo Alku:
Vocal effort compensation for MFCC feature extraction in a shouted versus normal speaker recognition task. Comput. Speech Lang. 53: 1-11 (2019) - [j58]N. P. Narendra, Manu Airaksinen, Brad H. Story, Paavo Alku:
Estimation of the glottal source from coded telephone speech using deep neural networks. Speech Commun. 106: 95-104 (2019) - [j57]Paavo Alku, Tiina Murtola, Jarmo Malinen, Juha Kuortti, Brad H. Story, Manu Airaksinen, Mika Salmi, Erkki Vilkman, Ahmed Geneid:
OPENGLOT - An open environment for the evaluation of glottal inverse filtering. Speech Commun. 107: 38-47 (2019) - [j56]Tiina Murtola, Jarmo Malinen, Ahmed Geneid, Paavo Alku:
Analysis of phonation onsets in vowel production, using information from glottal area and flow estimate. Speech Commun. 109: 55-65 (2019) - [j55]N. P. Narendra, Paavo Alku:
Dysarthric speech classification from coded telephone speech using glottal features. Speech Commun. 110: 47-55 (2019) - [j54]Bajibabu Bollepalli, Lauri Juvela, Manu Airaksinen, Cassia Valentini-Botinhao, Paavo Alku:
Normal-to-Lombard adaptation of speech synthesis using long short-term memory recurrent neural networks. Speech Commun. 110: 64-75 (2019) - [j53]Lauri Juvela, Bajibabu Bollepalli, Vassilis Tsiaras, Paavo Alku:
GlotNet - A Raw Waveform Model for the Glottal Excitation in Statistical Parametric Speech Synthesis. IEEE ACM Trans. Audio Speech Lang. Process. 27(6): 1019-1030 (2019) - [c150]Manu Airaksinen, Lauri Juvela, Paavo Alku, Okko Räsänen:
Data Augmentation Strategies for Neural Network F0 Estimation. ICASSP 2019: 6485-6489 - [c149]Shreyas Seshadri, Lauri Juvela, Junichi Yamagishi, Okko Räsänen, Paavo Alku:
Cycle-consistent Adversarial Networks for Non-parallel Vocal Effort Based Speaking Style Conversion. ICASSP 2019: 6835-6839 - [c148]Lauri Juvela, Bajibabu Bollepalli, Junichi Yamagishi, Paavo Alku:
Waveform Generation for Text-to-speech Synthesis Using Pitch-synchronous Multi-scale Generative Adversarial Networks. ICASSP 2019: 6915-6919 - [c147]Lauri Juvela, Bajibabu Bollepalli, Junichi Yamagishi, Paavo Alku:
GELP: GAN-Excited Linear Prediction for Speech Synthesis from Mel-Spectrogram. INTERSPEECH 2019: 694-698 - [c146]Sudarsana Reddy Kadiri, Paavo Alku:
Mel-Frequency Cepstral Coefficients of Voice Source Waveforms for Classification of Phonation Types in Speech. INTERSPEECH 2019: 2508-2512 - [c145]Bajibabu Bollepalli, Lauri Juvela, Paavo Alku:
Lombard Speech Synthesis Using Transfer Learning in a Tacotron Text-to-Speech System. INTERSPEECH 2019: 2833-2837 - [c144]Shreyas Seshadri, Lauri Juvela, Paavo Alku, Okko Räsänen:
Augmented CycleGANs for Continuous Scale Normal-to-Lombard Speaking Style Conversion. INTERSPEECH 2019: 2838-2842 - [i8]Bajibabu Bollepalli, Lauri Juvela, Paavo Alku:
Generative adversarial network-based glottal waveform model for statistical parametric speech synthesis. CoRR abs/1903.05955 (2019) - [i7]Lauri Juvela, Bajibabu Bollepalli, Junichi Yamagishi, Paavo Alku:
GELP: GAN-Excited Linear Prediction for Speech Synthesis from Mel-spectrogram. CoRR abs/1904.03976 (2019) - [i6]Xin Wang, Junichi Yamagishi, Massimiliano Todisco, Héctor Delgado, Andreas Nautsch, Nicholas W. D. Evans, Md. Sahidullah, Ville Vestman, Tomi Kinnunen, Kong Aik Lee, Lauri Juvela, Paavo Alku, Yu-Huai Peng, Hsin-Te Hwang, Yu Tsao, Hsin-Min Wang, Sébastien Le Maguer, Markus Becker, Fergus Henderson, Rob Clark, Yu Zhang, Quan Wang, Ye Jia, Kai Onuma, Koji Mushika, Takashi Kaneda, Yuan Jiang, Li-Juan Liu, Yi-Chiao Wu, Wen-Chin Huang, Tomoki Toda, Kou Tanaka, Hirokazu Kameoka, Ingmar Steiner, Driss Matrouf, Jean-François Bonastre, Avashna Govender, Srikanth Ronanki, Jing-Xuan Zhang, Zhen-Hua Ling:
The ASVspoof 2019 database. CoRR abs/1911.01601 (2019) - [i5]Thomas Drugman, Paavo Alku, Abeer Alwan, Bayya Yegnanarayana:
Glottal Source Processing: from Analysis to Applications. CoRR abs/1912.12604 (2019) - 2018
- [j52]Tiina Murtola, Paavo Alku, Jarmo Malinen, Ahmed Geneid:
Parameterization of a computational physical model for glottal flow using inverse filtering and high-speed videoendoscopy. Speech Commun. 96: 67-80 (2018) - [j51]Ville Vestman, Dhananjaya N. Gowda, Md. Sahidullah, Paavo Alku, Tomi Kinnunen:
Speaker recognition from whispered speech: A tutorial survey and an application of time-varying linear prediction. Speech Commun. 99: 62-79 (2018) - [j50]Sofoklis Kakouros, Okko Räsänen, Paavo Alku:
Comparison of spectral tilt measures for sentence prominence in speech - Effects of dimensionality and adverse noise conditions. Speech Commun. 103: 11-26 (2018) - [j49]Parham Mokhtari, Brad H. Story, Paavo Alku, Hiroshi Ando:
Estimation of the glottal flow from speech pressure signals: Evaluation of three variants of iterative adaptive inverse filtering using computational physical modelling of voice production. Speech Commun. 104: 24-38 (2018) - [j48]Manu Airaksinen, Lauri Juvela, Bajibabu Bollepalli, Junichi Yamagishi, Paavo Alku:
A Comparison Between STRAIGHT, Glottal, and Sinusoidal Vocoding in Statistical Parametric Speech Synthesis. IEEE ACM Trans. Audio Speech Lang. Process. 26(9): 1658-1670 (2018) - [c143]Lauri Juvela, Bajibabu Bollepalli, Xin Wang, Hirokazu Kameoka, Manu Airaksinen, Junichi Yamagishi, Paavo Alku:
Speech Waveform Synthesis from MFCC Sequences with Generative Adversarial Networks. ICASSP 2018: 5679-5683 - [c142]Manu Airaksinen, Lauri Juvela, Okko Räsänen, Paavo Alku:
Time-regularized Linear Prediction for Noise-robust Extraction of the Spectral Envelope of Speech. INTERSPEECH 2018: 701-705 - [c141]Lauri Juvela, Vassilis Tsiaras, Bajibabu Bollepalli, Manu Airaksinen, Junichi Yamagishi, Paavo Alku:
Speaker-independent Raw Waveform Model for Glottal Excitation. INTERSPEECH 2018: 2012-2016 - [c140]N. P. Narendra, Paavo Alku:
Dysarthric Speech Classification Using Glottal Features Computed from Non-words, Words and Sentences. INTERSPEECH 2018: 3403-3407 - [i4]Lauri Juvela, Bajibabu Bollepalli, Xin Wang, Hirokazu Kameoka, Manu Airaksinen, Junichi Yamagishi, Paavo Alku:
Speech waveform synthesis from MFCC sequences with generative adversarial networks. CoRR abs/1804.00920 (2018) - [i3]Lauri Juvela, Vassilis Tsiaras, Bajibabu Bollepalli, Manu Airaksinen, Junichi Yamagishi, Paavo Alku:
Speaker-independent raw waveform model for glottal excitation. CoRR abs/1804.09593 (2018) - [i2]Bajibabu Bollepalli, Lauri Juvela, Paavo Alku:
Speaking style adaptation in Text-To-Speech synthesis using Sequence-to-sequence models with attention. CoRR abs/1810.12051 (2018) - [i1]Lauri Juvela, Bajibabu Bollepalli, Junichi Yamagishi, Paavo Alku:
Waveform generation for text-to-speech synthesis using pitch-synchronous multi-scale generative adversarial networks. CoRR abs/1810.12598 (2018) - 2017
- [j47]Dong Liu, Elina Kankare, Anne-Maria Laukkanen, Paavo Alku:
Comparison of parametrization methods of electroglottographic and inverse filtered acoustic speech pressure signals in distinguishing between phonation types. Biomed. Signal Process. Control. 36: 183-193 (2017) - [j46]Manu Airaksinen, Bajibabu Bollepalli, Jouni Pohjalainen, Paavo Alku:
Glottal Vocoding With Frequency-Warped Time-Weighted Linear Prediction. IEEE Signal Process. Lett. 24(4): 446-450 (2017) - [j45]Manu Airaksinen, Tom Bäckström, Paavo Alku:
Quadratic Programming Approach to Glottal Inverse Filtering by Joint Norm-1 and Norm-2 Optimization. IEEE ACM Trans. Audio Speech Lang. Process. 25(5): 929-939 (2017) - [j44]Paavo Alku, Rahim Saeidi:
The Linear Predictive Modeling of Speech From Higher-Lag Autocorrelation Coefficients Applied to Noise-Robust Speaker Recognition. IEEE ACM Trans. Audio Speech Lang. Process. 25(8): 1606-1617 (2017) - [j43]Emma Jokinen, Ulpu Remes, Paavo Alku:
Intelligibility Enhancement of Telephone Speech Using Gaussian Process Regression for Normal-to-Lombard Spectral Tilt Conversion. IEEE ACM Trans. Audio Speech Lang. Process. 25(10): 1985-1996 (2017) - [c139]Ana Ramírez López, Rahim Saeidi, Lauri Juvela, Paavo Alku:
Normal-to-shouted speech spectral mapping for speaker recognition under vocal effort mismatch. ICASSP 2017: 4940-4944 - [c138]Bajibabu Bollepalli, Manu Airaksinen, Paavo Alku:
Lombard speech synthesis using long short-term memory recurrent neural networks. ICASSP 2017: 5505-5509 - [c137]Tomi Kinnunen, Lauri Juvela, Paavo Alku, Junichi Yamagishi:
Non-parallel voice conversion using i-vector PLDA: towards unifying speaker verification and transformation. ICASSP 2017: 5535-5539 - [c136]Manu Airaksinen, Bajibabu Bollepalli, Jouni Pohjalainen, Paavo Alku:
Frequency-warped time-weighted linear prediction for glottal vocoding. ICASSP 2017: 5630-5634 - [c135]Ana Ramírez López, Shreyas Seshadri, Lauri Juvela, Okko Räsänen, Paavo Alku:
Speaking Style Conversion from Normal to Lombard Speech Using a Glottal Vocoder and Bayesian GMMs. INTERSPEECH 2017: 1363-1367 - [c134]Lauri Juvela, Bajibabu Bollepalli, Junichi Yamagishi, Paavo Alku:
Reducing Mismatch in Training of DNN-Based Glottal Excitation Models in a Statistical Parametric Text-to-Speech System. INTERSPEECH 2017: 1368-1372 - [c133]Ville Vestman, Dhananjaya N. Gowda, Md. Sahidullah, Paavo Alku, Tomi Kinnunen:
Time-Varying Autoregressions for Speaker Verification in Reverberant Conditions. INTERSPEECH 2017: 1512-1516 - [c132]Sofoklis Kakouros, Okko Räsänen, Paavo Alku:
Evaluation of Spectral Tilt Measures for Sentence Prominence Under Different Noise Conditions. INTERSPEECH 2017: 3211-3215 - [c131]Bajibabu Bollepalli, Lauri Juvela, Paavo Alku:
Generative Adversarial Network-Based Glottal Waveform Model for Statistical Parametric Speech Synthesis. INTERSPEECH 2017: 3394-3398 - [c130]N. P. Narendra, Manu Airaksinen, Paavo Alku:
Glottal Source Estimation from Coded Telephone Speech Using a Deep Neural Network. INTERSPEECH 2017: 3931-3935 - [c129]Manu Airaksinen, Paavo Alku:
Effects of Training Data Variety in Generating Glottal Pulses from Acoustic Features with DNNs. INTERSPEECH 2017: 3946-3950 - 2016
- [j42]Ulpu Remes, Ana Ramírez López, Lauri Juvela, Kalle J. Palomäki, Guy J. Brown, Paavo Alku, Mikko Kurimo:
Comparing human and automatic speech recognition in a perceptual restoration experiment. Comput. Speech Lang. 35: 14-31 (2016) - [j41]Maria Hakonen, Patrick J. C. May, Jussi Alho, Paavo Alku, Emma Jokinen, Iiro P. Jääskeläinen, Hannu Tiitinen:
Previous exposure to intact speech increases intelligibility of its digitally degraded counterpart as a function of stimulus complexity. NeuroImage 125: 131-143 (2016) - [j40]Tuomo Raitio, Lauri Juvela, Antti Suni, Martti Vainio, Paavo Alku:
Phase perception of the glottal excitation and its relevance in statistical parametric speech synthesis. Speech Commun. 81: 104-119 (2016) - [j39]Emma Jokinen, Hannu Pulakka, Paavo Alku:
Phase modification for increasing the intelligibility of telephone speech in near-end noise conditions - evaluation of two methods. Speech Commun. 83: 64-80 (2016) - [j38]Rahim Saeidi, Paavo Alku, Tom Bäckström:
Feature Extraction Using Power-Law Adjusted Linear Prediction With Application to Speaker Recognition Under Severe Vocal Effort Mismatch. IEEE ACM Trans. Audio Speech Lang. Process. 24(1): 42-53 (2016) - [c128]Dhananjaya N. Gowda, Manu Airaksinen, Paavo Alku:
Quasi closed phase analysis of speech signals using time varying weighted linear prediction for accurate formant tracking. ICASSP 2016: 4980-4984 - [c127]Lauri Juvela, Bajibabu Bollepalli, Manu Airaksinen, Paavo Alku:
High-pitched excitation generation for glottal vocoding in statistical parametric speech synthesis using a deep neural network. ICASSP 2016: 5120-5124 - [c126]Johannes Abel, Magdalena Kaniewska, Cyril Guillaume, Wouter Tirry, Hannu Pulakka, Ville Myllylä, Jari Sjoberg, Paavo Alku, Itai Katsir, David Malah, Israel Cohen, M. A. Tugtekin Turan, Engin Erzin, Thomas Schlien, Peter Vary, Amr H. Nour-Eldin, Peter Kabal, Tim Fingscheidt:
A subjective listening test of six different artificial bandwidth extension approaches in English, Chinese, German, and Korean. ICASSP 2016: 5915-5919 - [c125]Lauri Juvela, Hirokazu Kameoka, Manu Airaksinen, Junichi Yamagishi, Paavo Alku:
Majorisation-Minimisation Based Optimisation of the Composite Autoregressive System with Application to Glottal Inverse Filtering. INTERSPEECH 2016: 968-972 - [c124]Manu Airaksinen, Lauri Juvela, Tom Bäckström, Paavo Alku:
Automatic Glottal Inverse Filtering with Non-Negative Matrix Factorization. INTERSPEECH 2016: 1039-1043 - [c123]Dhananjaya N. Gowda, Paavo Alku:
Time-Varying Quasi-Closed-Phase Weighted Linear Prediction Analysis of Speech for Accurate Formant Detection and Tracking. INTERSPEECH 2016: 1760-1764 - [c122]Rahim Saeidi, Ilkka Huhtakallio, Paavo Alku:
Analysis of Face Mask Effect on Speaker Recognition. INTERSPEECH 2016: 1800-1804 - [c121]Lauri Juvela, Xin Wang, Shinji Takaki, Manu Airaksinen, Junichi Yamagishi, Paavo Alku:
Using Text and Acoustic Features in Predicting Glottal Excitation Waveforms for Parametric Speech Synthesis with Recurrent Neural Networks. INTERSPEECH 2016: 2283-2287 - [c120]Manu Airaksinen, Bajibabu Bollepalli, Lauri Juvela, Zhizheng Wu, Simon King, Paavo Alku:
GlottDNN - A Full-Band Glottal Vocoder for Statistical Parametric Speech Synthesis. INTERSPEECH 2016: 2473-2477 - [c119]Emma Jokinen, Paavo Alku:
Intelligibility Enhancement at the Receiving End of the Speech Transmission System - Effects of Far-End Noise Reduction. INTERSPEECH 2016: 2498-2502 - [c118]Emma Jokinen, Ulpu Remes, Paavo Alku:
The Use of Read versus Conversational Lombard Speech in Spectral Tilt Modeling for Intelligibility Enhancement in Near-End Noise Conditions. INTERSPEECH 2016: 2771-2775 - 2015
- [c117]Manu Airaksinen, Tuomo Raitio, Paavo Alku:
Noise robust estimation of the voice source using a deep neural network. ICASSP 2015: 5137-5141 - [c116]Katri Jähi, Paavo Alku, Maija S. Peltola:
Does interest in language learning affect the non-native phoneme production in elderly learners? ICPhS 2015 - [c115]Kimmo Peltola, Henna Tamminen, Paavo Alku, Maija S. Peltola:
Non-native production training with an acoustic model and orthographic or transcription cues. ICPhS 2015 - [c114]Antti Saloranta, Henna Tamminen, Paavo Alku, Maija S. Peltola:
Learning of a non-native vowel through instructed production training. ICPhS 2015 - [c113]Emma Jokinen, Ulpu Remes, Paavo Alku:
Comparison of Gaussian process regression and Gaussian mixture models in spectral tilt modelling for intelligibility enhancement of telephone speech. INTERSPEECH 2015: 85-89 - [c112]Tuomo Raitio, Lauri Juvela, Antti Suni, Martti Vainio, Paavo Alku:
Phase perception of the glottal excitation of vocoded speech. INTERSPEECH 2015: 254-258 - [c111]Rahim Saeidi, Tuija Niemi, Hanna Karppelin, Jouni Pohjalainen, Tomi Kinnunen, Paavo Alku:
Speaker recognition for speech under face cover. INTERSPEECH 2015: 1012-1016 - [c110]Dhananjaya N. Gowda, Rahim Saeidi, Paavo Alku:
AM-FM based filter bank analysis for estimation of spectro-temporal envelopes and its application for speaker recognition in noisy reverberant environments. INTERSPEECH 2015: 1166-1170 - [c109]Manu Airaksinen, Tom Bäckström, Paavo Alku:
Glottal inverse filtering based on quadratic programming. INTERSPEECH 2015: 2342-2346 - [c108]Hannu Pulakka, Ville Myllylä, Anssi Rämö, Paavo Alku:
Speech quality evaluation of artificial bandwidth extension: comparing subjective judgments and instrumental predictions. INTERSPEECH 2015: 2583-2587 - [c107]Rahim Saeidi, Paavo Alku:
Accounting for uncertainty of i-vectors in speaker recognition using uncertainty propagation and modified imputation. INTERSPEECH 2015: 3546-3550 - [c106]Rizwan Ishaq, Dhananjaya N. Gowda, Paavo Alku, Begonya Garcia-Zapirain:
Vowel Enhancement in Early Stage Spanish Esophageal Speech Using Natural Glottal Flow Pulse and Vocal Tract Frequency Warping. SLPAT@Interspeech 2015: 55-59 - 2014
- [j37]Ricardo Teixeira Sousa, Aníbal J. S. Ferreira, Paavo Alku:
The harmonic and noise information of the glottal pulses in speech. Biomed. Signal Process. Control. 10: 137-143 (2014) - [j36]Emma Jokinen, Marko Takanen, Martti Vainio, Paavo Alku:
An adaptive post-filtering method producing an artificial Lombard-like effect for intelligibility enhancement of narrowband telephone speech. Comput. Speech Lang. 28(2): 619-628 (2014) - [j35]Tuomo Raitio, Antti Suni, Martti Vainio, Paavo Alku:
Synthesis and perception of breathy, normal, and Lombard speech in the presence of noise. Comput. Speech Lang. 28(2): 648-664 (2014) - [j34]Thomas Drugman, Paavo Alku, Abeer Alwan, Bayya Yegnanarayana:
Glottal source processing: From analysis to applications. Comput. Speech Lang. 28(5): 1117-1138 (2014) - [j33]Harri Auvinen, Tuomo Raitio, Manu Airaksinen, Samuli Siltanen, Brad H. Story, Paavo Alku:
Automatic glottal inverse filtering with the Markov chain Monte Carlo method. Comput. Speech Lang. 28(5): 1139-1155 (2014) - [j32]Jouni Pohjalainen, Cemal Hanilçi, Tomi Kinnunen, Paavo Alku:
Mixture Linear Prediction in Speaker Verification Under Vocal Effort Mismatch. IEEE Signal Process. Lett. 21(12): 1516-1520 (2014) - [j31]Manu Airaksinen, Tuomo Raitio, Brad H. Story, Paavo Alku:
Quasi Closed Phase Glottal Inverse Filtering Analysis With Weighted Linear Prediction. IEEE ACM Trans. Audio Speech Lang. Process. 22(3): 596-607 (2014) - [c105]Tuomo Raitio, Heng Lu, John Kane, Antti Suni, Martti Vainio, Simon King, Paavo Alku:
Voice source modelling using deep neural networks for statistical parametric speech synthesis. EUSIPCO 2014: 2290-2294 - [c104]Jouni Pohjalainen, Paavo Alku:
Multi-scale modulation filtering in automatic detection of emotions in telephone speech. ICASSP 2014: 980-984 - [c103]Emma Jokinen, Marko Takanen, Paavo Alku:
Comparison of post-processing methods for intelligibility enhancement of narrowband speech in a mobile phone framework. ICASSP 2014: 4643-4647 - [c102]Jouni Pohjalainen, Paavo Alku:
Gaussian mixture linear prediction. ICASSP 2014: 6285-6289 - [c101]Manu Airaksinen, Paavo Alku:
Parameterization of the glottal source with the phase plane plot. INTERSPEECH 2014: 96-100 - [c100]Manu Airaksinen, Tom Bäckström, Paavo Alku:
Automatic estimation of the lip radiation effect in glottal inverse filtering. INTERSPEECH 2014: 398-402 - [c99]Jouni Pohjalainen, Paavo Alku:
Filtering and subspace selection for spectral features in detecting speech under physical stress. INTERSPEECH 2014: 432-436 - [c98]Emma Jokinen, Marko Takanen, Hannu Pulakka, Paavo Alku:
Enhancement of speech intelligibility in near-end noise conditions with phase modification. INTERSPEECH 2014: 1643-1647 - [c97]Tuomo Raitio, Antti Suni, Lauri Juvela, Martti Vainio, Paavo Alku:
Deep neural network based trainable voice source model for synthesis of speech with varying vocal effort. INTERSPEECH 2014: 1969-1973 - [c96]Emma Jokinen, Ulpu Remes, Marko Takanen, Kalle J. Palomäki, Mikko Kurimo, Paavo Alku:
Spectral tilt modelling with GMMs for intelligibility enhancement of narrowband telephone speech. INTERSPEECH 2014: 2036-2040 - [c95]Hannu Pulakka, Anssi Rämö, Ville Myllylä, Henri Toukomaa, Paavo Alku:
Subjective voice quality evaluation of artificial bandwidth extension: comparing different audio bandwidths and speech codecs. INTERSPEECH 2014: 2804-2808 - [c94]Emma Jokinen, Ulpu Remes, Marko Takanen, Kalle J. Palomäki, Mikko Kurimo, Paavo Alku:
Spectral tilt modelling with extrapolated GMMs for intelligibility enhancement of narrowband telephone speech. IWAENC 2014: 164-168 - 2013
- [c93]Jouni Pohjalainen, Paavo Alku:
Automatic detection of anger in telephone speech with robust autoregressive modulation filtering. ICASSP 2013: 7537-7541 - [c92]Tuomo Raitio, Antti Suni, Martti Vainio, Paavo Alku:
Comparing glottal-flow-excited statistical parametric speech synthesis methods. ICASSP 2013: 7830-7834 - [c91]Cemal Hanilçi, Tomi Kinnunen, Rahim Saeidi, Jouni Pohjalainen, Paavo Alku, Figen Ertas:
Speaker identification from shouted speech: Analysis and compensation. ICASSP 2013: 8027-8031 - [c90]Dhananjaya N. Gowda, Jouni Pohjalainen, Mikko Kurimo, Paavo Alku:
Robust formant detection using group delay function and stabilized weighted linear prediction. INTERSPEECH 2013: 49-53 - [c89]Manu Airaksinen, Brad H. Story, Paavo Alku:
Quasi closed phase analysis for glottal inverse filtering. INTERSPEECH 2013: 143-147 - [c88]Bajibabu Bollepalli, Tuomo Raitio, Paavo Alku:
Effect of MPEG audio compression on HMM-based speech synthesis. INTERSPEECH 2013: 1062-1066 - [c87]Emma Jokinen, Marko Takanen, Paavo Alku:
Frequency-adaptive post-filtering for intelligibility enhancement of narrowband telephone speech. INTERSPEECH 2013: 1179-1183 - [c86]Tuomo Raitio, Antti Suni, Jouni Pohjalainen, Manu Airaksinen, Martti Vainio, Paavo Alku:
Analysis and synthesis of shouted speech. INTERSPEECH 2013: 1544-1548 - [c85]Jouni Pohjalainen, Paavo Alku:
Extended weighted linear prediction using the autocorrelation snapshot - a robust speech analysis method and its application to recognition of vocal emotions. INTERSPEECH 2013: 1931-1935 - [c84]Padmanabhan Rajan, Tomi Kinnunen, Cemal Hanilçi, Jouni Pohjalainen, Paavo Alku:
Using group delay functions from all-pole models for speaker recognition. INTERSPEECH 2013: 2489-2493 - [c83]Cemal Hanilçi, Tomi Kinnunen, Padmanabhan Rajan, Jouni Pohjalainen, Paavo Alku, Figen Ertas:
Comparison of spectrum estimators in speaker verification: mismatch conditions induced by vocal effort. INTERSPEECH 2013: 2881-2885 - [c82]Sebastian Möller, Emilia Kelaidi, Friedemann Köster, Nicolas Côté, Patrick Bauer, Tim Fingscheidt, Thomas Schlien, Hannu Pulakka, Paavo Alku:
Speech quality prediction for artificial bandwidth extension algorithms. INTERSPEECH 2013: 3439-3443 - [c81]Antti Suni, Reima Karhila, Tuomo Raitio, Mikko Kurimo, Martti Vainio, Paavo Alku:
Lombard modified text-to-speech synthesis for improved intelligibility: submission for the hurricane challenge 2013. INTERSPEECH 2013: 3562-3566 - [c80]Dhananjaya N. Gowda, Jouni Pohjalainen, Paavo Alku, Mikko Kurimo:
Robust spectral representation using group delay function and stabilized weighted linear prediction for additive noise degradations. SpeD 2013: 1-7 - [c79]Antti Suni, Daniel Aalto, Tuomo Raitio, Paavo Alku, Martti Vainio:
Wavelets for intonation modeling in HMM speech synthesis. SSW 2013: 285-290 - 2012
- [j30]Ismo Miettinen, Paavo Alku, Santeri Yrttiaho, Patrick J. C. May, Hannu Tiitinen:
Cortical processing of degraded speech sounds: Effects of distortion type and continuity. NeuroImage 60(2): 1036-1045 (2012) - [j29]Cemal Hanilçi, Tomi Kinnunen, Figen Ertas, Rahim Saeidi, Jouni Pohjalainen, Paavo Alku:
Regularized All-Pole Models for Speaker Verification Under Noisy Environments. IEEE Signal Process. Lett. 19(3): 163-166 (2012) - [j28]Hannu Pulakka, Laura Laaksonen, Ville Myllylä, Santeri Yrttiaho, Paavo Alku:
Conversational Evaluation of Speech Bandwidth Extension Using a Mobile Handset. IEEE Signal Process. Lett. 19(4): 203-206 (2012) - [j27]Hannu Pulakka, Ulpu Remes, Santeri Yrttiaho, Kalle J. Palomäki, Mikko Kurimo, Paavo Alku:
Bandwidth Extension of Telephone Speech to Low Frequencies Using Sinusoidal Synthesis and a Gaussian Mixture Model. IEEE Trans. Speech Audio Process. 20(8): 2219-2231 (2012) - [c78]Antti Suni, Tuomo Raitio, Martti Vainio, Paavo Alku:
The GlottHMM Entry for Blizzard Challenge 2012: Hybrid Approach. Blizzard Challenge 2012 - [c77]Emma Jokinen, Paavo Alku, Martti Vainio:
Comparison of post-filtering methods for intelligibility enhancement of telephone speech. EUSIPCO 2012: 2333-2337 - [c76]Tuomo Raitio, Marko Takanen, Olli Santala, Antti Suni, Martti Vainio, Paavo Alku:
On measuring the intelligibility of synthetic speech in noise - Do we need a realistic noise environment? ICASSP 2012: 4025-4028 - [c75]Hannu Pulakka, Laura Laaksonen, Ville Myllylä, Santeri Yrttiaho, Paavo Alku:
Conversational evaluation of artificial bandwidth extension of telephone speech using a mobile handset. ICASSP 2012: 4069-4072 - [c74]Jouni Pohjalainen, Paavo Alku:
Robust speech analysis by lag-weighted linear prediction. ICASSP 2012: 4453-4456 - [c73]Cemal Hanilçi, Tomi Kinnunen, Rahim Saeidi, Jouni Pohjalainen, Paavo Alku, Figen Ertas, Johan Sandberg, Maria Hansson-Sandsten:
Comparing spectrum estimators in speaker verification under additive noise degradation. ICASSP 2012: 4769-4772 - [c72]Emma Jokinen, Paavo Alku, Martti Vainio:
Utilization of the Lombard effect in post-filtering for intelligibility enhancement of telephone speech. INTERSPEECH 2012: 591-594 - [c71]Martti Vainio, Daniel Aalto, Antti Suni, Anja Arnhold, Tuomo Raitio, Henri Seijo, Juhani Järvikivi, Paavo Alku:
Effect of noise type and level on focus related fundamental frequency changes. INTERSPEECH 2012: 671-674 - [c70]Jouni Pohjalainen, Tuomo Raitio, Hannu Pulakka, Paavo Alku:
Automatic Detection of High Vocal Effort in Telephone Speech. INTERSPEECH 2012: 691-694 - [c69]Tuomo Raitio, Antti Suni, Martti Vainio, Paavo Alku:
Wideband Parametric Speech Synthesis Using Warped Linear Prediction. INTERSPEECH 2012: 1420-1423 - [c68]Alan Pinheiro, Tuomo Raitio, Danyane Gomes, Paavo Alku:
Voice source analysis using biomechanical modeling and glottal inverse filtering. INTERSPEECH 2012: 1604-1607 - [c67]Paavo Alku, Jouni Pohjalainen, Martti Vainio, Anne-Maria Laukkanen, Brad H. Story:
Improved formant frequency estimation from high-pitched vowels by downgrading the contribution of the glottal source with weighted linear prediction. INTERSPEECH 2012: 1612-1615 - [c66]Jaime Lorenzo-Trueba, Roberto Barra-Chicote, Tuomo Raitio, Nicolas Obin, Paavo Alku, Junichi Yamagishi, Juan Manuel Montero:
Towards Glottal Source Controllability in Expressive Speech Synthesis. INTERSPEECH 2012: 1620-1623 - [c65]Harri Auvinen, Tuomo Raitio, Samuli Siltanen, Paavo Alku:
Utilizing Markov Chain Monte Carlo (MCMC) Method for Improved Glottal Inverse Filtering. INTERSPEECH 2012: 1640-1643 - [c64]Cemal Hanilçi, Tomi Kinnunen, Rahim Saeidi, Jouni Pohjalainen, Paavo Alku, Figen Ertas:
Regularization of all-pole models for speaker verification under additive noise. Odyssey 2012: 236-242 - 2011
- [j26]Santeri Yrttiaho, Patrick J. C. May, Hannu Tiitinen, Paavo Alku:
Cortical encoding of aperiodic and periodic speech sounds: Evidence for distinct neural populations. NeuroImage 55(3): 1252-1259 (2011) - [j25]Tuomo Raitio, Antti Suni, Junichi Yamagishi, Hannu Pulakka, Jani Nurminen, Martti Vainio, Paavo Alku:
HMM-Based Speech Synthesis Utilizing Glottal Inverse Filtering. IEEE Trans. Speech Audio Process. 19(1): 153-165 (2011) - [j24]Hannu Pulakka, Paavo Alku:
Bandwidth Extension of Telephone Speech Using a Neural Network and a Filter Bank Implementation for Highband Mel Spectrum. IEEE Trans. Speech Audio Process. 19(7): 2170-2183 (2011) - [c63]Antti Suni, Tuomo Raitio, Martti Vainio, Paavo Alku:
The GlottHMM Speech Synthesis Entry for Blizzard Challenge 2011: Utilizing Source Unit Selection in HMM-Based Speech Synthesis for Improved Excitation Generation. Blizzard Challenge 2011 - [c62]Tuomo Raitio, Antti Suni, Hannu Pulakka, Martti Vainio, Paavo Alku:
Utilizing glottal source pulse library for generating improved excitation signal for HMM-based speech synthesis. ICASSP 2011: 4564-4567 - [c61]Jouni Pohjalainen, Paavo Alku, Tomi Kinnunen:
Shout detection in noise. ICASSP 2011: 4968-4971 - [c60]Hannu Pulakka, Ulpu Remes, Kalle J. Palomäki, Mikko Kurimo, Paavo Alku:
Speech bandwidth extension using Gaussian mixture model-based estimation of the highband mel spectrum. ICASSP 2011: 5100-5103 - [c59]George P. Kafentzis, Yannis Stylianou, Paavo Alku:
Glottal inverse filtering using stabilised weighted linear prediction. ICASSP 2011: 5408-5411 - [c58]Hannu Pulakka, Ulpu Remes, Santeri Yrttiaho, Kalle J. Palomäki, Mikko Kurimo, Paavo Alku:
Low-Frequency Bandwidth Extension of Telephone Speech Using Sinusoidal Synthesis and Gaussian Mixture Model. INTERSPEECH 2011: 1181-1184 - [c57]Sami Keronen, Jouni Pohjalainen, Paavo Alku, Mikko Kurimo:
Noise Robust Feature Extraction Based on Extended Weighted Linear Prediction in LVCSR. INTERSPEECH 2011: 1265-1268 - [c56]Jouni Pohjalainen, Tuomo Raitio, Paavo Alku:
Detection of Shouted Speech in the Presence of Ambient Noise. INTERSPEECH 2011: 2621-2624 - [c55]Tuomo Raitio, Antti Suni, Martti Vainio, Paavo Alku:
Analysis of HMM-Based Lombard Speech Synthesis. INTERSPEECH 2011: 2781-2784 - [c54]Ricardo Teixeira Sousa, Aníbal J. S. Ferreira, Paavo Alku:
Estimation of harmonic and noise components of the glottal excitation. MAVEBA 2011: 115-118 - 2010
- [j23]Rahim Saeidi, Jouni Pohjalainen, Tomi Kinnunen, Paavo Alku:
Temporally Weighted Linear Prediction Features for Tackling Additive Noise in Speaker Verification. IEEE Signal Process. Lett. 17(6): 599-602 (2010) - [c53]Antti Suni, Tuomo Raitio, Martti Vainio, Paavo Alku:
The GlottHMM Speech Synthesis Entry for Blizzard Challenge 2010. Blizzard Challenge 2010 - [c52]Hannu Pulakka, Ville Myllylä, Laura Laaksonen, Paavo Alku:
Bandwidth extension of telephone speech using a filter bank implementation for highband MEL spectrum. EUSIPCO 2010: 979-983 - [c51]Martti Vainio, Matti Airas, Juhani Järvikivi, Paavo Alku:
Laryngeal voice quality in the expression of focus. INTERSPEECH 2010: 921-924 - [c50]Jouni Pohjalainen, Rahim Saeidi, Tomi Kinnunen, Paavo Alku:
Extended weighted linear prediction (XLP) analysis of speech and its application to speaker verification in adverse conditions. INTERSPEECH 2010: 1477-1480 - [c49]Rahim Saeidi, Jouni Pohjalainen, Tomi Kinnunen, Paavo Alku:
Temporally Weighted Linear Prediction Features for Speaker Verification in Additive Noise. Odyssey 2010: 8 - [c48]Tuomo Raitio, Antti Suni, Hannu Pulakka, Martti Vainio, Paavo Alku:
Comparison of formant enhancement methods for HMM-based speech synthesis. SSW 2010: 334-339
2000 – 2009
- 2009
- [j22]Carlo Magi, Jouni Pohjalainen, Tom Bäckström, Paavo Alku:
Stabilised weighted linear prediction. Speech Commun. 51(5): 401-411 (2009) - [j21]Laura Laaksonen, Hannu Pulakka, Ville Myllylä, Paavo Alku:
Development, evaluation and implementation of an artificial bandwidth extension method of telephone speech in mobile terminal. IEEE Trans. Consumer Electron. 55(2): 780-787 (2009) - [c47]Tomi Kinnunen, Paavo Alku:
On separating glottal source and vocal tract information in telephony speaker verification. ICASSP 2009: 4545-4548 - [c46]Jouni Pohjalainen, Heikki Kallasjoki, Kalle J. Palomäki, Mikko Kurimo, Paavo Alku:
Weighted linear prediction for speech analysis in noisy conditions. INTERSPEECH 2009: 1315-1318 - [c45]Martti Vainio, Antti Suni, Tuomo Raitio, Jani Nurminen, Juhani Järvikivi, Paavo Alku:
New method for delexicalization and its application to prosodic tagging for text-to-speech synthesis. INTERSPEECH 2009: 1703-1706 - [c44]Atte Aalto, Paavo Alku, Jarmo Malinen:
A LF-pulse from a simple glottal flow model. MAVEBA 2009: 199-202 - 2008
- [j20]Carlo Magi, Tom Bäckström, Paavo Alku:
Simple proofs of root locations of two symmetric linear prediction models. Signal Process. 88(7): 1894-1897 (2008) - [j19]Hannu Pulakka, Laura Laaksonen, Martti Vainio, Jouni Pohjalainen, Paavo Alku:
Evaluation of an Artificial Speech Bandwidth Extension Method in Three Languages. IEEE Trans. Speech Audio Process. 16(6): 1124-1137 (2008) - [c43]Tuomo Raitio, Antti Suni, Hannu Pulakka, Martti Vainio, Paavo Alku:
HMM-based Finnish text-to-speech system utilizing glottal inverse filtering. INTERSPEECH 2008: 1881-1884 - [c42]Paavo Alku, Carlo Magi, Tom Bäckström:
DC-constrained linear prediction for glottal inverse filtering. INTERSPEECH 2008: 2861-2864 - 2007
- [j18]Tom Bäckström, Carlo Magi, Paavo Alku:
Minimum Separation of Line Spectral Frequencies. IEEE Signal Process. Lett. 14(2): 145-147 (2007) - [j17]Juho Kontio, Laura Laaksonen, Paavo Alku:
Neural Network-Based Artificial Bandwidth Expansion of Speech. IEEE Trans. Speech Audio Process. 15(3): 873-881 (2007) - [c41]Carlo Magi, Tom Bäckström, Paavo Alku:
Stabilised weighted linear prediction - a robust all-pole method for speech processing. INTERSPEECH 2007: 522-525 - [c40]Matti Airas, Paavo Alku:
Comparison of multiple voice source parameters in different phonation types. INTERSPEECH 2007: 1410-1413 - [c39]Hannu Pulakka, Paavo Alku, Laura Laaksonen, Päivi Valve:
The effect of highband harmonic structure in the artificial bandwidth expansion of telephone speech. INTERSPEECH 2007: 2497-2500 - [c38]Matti Airas, Paavo Alku, Martti Vainio:
Laryngeal voice quality changes in expression of prominence in continuous speech. MAVEBA 2007: 135-138 - 2006
- [j16]Matti Airas, Paavo Alku:
Emotions in Vowel Segments of Continuous Speech: Analysis of the Glottal Flow Using the Normalised Amplitude Quotient. Phonetica 63(1): 26-46 (2006) - [c37]Hannu Pulakka, Laura Laaksonen, Paavo Alku:
Quality improvement of telephone speech by artificial bandwidth expansion - listening tests in three languages. INTERSPEECH 2006 - 2005
- [c36]Laura Laaksonen, Juho Kontio, Paavo Alku:
Artificial Bandwidth Expansion Method to Improve Intelligibility and Quality of AMR-Coded Narrowband Speech. ICASSP (1) 2005: 809-812 - [c35]Tom Bäckström, Matti Airas, Laura Lehto, Paavo Alku:
Objective Quality Measures for Glottal Inverse Filtering of Speech Pressure Signals. ICASSP (1) 2005: 897-900 - [c34]Paavo Alku, Matti Airas, Tom Bäckström, Hannu Pulakka:
Group delay function as a means to assess quality of glottal inverse filtering. INTERSPEECH 2005: 1053-1056 - [c33]Eva Björkner, Johan Sundberg, Paavo Alku:
Subglottal pressure and NAQ variation in voice production of classically trained baritone singers. INTERSPEECH 2005: 1057-1060 - [c32]Matti Airas, Hannu Pulakka, Tom Bäckström, Paavo Alku:
A toolkit for voice inverse filtering and parametrisation. INTERSPEECH 2005: 2145-2148 - [c31]Paavo Alku, Jaromír Horácek, Matti Airas, Anne-Maria Laukkanen:
Assessment of glottal inverse filtering by using aeroelastic modelling of phonation and FE modelling of vocal tract. MAVEBA 2005: 73-76 - 2004
- [j15]Paavo Alku, Tom Bäckström:
Linear predictive method for improved spectral modeling of lower frequencies of speech with small prediction orders. IEEE Trans. Speech Audio Process. 12(2): 93-99 (2004) - [j14]Tom Bäckström, Paavo Alku, Tuomas Paatero, W. Bastiaan Kleijn:
A time-domain interpretation for the LSP decomposition. IEEE Trans. Speech Audio Process. 12(6): 554-560 (2004) - [c30]Matti Airas, Paavo Alku:
Emotions in Short Vowel Segments: Effects of the Glottal Flow as Reflected by the Normalized Amplitude Quotient. ADS 2004: 13-24 - [c29]Paavo Alku, Matti Airas, Brad H. Story:
Evaluation of an inverse filtering technique using physical modeling of voice production. INTERSPEECH 2004: 497-500 - [c28]Hannu Pulakka, Paavo Alku, Svante Granqvist, Stellan Hertegard, Hans Larsson, Anne-Maria Laukkanen, Per-Ake Lindestad, Erkki Vilkman:
Analysis of the voice source in different phonation types: simultaneous high-sped imaging of the vocal fold vibration and glottal inverse filtering. INTERSPEECH 2004: 1121-1124 - 2003
- [j13]Tom Bäckström, Paavo Alku:
A constrained linear predictive model with the minimum-phase property. Signal Process. 83(10): 2259-2264 (2003) - [j12]W. Bastiaan Kleijn, Tom Bäckström, Paavo Alku:
On line spectral frequencies. IEEE Signal Process. Lett. 10(3): 75-77 (2003) - [j11]Tom Bäckström, Paavo Alku:
All-pole modeling technique based on weighted sum of LSP polynomials. IEEE Signal Process. Lett. 10(6): 180-183 (2003) - [c27]Paavo Alku, Tom Bäckström:
All-pole modeling of wide-band speech with symmetric linear prediction. ICASSP (1) 2003: 152-155 - [c26]Tom Bäckström, Paavo Alku:
On the stability of constrained linear predictive models. ICASSP (6) 2003: 285-288 - [c25]Paavo Alku, Tom Bäckström:
Linear predictive method with low-frequency emphasis. INTERSPEECH 2003: 433-436 - 2002
- [j10]Anna Mari Mäkelä, Paavo Alku, Ville Mäkinen, Jussi Valtonen, Patrick J. C. May, Hannu Tiitinen:
Human Cortical Dynamics Determined by Speech Fundamental Frequency. NeuroImage 17(3): 1300-1305 (2002) - [j9]Paavo Alku, Juha Vintturi, Erkki Vilkman:
Measuring the effect of fundamental frequency raising as a strategy for increasing vocal intensity in soft, normal and loud phonation. Speech Commun. 38(3-4): 321-334 (2002) - [j8]Tom Bäckström, Paavo Alku, Erkki Vilkman:
Time-domain parameterization of the closing phase of glottal airflow waveform from voices over a large intensity range. IEEE Trans. Speech Audio Process. 10(3): 186-192 (2002) - [c24]Tom Bäckström, Paavo Alku, W. Bastiaan Kleijn:
A time domain reformulation of linear prediction equivalent to the LSP decomposition. ICASSP 2002: 661-664 - [c23]Paavo Alku, Tom Bäckström:
All-pole modeling technique based on the Weighted Sum of the LSP polynomials. ICASSP 2002: 665-668 - [c22]Paavo Alku, Tom Bäckström:
All-pole modeling of wide-band speech using weighted sum of the LSP polynomials. INTERSPEECH 2002: 977-980 - 2001
- [j7]Friedemann Pulvermüller, Teija Kujala, Yury Shtyrov, Jaana Simola, Hannu Tiitinen, Paavo Alku, Kimmo Alho, Sami Martinkauppi, Risto J. Ilmoniemi, Risto Näätänen:
Memory Traces for Words as Revealed by the Mismatch Negativity. NeuroImage 14(3): 607-616 (2001) - [c21]Federico Avanzini, Paavo Alku, Matti Karjalainen:
One-delayed-mass model for efficient synthesis of glottal flow. INTERSPEECH 2001: 51-54 - [c20]Paavo Alku, Juha Vintturi, Erkki Vilkman:
The use of fundamental frequency raising as a strategy for increasing vocal intensity in soft, normal, and loud phonation. INTERSPEECH 2001: 919-922 - 2000
- [c19]Susanna Varho, Paavo Alku:
All-pole spectral modelling of voiced speech with a highly compressed set of parameters. EUSIPCO 2000: 1-4 - [c18]Paavo Alku, Hannu Tiitinen, Kalle J. Palomäki, Päivi Sivonen:
MEG-measurements of brain activity reveal the link between human speech production and perception. INTERSPEECH 2000: 11-14 - [c17]Kalle J. Palomäki, Paavo Alku, Ville Mäkinen, Patrick J. C. May, Hannu Tiitinen:
Neuromagnetic study on localization of speech sounds. INTERSPEECH 2000: 462-465 - [c16]Paavo Alku, Jan G. Svec, Erkki Vilkman, Frantisek Sram:
Analysis of voice production in breathy, normal and pressed phonation by comparing inverse filtering and videokymography. INTERSPEECH 2000: 885-888 - [c15]Susanna Varho, Paavo Alku:
A linear predictive method for highly compressed presentation of speech spectra. ISCAS 2000: 57-60
1990 – 1999
- 1999
- [j6]Paavo Alku, Juha Vintturi, Erkki Vilkman:
On the linearity of the relationship between the sound pressure level and the negative peak amplitude of the differentiated glottal flow in vowel production. Speech Commun. 28(4): 269-281 (1999) - [c14]Susanna Varho, Paavo Alku:
A new predictive method for all-pole modelling of speech spectra with a compressed set of parameters. ISCAS (3) 1999: 126-129 - 1998
- [j5]Susanna Varho, Paavo Alku:
Separated Linear Prediction - A new all-pole modelling technique for speech analysis. Speech Commun. 24(2): 111-121 (1998) - [j4]Paavo Alku, Erkki Vilkman, Anne-Maria Laukkanen:
Estimation of amplitude features of the glottal flow by inverse filtering speech pressure signals. Speech Commun. 24(2): 123-132 (1998) - [c13]Susanna Varho, Paavo Alku:
Spectral estimation of voiced speech with regressive linear prediction. EUSIPCO 1998: 1-4 - [c12]Paavo Alku, Susanna Varho:
A new linear predictive method for compression of speech signals. ICSLP 1998 - [c11]Paavo Alku, Juha Vintturi, Erkki Vilkman:
Analyzing the effect of secondary excitations of the vocal tract on vocal intensity in different loudness conditions. ICSLP 1998 - 1997
- [j3]Paavo Alku, Helmer Strik, Erkki Vilkman:
Parabolic spectral parameter - A new method for quantification of the glottal flow. Speech Commun. 22(1): 67-79 (1997) - 1996
- [j2]Paavo Alku, Erkki Vilkman:
Amplitude domain quotient for characterization of the glottal volume velocity waveform estimated by inverse filtering. Speech Commun. 18(2): 131-138 (1996) - [c10]Paavo Alku, Erkki Vilkman:
A frequency domain method for parametrization of the voice source. ICSLP 1996: 1569-1572 - 1994
- [c9]Paavo Alku, Erkki Vilkman:
Estimation of the glottal pulseform based on discrete all-pole modeling. ICSLP 1994: 1619-1622 - 1992
- [j1]Paavo Alku:
Glottal wave analysis with Pitch Synchronous Iterative Adaptive Inverse Filtering. Speech Commun. 11(2-3): 109-118 (1992) - [c8]Paavo Alku:
An automatic method to estimate the time-based parameters of the glottal pulseform. ICASSP 1992: 29-32 - [c7]Paavo Alku:
Inverse filtering of the glottal waveform using the Itakura-saito distortion measure. ICSLP 1992: 847-850 - 1991
- [c6]Paavo Alku:
Glottal wave analysis with pitch synchronous iterative adaptive inverse filtering. EUROSPEECH 1991: 1081-1084 - 1990
- [c5]Paavo Alku:
Glottal-LPC based coding of telephone band vowels with simple all-pole excitation. ICSLP 1990: 89-92 - [c4]Paavo Alku, Erkki Vilkman, Unto K. Laine:
A comparison of egg and a new automatic inverse filtering method in phonation change from breathy to normal. ICSLP 1990: 197-200
1980 – 1989
- 1989
- [c3]Matti Karjalainen, Toomas Altosaar, Paavo Alku, Lauri Lehtinen, Seppo Helle:
Speech processing in the object-oriented DSP environment quicksig. EUROSPEECH 1989: 1450-1453 - [c2]Paavo Alku, Unto K. Laine:
A new glottal LPC method of low complexity for speech analysis and coding. EUROSPEECH 1989: 2031-2034 - 1988
- [c1]Matti Karjalainen, Toomas Altosaar, Paavo Alku:
QuickSig-an object-oriented signal processing environment. ICASSP 1988: 1682-1685
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-10-07 21:17 CEST by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint