


default search action
Ryo Aihara
Person information
Refine list

refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [j9]Ryoichi Takashima
, Yuya Sawa, Ryo Aihara
, Tetsuya Takiguchi
, Yoshie Imai:
Dysarthric Speech Recognition Using Pseudo-Labeling, Self-Supervised Feature Learning, and a Joint Multi-Task Learning Approach. IEEE Access 12: 36990-36999 (2024) - [c29]Ryoichi Takashima
, Takeru Otani
, Ryo Aihara
, Tetsuya Takiguchi
, Shinya Taguchi
:
Self-supervised learning using unlabeled speech with multiple types of speech disorder for disordered speech recognition. ASSETS 2024: 101:1-101:5 - [c28]Ryoichi Takashima, Fumiya Nakamura, Ryo Aihara, Tetsuya Takiguchi, Yusuke Itani:
Generation of Colored Subtitle Images Based on Emotional Information of Speech Utterances. EUSIPCO 2024: 536-540 - 2022
- [c27]Ryota Tsunoda, Ryo Aihara, Ryoichi Takashima, Tetsuya Takiguchi, Yoshie Imai:
Speaker-Targeted Audio-Visual Speech Recognition Using a Hybrid CTC/Attention Model with Interference Loss. ICASSP 2022: 251-255 - 2021
- [j8]Yuki Takashima, Ryoichi Takashima
, Ryota Tsunoda, Ryo Aihara, Tetsuya Takiguchi, Yasuo Ariki, Nobuaki Motoyama:
Unsupervised domain adaptation for lip reading based on cross-modal knowledge distillation. EURASIP J. Audio Speech Music. Process. 2021(1): 44 (2021)
2010 – 2019
- 2019
- [c26]Ryo Aihara, Toshiyuki Hanazawa, Yohei Okato, Gordon Wichern, Jonathan Le Roux:
Teacher-student Deep Clustering for Low-delay Single Channel Speech Separation. ICASSP 2019: 690-694 - 2018
- [c25]Taiki Izumi, Takanobu Uramoto, Shingo Uenohara, Ken'ichi Furuya
, Ryo Aihara, Toshiyuki Hanazawa, Yohei Okato:
Multichannel NMF with Reduced Computational Complexity for Speech Recognition. APSIPA 2018: 192-195 - [c24]Taiki Izumi, Ryo Aihara, Toshiyuki Hanazawa, Yohei Okato, Takanobu Uramoto, Shingo Uenohara, Ken'ichi Furuya
:
Reducing Computational Complexity of Multichannel Nonnegative Matrix Factorization Using Initial Value Setting for Speech Recognition. CISIS 2018: 893-900 - 2017
- [c23]Ryo Aihara, Tetsuya Takiguchi, Yasuo Ariki:
Phoneme-Discriminative Features for Dysarthric Speech Conversion. INTERSPEECH 2017: 3374-3378 - [c22]Rina Ra, Ryo Aihara, Tetsuya Takiguchi, Yasuo Ariki:
Visual-to-speech conversion based on maximum likelihood estimation. MVA 2017: 518-521 - 2016
- [j7]Ryo Aihara, Tetsuya Takiguchi, Yasuo Ariki:
Multiple Non-Negative Matrix Factorization for Many-to-Many Voice Conversion. IEEE ACM Trans. Audio Speech Lang. Process. 24(7): 1175-1184 (2016) - [c21]Yuichiro Kataoka, Toru Nakashika, Ryo Aihara, Tetsuya Takiguchi, Yasuo Ariki:
Selection of an optimum random matrix using a genetic algorithm for acoustic feature extraction. ICIS 2016: 1-6 - [c20]Ryo Aihara, Tetsuya Takiguchi, Yasuo Ariki:
Semi-non-negative matrix factorization using alternating direction method of multipliers for voice conversion. ICASSP 2016: 5170-5174 - [c19]Yuki Takashima, Ryo Aihara, Tetsuya Takiguchi, Yasuo Ariki, Nobuyuki Mitani, Kiyohiro Omori, Kaoru Nakazono:
Audio-Visual Speech Recognition Using Bimodal-Trained Bottleneck Features for a Person with Severe Hearing Loss. INTERSPEECH 2016: 277-281 - [c18]Ryo Aihara, Tetsuya Takiguchi, Yasuo Ariki:
Parallel Dictionary Learning for Voice Conversion Using Discriminative Graph-Embedded Non-Negative Matrix Factorization. INTERSPEECH 2016: 292-296 - 2015
- [j6]Kenta Masaka, Ryo Aihara, Tetsuya Takiguchi, Yasuo Ariki:
Multimodal voice conversion based on non-negative matrix factorization. EURASIP J. Audio Speech Music. Process. 2015: 24 (2015) - [j5]Ryo Aihara, Takao Fujii, Toru Nakashika, Tetsuya Takiguchi, Yasuo Ariki:
Small-parallel exemplar-based voice conversion in noisy environments using affine non-negative matrix factorization. EURASIP J. Audio Speech Music. Process. 2015: 32 (2015) - [j4]Yuki Takashima, Yasuhiro Kakihara, Ryo Aihara, Tetsuya Takiguchi, Yasuo Ariki, Nobuyuki Mitani, Kiyohiro Omori, Kaoru Nakazono:
Audio-Visual Speech Recognition Using Convolutive Bottleneck Networks for a Person with Severe Hearing Loss. IPSJ Trans. Comput. Vis. Appl. 7: 64-68 (2015) - [j3]Ryo Aihara, Tetsuya Takiguchi, Yasuo Ariki:
Individuality-Preserving Voice Conversion for Articulation Disorders Using Phoneme-Categorized Exemplars. ACM Trans. Access. Comput. 6(4): 13:1-13:17 (2015) - [c17]Ryo Aihara, Takao Fujii, Toru Nakashika, Tetsuya Takiguchi, Yasuo Ariki:
Noise-robust voice conversion using a small parallel data based on non-negative matrix factorization. EUSIPCO 2015: 315-319 - [c16]Ryo Aihara, Tetsuya Takiguchi, Yasuo Ariki:
Activity-mapping non-negative matrix factorization for exemplar-based voice conversion. ICASSP 2015: 4899-4903 - [c15]Ryo Aihara, Tetsuya Takiguchi, Yasuo Ariki:
Many-to-many voice conversion based on multiple non-negative matrix factorization. INTERSPEECH 2015: 2749-2753 - [c14]Reina Ueda, Ryo Aihara, Tetsuya Takiguchi, Yasuo Ariki:
Individuality-Preserving Spectrum Modification for Articulation Disorders Using Phone Selective Synthesis. SLPAT@Interspeech 2015: 118-123 - [c13]Ryo Aihara, Tetsuya Takiguchi, Yasuo Ariki:
Many-to-one voice conversion using exemplar-based sparse representation. WASPAA 2015: 1-5 - 2014
- [j2]Ryo Aihara, Ryoichi Takashima, Tetsuya Takiguchi, Yasuo Ariki:
A preliminary demonstration of exemplar-based voice conversion for articulation disorders using an individuality-preserving dictionary. EURASIP J. Audio Speech Music. Process. 2014: 5 (2014) - [j1]Ryo Aihara, Ryoichi Takashima, Tetsuya Takiguchi, Yasuo Ariki:
Noise-Robust Voice Conversion Based on Sparse Spectral Mapping Using Non-negative Matrix Factorization. IEICE Trans. Inf. Syst. 97-D(6): 1411-1418 (2014) - [c12]Ryo Aihara, Reina Ueda, Tetsuya Takiguchi, Yasuo Ariki:
Exemplar-based emotional voice conversion using non-negative matrix factorization. APSIPA 2014: 1-7 - [c11]Kenta Masaka, Ryo Aihara, Tetsuya Takiguchi, Yasuo Ariki:
Multimodal voice conversion using non-negative matrix factorization in noisy environments. ICASSP 2014: 1542-1546 - [c10]Ryo Aihara, Toru Nakashika, Tetsuya Takiguchi, Yasuo Ariki:
Voice conversion based on Non-negative matrix factorization using phoneme-categorized dictionary. ICASSP 2014: 7894-7898 - [c9]Kenta Masaka, Ryo Aihara, Tetsuya Takiguchi, Yasuo Ariki:
Multimodal exemplar-based voice conversion using lip features in noisy environments. INTERSPEECH 2014: 1159-1163 - [c8]E. Byambakhishig, Katsuyuki Tanaka, Ryo Aihara, Toru Nakashika, Tetsuya Takiguchi, Yasuo Ariki:
Error correction of automatic speech recognition based on normalized web distance. INTERSPEECH 2014: 2852-2856 - [c7]Ryo Aihara, Tetsuya Takiguchi, Yasuo Ariki:
Individuality-preserving Voice Conversion for Articulation Disorders Using Dictionary Selective Non-negative Matrix Factorization. SLPAT@ACL 2014: 29-37 - 2013
- [c6]Ryo Aihara, Ryoichi Takashima, Tetsuya Takiguchi, Yasuo Ariki:
Individuality-preserving voice conversion for articulation disorders based on non-negative matrix factorization. ICASSP 2013: 8037-8040 - [c5]Ryo Aihara, Ryoichi Takashima, Tetsuya Takiguchi, Yasuo Ariki:
Exemplar-based individuality-preserving voice conversion for articulation disorders in noisy environments. INTERSPEECH 2013: 3637-3641 - [c4]Takao Fujii, Ryo Aihara, Ryoichi Takashima, Tetsuya Takiguchi, Yasuo Ariki:
Voice conversion based on Non-negative Matrix Factorization in noisy environments. SII 2013: 495-498 - [c3]Ryo Aihara, Tetsuya Takiguchi, Yasuo Ariki:
Individuality-Preserving Voice Conversion for Articulation Disorders Using Locality-Constrained NMF. SLPAT 2013: 3-8 - [c2]Ryoichi Takashima, Ryo Aihara, Tetsuya Takiguchi, Yasuo Ariki:
Noise-robust voice conversion based on spectral mapping on sparse space. SSW 2013: 71-75 - 2012
- [c1]Ryo Aihara, Ryoichi Takashima, Tetsuya Takiguchi, Yasuo Ariki:
Consonant enhancement for articulation disorders based on non-negative matrix factorization. APSIPA 2012: 1-4
Coauthor Index

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from ,
, and
to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and
to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-11-07 20:31 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint