


default search action
Nobuaki Minematsu
Person information
Refine list

refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [c247]Haopeng Geng, Daisuke Saito, Nobuaki Minematsu:
A Pilot Study of Applying Sequence-to-Sequence Voice Conversion to Evaluate the Intelligibility of L2 Speech Using a Native Speaker's Shadowings. APSIPA 2024: 1-6 - [c246]Shuting Hao, Daisuke Saito, Nobuaki Minematsu:
Enhancing Acoustic Scene Classification with Layer-wise Fine-Tuning on the SSAST Model. APSIPA 2024: 1-6 - [c245]Joonyong Park, Daisuke Saito, Nobuaki Minematsu:
Analytic Study of Text-Free Speech Synthesis for Raw Audio using a Self-Supervised Learning Model. APSIPA 2024: 1-6 - [i7]Kentaro Onda, Joonyong Park, Nobuaki Minematsu, Daisuke Saito:
A Pilot Study of GSLM-based Simulation of Foreign Accentuation Only Using Native Speech Corpora. CoRR abs/2407.11370 (2024) - [i6]Haopeng Geng, Daisuke Saito, Nobuaki Minematsu:
Simulating Native Speaker Shadowing for Nonnative Speech Assessment with Latent Speech Representations. CoRR abs/2409.11742 (2024) - [i5]Haopeng Geng, Daisuke Saito, Nobuaki Minematsu:
A Pilot Study of Applying Sequence-to-Sequence Voice Conversion to Evaluate the Intelligibility of L2 Speech Using a Native Speaker's Shadowings. CoRR abs/2410.02239 (2024) - [i4]Joonyong Park, Daisuke Saito, Nobuaki Minematsu:
Analytic Study of Text-Free Speech Synthesis for Raw Audio using a Self-Supervised Learning Model. CoRR abs/2412.03074 (2024) - 2023
- [c244]Lifan Zhong, Erica Cooper, Junichi Yamagishi, Nobuaki Minematsu:
Exploring Isolated Musical Notes as Pre-training Data for Predominant Instrument Recognition in Polyphonic Music. APSIPA ASC 2023: 2312-2319 - [c243]Yurun He, Nobuaki Minematsu, Daisuke Saito:
Multiple Acoustic Features Speech Emotion Recognition Using Cross-Attention Transformer. ICASSP 2023: 1-5 - [c242]Qianying Liu, Zhuo Gong, Zhengdong Yang, Yuhang Yang, Sheng Li
, Chenchen Ding, Nobuaki Minematsu, Hao Huang, Fei Cheng, Chenhui Chu, Sadao Kurohashi:
Hierarchical Softmax for End-To-End Low-Resource Multilingual Speech Recognition. ICASSP 2023: 1-5 - [c241]Yingxiang Gao, Jaehyun Choi, Nobuaki Minematsu, Noriko Nakanishi, Daisuke Saito:
Automatic Prediction of Language Learners' Listenability Using Speech and Text Features Extracted from Listening Drills. INTERSPEECH 2023: 979-983 - [c240]Nobuaki Minematsu, Noriko Nakanishi, Yingxiang Gao, Haitong Sun:
A Unified Framework to Improve Learners' Skills of Perception and Production Based on Speech Shadowing and Overlapping. INTERSPEECH 2023: 3667-3668 - [c239]Haitong Sun, Yingxiang Gao, Yusuke Shozui, Tong Ma, Nobuaki Minematsu:
Sensitivity to Phonemic Contrasts and Insensitivity to Non-phonemic Contrasts of Various Speech Representations Tested for L2 Speech Assessment. SLaTE 2023: 31-35 - [c238]Chihiro Shoda, Yingxiang Gao, Yurun He, Nobuaki Minematsu, Noriko Nakanishi, Daisuke Saito:
Learners' Prosodic Control in the Task of Expressive Storytelling and Predicted Native Listeners' Impressions of the Learners' Speech. SLaTE 2023: 46-50 - [c237]Yusuke Shozui, Nobuaki Minematsu, Noriko Nakanishi, Daisuke Saito:
Density and Entropy of Spoken Syllables in American English and Japanese English Estimated with Acoustic Word Embeddings. SLaTE 2023: 131-135 - [i3]Lifan Zhong, Erica Cooper, Junichi Yamagishi, Nobuaki Minematsu:
Exploring Isolated Musical Notes as Pre-training Data for Predominant Instrument Recognition in Polyphonic Music. CoRR abs/2306.08850 (2023) - 2022
- [j28]Gaku Kotani
, Daisuke Saito, Nobuaki Minematsu
:
Voice Conversion Based on Deep Neural Networks for Time-Variant Linear Transformations. IEEE ACM Trans. Audio Speech Lang. Process. 30: 2981-2992 (2022) - [c236]Eisuke Konno, Daisuke Saito, Nobuaki Minematsu:
Quantifying Discriminability between NMF Bases. ICASSP 2022: 691-695 - [c235]Takuya Kunihara, Chuanbo Zhu, Nobuaki Minematsu, Noriko Nakanishi:
Gradual Improvements Observed in Learners' Perception and Production of L2 Sounds Through Continuing Shadowing Practices on a Daily Basis. INTERSPEECH 2022: 1303-1307 - [c234]Takeru Gorai, Daisuke Saito, Nobuaki Minematsu:
Text-to-speech synthesis using spectral modeling based on non-negative autoencoder. INTERSPEECH 2022: 1621-1625 - [c233]Takuya Kunihara, Chuanbo Zhu, Daisuke Saito, Nobuaki Minematsu, Noriko Nakanishi:
Detection of Learners' Listening Breakdown with Oral Dictation and Its Use to Model Listening Skill Improvement Exclusively Through Shadowing. INTERSPEECH 2022: 4461-4465 - [c232]Zhuo Gong, Daisuke Saito, Longfei Yang, Takahiro Shinozaki, Sheng Li
, Hisashi Kawai, Nobuaki Minematsu:
Self-Adaptive Multilingual ASR Rescoring with Language Identification and Unified Language Model. Odyssey 2022: 415-420 - [c231]Chuanbo Zhu, Takuya Kunihara, Daisuke Saito, Nobuaki Minematsu, Noriko Nakanishi:
Automatic Prediction of Intelligibility of Words and Phonemes Produced Orally by Japanese Learners of English. SLT 2022: 1029-1036 - [i2]Qianying Liu, Yuhang Yang, Zhuo Gong, Sheng Li
, Chenchen Ding, Nobuaki Minematsu, Hao Huang, Fei Cheng, Sadao Kurohashi:
Hierarchical Softmax for End-to-End Low-resource Multilingual Speech Recognition. CoRR abs/2204.03855 (2022) - 2021
- [c230]Ruiyan Chen, Tazuko Nishimura, Nobuaki Minematsu, Daisuke Saito:
Acoustic Simulation of Body-conducted Speech and Its Use to Convert One's Recorded Voices to One's Own Voices. APSIPA ASC 2021: 821-828 - [c229]Chuanbo Zhu, Ryo Hakoda, Daisuke Saito, Nobuaki Minematsu, Noriko Nakanishi, Tazuko Nishimura:
Multi-Granularity Annotation of Instantaneous Intelligibility of Learners' Utterances Based on Shadowing Techniques. ASRU 2021: 1071-1078 - [c228]Shintaro Ando, Nobuaki Minematsu, Daisuke Saito:
Lexical Density Analysis of Word Productions in Japanese English Using Acoustic Word Embeddings. Interspeech 2021: 4433-4437 - [c227]Yang Shen, Ayano Yasukagawa, Daisuke Saito, Nobuaki Minematsu, Kazuya Saito:
Optimized Prediction of Fluency of L2 English Based on Interpretable Network Using Quantity of Phonation and Quality of Pronunciation. SLT 2021: 698-704 - 2020
- [j27]Daisuke Saito, Nobuaki Minematsu, Keikichi Hirose:
Tensor Factor Analysis for Arbitrary Speaker Conversion. IEICE Trans. Inf. Syst. 103-D(6): 1395-1405 (2020) - [c226]Shintaro Ando, Masayuki Suzuki, Nobuyasu Itoh, Gakuto Kurata, Nobuaki Minematsu:
Converting Written Language to Spoken Language with Neural Machine Translation for Language Modeling. ICASSP 2020: 8124-8128 - [c225]Zhenchao Lin, Ryo Takashima, Daisuke Saito, Nobuaki Minematsu, Noriko Nakanishi:
Shadowability Annotation with Fine Granularity on L2 Utterances and its Improvement with Native Listeners' Script-Shadowing. INTERSPEECH 2020: 3865-3869 - [c224]Yuma Shirahata, Daisuke Saito, Nobuaki Minematsu:
Discriminative Method to Extract Coarse Prosodic Structure and its Application for Statistical Phrase/Accent Command Estimation. INTERSPEECH 2020: 4427-4431
2010 – 2019
- 2019
- [j26]Tetsuya Hashimoto
, Daisuke Saito, Nobuaki Minematsu:
Many-to-Many and Completely Parallel-Data-Free Voice Conversion Based on Eigenspace DNN. IEEE ACM Trans. Audio Speech Lang. Process. 27(2): 332-341 (2019) - [c223]Gaku Kotani, Hitoshi Suda, Daisuke Saito, Nobuaki Minematsu:
Experimental investigation on the efficacy of Affine-DTW in the quality of voice conversion. APSIPA 2019: 119-124 - [c222]Shunsuke Goto, Daisuke Saito, Nobuaki Minematsu:
DNN-based Statistical Parametric Speech Synthesis Incorporating Non-negative Matrix Factorization. APSIPA 2019: 148-153 - [c221]Daisuke Saito, So Suzuki, Nobuaki Minematsu:
Speech representation based on tensor factor analysis and its application to speaker recognition and language identification. APSIPA 2019: 402-406 - [c220]Shunsuke Goto, Yuma Shirahata, Gaku Kotani, Hitoshi Suda, Daisuke Saito, Nobuaki Minematsu:
The UTokyo speech synthesis system for Blizzard Challenge 2019. Blizzard Challenge 2019 - [c219]Yusaku Korematsu, Daisuke Saito, Nobuaki Minematsu:
Cooking State Recognition based on Acoustic Event Detection. CEA@ICMR 2019: 41-44 - [c218]Tasavat Trisitichoke, Shintaro Ando, Daisuke Saito, Nobuaki Minematsu:
Analysis of Native Listeners' Facial Microexpressions While Shadowing Non-Native Speech - Potential of Shadowers' Facial Expressions for Comprehensibility Prediction. INTERSPEECH 2019: 1861-1865 - [c217]Shintaro Ando, Zhenchao Lin, Tasavat Trisitichoke, Yusuke Inoue, Fuki Yoshizawa, Daisuke Saito, Nobuaki Minematsu:
A Large Collection of Sentences Read Aloud by Vietnamese Learners of Japanese and Native Speaker's Reverse Shadowings. O-COCOSDA 2019: 1-6 - [c216]Adriana Guevara-Rukoz, Alexander Martin
, Yutaka Yamauchi, Nobuaki Minematsu:
Prototyping a web-based phonetic training game to improve /r/-/l/ identification by Japanese learners of English. SLaTE 2019: 20-24 - [c215]Zhenchao Lin, Yusuke Inoue, Tasavat Trisitichoke, Shintaro Ando, Daisuke Saito, Nobuaki Minematsu:
Native Listeners' Shadowing of Non-native Utterances as Spoken Annotation Representing Comprehensibility of the Utterances. SLaTE 2019: 43-47 - [c214]Satoshi Kobashikawa, Atushi Odakura, Takao Nakamura, Takeshi Mori, Kimitaka Endo, Takafumi Moriya, Ryo Masumura, Yushi Aono, Nobuaki Minematsu:
Does Speaking Training Application with Speech Recognition Motivate Junior High School Students in Actual Classroom? - A Case Study. SLaTE 2019: 119-123 - [c213]Hitoshi Suda, Daisuke Saito, Nobuaki Minematsu:
Voice Conversion without Explicit Separation of Source and Filter Components Based on Non-negative Matrix Factorization. SSW 2019: 69-74 - [c212]Yuma Shirahata, Daisuke Saito, Nobuaki Minematsu:
Generative Modeling of F0 Contours Leveraged by Phrase Structure and Its Application to Statistical Focus Control. SSW 2019: 228-233 - 2018
- [j25]Yi Zhao
, Shinji Takaki, Hieu-Thi Luong, Junichi Yamagishi, Daisuke Saito, Nobuaki Minematsu:
Wasserstein GAN and Waveform Loss-Based Acoustic Model Training for Multi-Speaker Text-to-Speech Synthesis Systems Using a WaveNet Vocoder. IEEE Access 6: 60478-60488 (2018) - [c211]Yasuhito Ohsugi, Daisuke Saito, Nobuaki Minematsu:
A Comparative Study of Statistical Conversion of Face to Voice Based on Their Subjective Impressions. INTERSPEECH 2018: 1001-1005 - [c210]Yusuke Inoue, Suguru Kabashima, Daisuke Saito, Nobuaki Minematsu, Kumi Kanamura, Yutaka Yamauchi:
A Study of Objective Measurement of Comprehensibility through Native Speakers' Shadowing of Learners' Utterances. INTERSPEECH 2018: 1651-1655 - [c209]Suguru Kabashima, Yusuke Inoue, Daisuke Saito, Nobuaki Minematsu:
DNN-Based Scoring of Language Learners' Proficiency Using Learners' Shadowings and Native Listeners' Responsive Shadowings. SLT 2018: 971-978 - [i1]Yi Zhao, Shinji Takaki, Hieu-Thi Luong, Junichi Yamagishi, Daisuke Saito, Nobuaki Minematsu:
Wasserstein GAN and Waveform Loss-based Acoustic Model Training for Multi-speaker Text-to-Speech Synthesis Systems Using a WaveNet Vocoder. CoRR abs/1807.11679 (2018) - 2017
- [j24]Masayuki Suzuki, Ryo Kuroiwa, Keisuke Innami, Shumpei Kobayashi, Shinya Shimizu, Nobuaki Minematsu, Keikichi Hirose:
Accent Sandhi Estimation of Tokyo Dialect of Japanese Using Conditional Random Fields. IEICE Trans. Inf. Syst. 100-D(4): 655-661 (2017) - [j23]Nobuaki Minematsu, Ibuki Nakamura, Masayuki Suzuki, Hiroko Hirano, Chieko Nakagawa, Noriko Nakamura, Yukinori Tagawa, Keikichi Hirose, Hiroya Hashimoto:
Development and Evaluation of Online Infrastructure to Aid Teaching and Learning of Japanese Prosody. IEICE Trans. Inf. Syst. 100-D(4): 662-669 (2017) - [c208]Gaku Kotani, Daisuke Saito, Nobuaki Minematsu:
Voice conversion based on deep neural networks for time-variant linear transformations. APSIPA 2017: 1259-1262 - [c207]Shinnosuke Takamichi, Daisuke Saito, Hiroshi Saruwatari, Nobuaki Minematsu:
The UTokyo speech synthesis system for Blizzard Challenge 2017. Blizzard Challenge 2017 - [c206]Shohei Toyama, Daisuke Saito, Nobuaki Minematsu:
Use of Global and Acoustic Features Associated with Contextual Factors to Adapt Language Models for Spontaneous Speech Recognition. INTERSPEECH 2017: 543-547 - [c205]Hidetsugu Uchida, Daisuke Saito, Nobuaki Minematsu:
Acoustic-to-Articulatory Mapping Based on Mixture of Probabilistic Canonical Correlation Analysis. INTERSPEECH 2017: 989-993 - [c204]Tetsuya Hashimoto, Hidetsugu Uchida, Daisuke Saito, Nobuaki Minematsu:
Parallel-Data-Free Many-to-Many Voice Conversion Based on DNN Integrated with Eigenspace Using a Non-Parallel Speech Corpus. INTERSPEECH 2017: 1278-1282 - [c203]Junwei Yue, Fumiya Shiozawa, Shohei Toyama, Yutaka Yamauchi, Kayoko Ito, Daisuke Saito, Nobuaki Minematsu:
Automatic Scoring of Shadowing Speech Based on DNN Posteriors and Their DTW. INTERSPEECH 2017: 1422-1426 - [c202]Yutaka Yamauchi, Junwei Yue, Kayoko Ito, Nobuaki Minematsu:
Investigation of teacher-selected sentences and machine-suggested sentences in terms of correlation between human ratings and GOP-based machine scores. SLaTE 2017: 30-35 - [c201]Nobuaki Minematsu, Daisuke Saito:
New Features and Effectiveness of Suzuki-kun, the First and Only Prosodic Reading Tutor of Tokyo Japanese. SLaTE 2017: 188 - [c200]Junwei Yue, Daisuke Saito, Nobuaki Minematsu, Yutaka Yamauchi, Kayoko Ito:
Development and Maintenance of Practical and In-service Systems for Recording Shadowing Utterances and Their Assessment. SLaTE 2017: 189 - 2016
- [j22]Josef R. Novak, Nobuaki Minematsu, Keikichi Hirose:
Phonetisaurus: Exploring grapheme-to-phoneme conversion with joint n-gram models in the WFST framework. Nat. Lang. Eng. 22(6): 907-938 (2016) - [c199]Tetsuya Hashimoto, Daisuke Saito, Nobuaki Minematsu:
Arbitrary speaker conversion based on speaker space bases constructed by deep neural networks. APSIPA 2016: 1-4 - [c198]Yi Zhao, Xiu You, Daisuke Saito, Nobuaki Minematsu:
The UTokyo System for Blizzard Challenge 2016. Blizzard Challenge 2016 - [c197]Yosuke Kashiwagi, Congying Zhang, Daisuke Saito, Nobuaki Minematsu:
Divergence estimation based on deep neural networks and its use for language identification. ICASSP 2016: 5435-5439 - [c196]Yi Yang, Hidetsugu Uchida, Daisuke Saito, Nobuaki Minematsu:
Voice Conversion Based on Matrix Variate Gaussian Mixture Model Using Multiple Frame Features. INTERSPEECH 2016: 302-306 - [c195]Hidetsugu Uchida, Daisuke Saito, Nobuaki Minematsu:
Prediction of the Articulatory Movements of Unseen Phonemes of a Speaker Using the Speech Structure of Another Speaker. INTERSPEECH 2016: 450-454 - [c194]Yi Zhao, Daisuke Saito, Nobuaki Minematsu:
Speaker Representations for Speaker Adaptation in Multiple Speakers' BLSTM-RNN-Based Speech Synthesis. INTERSPEECH 2016: 2268-2272 - [c193]Shuju Shi, Yosuke Kashiwagi, Shohei Toyama, Junwei Yue, Yutaka Yamauchi, Daisuke Saito, Nobuaki Minematsu:
Automatic Assessment and Error Detection of Shadowing Speech: Case of English Spoken by Japanese Learners. INTERSPEECH 2016: 3142-3146 - [c192]Shuju Shi, Chiharu Tsurutani, Xiaoli Feng, Jinsong Zhang, Nobuaki Minematsu:
Acoustic correlates and gender effects in production and perception of Japanese polite speech. ISCSLP 2016: 1-5 - [c191]Fumiya Shiozawa, Daisuke Saito, Nobuaki Minematsu:
Improved prediction of the accent gap between speakers of English for individual-based clustering of World Englishes. SLT 2016: 129-135 - [c190]Nobuaki Minematsu, Daisuke Saito, Nobuyuki Nishizawa:
Prosodic Reading Tutor of Japanese, Suzuki-kun: The first and only educational tool to teach the formal Japanese. SSW 2016: 122 - 2015
- [j21]Masayuki Suzuki, Gakuto Kurata, Masafumi Nishimura, Nobuaki Minematsu:
Discriminative re-ranking for automatic speech recognition by leveraging invariant structures. Speech Commun. 72: 208-217 (2015) - [j20]Greg Short, Keikichi Hirose, Mariko Kondo, Nobuaki Minematsu:
Automatic recognition of Japanese vowel length accounting for speaking rate and motivated by perception analysis. Speech Commun. 73: 47-63 (2015) - [c189]Tianze Shi, Shun Kasahara, Teeraphon Pongkittiphan, Nobuaki Minematsu, Daisuke Saito, Keikichi Hirose:
A measure of phonetic similarity to quantify pronunciation variation by using ASR technology. ICPhS 2015 - [c188]Hidetsugu Uchida, Daisuke Saito, Nobuaki Minematsu, Keikichi Hirose:
Statistical acoustic-to-articulatory mapping unified with speaker normalization based on voice conversion. INTERSPEECH 2015: 588-592 - [c187]Yuichi Sato, Yosuke Kashiwagi, Nobuaki Minematsu, Daisuke Saito, Keikichi Hirose:
Noise-robust and stress-free visualization of pronunciation diversity of World Englishes using a learner's self-centered viewpoint. O-COCOSDA/CASLRE 2015: 1-6 - [c186]Teeraphon Pongkittiphan, Nobuaki Minematsu, Takehiko Makino, Daisuke Saito, Keikichi Hirose:
Automatic prediction of intelligibility of English words spoken with Japanese accents - comparative study of features and models used for prediction. SLaTE 2015: 19-22 - [c185]Nobuaki Minematsu, Hiroya Hashimoto, Hiroko Hirano, Daisuke Saito:
Development of a prosodic reading tutor of Japanese - effective use of TTS and F0 contour modeling techniques for CALL. SLaTE 2015: 189 - 2014
- [c184]Congying Zhang, Masayuki Suzuki, Gakuto Kurata, Masafumi Nishimura, Nobuaki Minematsu:
Leveraging phonetic context dependent invariant structure for continuous speech recognition. ChinaSIP 2014: 52-56 - [c183]Yi Luan, Daisuke Saito, Yosuke Kashiwagi, Nobuaki Minematsu, Keikichi Hirose:
Semi-supervised noise dictionary adaptation for exemplar-based noise robust speech recognition. ICASSP 2014: 1745-1748 - [c182]Shun Kasahara, S. Kitahara, Nobuaki Minematsu, Han-Ping Shen, Takehiko Makino, Daisuke Saito, K. Hiorse:
Improved and robust prediction of pronunciation distance for individual-basis clustering of World Englishes pronunciation. ICASSP 2014: 3216-3220 - [c181]Daisuke Saito, Hidenobu Doi, Nobuaki Minematsu, Keikichi Hirose:
Application of matrix variate Gaussian mixture model to statistical voice conversion. INTERSPEECH 2014: 2504-2508 - [c180]Yuji Kawase, Nobuaki Minematsu, Daisuke Saito, Keikichi Hirose:
Visualization of pronunciation diversity of world Englishes from a speaker's self-centered viewpoint. O-COCOSDA 2014: 1-5 - [c179]Nobuaki Minematsu:
Keynote 2: Perceptual and structural analysis of pronunciation diversity of World Englishes. O-COCOSDA 2014: 1-2 - [c178]Nobuaki Minematsu, Shun Kasahara, Takehiko Makino, Daisuke Saito, Keikichi Hirose:
Speaker-basis Accent Clustering Using Invariant Structure Analysis and the Speech Accent Archive. Odyssey 2014: 158-165 - 2013
- [j19]Yu Qiao, Dean Luo, Nobuaki Minematsu:
Unsupervised optimal phoneme segmentation: theory and experimental evaluation. IET Signal Process. 7(7): 577-586 (2013) - [j18]Greg Short, Keikichi Hirose, Nobuaki Minematsu:
Japanese lexical accent recognition for a CALL system by deriving classification equations with perceptual experiments. Speech Commun. 55(10): 1064-1080 (2013) - [j17]Masayuki Suzuki, Takuya Yoshioka, Shinji Watanabe
, Nobuaki Minematsu, Keikichi Hirose:
Feature Enhancement With Joint Use of Consecutive Corrupted and Noise Feature Vectors With Discriminative Region Weighting. IEEE Trans. Speech Audio Process. 21(10): 2172-2181 (2013) - [c177]Nobuaki Minematsu, Yousuke Ozaki, Keikichi Hirose, Donna Erickson:
Speaker-invariant and rhythm-sensitive representation of spoken words. APSIPA 2013: 1-9 - [c176]Han-Ping Shen, Nobuaki Minematsu, Takehiko Makino, Steven H. Weinberger, Teeraphon Pongkittiphan, Chung-Hsien Wu
:
Automatic pronunciation clustering using a World English archive and pronunciation structure analysis. ASRU 2013: 222-227 - [c175]Yosuke Kashiwagi, Daisuke Saito, Nobuaki Minematsu, Keikichi Hirose:
Discriminative piecewise linear transformation based on deep learning for noise robust automatic speech recognition. ASRU 2013: 350-355 - [c174]Chengshuo Wang, Masayuki Suzuki, Nobuaki Minematsu, Kyoko Sakuraba, Keikichi Hirose:
Improved estimation of femininity using GMM supervectors and SVR for voice therapy of Gender Identity Disorder Clients. ICASSP 2013: 7751-7754 - [c173]Oraphan Krityakien, Keikichi Hirose, Nobuaki Minematsu:
Generation of fundamental frequency contours for Thai speech synthesis using tone nucleus model. INTERSPEECH 2013: 1037-1041 - [c172]Josef R. Novak, Nobuaki Minematsu, Keikichi Hirose:
Failure transitions for joint n-gram models and G2p conversion. INTERSPEECH 2013: 1821-1825 - [c171]Hiroko Hirano, Ibuki Nakamura, Nobuaki Minematsu, Masayuki Suzuki, Chieko Nakagawa, Noriko Nakamura, Yukinori Tagawa, Keikichi Hirose, Hiroya Hashimoto:
A free online accent and intonation dictionary for teachers and learners of Japanese. INTERSPEECH 2013: 1875-1876 - [c170]Ibuki Nakamura, Nobuaki Minematsu, Masayuki Suzuki, Hiroko Hirano, Chieko Nakagawa, Noriko Nakamura, Yukinori Tagawa, Keikichi Hirose, Hiroya Hashimoto:
Development of a web framework for teaching and learning Japanese prosody: OJAD (online Japanese accent dictionary). INTERSPEECH 2013: 2554-2558 - [c169]Nguyen Duc Duy, Masayuki Suzuki, Nobuaki Minematsu, Keikichi Hirose:
Artificial bandwidth extension based on regularized piecewise linear mapping with discriminative region weighting and long-Span features. INTERSPEECH 2013: 3453-3457 - [c168]Hiroko Hirano, Ibuki Nakamura, Nobuaki Minematsu, Masayuki Suzuki, Chieko Nakagawa, Noriko Nakamura, Yukinori Tagawa, Keikichi Hirose, Hiroya Hashimoto:
OJAD: a free online accent and intonation dictionary for teachers and learners of Japanese. SLaTE 2013: 94 - [c167]Teeraphon Pongkittiphan, Nobuaki Minematsu, Takehiko Makino, Keikichi Hirose:
Automatic detection of the words that will become unintelligible through Japanese accented pronunciation of English. SLaTE 2013: 109-111 - [c166]Greg Short, Keikichi Hirose, Nobuaki Minematsu:
Automatic recognition of vowel length in Japanese for a CALL system motivated by perceptual experiments. SLaTE 2013: 178-183 - [c165]Han-Ping Shen, Nobuaki Minematsu, Takehiko Makino, Steven H. Weinberger, Teeraphon Pongkittiphan, Chung-Hsien Wu:
Speaker-based accented English clustering using a world English archive. SLaTE 2013: 184-188 - [c164]Hiroya Hashimoto, Keikichi Hirose, Nobuaki Minematsu:
Context labels based on "bunsetsu" for HMM-based speech synthesis of Japanese. SSW 2013: 35-39 - 2012
- [j16]Qinghua Sun, Keikichi Hirose, Nobuaki Minematsu:
A method for generation of Mandarin F0 contours based on tone nucleus model and superpositional model. Speech Commun. 54(8): 932-945 (2012) - [j15]Daisuke Saito, Shinji Watanabe
, Atsushi Nakamura, Nobuaki Minematsu:
Statistical Voice Conversion Based on Noisy Channel Model. IEEE Trans. Speech Audio Process. 20(6): 1784-1794 (2012) - [c163]Josef R. Novak, Nobuaki Minematsu, Keikichi Hirose:
WFST-Based Grapheme-to-Phoneme Conversion: Open Source tools for Alignment, Model-Building and Decoding. FSMNLP 2012: 45-49 - [c162]Masayuki Suzuki, Takuya Yoshioka, Shinji Watanabe
, Nobuaki Minematsu, Keikichi Hirose:
MFCC enhancement using joint corrupted and noise feature space for highly non-stationary noise environments. ICASSP 2012: 4109-4112 - [c161]Keigo Chijiiwa, Masayuki Suzuki, Nobuaki Minematsu, Keikichi Hirose:
Unseen noise robust speech recognition using adaptive piecewise linear transformation. ICASSP 2012: 4289-4292 - [c160]Daisuke Saito, Nobuaki Minematsu, Keikichi Hirose:
Effects of Speaker Adaptive Training on Tensor-based Arbitrary Speaker Conversion. INTERSPEECH 2012: 98-101 - [c159]Hiroya Hashimoto, Keikichi Hirose, Nobuaki Minematsu:
Improved Automatic Extraction of Generation Process Model Commands and Its use for Generating Fundamental Frequency Contours for Training HMM-based Speech Synthesis. INTERSPEECH 2012: 458-461 - [c158]Masayuki Suzuki, Gakuto Kurata, Masafumi Nishimura, Nobuaki Minematsu:
Discriminative Reranking for LVCSR Leveraging Invariant Structure. INTERSPEECH 2012: 563-566 - [c157]Josef R. Novak, Nobuaki Minematsu, Keikichi Hirose:
Dynamic Grammars with Lookahead Composition for WFST-based Speech Recognition. INTERSPEECH 2012: 1079-1082 - [c156]Josef R. Novak, Nobuaki Minematsu, Keikichi Hirose, Chiori Hori, Hideki Kashioka, Paul R. Dixon:
Improving WFST-based G2P Conversion with Alignment Constraints and RNNLM N-best Rescoring. INTERSPEECH 2012: 2526-2529 - [c155]Nobuaki Minematsu, Shumpei Kobayashi, Shinya Shimizu, Keikichi Hirose:
Improved Prediction of Japanese Word Accent Sandhi Using CRF. INTERSPEECH 2012: 2562-2565 - [c154]Yosuke Kashiwagi, Masayuki Suzuki, Nobuaki Minematsu, Keikichi Hirose:
Audio-visual feature integration based on piecewise linear transformation for noise robust automatic speech recognition. SLT 2012: 149-152 - [c153]Yi Luan, Masayuki Suzuki, Yutaka Yamauchi, Nobuaki Minematsu, Shuhei Kato, Keikichi Hirose:
Performance improvement of automatic pronunciation assessment in a noisy classroom. SLT 2012: 428-431 - [c152]Tongmu Zhao, Akemi Hoshino, Masayuki Suzuki, Nobuaki Minematsu, Keikichi Hirose:
Automatic Chinese pronunciation error detection using SVM trained with structural features. SLT 2012: 473-478 - 2011
- [j14]Dean Luo, Yu Qiao, Nobuaki Minematsu, Keikichi Hirose:
Regularized Maximum Likelihood Linear Regression Adaptation for Computer-Assisted Language Learning Systems. IEICE Trans. Inf. Syst. 94-D(2): 308-316 (2011) - [c151]Yu Qiao, Masayuki Suzuki, Nobuaki Minematsu, Keikichi Hirose:
Structure-constrained distribution matching using quadratic programming and its application to pronunciation evaluation. ACPR 2011: 350-354 - [c150]Di Lu, Takuya Nishimoto, Nobuaki Minematsu:
Decision of response timing for incremental speech recognition with reinforcement learning. ASRU 2011: 467-472 - [c149]Josef R. Novak, Nobuaki Minematsu, Keikichi Hirose:
Open Source WFST Tools for LVCSR Cascade Development. FSMNLP 2011: 65-73 - [c148]Aki Kunikoshi, Yao Qian, Frank K. Soong, Nobuaki Minematsu:
Improved F0 modeling and generation in voice conversion. ICASSP 2011: 4568-4571 - [c147]Daisuke Saito, Shinji Watanabe
, Atsushi Nakamura, Nobuaki Minematsu:
High accurate model-integration-based voice conversion using dynamic features and model structure optimization. ICASSP 2011: 4576-4579 - [c146]Daisuke Saito, Keisuke Yamamoto, Nobuaki Minematsu, Keikichi Hirose:
One-to-Many Voice Conversion Based on Tensor Representation of Speaker Space. INTERSPEECH 2011: 653-656 - [c145]Yu Qiao, Tong Tong, Nobuaki Minematsu:
A Study on Bag of Gaussian Model with Application to Voice Conversion. INTERSPEECH 2011: 657-660 - [c144]Masayuki Suzuki, Gakuto Kurata, Masafumi Nishimura, Nobuaki Minematsu:
Continuous Digits Recognition Leveraging Invariant Structure. INTERSPEECH 2011: 993-996 - [c143]Nobuaki Minematsu, Koji Okabe, Keisuke Ogaki, Keikichi Hirose:
Measurement of Objective Intelligibility of Japanese Accented English Using ERJ (English Read by Japanese) Database. INTERSPEECH 2011: 1481-1484 - [c142]Josef R. Novak, Nobuaki Minematsu, Keikichi Hirose:
Painless WFST Cascade Construction for LVCSR - Transducersaurus. INTERSPEECH 2011: 1537-1540 - [c141]Keikichi Hirose, Keiko Ochi, Ryusuke Mihara, Hiroya Hashimoto, Daisuke Saito, Nobuaki Minematsu:
Adaptation of Prosody in Speech Synthesis by Changing Command Values of the Generation Process Model of Fundamental Frequency. INTERSPEECH 2011: 2793-2796 - [c140]Miaomiao Wen, Miaomiao Wang, Keikichi Hirose, Nobuaki Minematsu:
Prosody Conversion for Emotional Mandarin Speech Synthesis Using the Tone Nucleus Model. INTERSPEECH 2011: 2797-2800 - [c139]Aki Kunikoshi, Yu Qiao, Daisuke Saito, Nobuaki Minematsu, Keikichi Hirose:
Gesture Design of Hand-to-Speech Converter Derived from Speech-to-Hand Converter Based on Probabilistic Integration Model. INTERSPEECH 2011: 3025-3028 - [c138]Keikichi Hirose, Tatsuya Matsuda, Hiroya Hashimoto, Nobuaki Minematsu:
Representing fundamental frequency contours generated by HMM-based speech synthesis using generation process model. MLSP 2011: 1-6 - [c137]Greg Short, Keikichi Hirose, Nobuaki Minematsu:
Rule-based method for pitch level classification for a Japanese pitch accent CALL system. SLaTE 2011: 45-48 - [c136]Shuhei Kato, Greg Short, Nobuaki Minematsu, Chiharu Tsurutani, Keikichi Hirose:
Comparison of native and non-native evaluations of the naturalness of Japanesewords with prosody modified through voice morphing. SLaTE 2011: 145-148 - 2010
- [j13]Nobuaki Minematsu, Satoshi Asakawa, Masayuki Suzuki, Yu Qiao:
Speech Structure and Its Application to Robust Speech Processing. New Gener. Comput. 28(3): 299-319 (2010) - [j12]Yu Qiao, Nobuaki Minematsu:
A study on invariance of f-divergence and its application to speech recognition. IEEE Trans. Signal Process. 58(7): 3884-3890 (2010) - [c135]Keikichi Hirose, Keiko Ochi, Miaomiao Wang, Tatsuya Matsuda, Miaomiao Wen, Nobuaki Minematsu:
Using FO Contour Generation Process Model for Improved and Flexible Control of Prosodie Features in HMM-based Speech Synthesis. ESSV 2010: 84-93 - [c134]Nobuaki Minematsu:
Human Speech Model based on Information Separation. ESSV 2010: 273-280 - [c133]Greg Short, Keikichi Hirose, Nobuaki Minematsu:
Pitch Pattern Recognition of Isolated Words for the Development of a Japanese Language Call System. ESSV 2010: 304-313 - [c132]Yu Qiao, Daisuke Saito, Nobuaki Minematsu:
HMM-based sequence-to-frame mapping for voice conversion. ICASSP 2010: 4830-4833 - [c131]Masayuki Suzuki, Yu Qiao, Nobuaki Minematsu, Keikichi Hirose:
Integration of multilayer regression analysis with structure-based pronunciation assessment. INTERSPEECH 2010: 586-589 - [c130]Dean Luo, Yu Qiao, Nobuaki Minematsu, Yutaka Yamauchi, Keikichi Hirose:
Regularized-MLLR speaker adaptation for computer-assisted language learning system. INTERSPEECH 2010: 594-597 - [c129]Daisuke Saito, Shinji Watanabe, Atsushi Nakamura, Nobuaki Minematsu:
Probabilistic integration of joint density model and speaker model for voice conversion. INTERSPEECH 2010: 1728-1731 - [c128]Miaomiao Wang, Miaomiao Wen, Keikichi Hirose, Nobuaki Minematsu:
Improved generation of fundamental frequency in HMM-based speech synthesis using generation process model. INTERSPEECH 2010: 2166-2169 - [c127]Miaomiao Wen, Miaomiao Wang, Keikichi Hirose, Nobuaki Minematsu:
Improving Mandarin segmental duration prediction with automatically extracted syntax features. INTERSPEECH 2010: 2178-2181 - [c126]Miaomiao Wang, Miaomiao Wen, Keikichi Hirose, Nobuaki Minematsu:
A method for modeling and generating Mandarin tone contour with phrase intonation based on the generation process model. ISCSLP 2010: 153-156 - [c125]Xuebin Ma, Ruiyuan Xu, Nobuaki Minematsu, Yu Qiao, Keikichi Hirose, Aijun Li:
Dialect-based speaker classification using speaker-invariant dialect features. ISCSLP 2010: 171-176 - [c124]Nobuaki Minematsu:
Human speech model based on information separation and its application to speech processing. ISCSLP 2010: 477-482 - [c123]Miaomiao Wang, Miaomiao Wen, Daisuke Saito, Keikichi Hirose, Nobuaki Minematsu:
Improved generation of prosodic features in HMM-based Mandarin speech synthesis. SSW 2010: 359-364
2000 – 2009
- 2009
- [j11]Yu Qiao, Wei Wang, Nobuaki Minematsu, Jianzhuang Liu, Mitsou Takeda, Xiaoou Tang:
A Theory of Phase Singularities for Image Representation and its Applications to Object Tracking and Image Matching. IEEE Trans. Image Process. 18(10): 2153-2166 (2009) - [c122]Yu Qiao, Masayuki Suzuki, Nobuaki Minematsu:
A study on Hidden Structural Model and its application to labeling sequences. ASRU 2009: 118-123 - [c121]Masayuki Suzuki, Nobuaki Minematsu, Dean Luo, Keikichi Hirose:
Sub-structure-based estimation of pronunciation proficiency and classification of learners. ASRU 2009: 574-579 - [c120]Yu Qiao, Nobuaki Minematsu:
Mixture of Probabilistic Linear Regressions: A unified view of GMM-based mapping techiques. ICASSP 2009: 3913-3916 - [c119]Keiko Ochi
, Keikichi Hirose, Nobuaki Minematsu:
Control of prosodic focus in corpus-based generation of fundamental frequency contours of Japanese based on the generation process model. ICASSP 2009: 4257-4260 - [c118]Yu Qiao
, Masayuki Suzuki, Nobuaki Minematsu:
Affine invariant features and their application to speech recognition. ICASSP 2009: 4629-4632 - [c117]Aki Kunikoshi, Yu Qiao, Nobuaki Minematsu, Keikichi Hirose:
Speech generation from hand gestures based on space mapping. INTERSPEECH 2009: 308-311 - [c116]Antonio Rui Ferreira Rebordão, Shaikh Mostafa Al Masum, Keikichi Hirose, Nobuaki Minematsu:
How to improve TTS systems for emotional expressivity. INTERSPEECH 2009: 524-527 - [c115]Dean Luo, Yu Qiao, Nobuaki Minematsu, Yutaka Yamauchi, Keikichi Hirose:
Analysis and utilization of MLLR speaker adaptation technique for learners' pronunciation evaluation. INTERSPEECH 2009: 608-611 - [c114]Daisuke Saito, Yu Qiao, Nobuaki Minematsu, Keikichi Hirose:
Optimal event search using a structural cost function - improvement of structure to speech conversion. INTERSPEECH 2009: 2047-2050 - [c113]Xuebin Ma, Akira Nemoto, Nobuaki Minematsu, Yu Qiao, Keikichi Hirose:
Structural analysis of dialects, sub-dialects and sub-sub-dialects of Chinese. INTERSPEECH 2009: 2219-2222 - [c112]Yu Qiao, Nobuaki Minematsu, Keikichi Hirose:
On invariant structural representation for speech recognition: theoretical validation and experimental improvement. INTERSPEECH 2009: 3055-3058 - [c111]Dean Luo, Nobuaki Minematsu, Yutaka Yamauchi:
Development of a CALL system to enhance ESL/EFL learners' skills of shadowing and reading aloud. SLaTE 2009 - [c110]Dean Luo, Nobuaki Minematsu, Yutaka Yamauchi, Keikichi Hirose:
Analysis and comparison of automatic language proficiency assessment between shadowed sentences and read sentences. SLaTE 2009: 37-40 - [c109]Nobuaki Minematsu, Masayuki Suzuki:
Structure-based pronunciation assessment. SLaTE 2009 - [c108]Masayuki Suzuki, Dean Luo, Nobuaki Minematsu, Keikichi Hirose:
Improved structure-based automatic estimation of pronunciation proficiency. SLaTE 2009: 137-140 - 2008
- [j10]Toshiaki Kamada, Nobuaki Minematsu, Takashi Osanai, Hisanori Makinae, Masumi Tanimoto:
Speaker Verification in Realistic Noisy Environment in Forensic Science. IEICE Trans. Inf. Syst. 91-D(3): 558-566 (2008) - [j9]Xiaodong Wang, Keikichi Hirose, Jinsong Zhang, Nobuaki Minematsu:
Tone Recognition of Continuous Mandarin Speech Based on Tone Nucleus Model and Neural Network. IEICE Trans. Inf. Syst. 91-D(6): 1748-1755 (2008) - [j8]Michiko Watanabe, Keikichi Hirose, Yasuharu Den, Nobuaki Minematsu:
Filled pauses as cues to the complexity of upcoming phrases for native and non-native listeners. Speech Commun. 50(2): 81-94 (2008) - [c107]Yu Qiao, Wei Wang, Nobuaki Minematsu, Jianzhuang Liu, Xiaoou Tang:
Phase singularities for image representation and matching. ICASSP 2008: 885-888 - [c106]Yu Qiao
, Naoya Shimomura, Nobuaki Minematsu:
Unsupervised optimal phoneme segmentation: Objectives, algorithm and comparisons. ICASSP 2008: 3989-3992 - [c105]Satoshi Asakawa, Nobuaki Minematsu, Keikichi Hirose:
Multi-stream parameterization for structural speech recognition. ICASSP 2008: 4097-4100 - [c104]Daisuke Saito, Ryo Matsuura, Satoshi Asakawa, Nobuaki Minematsu, Keikichi Hirose:
Directional dependency of cepstrum on vocal tract length. ICASSP 2008: 4485-4488 - [c103]Yu Qiao, Nobuaki Minematsu:
Metric learning for unsupervised phoneme segmentation. INTERSPEECH 2008: 1060-1063 - [c102]Keiko Ochi, Keikichi Hirose, Nobuaki Minematsu:
Control of prosodic focus in corpus-based generation of fundamental frequency based on the generation process model. INTERSPEECH 2008: 1216 - [c101]Yu Qiao, Nobuaki Minematsu:
f-divergence is a generalized invariant measure between distributions. INTERSPEECH 2008: 1349-1352 - [c100]Daisuke Saito, Nobuaki Minematsu, Keikichi Hirose:
Decomposition of rotational distortion caused by VTL difference using eigenvalues of its transformation matrix. INTERSPEECH 2008: 1361-1364 - [c99]Daisuke Saito, Satoshi Asakawa, Nobuaki Minematsu, Keikichi Hirose:
Structure to speech conversion - speech generation based on infant-like vocal imitation. INTERSPEECH 2008: 1837-1840 - [c98]Md. Khademul Islam Molla, Keikichi Hirose, Nobuaki Minematsu:
Robust voiced/unvoiced speech classification using empirical mode decomposition and periodic correlation model. INTERSPEECH 2008: 2530-2533 - [c97]Dean Luo, Naoya Shimomura, Nobuaki Minematsu, Yutaka Yamauchi, Keikichi Hirose:
Automatic pronunciation evaluation of language learners' utterances generated through shadowing. INTERSPEECH 2008: 2807-2810 - [c96]Dean Luo, Nobuaki Minematsu, Yutaka Yamauchi, Keikichi Hirose:
Automatic Assessment of Language Proficiency through Shadowing. ISCSLP 2008: 41-44 - [c95]Keikichi Hirose, Qinghua Sun, Nobuaki Minematsu:
Corpus-based synthesis of Mandarin speech with F0 contours generated by superposing tone components on rule-generated phrase components. SLT 2008: 33-36 - 2007
- [c94]Yu Qiao, Satoshi Asakawa, Nobuaki Minematsu:
Random discriminant structure analysis for automatic recognition of connected vowels. ASRU 2007: 576-581 - [c93]Erhan Deger, Md. Khademul Islam Molla, Keikichi Hirose, Nobuaki Minematsu, Md. Kamrul Hasan:
Speech enhancement using soft thresholding with DCT-EMD based hybrid algorithm. EUSIPCO 2007: 75-79 - [c92]Nobuaki Minematsu, Kazutaka Maruyama, Kyoko Sakuraba, Keikichi Hirose, Niro Tayama, Satoshi Imaizumi, Toshio Yamauchi:
Development of a Femininity Estimator using Speaker Recognition Techniques for Voice Therapy of Gender Identity Disorder Clients. ICASSP (4) 2007: 297-300 - [c91]Michiko Watanabe, Yasuharu Den, Keikichi Hirose, Shusaku Miwa, Nobuaki Minematsu:
Features of pauses and conjunctions at syntactic and discourse boundaries in Japanese monologues. INTERSPEECH 2007: 118-121 - [c90]Nobuaki Minematsu, K. Kamata, Satoshi Asakawa, Takehiko Makino, Tazuko Nishimura, Keikichi Hirose:
Structural assessment of language learners' pronunciation. INTERSPEECH 2007: 210-213 - [c89]Erhan Deger, Md. Khademul Islam Molla, Keikichi Hirose, Nobuaki Minematsu, Md. Kamrul Hasan:
EMD based soft-thresholding for speech enhancement. INTERSPEECH 2007: 810-813 - [c88]Satoshi Asakawa, Nobuaki Minematsu, Keikichi Hirose:
Automatic recognition of connected vowels only using speaker-invariant representation of speech dynamics. INTERSPEECH 2007: 890-893 - [c87]Keikichi Hirose, Keiko Ochi, Nobuaki Minematsu:
Corpus-based generation of prosodic features from text based on generation process model. INTERSPEECH 2007: 1274-1277 - [c86]Seiya Takada, Yuji Yagi, Keikichi Hirose, Nobuaki Minematsu:
A framework of reply speech generation for concept-to-speech conversion in spoken dialogue systems. INTERSPEECH 2007: 1286-1289 - [c85]Md. Khademul Islam Molla, Keikichi Hirose, Nobuaki Minematsu, Md. Kamrul Hasan:
Pitch estimation of noisy speech signals using empirical mode decomposition. INTERSPEECH 2007: 1645-1648 - [c84]Hiroko Hirano, Keikichi Hirose, Goh Kawai, Wentao Gu, Nobuaki Minematsu:
F0 models show Chinese speakers of Japanese insert intonational boundaries and drop pitch. INTERSPEECH 2007: 1885-1888 - [c83]Nobuaki Minematsu, Tazuko Nishimura:
Consideration of Infants' Vocal Imitation Through Modeling Speech as Timbre-Based Melody. JSAI 2007: 26-39 - [c82]Nobuaki Minematsu:
Are learners myna birds to the averaged distributions of native speakers? - a note ofwarning from a serious speech engineer -. SLaTE 2007: 100-103 - [c81]Nobuaki Minematsu, K. Kamata, Satoshi Asakawa, Takehiko Makino, Keikichi Hirose:
Structural representation of pronunciation and its application for classifying Japanese learners of English. SLaTE 2007: 116-119 - [c80]Nobuaki Minematsu, Kyoko Sakuraba:
Development of a Femininity Estimator for Voice Therapy of Gender Identity Disorder Clients. Speaker Classification (2) 2007: 22-33 - [c79]Nobuaki Minematsu, Ryo Kuroiwa, Keikichi Hirose, Michiko Watanabe:
CRF-based statistical learning of Japanese accent sandhi for developing Japanese text-to-speech synthesis systems. SSW 2007: 148-153 - [c78]Qinghua Sun, Keikichi Hirose, Nobuaki Minematsu:
Two-step generation of Mandarin F0 contours based on tone nucleus and superpositional models. SSW 2007: 154-159 - 2006
- [j7]M. Khademul Islam Molla, Keikichi Hirose, Nobuaki Minematsu:
Separation of Mixed Audio Signals by Decomposing Hilbert Spectrum with Modified EMD. IEICE Trans. Fundam. Electron. Commun. Comput. Sci. 89-A(3): 727-734 (2006) - [c77]Michiko Watanabe, Keikichi Hirose, Yasuharu Den, Shusaku Miwa, Nobuaki Minematsu:
Factors influencing ratios of filled pauses at clause boundaries in Japanese. ExLing 2006: 253-256 - [c76]M. Khademul Islam Molla, Keikichi Hirose, Nobuaki Minematsu:
Separation of Mixed Audio Signals by Source Localization and Binary Masking with Hilbert Spectrum. ICA 2006: 641-648 - [c75]Md. Khademul Islam Molla, Keikichi Hirose, Nobuaki Minematsu:
Localization Based Separation of Mixed Audio Signals with Binary Masking of Hilbert Spectrum. ICASSP (5) 2006: 85-88 - [c74]Nobuaki Minematsu, Satoshi Asakawa, Keikichi Hirose:
Para-Linguistic Information Represented as Distortion of the Acoustic Universal Structure In Speech. ICASSP (1) 2006: 261-264 - [c73]Hiroko Hirano, Goh Kawai, Keikichi Hirose, Nobuaki Minematsu:
Unfilled pauses in Japanese sentences read aloud by non-native learners. INTERSPEECH 2006 - [c72]Keikichi Hirose, Yasufumi Asano, Nobuaki Minematsu:
Corpus-based generation of fundamental frequency contours using generation process model and considering emotional focuses. INTERSPEECH 2006 - [c71]Keikichi Hirose, Hui Hu, Xiaodong Wang, Nobuaki Minematsu:
Tone recognition of continuous speech of standard Chinese using neural network and tone nucleus model. INTERSPEECH 2006 - [c70]Chiharu Tsurutani, Yutaka Yamauchi, Nobuaki Minematsu, Dean Luo, Kazutaka Maruyama, Keikichi Hirose:
Development of a program for self assessment of Japanese pronunciation by English learners. INTERSPEECH 2006 - [c69]Michiko Watanabe, Yasuharu Den, Keikichi Hirose, Shusaku Miwa, Nobuaki Minematsu:
Factors affecting speakers² choice of fillers in Japanese presentations. INTERSPEECH 2006 - [c68]M. Khademul Islam Molla, Keikichi Hirose, Nobuaki Minematsu:
Localization based audio source separation by sub-band beamforming. ISCAS 2006 - [c67]Nobuaki Minematsu, Satoshi Asakawa, Keikichi Hirose:
Structural Representation of the pronunciation and its Use for Call. SLT 2006: 126-129 - 2005
- [j6]Keikichi Hirose, Kentaro Sato, Yasufumi Asano, Nobuaki Minematsu:
Synthesis of F0 contours using generation process model parameters predicted from unlabeled corpora: application to emotional speech synthesis. Speech Commun. 46(3-4): 385-404 (2005) - [c66]Yuji Yagi, Keikichi Hirose, Seiya Takada, Nobuaki Minematsu:
Improved concept-to-speech generation in a dialogue system on road guidance. CW 2005: 429-436 - [c65]Michiko Watanabe, Yasuharu Den, Keikichi Hirose, Nobuaki Minematsu:
The effects of filled pauses on native and non-native listeners2 speech processing. DiSS 2005: 169-172 - [c64]Nobuaki Minematsu:
Mathematical Evidence of the Acoustic Universal Structure in Speech. ICASSP (1) 2005: 889-892 - [c63]Michiko Watanabe, Keikichi Hirose, Yasuharu Den, Nobuaki Minematsu:
Filled pauses as cues to the complexity of following phrases. INTERSPEECH 2005: 37-40 - [c62]Satoshi Asakawa, Nobuaki Minematsu, Toshiko Isei-Jaakkola, Keikichi Hirose:
Structural representation of the non-native pronunciations. INTERSPEECH 2005: 165-168 - [c61]Md. Khademul Islam Molla, Keikichi Hirose, Nobuaki Minematsu:
Multi-band approach of audio source discrimination with empirical mode decomposition. INTERSPEECH 2005: 673-676 - [c60]Takao Murakami, Kazutaka Maruyama, Nobuaki Minematsu, Keikichi Hirose:
Japanese vowel recognition based on structural representation of speech. INTERSPEECH 2005: 1261-1264 - [c59]Keikichi Hirose, Yusuke Furuyama, Nobuaki Minematsu:
Corpus-based extraction of F0 contour generation process model parameters. INTERSPEECH 2005: 3257-3260 - [c58]Qinghua Sun, Keikichi Hirose, Wentao Gu, Nobuaki Minematsu:
Generation of fundamental frequency contours for Mandarin speech synthesis based on tone nucleus model. INTERSPEECH 2005: 3265-3268 - [c57]M. Khademul Islam Molla, Keikichi Hirose, Nobuaki Minematsu:
Audio source separation by source localization with Hilbert spectrum. ISCAS (6) 2005: 5734-5737 - 2004
- [j5]Nobuaki Minematsu, Bungo Matsuoka, Keikichi Hirose:
Prosodic Analysis and Modeling of Nagauta Singing to Generate Prosodic Contours from Standard Scores. IEICE Trans. Inf. Syst. 87-D(5): 1093-1101 (2004) - [j4]Shinya Kiriyama, Keikichi Hirose, Nobuaki Minematsu:
A spoken dialogue system for document information retrieval utilizing topic knowledge. Syst. Comput. Jpn. 35(5): 67-82 (2004) - [c56]Nobuaki Minematsu:
Yet another acoustic representation of speech sounds. ICASSP (1) 2004: 585-588 - [c55]Sungyup Chung, Keikichi Hirose, Nobuaki Minematsu:
N-gram language modeling of Japanese using bunsetsu boundaries. INTERSPEECH 2004: 993-996 - [c54]Nobuaki Minematsu:
Pronunciation assessment based upon the compatibility between a learner's pronunciation structure and the target language's lexical structure. INTERSPEECH 2004: 1317-1320 - [c53]Keikichi Hirose, Nobuaki Minematsu:
Use of prosodic features for speech recognition. INTERSPEECH 2004: 1445-1448 - [c52]Nobuaki Minematsu:
Pronunciation assessment based upon the phonological distortions observed in language learners' utterances. INTERSPEECH 2004: 1669-1672 - [c51]Md. Khademul Islam Molla, Keikichi Hirose, Nobuaki Minematsu:
Audio source separation from the mixture using empirical mode decomposition with independent subspace analysis. INTERSPEECH 2004: 2449-2452 - [c50]Michiko Watanabe, Yasuharu Den, Keikichi Hirose, Nobuaki Minematsu:
Clause types and filed pauses in Japanese spontaneous monologues. INTERSPEECH 2004: 2981-2984 - [c49]Keikichi Hirose, Kentaro Sato, Nobuaki Minematsu:
Corpus-based synthesis of fundamental frequency contours with various speaking styles from text using F0 contour generation process model. SSW 2004: 161-166 - [p1]Shinichi Kawamoto, Hiroshi Shimodaira, Tsuneo Nitta, Takuya Nishimoto, Satoshi Nakamura, Katsunobu Itou, Shigeo Morishima, Tatsuo Yotsukura, Atsuhiko Kai, Akinobu Lee, Yoichi Yamashita, Takao Kobayashi, Keiichi Tokuda, Keikichi Hirose, Nobuaki Minematsu, Atsushi Yamada, Yasuharu Den, Takehito Utsuro, Shigeki Sagayama:
Galatea: Open-Source Software for Developing Anthropomorphic Spoken Dialog Agents. Life-like characters 2004: 187-212 - 2003
- [j3]Atsuhiro Sakurai, Keikichi Hirose, Nobuaki Minematsu:
Data-driven generation of F0 contours using a superpositional model. Speech Commun. 40(4): 535-549 (2003) - [j2]Carlos Toshinori Ishi, Keikichi Hirose, Nobuaki Minematsu:
Mora F0 representation for accent type identification in continuous speech and considerations on its relation with perceived pitch values. Speech Commun. 41(2-3): 441-453 (2003) - [c48]Keikichi Hirose, Yusuke Furuyama, Shuichi Narusawa, Nobuaki Minematsu, Hiroya Fujisaki:
Use of linguistic information for automatic extraction of f_0 contour generation process model parameters. INTERSPEECH 2003: 141-144 - [c47]Keikichi Hirose, Takayuki Ono, Nobuaki Minematsu:
Corpus-based synthesis of fundamental frequency contours of Japanese using automatically-generated prosodic corpus and generation process model. INTERSPEECH 2003: 333-336 - [c46]Nobuaki Minematsu, Bungo Matsuoka, Keikichi Hirose:
Prosodic analysis and modeling of the NAGAUTA singing to synthesize its prosodic patterns from the standard notation. INTERSPEECH 2003: 385-388 - [c45]Nobuyuki Nishizawa, Keikichi Hirose, Nobuaki Minematsu:
Estimation of resonant characteristics based on AR-HMM modeling and spectral envelope conversion of vowel sounds. INTERSPEECH 2003: 885-888 - [c44]Keikichi Hirose, Junji Tago, Nobuaki Minematsu:
Speech generation from concept for realizing conversation with an agent in a virtual room. INTERSPEECH 2003: 1693-1696 - [c43]Nobuaki Minematsu, Changchen Guo, Keikichi Hirose:
CART-based factor analysis of intelligibility reduction in Japanese English. INTERSPEECH 2003: 2069-2072 - [c42]Nobuaki Minematsu, Koichi Osaki, Keikichi Hirose:
Improvement of non-native speech recognition by effectively modeling frequently observed pronunciation habits. INTERSPEECH 2003: 2597-2600 - [c41]Nobuaki Minematsu, Keita Yamauchi, Keikichi Hirose:
Automatic estimation of perceptual age using speaker modeling techniques. INTERSPEECH 2003: 3005-3008 - [c40]Keikichi Hirose, Frédéric Gendrin, Nobuaki Minematsu:
A pronunciation training system for Japanese lexical accents with corrective feedback in learner's voice. INTERSPEECH 2003: 3149-3152 - [c39]Taro Mouri, Keikichi Hirose, Nobuaki Minematsu:
Considerations on vowel durations for Japanese CALL system. INTERSPEECH 2003: 3153-3156 - 2002
- [c38]Nobuaki Minematsu, Mariko Sekiguchi, Keikichi Hirose:
Automatic estimation of one's age with his/her speech based upon acoustic modeling techniques of speakers. ICASSP 2002: 137-140 - [c37]Shuichi Narusawa, Nobuaki Minematsu, Keikichi Hirose, Hiroya Fujisaki:
A method for automatic extraction of model parameters from fundamental frequency contours of speech. ICASSP 2002: 509-512 - [c36]Nobuaki Minematsu, Gakuto Kurata, Keikichi Hirose:
Integration of MLLR adaptation with pronunciation proficiency adaptation for non-native speech recognition. INTERSPEECH 2002: 529-532 - [c35]Nobuaki Minematsu, Satoshi Kobashikawa, Keikichi Hirose, Donna Erickson:
Acoustic modeling of sentence stress using differential features between syllables for English rhythm learning system development. INTERSPEECH 2002: 745-748 - [c34]Keikichi Hirose, Nobuaki Minematsu, Makoto Terao:
Statistical language modeling with prosodic boundaries and its use for continuous speech recognition. INTERSPEECH 2002: 937-940 - [c33]Nobuaki Minematsu, Gakuto Kurata, Keikichi Hirose:
Corpus-based analysis of English spoken by Japanese students in view of the entire phonemic system of English. INTERSPEECH 2002: 1213-1216 - [c32]Baojie Li, Keikichi Hirose, Nobuaki Minematsu:
Robust speech recognition using inter-speaker and intra-speaker adaptation. INTERSPEECH 2002: 1397-1400 - [c31]Nobuyuki Nishizawa, Keikichi Hirose, Nobuaki Minematsu:
Separation of voiced source characteristics and vocal tract transfer function characteristics for speech sounds by iterative analysis based on AR-HMM model. INTERSPEECH 2002: 1721-1724 - [c30]Shuichi Narusawa, Nobuaki Minematsu, Keikichi Hirose, Hiroya Fujisaki:
Automatic extraction of model parameters from fundamental frequency contours of English utterances. INTERSPEECH 2002: 1725-1728 - [c29]Keikichi Hirose, Masaya Eto, Nobuaki Minematsu:
Improved corpus-based synthesis of fundamental frequency contours using generation process model. INTERSPEECH 2002: 2085-2088 - [c28]Nobuaki Minematsu, Yoshihiro Tomiyama, Kei Yoshimoto, Katsumasa Shimizu, Seiichi Nakagawa, Masatake Dantsuji, Shozo Makino:
English Speech Database Read by Japanese Learners for CALL System Development. LREC 2002 - 2001
- [c27]Atsuhiro Sakurai, Keikichi Hirose, Nobuaki Minematsu:
Generation of F0 contours using a model-constrained data-driven method. ICASSP 2001: 817-820 - [c26]Shinya Kiriyama, Keikichi Hirose, Nobuaki Minematsu:
Use of topic knowledge in spoken dialogue information retrieval system for academic documents. INTERSPEECH 2001: 1315-1318 - [c25]Keikichi Hirose, Masaya Eto, Nobuaki Minematsu, Atsuhiro Sakurai:
Corpus-based synthesis of fundamental frequency contours based on a generation process model. INTERSPEECH 2001: 2255-2258 - [c24]Carlos Toshinori Ishi, Nobuaki Minematsu, Ryuji Nishide, Keikichi Hirose:
Identification of accent and intonation in sentences for CALL systems. INTERSPEECH 2001: 2455-2458 - [c23]Naoki Nakamura, Nobuaki Minematsu, Seiichi Nakagawa:
Instantaneous estimation of accentuation habits for Japanese students to learn English pronunciation. INTERSPEECH 2001: 2811-2814 - 2000
- [c22]Nobuaki Minematsu, Seiichi Nakagawa:
Instantaneous estimation of prosodic pronunciation habits for Japanese students to learn English pronunciation. INTERSPEECH 2000: 191-194 - [c21]Atsuhiro Sakurai, Nobuaki Minematsu, Keikichi Hirose:
Data-driven intonation modeling using a neural network and a command response model. INTERSPEECH 2000: 223-226 - [c20]Shi-wook Lee, Keikichi Hirose, Nobuaki Minematsu:
Efficient search strategy in large vocabulary continuous speech recognition using prosodic boundary information. INTERSPEECH 2000: 274-277 - [c19]Baojie Li, Keikichi Hirose, Nobuaki Minematsu:
Modeling phone correlation for speaker adaptive speech recognition. INTERSPEECH 2000: 350-353 - [c18]Keikichi Hirose, Nobuaki Minematsu, Hiromichi Kawanami:
Analytical and perceptual study on the role of acoustic features in realizing emotional speech. INTERSPEECH 2000: 369-372 - [c17]Tatsuya Kawahara, Akinobu Lee, Tetsunori Kobayashi, Kazuya Takeda, Nobuaki Minematsu, Shigeki Sagayama, Katsunobu Itou, Akinori Ito, Mikio Yamamoto, Atsushi Yamada, Takehito Utsuro, Kiyohiro Shikano:
Free software toolkit for Japanese large vocabulary continuous speech recognition. INTERSPEECH 2000: 476-479 - [c16]Nobuaki Minematsu, Yukiko Fujisawa, Seiichi Nakagawa:
Performance comparison among HMM, DTW, and human abilities in terms of identifying stress patterns of word utterances. INTERSPEECH 2000: 617-620 - [c15]Nobuyuki Nishizawa, Nobuaki Minematsu, Keikichi Hirose:
Development of a formant-based analysis-synthesis system and generation of high quality liquid sounds of Japanese. INTERSPEECH 2000: 725-728 - [c14]Nobuaki Minematsu, Seiichi Nakagawa:
Quality improvement of PSOLA analysis-synthesis using partial zero-phase conversion. INTERSPEECH 2000: 779-782 - [c13]Carlos Toshinori Ishi, Keikichi Hirose, Nobuaki Minematsu:
Identification of Japanese double-mora phonemes considering speaking rate for the use in CALL systems. INTERSPEECH 2000: 786-790 - [c12]Katsunobu Itou, Kiyohiro Shikano, Tatsuya Kawahara, Kazuya Takeda, Atsushi Yamada, Akinori Ito, Takehito Utsuro, Tetsunori Kobayashi, Nobuaki Minematsu, Mikio Yamamoto, Shigeki Sagayama, Akinobu Lee:
IPA Japanese Dictation Free Software Project. LREC 2000
1990 – 1999
- 1998
- [c11]Yukiko Fujisawa, Nobuaki Minematsu, Seiichi Nakagawa:
Evaluation of Japanese manners of generating word accent of English based on a stressed syllable detection technique. ICSLP 1998 - [c10]Kengo Hanai, Kazumasa Yamamoto, Nobuaki Minematsu, Seiichi Nakagawa:
Continuous speech recognition using segmental unit input HMMs with a mixture of probability density functions and context dependency. ICSLP 1998 - [c9]Tatsuya Kawahara, Tetsunori Kobayashi, Kazuya Takeda, Nobuaki Minematsu, Katsunobu Itou, Mikio Yamamoto, Atsushi Yamada, Takehito Utsuro, Kiyohiro Shikano:
Sharable software repository for Japanese large vocabulary continuous speech recognition. ICSLP 1998 - [c8]Nobuaki Minematsu, Seiichi Nakagawa:
Modeling of variations in cepstral coefficients caused by F0 changes and its application to speech processing. ICSLP 1998 - 1997
- [c7]Nobuaki Minematsu, Nariaki Ohashi, Seiichi Nakagawa:
Automatic detection of accent in English words spoken by Japanese students. EUROSPEECH 1997: 701-704 - 1996
- [c6]Nobuaki Minematsu, Seiichi Nakagawa:
Automatic detection of accent nuclei at the head of words for speech recognition. ICSLP 1996: 1620-1623 - [c5]Nobuaki Minematsu, Seiichi Nakagawa, Keikichi Hirose:
Prosodic manipulation system of speech material for perceptual experiments. ICSLP 1996: 2056-2059 - 1995
- [j1]Nobuaki Minematsu, Keikichi Hirose:
Duration Modeling with Decreased Intra-Group Temporal Variation for HMM-Based Phoneme Recognition. IEICE Trans. Inf. Syst. 78-D(6): 654-661 (1995) - 1994
- [c4]Nobuaki Minematsu, Keikichi Hirose:
Speech recognition using HMM with decreased intra-group variation in the temporal structure. ICSLP 1994: 187-190 - [c3]Nobuaki Minematsu, Keikichi Hirose:
Role of prosodic features in the human process of speech perception. ICSLP 1994: 1151-1154 - 1992
- [c2]Nobuaki Minematsu, Sumio Ohno, Keikichi Hirose, Hiroya Fujisaki:
The influence of semantic and syntactic information on spoken sentence recognition. ICSLP 1992: 153-156 - 1990
- [c1]Hiroya Fujisaki, Keikichi Hirose, Sumio Ohno, Nobuaki Minematsu:
Influence of context and knowledge on the perception of continuous speech. ICSLP 1990: 417-420
Coauthor Index
aka: M. Khademul Islam Molla

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from ,
, and
to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and
to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2025-02-27 22:46 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint