


default search action
Jiaming Zhou 0001
Person information
- affiliation: Nankai University, College of Computer Science, Tianjin, China
Other persons with the same name
- Jiaming Zhou — disambiguation page
Refine list

refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2025
[j1]Hui Wang
, Yifan Yang
, Shujie Liu, Jinyu Li
, Lingwei Meng, Tie-Yan Liu, Jiaming Zhou
, Haoqin Sun, Yan Lu, Yong Qin:
StreamMel: Real-Time Zero-Shot Text-to-Speech Via Interleaved Continuous Autoregressive Modeling. IEEE Signal Process. Lett. 32: 3530-3534 (2025)
[c21]Jiaming Zhou, Shiyao Wang, Shiwan Zhao, Jiabei He, Haoqin Sun, Hui Wang, Cheng Liu, Aobo Kong, Yujie Guo, Xi Yang, Yequan Wang, Yonghua Lin, Yong Qin:
ChildMandarin: A Comprehensive Mandarin Speech Dataset for Young Children Aged 3-5. ACL (1) 2025: 12524-12537
[c20]Jiabei He, Shiwan Zhao, Jiaming Zhou, Haoqin Sun, Hui Wang, Yong Qin:
Emotion-Preserving Prosody Anonymization Network for Voice Privacy Protection. ICASSP 2025: 1-5
[c19]Cheng Liu, Hui Wang, Jinghua Zhao, Shiwan Zhao, Hui Bu, Xin Xu, Jiaming Zhou, Haoqin Sun, Yong Qin:
MusicEval: A Generative Music Dataset with Expert Ratings for Automatic Text-to-Music Evaluation. ICASSP 2025: 1-5
[c18]Haoqin Sun, Shiwan Zhao, Shaokai Li, Xiangyu Kong, Xuechen Wang, Jiaming Zhou, Aobo Kong, Yong Chen, Wenjia Zeng, Yong Qin:
Enhancing Emotion Recognition in Incomplete Data: A Novel Cross-Modal Alignment, Reconstruction, and Refinement Framework. ICASSP 2025: 1-5
[c17]Xuechen Wang, Shiwan Zhao, Haoqin Sun, Hui Wang, Jiaming Zhou, Yong Qin:
Enhancing Multimodal Emotion Recognition through Multi-Granularity Cross-Modal Alignment. ICASSP 2025: 1-5
[c16]Jiaming Zhou, Shiwan Zhao, Jiabei He, Hui Wang, Wenjia Zeng, Yong Chen, Haoqin Sun, Aobo Kong, Yong Qin:
M2R-Whisper: Multi-stage and Multi-scale Retrieval Augmentation for Enhancing Whisper. ICASSP 2025: 1-5
[c15]Jiaming Zhou, Shiwan Zhao, Hui Wang, Tian-Hao Zhang, Haoqin Sun, Xuechen Wang, Yong Qin:
Improving Zero-Shot Chinese-English Code-Switching ASR with kNN-CTC and Gated Monolingual Datastores. ICASSP 2025: 1-5
[c14]Jinghua Zhao, Yuhang Jia, Shiyao Wang, Jiaming Zhou, Hui Wang, Yong Qin:
Chinese-LiPS: A Chinese Audio-Visual Speech Recognition Dataset with Lip-Reading and Presentation Slides. ICME 2025: 1-6
[c13]Haoqin Sun, Jingguang Tian, Jiaming Zhou, Hui Wang, Jiabei He, Shiwan Zhao, Xiangyu Kong, Desheng Hu, Xinkang Xu, Xinhui Hu, Yong Qin:
RA-CLAP: Relation-Augmented Emotional Speaking Style Contrastive Language-Audio Pretraining For Speech Retrieval. INTERSPEECH 2025
[c12]Shiyao Wang, Jiaming Zhou, Shiwan Zhao, Yong Qin:
A Self-Training Approach for Whisper to Enhance Long Dysarthric Speech Recognition. INTERSPEECH 2025
[c11]Hui Wang
, Shujie Liu
, Lingwei Meng
, Jinyu Li
, Yifan Yang
, Shiwan Zhao
, Haiyang Sun
, Yanqing Liu
, Haoqin Sun
, Jiaming Zhou
, Yan Lu
, Yong Qin
:
FELLE: Autoregressive Speech Synthesis with Token-Wise Coarse-to-Fine Flow Matching. ACM Multimedia 2025: 10229-10238
[c10]Aobo Kong, Shiwan Zhao, Hao Chen, Qicheng Li
, Yong Qin, Jiaming Zhou, Haoqin Sun:
Self-Prompt Tuning: Enable Autonomous Role-Playing in LLMs. NLPCC (1) 2025: 427-438
[i33]Cheng Liu, Hui Wang, Jinghua Zhao, Shiwan Zhao, Hui Bu, Xin Xu, Jiaming Zhou, Haoqin Sun, Yong Qin:
MusicEval: A Generative Music Corpus with Expert Ratings for Automatic Text-to-Music Evaluation. CoRR abs/2501.10811 (2025)
[i32]Hui Wang, Shujie Liu, Lingwei Meng, Jinyu Li, Yifan Yang, Shiwan Zhao, Haiyang Sun, Yanqing Liu, Haoqin Sun, Jiaming Zhou, Yan Lu, Yong Qin:
FELLE: Autoregressive Speech Synthesis with Token-Wise Coarse-to-Fine Flow Matching. CoRR abs/2502.11128 (2025)
[i31]Jiaming Zhou, Yujie Guo, Shiwan Zhao, Haoqin Sun, Hui Wang, Jiabei He, Aobo Kong, Shiyao Wang, Xi Yang, Yequan Wang, Yonghua Lin, Yong Qin:
CS-Dialogue: A 104-Hour Dataset of Spontaneous Mandarin-English Code-Switching Dialogues for Speech Recognition. CoRR abs/2502.18913 (2025)
[i30]Yang Chen, Hui Wang, Shiyao Wang, Junyang Chen, Jiabei He, Jiaming Zhou, Xi Yang, Yequan Wang, Yonghua Lin, Yong Qin:
SeniorTalk: A Chinese Conversation Dataset with Rich Annotations for Super-Aged Seniors. CoRR abs/2503.16578 (2025)
[i29]Jinghua Zhao, Yuhang Jia, Shiyao Wang, Jiaming Zhou, Hui Wang, Yong Qin:
Chinese-LiPS: A Chinese audio-visual speech recognition dataset with Lip-reading and Presentation Slides. CoRR abs/2504.15066 (2025)
[i28]Haoqin Sun, Jingguang Tian, Jiaming Zhou, Hui Wang, Jiabei He, Shiwan Zhao, Xiangyu Kong, Desheng Hu, Xinkang Xu, Xinhui Hu, Yong Qin:
RA-CLAP: Relation-Augmented Emotional Speaking Style Contrastive Language-Audio Pretraining For Speech Retrieval. CoRR abs/2505.19437 (2025)
[i27]Haoqin Sun, Xuechen Wang, Jinghua Zhao, Shiwan Zhao, Jiaming Zhou, Hui Wang, Jiabei He, Aobo Kong, Xi Yang, Yequan Wang, Yonghua Lin, Yong Qin:
EmotionTalk: An Interactive Chinese Multimodal Emotion Dataset With Rich Annotations. CoRR abs/2505.23018 (2025)
[i26]Hui Wang, Yifan Yang, Shujie Liu, Jinyu Li, Lingwei Meng, Yanqing Liu, Jiaming Zhou, Haoqin Sun, Yan Lu, Yong Qin:
StreamMel: Real-Time Zero-shot Text-to-Speech via Interleaved Continuous Autoregressive Modeling. CoRR abs/2506.12570 (2025)
[i25]Shiyao Wang, Jiaming Zhou, Shiwan Zhao, Yong Qin:
A Self-Training Approach for Whisper to Enhance Long Dysarthric Speech Recognition. CoRR abs/2506.22810 (2025)
[i24]Jiaming Zhou, Hongjie Chen, Shiwan Zhao, Jian Kang, Jie Li, Enzhi Wang, Yujie Guo, Haoqin Sun, Hui Wang, Aobo Kong, Yong Qin, Xuelong Li:
DIFFA: Large Language Diffusion Models Can Listen and Understand. CoRR abs/2507.18452 (2025)
[i23]Enzhi Wang, Qicheng Li, Shiwan Zhao, Aobo Kong, Jiaming Zhou, Xi Yang, Yequan Wang, Yonghua Lin, Yong Qin:
RealTalk-CN: A Realistic Chinese Speech-Text Dialogue Benchmark With Cross-Modal Interaction Analysis. CoRR abs/2508.10015 (2025)
[i22]Hui Wang, Cheng Liu, Junyang Chen, Haoze Liu, Yuhang Jia, Shiwan Zhao, Jiaming Zhou, Haoqin Sun, Hui Bu, Yong Qin:
TTA-Bench: A Comprehensive Benchmark for Evaluating Text-to-Audio Models. CoRR abs/2509.02398 (2025)
[i21]Yujie Guo, Jiaming Zhou, Yuhang Jia, Shiwan Zhao, Yong Qin:
GLAD: Global-Local Aware Dynamic Mixture-of-Experts for Multi-Talker ASR. CoRR abs/2509.13093 (2025)
[i20]Shiwan Zhao, Xuyang Zhao, Jiaming Zhou, Aobo Kong, Qicheng Li, Yong Qin:
Mind the Gap: Data Rewriting for Stable Off-Policy Supervised Fine-Tuning. CoRR abs/2509.15157 (2025)
[i19]Haoqin Sun, Chenyang Lyu, Xiangyu Kong, Shiwan Zhao, Jiaming Zhou, Hui Wang, Aobo Kong, Jinghua Zhao, Longyue Wang, Weihua Luo, Kaifu Zhang, Yong Qin:
MECap-R1: Emotion-aware Policy with Reinforcement Learning for Multimodal Emotion Captioning. CoRR abs/2509.18729 (2025)
[i18]Hui Wang, Jiaming Zhou, Jiabei He, Haoqin Sun, Yong Qin:
WildElder: A Chinese Elderly Speech Dataset from the Wild with Fine-Grained Manual Annotations. CoRR abs/2510.09344 (2025)
[i17]Hui Wang, Jinghua Zhao, Cheng Liu, Yuhang Jia, Haoqin Sun, Jiaming Zhou, Yong Qin:
AudioEval: Automatic Dual-Perspective and Multi-Dimensional Evaluation of Text-to-Audio-Generation. CoRR abs/2510.14570 (2025)
[i16]Hui Wang, Jinghua Zhao, Yifan Yang, Shujie Liu, Junyang Chen, Yanzhe Zhang, Shiwan Zhao, Jinyu Li, Jiaming Zhou, Haoqin Sun, Yan Lu, Yong Qin:
SpeechLLM-as-Judges: Towards General and Interpretable Speech Quality Evaluation. CoRR abs/2510.14664 (2025)
[i15]Shiyao Wang, Shiwan Zhao, Jiaming Zhou, Yong Qin:
Zero- and One-Shot Data Augmentation for Sentence-Level Dysarthric Speech Recognition in Constrained Scenarios. CoRR abs/2510.16700 (2025)- 2024
[c9]Tian-Hao Zhang, Dinghao Zhou, Guiping Zhong, Jiaming Zhou, Baoxiang Li:
CIF-T: A Novel CIF-Based Transducer Architecture for Automatic Speech Recognition. ICASSP 2024: 10531-10535
[c8]Jiaming Zhou, Shiwan Zhao, Yaqi Liu, Wenjia Zeng, Yong Chen, Yong Qin:
KNN-CTC: Enhancing ASR via Retrieval of CTC Pseudo Labels. ICASSP 2024: 11006-11010
[c7]Rong Gong, Hongfei Xue
, Lezhi Wang, Xin Xu, Qisheng Li, Lei Xie, Hui Bu, Shaomei Wu, Jiaming Zhou, Yong Qin, Binbin Zhang, Jun Du, Jia Bin, Ming Li:
AS-70: A Mandarin stuttered speech dataset for automatic speech recognition and stuttering event detection. INTERSPEECH 2024
[c6]Haoqin Sun, Shiwan Zhao, Xiangyu Kong, Xuechen Wang, Hui Wang, Jiaming Zhou, Yong Qin:
Iterative Prototype Refinement for Ambiguous Speech Emotion Recognition. INTERSPEECH 2024
[c5]Shiyao Wang, Shiwan Zhao, Jiaming Zhou
, Aobo Kong, Yong Qin:
Enhancing Dysarthric Speech Recognition for Unseen Speakers via Prototype-Based Adaptation. INTERSPEECH 2024
[c4]Hui Wang, Shiwan Zhao, Jiaming Zhou, Xiguang Zheng, Haoqin Sun, Xuechen Wang, Yong Qin:
Uncertainty-Aware Mean Opinion Score Prediction. INTERSPEECH 2024
[c3]Hongfei Xue
, Rong Gong, Mingchen Shao, Xin Xu, Lezhi Wang, Lei Xie, Hui Bu, Jiaming Zhou, Yong Qin, Jun Du, Ming Li, Binbin Zhang, Bin Jia:
Findings of the 2024 Mandarin Stuttering Event Detection and Automatic Speech Recognition Challenge. SLT 2024: 385-392
[c2]Shiyao Wang, Jiaming Zhou, Shiwan Zhao, Yong Qin:
PB-LRDWWS System For the SLT 2024 Low-Resource Dysarthria Wake-Up Word Spotting Challenge. SLT 2024: 586-591
[i14]Jiaming Zhou, Shiwan Zhao, Hui Wang, Tian-Hao Zhang, Haoqin Sun, Xuechen Wang, Yong Qin:
Improving Zero-Shot Chinese-English Code-Switching ASR with kNN-CTC and Gated Monolingual Datastores. CoRR abs/2406.03814 (2024)
[i13]Rong Gong, Hongfei Xue, Lezhi Wang, Xin Xu, Qisheng Li, Lei Xie, Hui Bu, Shaomei Wu, Jiaming Zhou, Yong Qin, Binbin Zhang, Jun Du, Jia Bin, Ming Li:
AS-70: A Mandarin stuttered speech dataset for automatic speech recognition and stuttering event detection. CoRR abs/2406.07256 (2024)
[i12]Aobo Kong, Shiwan Zhao, Hao Chen, Qicheng Li, Yong Qin, Ruiqi Sun, Xin Zhou, Jiaming Zhou, Haoqin Sun:
Self-Prompt Tuning: Enable Autonomous Role-Playing in LLMs. CoRR abs/2407.08995 (2024)
[i11]Haoqin Sun, Shiwan Zhao, Shaokai Li, Xiangyu Kong, Xuechen Wang, Aobo Kong, Jiaming Zhou, Yong Chen, Wenjia Zeng, Yong Qin:
Enhancing Emotion Recognition in Incomplete Data: A Novel Cross-Modal Alignment, Reconstruction, and Refinement Framework. CoRR abs/2407.09029 (2024)
[i10]Shiyao Wang, Shiwan Zhao, Jiaming Zhou, Aobo Kong, Yong Qin:
Enhancing Dysarthric Speech Recognition for Unseen Speakers via Prototype-Based Adaptation. CoRR abs/2407.18461 (2024)
[i9]Haoqin Sun, Shiwan Zhao, Xiangyu Kong, Xuechen Wang, Hui Wang, Jiaming Zhou, Yong Qin:
Iterative Prototype Refinement for Ambiguous Speech Emotion Recognition. CoRR abs/2408.00325 (2024)
[i8]Hui Wang, Shiwan Zhao, Jiaming Zhou, Xiguang Zheng, Haoqin Sun, Xuechen Wang, Yong Qin:
Uncertainty-Aware Mean Opinion Score Prediction. CoRR abs/2408.12829 (2024)
[i7]Shiyao Wang, Jiaming Zhou, Shiwan Zhao, Yong Qin:
PB-LRDWWS System for the SLT 2024 Low-Resource Dysarthria Wake-Up Word Spotting Challenge. CoRR abs/2409.04799 (2024)
[i6]Hongfei Xue, Rong Gong, Mingchen Shao, Xin Xu, Lezhi Wang, Lei Xie, Hui Bu, Jiaming Zhou, Yong Qin, Jun Du, Ming Li, Binbin Zhang, Bin Jia:
Findings of the 2024 Mandarin Stuttering Event Detection and Automatic Speech Recognition Challenge. CoRR abs/2409.05430 (2024)
[i5]Jiaming Zhou, Shiwan Zhao, Jiabei He, Hui Wang, Wenjia Zeng, Yong Chen, Haoqin Sun, Aobo Kong, Yong Qin:
M2R-Whisper: Multi-stage and Multi-scale Retrieval Augmentation for Enhancing Whisper. CoRR abs/2409.11889 (2024)
[i4]Jiaming Zhou, Shiyao Wang, Shiwan Zhao, Jiabei He, Haoqin Sun, Hui Wang, Cheng Liu, Aobo Kong, Yujie Guo, Yong Qin:
ChildMandarin: A Comprehensive Mandarin Speech Dataset for Young Children Aged 3-5. CoRR abs/2409.18584 (2024)
[i3]Xuechen Wang, Shiwan Zhao, Haoqin Sun, Hui Wang, Jiaming Zhou, Yong Qin:
Enhancing Multimodal Emotion Recognition through Multi-Granularity Cross-Modal Alignment. CoRR abs/2412.20821 (2024)- 2023
[c1]Jiaming Zhou, Shiwan Zhao, Ning Jiang, Guoqing Zhao, Yong Qin:
MADI: Inter-Domain Matching and Intra-Domain Discrimination for Cross-Domain Speech Recognition. ICASSP 2023: 1-5
[i2]Jiaming Zhou, Shiwan Zhao, Ning Jiang, Guoqing Zhao, Yong Qin:
MADI: Inter-domain Matching and Intra-domain Discrimination for Cross-domain Speech Recognition. CoRR abs/2302.11224 (2023)
[i1]Jiaming Zhou, Shiwan Zhao, Yaqi Liu, Wenjia Zeng, Yong Chen, Yong Qin:
kNN-CTC: Enhancing ASR via Retrieval of CTC Pseudo Labels. CoRR abs/2312.13560 (2023)
Coauthor Index

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from
to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the
of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from
,
, and
to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from
and
to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from
.
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2026-02-08 23:15 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint


Google
Google Scholar
Semantic Scholar
Internet Archive Scholar
CiteSeerX
ORCID







