default search action
Yi Ren 0006
Person information
- affiliation: Zhejiang University, China
Other persons with the same name
- Yi Ren — disambiguation page
- Yi Ren 0001 (aka: Edwin Yi Ren) — University of East Anglia, School of Computing Science, Norwich, UK (and 2 more)
- Yi Ren 0002 — Chongqing University of Posts and Telecommunications, School of Electronic Engineering, China (and 3 more)
- Yi Ren 0003 — Beihang University, School of Reliability and Systems Engineering, Beijing, China
- Yi Ren 0004 — University of Pittsburgh, Department of Biostatistics, Graduate School of Public Health, PA, USA
- Yi Ren 0005 — Technion - Israel Institute of Technology, Computer Science Department, Haifa, Israel
- Yi Ren 0007 — Columbia University, NY, USA
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
showing all ?? records
2020 – today
- 2024
- [j3]Can Jiang, Yi Ren, Bo Yang, Hong Peng, Xiaohui Luo:
Retinal vessels segmentation method based on dynamic threshold neural P systems with orientation feedback. J. Membr. Comput. 6(4): 266-277 (2024) - [j2]Mingyang Zhang, Yi Zhou, Yi Ren, Chen Zhang, Xiang Yin, Haizhou Li:
RefXVC: Cross-Lingual Voice Conversion With Enhanced Reference Leveraging. IEEE ACM Trans. Audio Speech Lang. Process. 32: 4146-4156 (2024) - [j1]Chen Zhang, Yi Ren, Kejun Zhang, Shuicheng Yan:
SDMuse: Stochastic Differential Music Editing and Generation via Hybrid Representation. IEEE Trans. Multim. 26: 1681-1689 (2024) - [c58]Rui Liu, Yifan Hu, Yi Ren, Xiang Yin, Haizhou Li:
Emotion Rendering for Conversational Speech Synthesis with Heterogeneous Graph-Based Context Modeling. AAAI 2024: 18698-18706 - [c57]Rongjie Huang, Mingze Li, Dongchao Yang, Jiatong Shi, Xuankai Chang, Zhenhui Ye, Yuning Wu, Zhiqing Hong, Jiawei Huang, Jinglin Liu, Yi Ren, Yuexian Zou, Zhou Zhao, Shinji Watanabe:
AudioGPT: Understanding and Generating Speech, Music, Sound, and Talking Head. AAAI 2024: 23802-23804 - [c56]Ziyue Jiang, Jinglin Liu, Yi Ren, Jinzheng He, Zhenhui Ye, Shengpeng Ji, Qian Yang, Chen Zhang, Pengfei Wei, Chunfeng Wang, Xiang Yin, Zejun Ma, Zhou Zhao:
Mega-TTS 2: Boosting Prompting Mechanisms for Zero-Shot Speech Synthesis. ICLR 2024 - [c55]Zhenhui Ye, Tianyun Zhong, Yi Ren, Jiaqi Yang, Weichuang Li, Jiawei Huang, Ziyue Jiang, Jinzheng He, Rongjie Huang, Jinglin Liu, Chen Zhang, Xiang Yin, Zejun Ma, Zhou Zhao:
Real3D-Portrait: One-shot Realistic 3D Talking Portrait Synthesis. ICLR 2024 - [c54]Rui Liu, Yifan Hu, Yi Ren, Xiang Yin, Haizhou Li:
Generative Expressive Conversational Speech Synthesis. ACM Multimedia 2024: 4187-4196 - [i63]Zhenhui Ye, Tianyun Zhong, Yi Ren, Jiaqi Yang, Weichuang Li, Jiawei Huang, Ziyue Jiang, Jinzheng He, Rongjie Huang, Jinglin Liu, Chen Zhang, Xiang Yin, Zejun Ma, Zhou Zhao:
Real3D-Portrait: One-shot Realistic 3D Talking Portrait Synthesis. CoRR abs/2401.08503 (2024) - [i62]Lingdong Kong, Shaoyuan Xie, Hanjiang Hu, Yaru Niu, Wei Tsang Ooi, Benoit R. Cottereau, Lai Xing Ng, Yuexin Ma, Wenwei Zhang, Liang Pan, Kai Chen, Ziwei Liu, Weichao Qiu, Wei Zhang, Xu Cao, Hao Lu, Ying-Cong Chen, Caixin Kang, Xinning Zhou, Chengyang Ying, Wentao Shang, Xingwei Wang, Yinpeng Dong, Bo Yang, Shengyin Jiang, Zeliang Ma, Dengyi Ji, Haiwen Li, Xingliang Huang, Yu Tian, Genghua Kou, Fan Jia, Yingfei Liu, Tiancai Wang, Ying Li, Xiaoshuai Hao, Yifan Yang, Hui Zhang, Mengchuan Wei, Yi Zhou, Haimei Zhao, Jing Zhang, Jinke Li, Xiao He, Xiaoqiang Cheng, Bingyang Zhang, Lirong Zhao, Dianlei Ding, Fangsheng Liu, Yixiang Yan, Hongming Wang, Nanfei Ye, Lun Luo, Yubo Tian, Yiwei Zuo, Zhe Cao, Yi Ren, Yunfan Li, Wenjie Liu, Xun Wu, Yifan Mao, Ming Li, Jian Liu, Jiayang Liu, Zihan Qin, Cunxi Chu, Jialei Xu, Wenbo Zhao, Junjun Jiang, Xianming Liu, Ziyan Wang, Chiwei Li, Shilong Li, Chendong Yuan, Songyue Yang, Wentao Liu, Peng Chen, Bin Zhou, Yubo Wang, Chi Zhang, Jianhang Sun, Hai Chen, Xiao Yang, Lizhong Wang, Dongyi Fu, Yongchun Lin, Huitong Yang, Haoang Li, Yadan Luo, Xianjing Cheng, Yong Xu:
The RoboDrive Challenge: Drive Anytime Anywhere in Any Condition. CoRR abs/2405.08816 (2024) - [i61]Rui Liu, Yifan Hu, Yi Ren, Xiang Yin, Haizhou Li:
Generative Expressive Conversational Speech Synthesis. CoRR abs/2407.21491 (2024) - [i60]Jiawei Huang, Chen Zhang, Yi Ren, Ziyue Jiang, Zhenhui Ye, Jinglin Liu, Jinzheng He, Xiang Yin, Zhou Zhao:
MulliVC: Multi-lingual Voice Conversion With Cycle Consistency. CoRR abs/2408.04708 (2024) - [i59]Zhenhui Ye, Tianyun Zhong, Yi Ren, Ziyue Jiang, Jiawei Huang, Rongjie Huang, Jinglin Liu, Jinzheng He, Chen Zhang, Zehan Wang, Xize Chen, Xiang Yin, Zhou Zhao:
MimicTalk: Mimicking a personalized and expressive 3D talking face in minutes. CoRR abs/2410.06734 (2024) - 2023
- [c53]Rongjie Huang, Yi Ren, Ziyue Jiang, Chenye Cui, Jinglin Liu, Zhou Zhao:
FastDiff 2: Revisiting and Incorporating GANs and Diffusion Models in High-Fidelity Speech Synthesis. ACL (Findings) 2023: 6994-7009 - [c52]Rongjie Huang, Chunlei Zhang, Yi Ren, Zhou Zhao, Dong Yu:
Prosody-TTS: Improving Prosody with Masked Autoencoder and Conditional Diffusion Model For Expressive Text-to-Speech. ACL (Findings) 2023: 8018-8034 - [c51]Rongjie Huang, Huadai Liu, Xize Cheng, Yi Ren, Linjun Li, Zhenhui Ye, Jinzheng He, Lichao Zhang, Jinglin Liu, Xiang Yin, Zhou Zhao:
AV-TranSpeech: Audio-Visual Robust Speech-to-Speech Translation. ACL (1) 2023: 8590-8604 - [c50]Zhenhui Ye, Rongjie Huang, Yi Ren, Ziyue Jiang, Jinglin Liu, Jinzheng He, Xiang Yin, Zhou Zhao:
CLAPSpeech: Learning Prosody from Text Context with Contrastive Language-Audio Pre-Training. ACL (1) 2023: 9317-9331 - [c49]Ziyue Jiang, Qian Yang, Jialong Zuo, Zhenhui Ye, Rongjie Huang, Yi Ren, Zhou Zhao:
FluentSpeech: Stutter-Oriented Automatic Speech Editing with Context-Aware Diffusion Models. ACL (Findings) 2023: 11655-11671 - [c48]Chenye Cui, Zhou Zhao, Yi Ren, Jinglin Liu, Rongjie Huang, Feiyang Chen, Zhefeng Wang, Baoxing Huai, Fei Wu:
VarietySound: Timbre-Controllable Video to Sound Generation Via Unsupervised Information Disentanglement. ICASSP 2023: 1-5 - [c47]Qinglin Zhang, Chong Deng, Jiaqing Liu, Hai Yu, Qian Chen, Wen Wang, Zhijie Yan, Jinglin Liu, Yi Ren, Zhou Zhao:
Overview of the ICASSP 2023 General Meeting Understanding and Generation Challenge (MUG). ICASSP 2023: 1-2 - [c46]Qinglin Zhang, Chong Deng, Jiaqing Liu, Hai Yu, Qian Chen, Wen Wang, Zhijie Yan, Jinglin Liu, Yi Ren, Zhou Zhao:
MUG: A General Meeting Understanding and Generation Benchmark. ICASSP 2023: 1-5 - [c45]Yi Ren, Chen Zhang, Shuicheng Yan:
Bag of Tricks for Unsupervised Text-to-Speech. ICLR 2023 - [c44]Rongjie Huang, Jinglin Liu, Huadai Liu, Yi Ren, Lichao Zhang, Jinzheng He, Zhou Zhao:
TranSpeech: Speech-to-Speech Translation With Bilateral Perturbation. ICLR 2023 - [c43]Zhenhui Ye, Ziyue Jiang, Yi Ren, Jinglin Liu, Jinzheng He, Zhou Zhao:
GeneFace: Generalized and High-Fidelity Audio-Driven 3D Talking Face Synthesis. ICLR 2023 - [c42]Rongjie Huang, Jiawei Huang, Dongchao Yang, Yi Ren, Luping Liu, Mingze Li, Zhenhui Ye, Jinglin Liu, Xiang Yin, Zhou Zhao:
Make-An-Audio: Text-To-Audio Generation with Prompt-Enhanced Diffusion Models. ICML 2023: 13916-13932 - [c41]Kun Song, Yi Ren, Yi Lei, Chunfeng Wang, Kun Wei, Lei Xie, Xiang Yin, Zejun Ma:
StyleS2ST: Zero-shot Style Transfer for Direct Speech-to-speech Translation. INTERSPEECH 2023: 42-46 - [c40]Yahuan Cong, Haoyu Zhang, Haopeng Lin, Shichao Liu, Chunfeng Wang, Yi Ren, Xiang Yin, Zejun Ma:
GenerTTS: Pronunciation Disentanglement for Timbre and Style Generalization in Cross-Lingual Text-to-Speech. INTERSPEECH 2023: 5486-5490 - [c39]Pengfei Wei, Lingdong Kong, Xinghua Qu, Yi Ren, Zhiqiang Xu, Jing Jiang, Xiang Yin:
Unsupervised Video Domain Adaptation for Action Recognition: A Disentanglement Perspective. NeurIPS 2023 - [i58]Rongjie Huang, Jiawei Huang, Dongchao Yang, Yi Ren, Luping Liu, Mingze Li, Zhenhui Ye, Jinglin Liu, Xiang Yin, Zhou Zhao:
Make-An-Audio: Text-To-Audio Generation with Prompt-Enhanced Diffusion Models. CoRR abs/2301.12661 (2023) - [i57]Zhenhui Ye, Ziyue Jiang, Yi Ren, Jinglin Liu, Jinzheng He, Zhou Zhao:
GeneFace: Generalized and High-Fidelity Audio-Driven 3D Talking Face Synthesis. CoRR abs/2301.13430 (2023) - [i56]Qinglin Zhang, Chong Deng, Jiaqing Liu, Hai Yu, Qian Chen, Wen Wang, Zhijie Yan, Jinglin Liu, Yi Ren, Zhou Zhao:
Overview of the ICASSP 2023 General Meeting Understanding and Generation Challenge (MUG). CoRR abs/2303.13932 (2023) - [i55]Qinglin Zhang, Chong Deng, Jiaqing Liu, Hai Yu, Qian Chen, Wen Wang, Zhijie Yan, Jinglin Liu, Yi Ren, Zhou Zhao:
MUG: A General Meeting Understanding and Generation Benchmark. CoRR abs/2303.13939 (2023) - [i54]Rongjie Huang, Mingze Li, Dongchao Yang, Jiatong Shi, Xuankai Chang, Zhenhui Ye, Yuning Wu, Zhiqing Hong, Jiawei Huang, Jinglin Liu, Yi Ren, Zhou Zhao, Shinji Watanabe:
AudioGPT: Understanding and Generating Speech, Music, Sound, and Talking Head. CoRR abs/2304.12995 (2023) - [i53]Zhenhui Ye, Jinzheng He, Ziyue Jiang, Rongjie Huang, Jiawei Huang, Jinglin Liu, Yi Ren, Xiang Yin, Zejun Ma, Zhou Zhao:
GeneFace++: Generalized and Stable Real-Time Audio-Driven 3D Talking Face Generation. CoRR abs/2305.00787 (2023) - [i52]Zhenhui Ye, Rongjie Huang, Yi Ren, Ziyue Jiang, Jinglin Liu, Jinzheng He, Xiang Yin, Zhou Zhao:
CLAPSpeech: Learning Prosody from Text Context with Contrastive Language-Audio Pre-training. CoRR abs/2305.10763 (2023) - [i51]Ziyue Jiang, Qian Yang, Jialong Zuo, Zhenhui Ye, Rongjie Huang, Yi Ren, Zhou Zhao:
FluentSpeech: Stutter-Oriented Automatic Speech Editing with Context-Aware Diffusion Models. CoRR abs/2305.13612 (2023) - [i50]Rongjie Huang, Huadai Liu, Xize Cheng, Yi Ren, Linjun Li, Zhenhui Ye, Jinzheng He, Lichao Zhang, Jinglin Liu, Xiang Yin, Zhou Zhao:
AV-TranSpeech: Audio-Visual Robust Speech-to-Speech Translation. CoRR abs/2305.15403 (2023) - [i49]Kun Song, Yi Ren, Yi Lei, Chunfeng Wang, Kun Wei, Lei Xie, Xiang Yin, Zejun Ma:
StyleS2ST: Zero-shot Style Transfer for Direct Speech-to-speech Translation. CoRR abs/2305.17732 (2023) - [i48]Jiawei Huang, Yi Ren, Rongjie Huang, Dongchao Yang, Zhenhui Ye, Chen Zhang, Jinglin Liu, Xiang Yin, Zejun Ma, Zhou Zhao:
Make-An-Audio 2: Temporal-Enhanced Text-to-Audio Generation. CoRR abs/2305.18474 (2023) - [i47]Luping Liu, Zijian Zhang, Yi Ren, Rongjie Huang, Xiang Yin, Zhou Zhao:
Detector Guidance for Multi-Object Text-to-Image Generation. CoRR abs/2306.02236 (2023) - [i46]Zhenhui Ye, Ziyue Jiang, Yi Ren, Jinglin Liu, Chen Zhang, Xiang Yin, Zejun Ma, Zhou Zhao:
Ada-TTA: Towards Adaptive High-Quality Text-to-Talking Avatar Synthesis. CoRR abs/2306.03504 (2023) - [i45]Ziyue Jiang, Yi Ren, Zhenhui Ye, Jinglin Liu, Chen Zhang, Qian Yang, Shengpeng Ji, Rongjie Huang, Chunfeng Wang, Xiang Yin, Zejun Ma, Zhou Zhao:
Mega-TTS: Zero-Shot Text-to-Speech at Scale with Intrinsic Inductive Bias. CoRR abs/2306.03509 (2023) - [i44]Yahuan Cong, Haoyu Zhang, Haopeng Lin, Shichao Liu, Chunfeng Wang, Yi Ren, Xiang Yin, Zejun Ma:
GenerTTS: Pronunciation Disentanglement for Timbre and Style Generalization in Cross-Lingual Text-to-Speech. CoRR abs/2306.15304 (2023) - [i43]Ziyue Jiang, Jinglin Liu, Yi Ren, Jinzheng He, Chen Zhang, Zhenhui Ye, Pengfei Wei, Chunfeng Wang, Xiang Yin, Zejun Ma, Zhou Zhao:
Mega-TTS 2: Zero-Shot Text-to-Speech with Arbitrary Length Speech Prompts. CoRR abs/2307.07218 (2023) - [i42]Longbin Ji, Pengfei Wei, Yi Ren, Jinglin Liu, Chen Zhang, Xiang Yin:
C2G2: Controllable Co-speech Gesture Generation with Latent Diffusion Model. CoRR abs/2308.15016 (2023) - [i41]Rui Liu, Yifan Hu, Yi Ren, Xiang Yin, Haizhou Li:
Emotion Rendering for Conversational Speech Synthesis with Heterogeneous Graph-Based Context Modeling. CoRR abs/2312.11947 (2023) - [i40]Bo Han, Yi Ren, Hao Peng, Teng Zhang, Zeyu Ling, Xiang Yin, Feilin Han:
EnchantDance: Unveiling the Potential of Music-Driven Dance Movement. CoRR abs/2312.15946 (2023) - 2022
- [c38]Jinzheng He, Zhou Zhao, Yi Ren, Jinglin Liu, Baoxing Huai, Nicholas Jing Yuan:
Flow-Based Unconstrained Lip to Speech Generation. AAAI 2022: 843-851 - [c37]Jinglin Liu, Zhiying Zhu, Yi Ren, Wencan Huang, Baoxing Huai, Nicholas Jing Yuan, Zhou Zhao:
Parallel and High-Fidelity Text-to-Lip Generation. AAAI 2022: 1738-1746 - [c36]Jinglin Liu, Chengxi Li, Yi Ren, Feiyang Chen, Zhou Zhao:
DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism. AAAI 2022: 11020-11028 - [c35]Jinglin Liu, Chengxi Li, Yi Ren, Zhiying Zhu, Zhou Zhao:
Learning the Beauty in Songs: Neural Singing Voice Beautifier. ACL (1) 2022: 7970-7983 - [c34]Yi Ren, Xu Tan, Tao Qin, Zhou Zhao, Tie-Yan Liu:
Revisiting Over-Smoothness in Text to Speech. ACL (1) 2022: 8197-8213 - [c33]Lichao Zhang, Yi Ren, Liqun Deng, Zhou Zhao:
HiFiDenoise: High-Fidelity Denoising Text to Speech with Adversarial Networks. ICASSP 2022: 7232-7236 - [c32]Yi Ren, Ming Lei, Zhiying Huang, Shiliang Zhang, Qian Chen, Zhijie Yan, Zhou Zhao:
Prosospeech: Enhancing Prosody with Quantized Vector Pre-Training in Text-To-Speech. ICASSP 2022: 7577-7581 - [c31]Luping Liu, Yi Ren, Zhijie Lin, Zhou Zhao:
Pseudo Numerical Methods for Diffusion Models on Manifolds. ICLR 2022 - [c30]Rongjie Huang, Max W. Y. Lam, Jun Wang, Dan Su, Dong Yu, Yi Ren, Zhou Zhao:
FastDiff: A Fast Conditional Diffusion Model for High-Quality Speech Synthesis. IJCAI 2022: 4157-4163 - [c29]Zhenhui Ye, Zhou Zhao, Yi Ren, Fei Wu:
SyntaSpeech: Syntax-Aware Generative Adversarial Text-to-Speech. IJCAI 2022: 4468-4474 - [c28]Lichao Zhang, Zhou Zhao, Yi Ren, Liqun Deng:
EditSinger: Zero-Shot Text-Based Singing Voice Editing System with Diverse Prosody Modeling. IJCAI 2022: 4503-4509 - [c27]Rongjie Huang, Chenye Cui, Feiyang Chen, Yi Ren, Jinglin Liu, Zhou Zhao, Baoxing Huai, Zhefeng Wang:
SingGAN: Generative Adversarial Network For High-Fidelity Singing Voice Generation. ACM Multimedia 2022: 2525-2535 - [c26]Rongjie Huang, Zhou Zhao, Huadai Liu, Jinglin Liu, Chenye Cui, Yi Ren:
ProDiff: Progressive Fast Diffusion Model for High-Quality Text-to-Speech. ACM Multimedia 2022: 2595-2605 - [c25]Yan Xia, Zhou Zhao, Shangwei Ye, Yang Zhao, Haoyuan Li, Yi Ren:
Video-Guided Curriculum Learning for Spoken Video Grounding. ACM Multimedia 2022: 5191-5200 - [c24]Kexun Zhang, Rui Wang, Xu Tan, Junliang Guo, Yi Ren, Tao Qin, Tie-Yan Liu:
A Study of Syntactic Multi-Modality in Non-Autoregressive Machine Translation. NAACL-HLT 2022: 1747-1757 - [c23]Rongjie Huang, Yi Ren, Jinglin Liu, Chenye Cui, Zhou Zhao:
GenerSpeech: Towards Style Transfer for Generalizable Out-Of-Domain Text-to-Speech. NeurIPS 2022 - [c22]Ziyue Jiang, Su Zhe, Zhou Zhao, Qian Yang, Yi Ren, Jinglin Liu:
Dict-TTS: Learning to Pronounce with Prior Dictionary Knowledge for Text-to-Speech. NeurIPS 2022 - [c21]Lichao Zhang, Ruiqi Li, Shoutong Wang, Liqun Deng, Jinglin Liu, Yi Ren, Jinzheng He, Rongjie Huang, Jieming Zhu, Xiao Chen, Zhou Zhao:
M4Singer: A Multi-Style, Multi-Singer and Musical Score Provided Mandarin Singing Corpus. NeurIPS 2022 - [i39]Shoutong Wang, Jinglin Liu, Yi Ren, Zhen Wang, Changliang Xu, Zhou Zhao:
MR-SVS: Singing Voice Synthesis with Multi-Reference Encoder. CoRR abs/2201.03864 (2022) - [i38]Yi Ren, Ming Lei, Zhiying Huang, Shiliang Zhang, Qian Chen, Zhijie Yan, Zhou Zhao:
ProsoSpeech: Enhancing Prosody With Quantized Vector Pre-training in Text-to-Speech. CoRR abs/2202.07816 (2022) - [i37]Luping Liu, Yi Ren, Zhijie Lin, Zhou Zhao:
Pseudo Numerical Methods for Diffusion Models on Manifolds. CoRR abs/2202.09778 (2022) - [i36]Yi Ren, Xu Tan, Tao Qin, Zhou Zhao, Tie-Yan Liu:
Revisiting Over-Smoothness in Text to Speech. CoRR abs/2202.13066 (2022) - [i35]Jinglin Liu, Chengxi Li, Yi Ren, Zhiying Zhu, Zhou Zhao:
Learning the Beauty in Songs: Neural Singing Voice Beautifier. CoRR abs/2202.13277 (2022) - [i34]Rongjie Huang, Max W. Y. Lam, Jun Wang, Dan Su, Dong Yu, Yi Ren, Zhou Zhao:
FastDiff: A Fast Conditional Diffusion Model for High-Quality Speech Synthesis. CoRR abs/2204.09934 (2022) - [i33]Zhenhui Ye, Zhou Zhao, Yi Ren, Fei Wu:
SyntaSpeech: Syntax-Aware Generative Adversarial Text-to-Speech. CoRR abs/2204.11792 (2022) - [i32]Rongjie Huang, Yi Ren, Jinglin Liu, Chenye Cui, Zhou Zhao:
GenerSpeech: Towards Style Transfer for Generalizable Out-Of-Domain Text-to-Speech Synthesis. CoRR abs/2205.07211 (2022) - [i31]Rongjie Huang, Zhou Zhao, Jinglin Liu, Huadai Liu, Yi Ren, Lichao Zhang, Jinzheng He:
TranSpeech: Speech-to-Speech Translation With Bilateral Perturbation. CoRR abs/2205.12523 (2022) - [i30]Ziyue Jiang, Su Zhe, Zhou Zhao, Qian Yang, Yi Ren, Jinglin Liu, Zhenhui Ye:
Dict-TTS: Learning to Pronounce with Prior Dictionary Knowledge for Text-to-Speech. CoRR abs/2206.02147 (2022) - [i29]Kexun Zhang, Rui Wang, Xu Tan, Junliang Guo, Yi Ren, Tao Qin, Tie-Yan Liu:
A Study of Syntactic Multi-Modality in Non-Autoregressive Machine Translation. CoRR abs/2207.04206 (2022) - [i28]Rongjie Huang, Zhou Zhao, Huadai Liu, Jinglin Liu, Chenye Cui, Yi Ren:
ProDiff: Progressive Fast Diffusion Model For High-Quality Text-to-Speech. CoRR abs/2207.06389 (2022) - [i27]Yan Xia, Zhou Zhao, Shangwei Ye, Yang Zhao, Haoyuan Li, Yi Ren:
Video-Guided Curriculum Learning for Spoken Video Grounding. CoRR abs/2209.00277 (2022) - [i26]Chen Zhang, Yi Ren, Kejun Zhang, Shuicheng Yan:
SDMuse: Stochastic Differential Music Editing and Generation via Hybrid Representation. CoRR abs/2211.00222 (2022) - [i25]Chenye Cui, Yi Ren, Jinglin Liu, Rongjie Huang, Zhou Zhao:
VarietySound: Timbre-Controllable Video to Sound Generation via Unsupervised Information Disentanglement. CoRR abs/2211.10666 (2022) - [i24]Luping Liu, Yi Ren, Xize Cheng, Zhou Zhao:
Diffusion Denoising Process for Perceptron Bias in Out-of-distribution Detection. CoRR abs/2211.11255 (2022) - 2021
- [c20]Zhonghao Sheng, Kaitao Song, Xu Tan, Yi Ren, Wei Ye, Shikun Zhang, Tao Qin:
SongMASS: Automatic Song Writing with Pre-training and Alignment Constraint. AAAI 2021: 13798-13805 - [c19]Chen Zhang, Xu Tan, Yi Ren, Tao Qin, Kejun Zhang, Tie-Yan Liu:
UWSpeech: Speech to Speech Translation for Unwritten Languages. AAAI 2021: 14319-14327 - [c18]Chen Zhang, Yi Ren, Xu Tan, Jinglin Liu, Kejun Zhang, Tao Qin, Sheng Zhao, Tie-Yan Liu:
Denoispeech: Denoising Text to Speech with Frame-Level Noise Modeling. ICASSP 2021: 7063-7067 - [c17]Yi Ren, Chenxu Hu, Xu Tan, Tao Qin, Sheng Zhao, Zhou Zhao, Tie-Yan Liu:
FastSpeech 2: Fast and High-Quality End-to-End Text to Speech. ICLR 2021 - [c16]Ziyue Jiang, Yi Ren, Ming Lei, Zhou Zhao:
FedSpeech: Federated Text-to-Speech with Continual Learning. IJCAI 2021: 3829-3835 - [c15]Kexun Zhang, Yi Ren, Changliang Xu, Zhou Zhao:
WSRGlow: A Glow-Based Waveform Generative Model for Audio Super-Resolution. Interspeech 2021: 1649-1653 - [c14]Chenye Cui, Yi Ren, Jinglin Liu, Feiyang Chen, Rongjie Huang, Ming Lei, Zhou Zhao:
EMOVIE: A Mandarin Emotion Speech Dataset with a Simple Emotional Text-to-Speech Model. Interspeech 2021: 2766-2770 - [c13]Rongjie Huang, Feiyang Chen, Yi Ren, Jinglin Liu, Chenye Cui, Zhou Zhao:
Multi-Singer: Fast Multi-Singer Singing Voice Vocoder With A Large-Scale Corpus. ACM Multimedia 2021: 3945-3954 - [c12]Yi Ren, Jinglin Liu, Zhou Zhao:
PortaSpeech: Portable and High-Quality Generative Text-to-Speech. NeurIPS 2021: 13963-13974 - [i23]Jinglin Liu, Chengxi Li, Yi Ren, Feiyang Chen, Peng Liu, Zhou Zhao:
DiffSinger: Diffusion Acoustic Model for Singing Voice Synthesis. CoRR abs/2105.02446 (2021) - [i22]Kexun Zhang, Yi Ren, Changliang Xu, Zhou Zhao:
WSRGlow: A Glow-based Waveform Generative Model for Audio Super-Resolution. CoRR abs/2106.08507 (2021) - [i21]Chenye Cui, Yi Ren, Jinglin Liu, Feiyang Chen, Rongjie Huang, Ming Lei, Zhou Zhao:
EMOVIE: A Mandarin Emotion Speech Dataset with a Simple Emotional Text-to-Speech Model. CoRR abs/2106.09317 (2021) - [i20]Jinglin Liu, Zhiying Zhu, Yi Ren, Zhou Zhao:
High-Speed and High-Quality Text-to-Lip Generation. CoRR abs/2107.06831 (2021) - [i19]Yi Ren, Jinglin Liu, Zhou Zhao:
PortaSpeech: Portable and High-Quality Generative Text-to-Speech. CoRR abs/2109.15166 (2021) - [i18]Ziyue Jiang, Yi Ren, Ming Lei, Zhou Zhao:
FedSpeech: Federated Text-to-Speech with Continual Learning. CoRR abs/2110.07216 (2021) - [i17]Feiyang Chen, Rongjie Huang, Chenye Cui, Yi Ren, Jinglin Liu, Zhou Zhao, Nicholas Jing Yuan, Baoxing Huai:
SingGAN: Generative Adversarial Network For High-Fidelity Singing Voice Generation. CoRR abs/2110.07468 (2021) - [i16]Rongjie Huang, Feiyang Chen, Yi Ren, Jinglin Liu, Chenye Cui, Zhou Zhao:
Multi-Singer: Fast Multi-Singer Singing Voice Vocoder With A Large-Scale Corpus. CoRR abs/2112.10358 (2021) - 2020
- [c11]Yi Ren, Jinglin Liu, Xu Tan, Zhou Zhao, Sheng Zhao, Tie-Yan Liu:
A Study of Non-autoregressive Model for Sequence Generation. ACL 2020: 149-159 - [c10]Yi Ren, Jinglin Liu, Xu Tan, Chen Zhang, Tao Qin, Zhou Zhao, Tie-Yan Liu:
SimulSpeech: End-to-End Simultaneous Speech to Text Translation. ACL 2020: 3787-3796 - [c9]Jinglin Liu, Yi Ren, Xu Tan, Chen Zhang, Tao Qin, Zhou Zhao, Tie-Yan Liu:
Task-Level Curriculum Learning for Non-Autoregressive Neural Machine Translation. IJCAI 2020: 3861-3867 - [c8]Mingjian Chen, Xu Tan, Yi Ren, Jin Xu, Hao Sun, Sheng Zhao, Tao Qin:
MultiSpeech: Multi-Speaker Text to Speech with Transformer. INTERSPEECH 2020: 4024-4028 - [c7]Yi Ren, Xu Tan, Tao Qin, Jian Luan, Zhou Zhao, Tie-Yan Liu:
DeepSinger: Singing Voice Synthesis with Data Mined From the Web. KDD 2020: 1979-1989 - [c6]Jin Xu, Xu Tan, Yi Ren, Tao Qin, Jian Li, Sheng Zhao, Tie-Yan Liu:
LRSpeech: Extremely Low-Resource Speech Synthesis and Recognition. KDD 2020: 2802-2812 - [c5]Yi Ren, Jinzheng He, Xu Tan, Tao Qin, Zhou Zhao, Tie-Yan Liu:
PopMAG: Pop Music Accompaniment Generation. ACM Multimedia 2020: 1198-1206 - [c4]Jinglin Liu, Yi Ren, Zhou Zhao, Chen Zhang, Baoxing Huai, Jing Yuan:
FastLR: Non-Autoregressive Lipreading Model with Integrate-and-Fire. ACM Multimedia 2020: 4328-4336 - [i15]Yi Ren, Jinglin Liu, Xu Tan, Sheng Zhao, Zhou Zhao, Tie-Yan Liu:
A Study of Non-autoregressive Model for Sequence Generation. CoRR abs/2004.10454 (2020) - [i14]Yi Ren, Chenxu Hu, Xu Tan, Tao Qin, Sheng Zhao, Zhou Zhao, Tie-Yan Liu:
FastSpeech 2: Fast and High-Quality End-to-End Text to Speech. CoRR abs/2006.04558 (2020) - [i13]Mingjian Chen, Xu Tan, Yi Ren, Jin Xu, Hao Sun, Sheng Zhao, Tao Qin:
MultiSpeech: Multi-Speaker Text to Speech with Transformer. CoRR abs/2006.04664 (2020) - [i12]Chen Zhang, Xu Tan, Yi Ren, Tao Qin, Kejun Zhang, Tie-Yan Liu:
UWSpeech: Speech to Speech Translation for Unwritten Languages. CoRR abs/2006.07926 (2020) - [i11]Yi Ren, Xu Tan, Tao Qin, Jian Luan, Zhou Zhao, Tie-Yan Liu:
DeepSinger: Singing Voice Synthesis with Data Mined From the Web. CoRR abs/2007.04590 (2020) - [i10]Jinglin Liu, Yi Ren, Xu Tan, Chen Zhang, Tao Qin, Zhou Zhao, Tie-Yan Liu:
Task-Level Curriculum Learning for Non-Autoregressive Neural Machine Translation. CoRR abs/2007.08772 (2020) - [i9]Jinglin Liu, Yi Ren, Zhou Zhao, Chen Zhang, Baoxing Huai, Jing Yuan:
FastLR: Non-Autoregressive Lipreading Model with Integrate-and-Fire. CoRR abs/2008.02516 (2020)