default search action
Zhou Zhao
This is just a disambiguation page, and is not intended to be the bibliography of an actual person. Any publication listed on this page has not been assigned to an actual author yet. If you know the true author of one of the publications listed below, you are welcome to contact us.
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
showing all ?? records
2020 – today
- 2025
- [j78]Yuxuan Xiong, Zhou Zhao, Yongchao Xu, Yan Zhang, Bo Du:
Calibration Matters: Prototype-Aware Diffusion for OCT Cervical Classification With Calibration. IEEE Signal Process. Lett. 32: 396-400 (2025) - 2024
- [j77]Zhiyi Yang, Zhou Zhao, Yuliang Gu, Yongchao Xu:
Query-guided generalizable medical image segmentation. Pattern Recognit. Lett. 184: 52-58 (2024) - [j76]Zhou Zhao, Wenhao He, Zhenyu Lu:
Tactile-Based Grasping Stability Prediction Based on Human Grasp Demonstration for Robot Manipulation. IEEE Robotics Autom. Lett. 9(3): 2646-2653 (2024) - [j75]Hong Nie, Zhou Zhao, Lu Chen, Zhenyu Lu, Zhuomao Li, Jing Yang:
Smaller and Faster Robotic Grasp Detection Model via Knowledge Distillation and Unequal Feature Encoding. IEEE Robotics Autom. Lett. 9(8): 7206-7213 (2024) - [j74]Zhou Zhao, Dongyuan Zheng, Lu Chen:
Detecting Transitions from Stability to Instability in Robotic Grasping Based on Tactile Perception. Sensors 24(15): 5080 (2024) - [j73]Zhenyu Lu, Zhou Zhao, Tianqi Yue, Xu Zhu, Ning Wang:
A Bioinspired Multifunctional Tendon-Driven Tactile Sensor and Application in Obstacle Avoidance Using Reinforcement Learning. IEEE Trans. Cogn. Dev. Syst. 16(2): 407-415 (2024) - [j72]Linjun Li, Tao Jin, Wang Lin, Hao Jiang, Wenwen Pan, Jian Wang, Shuwen Xiao, Yan Xia, Weihao Jiang, Zhou Zhao:
Multi-Granularity Relational Attention Network for Audio-Visual Question Answering. IEEE Trans. Circuits Syst. Video Technol. 34(8): 7080-7094 (2024) - [j71]Shengyu Zhang, Ziqi Jiang, Jiangchao Yao, Fuli Feng, Kun Kuang, Zhou Zhao, Shuo Li, Hongxia Yang, Tat-Seng Chua, Fei Wu:
Causal Distillation for Alleviating Performance Heterogeneity in Recommender Systems. IEEE Trans. Knowl. Data Eng. 36(2): 459-474 (2024) - [j70]Wenwen Pan, Zhou Zhao, Wencan Huang, Zhu Zhang, Liyong Fu, Zhigeng Pan, Jun Yu, Fei Wu:
Video Moment Retrieval With Noisy Labels. IEEE Trans. Neural Networks Learn. Syst. 35(5): 6779-6791 (2024) - [j69]Shengyu Zhang, Tan Jiang, Kun Kuang, Fuli Feng, Jin Yu, Jianxin Ma, Zhou Zhao, Jianke Zhu, Hongxia Yang, Tat-Seng Chua, Fei Wu:
SLED: Structure Learning based Denoising for Recommendation. ACM Trans. Inf. Syst. 42(2): 43:1-43:31 (2024) - [c268]Yufeng Huang, Jiji Tang, Zhuo Chen, Rongsheng Zhang, Xinfeng Zhang, Weijie Chen, Zeng Zhao, Zhou Zhao, Tangjie Lv, Zhipeng Hu, Wen Zhang:
Structure-CLIP: Towards Scene Graph Knowledge to Enhance Multi-Modal Structured Representations. AAAI 2024: 2417-2425 - [c267]Yu Zhang, Rongjie Huang, Ruiqi Li, Jinzheng He, Yan Xia, Feiyang Chen, Xinyu Duan, Baoxing Huai, Zhou Zhao:
StyleSinger: Style Transfer for Out-of-Domain Singing Voice Synthesis. AAAI 2024: 19597-19605 - [c266]Rongjie Huang, Mingze Li, Dongchao Yang, Jiatong Shi, Xuankai Chang, Zhenhui Ye, Yuning Wu, Zhiqing Hong, Jiawei Huang, Jinglin Liu, Yi Ren, Yuexian Zou, Zhou Zhao, Shinji Watanabe:
AudioGPT: Understanding and Generating Speech, Music, Sound, and Talking Head. AAAI 2024: 23802-23804 - [c265]Yongqi Wang, Jionghao Bai, Rongjie Huang, Ruiqi Li, Zhiqing Hong, Zhou Zhao:
Speech-to-Speech Translation with Discrete-Unit-Based Style Transfer. ACL (Student Research Workshop) 2024: 42-49 - [c264]Zirun Guo, Tao Jin, Zhou Zhao:
Multimodal Prompt Learning with Missing Modalities for Sentiment Analysis and Emotion Recognition. ACL (1) 2024: 1726-1736 - [c263]Qian Yang, Jin Xu, Wenrui Liu, Yunfei Chu, Ziyue Jiang, Xiaohuan Zhou, Yichong Leng, Yuanjun Lv, Zhou Zhao, Chang Zhou, Jingren Zhou:
AIR-Bench: Benchmarking Large Audio-Language Models via Generative Comprehension. ACL (1) 2024: 1979-1998 - [c262]Huadai Liu, Rongjie Huang, Jinzheng He, Gang Sun, Ran Shen, Xize Cheng, Zhou Zhao:
Wav2SQL: Direct Generalizable Speech-To-SQL Parsing. ACL (Findings) 2024: 4230-4242 - [c261]Tao Jin, Wang Lin, Ye Wang, Linjun Li, Xize Cheng, Zhou Zhao:
Rethinking the Multimodal Correlation of Multimodal Sequential Learning via Generalizable Attentional Results Alignment. ACL (1) 2024: 5247-5265 - [c260]Zhiqing Hong, Rongjie Huang, Xize Cheng, Yongqi Wang, Ruiqi Li, Fuming You, Zhou Zhao, Zhimeng Zhang:
Text-to-Song: Towards Controllable Music Generation Incorporating Vocal and Accompaniment. ACL (1) 2024: 6248-6261 - [c259]Ruiqi Li, Yu Zhang, Yongqi Wang, Zhiqing Hong, Rongjie Huang, Zhou Zhao:
Robust Singing Voice Transcription Serves Synthesis. ACL (1) 2024: 9751-9766 - [c258]Ruiqi Li, Rongjie Huang, Yongqi Wang, Zhiqing Hong, Zhou Zhao:
Self-Supervised Singing Voice Pre-Training towards Speech-to-Singing Conversion. ACL (Findings) 2024: 9819-9831 - [c257]Xize Cheng, Rongjie Huang, Linjun Li, Zehan Wang, Tao Jin, Aoxiong Yin, Feiyang Chen, Xinyu Duan, Baoxing Huai, Zhou Zhao:
TransFace: Unit-Based Audio-Visual Speech Synthesizer for Talking Head Translation. ACL (Findings) 2024: 9973-9986 - [c256]Songju Lei, Xize Cheng, Mengjiao Lyu, Jianqiao Hu, Jintao Tan, Runlin Liu, Lingyu Xiong, Tao Jin, Xiandong Li, Zhou Zhao:
Uni-Dubbing: Zero-Shot Speech Synthesis from Visual Articulation. ACL (1) 2024: 10082-10099 - [c255]Rongjie Huang, Chunlei Zhang, Yongqi Wang, Dongchao Yang, Jinchuan Tian, Zhenhui Ye, Luping Liu, Zehan Wang, Ziyue Jiang, Xuankai Chang, Jiatong Shi, Chao Weng, Zhou Zhao, Dong Yu:
Make-A-Voice: Revisiting Voice Large Language Models as Scalable Multilingual and Multitask Learners. ACL (1) 2024: 10929-10942 - [c254]Shengpeng Ji, Ziyue Jiang, Hanting Wang, Jialong Zuo, Zhou Zhao:
MobileSpeech: A Fast and High-Fidelity Framework for Mobile Zero-Shot Text-to-Speech. ACL (1) 2024: 13588-13600 - [c253]Huadai Liu, Wenqiang Xu, Xuan Lin, Jingjing Huo, Hong Chen, Zhou Zhao:
AntCritic: Argument Mining for Free-Form and Visually-Rich Financial Comments. LREC/COLING 2024: 1306-1317 - [c252]Jimin Xu, Tianbao Wang, Tao Jin, Shengyu Zhang, Dongjie Fu, Zhe Wang, Jiangjing Lyu, Chengfei Lv, Chaoyue Niu, Zhou Yu, Zhou Zhao, Fei Wu:
MPOD123: One Image to 3D Content Generation Using Mask-Enhanced Progressive Outline-to-Detail Optimization. CVPR 2024: 10682-10692 - [c251]Yu Zhang, Ziyue Jiang, Ruiqi Li, Changhao Pan, Jinzheng He, Rongjie Huang, Chuxin Wang, Zhou Zhao:
TCSinger: Zero-Shot Singing Voice Synthesis with Style Transfer and Multi-Level Style Control. EMNLP 2024: 1960-1975 - [c250]Aoxiong Yin, Tianyun Zhong, Haoyuan Li, Siliang Tang, Zhou Zhao:
Language Model is a Branch Predictor for Simultaneous Machine Translation. ICASSP 2024: 9976-9980 - [c249]Shengpeng Ji, Jialong Zuo, Minghui Fang, Ziyue Jiang, Feiyang Chen, Xinyu Duan, Baoxing Huai, Zhou Zhao:
TextrolSpeech: A Text Style Control Speech Corpus with Codec Language Text-to-Speech Models. ICASSP 2024: 10301-10305 - [c248]Ziyue Jiang, Jinglin Liu, Yi Ren, Jinzheng He, Zhenhui Ye, Shengpeng Ji, Qian Yang, Chen Zhang, Pengfei Wei, Chunfeng Wang, Xiang Yin, Zejun Ma, Zhou Zhao:
Mega-TTS 2: Boosting Prompting Mechanisms for Zero-Shot Speech Synthesis. ICLR 2024 - [c247]Zhenhui Ye, Tianyun Zhong, Yi Ren, Jiaqi Yang, Weichuang Li, Jiawei Huang, Ziyue Jiang, Jinzheng He, Rongjie Huang, Jinglin Liu, Chen Zhang, Xiang Yin, Zejun Ma, Zhou Zhao:
Real3D-Portrait: One-shot Realistic 3D Talking Portrait Synthesis. ICLR 2024 - [c246]Zehan Wang, Ziang Zhang, Xize Cheng, Rongjie Huang, Luping Liu, Zhenhui Ye, Haifeng Huang, Yang Zhao, Tao Jin, Peng Gao, Zhou Zhao:
FreeBind: Free Lunch in Unified Multimodal Space via Knowledge Fusion. ICML 2024 - [c245]Rongjie Huang, Ruofan Hu, Yongqi Wang, Zehan Wang, Xize Cheng, Ziyue Jiang, Zhenhui Ye, Dongchao Yang, Luping Liu, Peng Gao, Zhou Zhao:
InstructSpeech: Following Speech Editing Instructions via Large Language Models. ICML 2024 - [c244]Wang Lin, Jingyuan Chen, Jiaxin Shi, Yichen Zhu, Chen Liang, Junzhong Miao, Tao Jin, Zhou Zhao, Fei Wu, Shuicheng Yan, Hanwang Zhang:
Non-confusing Generation of Customized Concepts in Diffusion Models. ICML 2024 - [c243]Dongchao Yang, Jinchuan Tian, Xu Tan, Rongjie Huang, Songxiang Liu, Haohan Guo, Xuankai Chang, Jiatong Shi, Sheng Zhao, Jiang Bian, Zhou Zhao, Xixin Wu, Helen M. Meng:
UniAudio: Towards Universal Audio Generation with Large Language Models. ICML 2024 - [c242]Ye Wang, Jiahao Xun, Minjie Hong, Jieming Zhu, Tao Jin, Wang Lin, Haoyuan Li, Linjun Li, Yan Xia, Zhou Zhao, Zhenhua Dong:
EAGER: Two-Stream Generative Recommender with Behavior-Semantic Collaboration. KDD 2024: 3245-3254 - [c241]Qijiong Liu, Jieming Zhu, Yanting Yang, Quanyu Dai, Zhaocheng Du, Xiao-Ming Wu, Zhou Zhao, Rui Zhang, Zhenhua Dong:
Multimodal Pretraining, Adaptation, and Generation for Recommendation: A Survey. KDD 2024: 6566-6576 - [c240]Haoyu Zhao, Wenhui Dong, Rui Yu, Zhou Zhao, Bo Du, Yongchao Xu:
MoreStyle: Relax Low-Frequency Constraint of Fourier-Based Image Reconstruction in Generalizable Medical Image Segmentation. MICCAI (8) 2024: 434-444 - [c239]Zhikai Wei, Wenhui Dong, Peilin Zhou, Yuliang Gu, Zhou Zhao, Yongchao Xu:
Prompting Segment Anything Model with Domain-Adaptive Prototype for Generalizable Medical Image Segmentation. MICCAI (8) 2024: 533-543 - [c238]Zhichao Sun, Yuliang Gu, Yepeng Liu, Zerui Zhang, Zhou Zhao, Yongchao Xu:
Position-Guided Prompt Learning for Anomaly Detection in Chest X-Rays. MICCAI (1) 2024: 567-577 - [c237]Zerui Zhang, Zhichao Sun, Zelong Liu, Zhou Zhao, Rui Yu, Bo Du, Yongchao Xu:
Spatial-Aware Attention Generative Adversarial Network for Semi-supervised Anomaly Detection in Medical Image. MICCAI (5) 2024: 638-648 - [c236]Haoyu Zhao, Yuliang Gu, Zhou Zhao, Bo Du, Yongchao Xu, Rui Yu:
WIA-LD2ND: Wavelet-Based Image Alignment for Self-supervised Low-Dose CT Denoising. MICCAI (7) 2024: 764-774 - [c235]Mengze Li, Kairong Han, Jiahe Xu, Yueying Li, Tao Wu, Zhou Zhao, Jiaxu Miao, Shengyu Zhang, Jingyuan Chen:
Cross-modal Observation Hypothesis Inference. ACM Multimedia 2024: 466-475 - [c234]Tao Wu, Mengze Li, Jingyuan Chen, Wei Ji, Wang Lin, Jinyang Gao, Kun Kuang, Zhou Zhao, Fei Wu:
Semantic Alignment for Multimodal Large Language Models. ACM Multimedia 2024: 3489-3498 - [c233]Dongjie Fu, Xize Cheng, Xiaoda Yang, Hanting Wang, Zhou Zhao, Tao Jin:
Boosting Speech Recognition Robustness to Modality-Distortion with Contrast-Augmented Prompts. ACM Multimedia 2024: 3838-3847 - [c232]Tao Jin, Weicai Yan, Ye Wang, Sihang Cai, Qifan Shuai, Zhou Zhao:
Calibrating Prompt from History for Continual Vision-Language Retrieval and Grounding. ACM Multimedia 2024: 4302-4311 - [c231]Huadai Liu, Rongjie Huang, Yang Liu, Hengyuan Cao, Jialei Wang, Xize Cheng, Siqi Zheng, Zhou Zhao:
AudioLCM: Efficient and High-Quality Text-to-Audio Generation with Minimal Inference Steps. ACM Multimedia 2024: 7008-7017 - [c230]Xiaoda Yang, Xize Cheng, Dongjie Fu, Minghui Fang, Jialung Zuo, Shengpeng Ji, Zhou Zhao, Tao Jin:
SyncTalklip: Highly Synchronized Lip-Readable Speaker Generation with Multi-Task Learning. ACM Multimedia 2024: 8149-8158 - [c229]Weicai Yan, Ye Wang, Wang Lin, Zirun Guo, Zhou Zhao, Tao Jin:
Low-rank Prompt Interaction for Continual Vision-Language Retrieval. ACM Multimedia 2024: 8257-8266 - [c228]Zheqi Lv, Shaoxuan He, Tianyu Zhan, Shengyu Zhang, Wenqiao Zhang, Jingyuan Chen, Zhou Zhao, Fei Wu:
Semantic Codebook Learning for Dynamic Recommendation Models. ACM Multimedia 2024: 9611-9620 - [c227]Rongjie Huang, Yongqi Wang, Ruofan Hu, Xiaoshan Xu, Zhiqing Hong, Dongchao Yang, Xize Cheng, Zehan Wang, Ziyue Jiang, Zhenhui Ye, Luping Liu, Siqi Zheng, Zhou Zhao:
VoiceTuner: Self-Supervised Pre-training and Efficient Fine-tuning For Voice Generation. ACM Multimedia 2024: 10630-10639 - [c226]Yongqi Wang, Ruofan Hu, Rongjie Huang, Zhiqing Hong, Ruiqi Li, Wenrui Liu, Fuming You, Tao Jin, Zhou Zhao:
Prompt-Singer: Controllable Singing-Voice-Synthesis with Natural Language Prompt. NAACL-HLT 2024: 4780-4794 - [c225]Dong Yao, Jieming Zhu, Jiahao Xun, Shengyu Zhang, Zhou Zhao, Liqun Deng, Wenqiao Zhang, Zhenhua Dong, Xin Jiang:
MART: Learning Hierarchical Music Audio Representations with Part-Whole Transformer. WWW (Companion Volume) 2024: 967-970 - [i186]Zhenhui Ye, Tianyun Zhong, Yi Ren, Jiaqi Yang, Weichuang Li, Jiawei Huang, Ziyue Jiang, Jinzheng He, Rongjie Huang, Jinglin Liu, Chen Zhang, Xiang Yin, Zejun Ma, Zhou Zhao:
Real3D-Portrait: One-shot Realistic 3D Talking Portrait Synthesis. CoRR abs/2401.08503 (2024) - [i185]Qian Yang, Jin Xu, Wenrui Liu, Yunfei Chu, Ziyue Jiang, Xiaohuan Zhou, Yichong Leng, Yuanjun Lv, Zhou Zhao, Chang Zhou, Jingren Zhou:
AIR-Bench: Benchmarking Large Audio-Language Models via Generative Comprehension. CoRR abs/2402.07729 (2024) - [i184]Shengpeng Ji, Ziyue Jiang, Hanting Wang, Jialong Zuo, Zhou Zhao:
MobileSpeech: A Fast and High-Fidelity Framework for Mobile Zero-Shot Text-to-Speech. CoRR abs/2402.09378 (2024) - [i183]Shengpeng Ji, Minghui Fang, Ziyue Jiang, Rongjie Huang, Jialong Zuo, Shulei Wang, Zhou Zhao:
Language-Codec: Reducing the Gaps Between Discrete Codec Representation and Speech Language Models. CoRR abs/2402.12208 (2024) - [i182]Hai Huang, Yan Xia, Shengpeng Ji, Shulei Wang, Hanting Wang, Jieming Zhu, Zhenhua Dong, Zhou Zhao:
Unlocking the Potential of Multimodal Unified Discrete Representation through Training-Free Codebook Optimization and Hierarchical Alignment. CoRR abs/2403.05168 (2024) - [i181]Haoyu Zhao, Yuliang Gu, Zhou Zhao, Bo Du, Yongchao Xu, Rui Yu:
WIA-LD2ND: Wavelet-based Image Alignment for Self-supervised Low-Dose CT Denoising. CoRR abs/2403.11672 (2024) - [i180]Haoyu Zhao, Wenhui Dong, Rui Yu, Zhou Zhao, Bo Du, Yongchao Xu:
MoreStyle: Relax Low-frequency Constraint of Fourier-based Image Reconstruction in Generalizable Medical Image Segmentation. CoRR abs/2403.11689 (2024) - [i179]Yongqi Wang, Ruofan Hu, Rongjie Huang, Zhiqing Hong, Ruiqi Li, Wenrui Liu, Fuming You, Tao Jin, Zhou Zhao:
Prompt-Singer: Controllable Singing-Voice-Synthesis with Natural Language Prompt. CoRR abs/2403.11780 (2024) - [i178]Qijiong Liu, Jieming Zhu, Yanting Yang, Quanyu Dai, Zhaocheng Du, Xiao-Ming Wu, Zhou Zhao, Rui Zhang, Zhenhua Dong:
Multimodal Pretraining, Adaptation, and Generation for Recommendation: A Survey. CoRR abs/2404.00621 (2024) - [i177]Zhiqing Hong, Rongjie Huang, Xize Cheng, Yongqi Wang, Ruiqi Li, Fuming You, Zhou Zhao, Zhimeng Zhang:
Text-to-Song: Towards Controllable Music Generation Incorporating Vocals and Accompaniment. CoRR abs/2404.09313 (2024) - [i176]Kunxi Li, Tianyu Zhan, Shengyu Zhang, Kun Kuang, Jiwei Li, Zhou Zhao, Fei Wu:
MergeNet: Knowledge Migration across Heterogeneous Models, Tasks, and Modalities. CoRR abs/2404.13322 (2024) - [i175]Bo Lin, Yingjing Xu, Xuanwen Bao, Zhou Zhao, Zuyong Zhang, Zhouyang Wang, Jie Zhang, Shuiguang Deng, Jianwei Yin:
SkinGEN: an Explainable Dermatology Diagnosis-to-Generation Framework with Interactive Vision-Language Models. CoRR abs/2404.14755 (2024) - [i174]Zehan Wang, Ziang Zhang, Xize Cheng, Rongjie Huang, Luping Liu, Zhenhui Ye, Haifeng Huang, Yang Zhao, Tao Jin, Peng Gao, Zhou Zhao:
FreeBind: Free Lunch in Unified Multimodal Space via Knowledge Fusion. CoRR abs/2405.04883 (2024) - [i173]Wang Lin, Jingyuan Chen, Jiaxin Shi, Yichen Zhu, Chen Liang, Junzhong Miao, Tao Jin, Zhou Zhao, Fei Wu, Shuicheng Yan, Hanwang Zhang:
Non-confusing Generation of Customized Concepts in Diffusion Models. CoRR abs/2405.06914 (2024) - [i172]Ruiqi Li, Yu Zhang, Yongqi Wang, Zhiqing Hong, Rongjie Huang, Zhou Zhao:
Robust Singing Voice Transcription Serves Synthesis. CoRR abs/2405.09940 (2024) - [i171]Zhichao Sun, Yuliang Gu, Yepeng Liu, Zerui Zhang, Zhou Zhao, Yongchao Xu:
Position-Guided Prompt Learning for Anomaly Detection in Chest X-Rays. CoRR abs/2405.11976 (2024) - [i170]Zerui Zhang, Zhichao Sun, Zelong Liu, Bo Du, Rui Yu, Zhou Zhao, Yongchao Xu:
Spatial-aware Attention Generative Adversarial Network for Semi-supervised Anomaly Detection in Medical Image. CoRR abs/2405.12872 (2024) - [i169]Shengyu Zhang, Ziqi Jiang, Jiangchao Yao, Fuli Feng, Kun Kuang, Zhou Zhao, Shuo Li, Hongxia Yang, Tat-Seng Chua, Fei Wu:
Causal Distillation for Alleviating Performance Heterogeneity in Recommender Systems. CoRR abs/2405.20626 (2024) - [i168]Yongqi Wang, Wenxiang Guo, Rongjie Huang, Jiawei Huang, Zehan Wang, Fuming You, Ruiqi Li, Zhou Zhao:
Frieren: Efficient Video-to-Audio Generation with Rectified Flow Matching. CoRR abs/2406.00320 (2024) - [i167]Huadai Liu, Rongjie Huang, Yang Liu, Hengyuan Cao, Jialei Wang, Xize Cheng, Siqi Zheng, Zhou Zhao:
AudioLCM: Text-to-Audio Generation with Latent Consistency Models. CoRR abs/2406.00356 (2024) - [i166]Shengpeng Ji, Jialong Zuo, Minghui Fang, Siqi Zheng, Qian Chen, Wen Wang, Ziyue Jiang, Hai Huang, Xize Cheng, Rongjie Huang, Zhou Zhao:
ControlSpeech: Towards Simultaneous Zero-shot Speaker Cloning and Zero-shot Language Style Control With Decoupled Codec. CoRR abs/2406.01205 (2024) - [i165]Ruiqi Li, Rongjie Huang, Yongqi Wang, Zhiqing Hong, Zhou Zhao:
Self-Supervised Singing Voice Pre-Training towards Speech-to-Singing Conversion. CoRR abs/2406.02429 (2024) - [i164]Ye Wang, Jiahao Xun, Mingjie Hong, Jieming Zhu, Tao Jin, Wang Lin, Haoyuan Li, Linjun Li, Yan Xia, Zhou Zhao, Zhenhua Dong:
EAGER: Two-Stream Generative Recommender with Behavior-Semantic Collaboration. CoRR abs/2406.14017 (2024) - [i163]Minghui Fang, Shengpeng Ji, Jialong Zuo, Hai Huang, Yan Xia, Jieming Zhu, Xize Cheng, Xiaoda Yang, Wenrui Liu, Gang Wang, Zhenhua Dong, Zhou Zhao:
ACE: A Generative Cross-Modal Retrieval Framework with Coarse-To-Fine Semantic Modeling. CoRR abs/2406.17507 (2024) - [i162]Ruiqi Li, Zhiqing Hong, Yongqi Wang, Lichao Zhang, Rongjie Huang, Siqi Zheng, Zhou Zhao:
Accompanied Singing Voice Synthesis with Fully Text-controlled Melody. CoRR abs/2407.02049 (2024) - [i161]Zirun Guo, Tao Jin, Zhou Zhao:
Multimodal Prompt Learning with Missing Modalities for Sentiment Analysis and Emotion Recognition. CoRR abs/2407.05374 (2024) - [i160]Zehan Wang, Ziang Zhang, Hang Zhang, Luping Liu, Rongjie Huang, Xize Cheng, Hengshuang Zhao, Zhou Zhao:
OmniBind: Large-scale Omni Multimodal Representation via Binding Spaces. CoRR abs/2407.11895 (2024) - [i159]Huadai Liu, Jialei Wang, Rongjie Huang, Yang Liu, Jiayang Xu, Zhou Zhao:
MEDIC: Zero-shot Music Editing with Disentangled Inversion Control. CoRR abs/2407.13220 (2024) - [i158]Qian Yang, Jialong Zuo, Zhe Su, Ziyue Jiang, Mingze Li, Zhou Zhao, Feiyang Chen, Zhefeng Wang, Baoxing Huai:
MSceneSpeech: A Multi-Scene Speech Dataset For Expressive Speech Synthesis. CoRR abs/2407.14006 (2024) - [i157]Zheqi Lv, Shaoxuan He, Tianyu Zhan, Shengyu Zhang, Wenqiao Zhang, Jingyuan Chen, Zhou Zhao, Fei Wu:
Semantic Codebook Learning for Dynamic Recommendation Models. CoRR abs/2408.00123 (2024) - [i156]Jiawei Huang, Chen Zhang, Yi Ren, Ziyue Jiang, Zhenhui Ye, Jinglin Liu, Jinzheng He, Xiang Yin, Zhou Zhao:
MulliVC: Multi-lingual Voice Conversion With Cycle Consistency. CoRR abs/2408.04708 (2024) - [i155]Tao Wu, Mengze Li, Jingyuan Chen, Wei Ji, Wang Lin, Jinyang Gao, Kun Kuang, Zhou Zhao, Fei Wu:
Semantic Alignment for Multimodal Large Language Models. CoRR abs/2408.12867 (2024) - [i154]Shengpeng Ji, Ziyue Jiang, Xize Cheng, Yifu Chen, Minghui Fang, Jialong Zuo, Qian Yang, Ruiqi Li, Ziang Zhang, Xiaoda Yang, Rongjie Huang, Yidi Jiang, Qian Chen, Siqi Zheng, Wen Wang, Zhou Zhao:
WavTokenizer: an Efficient Acoustic Discrete Codec Tokenizer for Audio Language Modeling. CoRR abs/2408.16532 (2024) - [i153]Qijiong Liu, Jieming Zhu, Lu Fan, Zhou Zhao, Xiao-Ming Wu:
STORE: Streamlining Semantic Tokenization and Generative Recommendation with A Single LLM. CoRR abs/2409.07276 (2024) - [i152]Zhikai Wei, Wenhui Dong, Peilin Zhou, Yuliang Gu, Zhou Zhao, Yongchao Xu:
Prompting Segment Anything Model with Domain-Adaptive Prototype for Generalizable Medical Image Segmentation. CoRR abs/2409.12522 (2024) - [i151]Yu Zhang, Changhao Pan, Wenxiang Guo, Ruiqi Li, Zhiyuan Zhu, Jialei Wang, Wenhao Xu, Jingyu Lu, Zhiqing Hong, Chuxin Wang, LiChao Zhang, Jinzheng He, Ziyue Jiang, Yuxin Chen, Chen Yang, Jiecheng Zhou, Xinyu Cheng, Zhou Zhao:
GTSinger: A Global Multi-Technique Singing Corpus with Realistic Music Scores for All Singing Tasks. CoRR abs/2409.13832 (2024) - [i150]Yu Zhang, Ziyue Jiang, Ruiqi Li, Changhao Pan, Jinzheng He, Rongjie Huang, Chuxin Wang, Zhou Zhao:
TCSinger: Zero-Shot Singing Voice Synthesis with Style Transfer and Multi-Level Style Control. CoRR abs/2409.15977 (2024) - [i149]Wenrui Liu, Zhifang Guo, Jin Xu, Yuanjun Lv, Yunfei Chu, Zhou Zhao, Junyang Lin:
Analyzing and Mitigating Inconsistency in Discrete Audio Tokens for Neural Codec Language Models. CoRR abs/2409.19283 (2024) - [i148]Zhenhui Ye, Tianyun Zhong, Yi Ren, Ziyue Jiang, Jiawei Huang, Rongjie Huang, Jinglin Liu, Jinzheng He, Chen Zhang, Zehan Wang, Xize Chen, Xiang Yin, Zhou Zhao:
MimicTalk: Mimicking a personalized and expressive 3D talking face in minutes. CoRR abs/2410.06734 (2024) - [i147]Huadai Liu, Jialei Wang, Rongjie Huang, Yang Liu, Heng Lu, Wei Xue, Zhou Zhao:
FlashAudio: Rectified Flows for Fast and High-Fidelity Text-to-Audio Generation. CoRR abs/2410.12266 (2024) - [i146]Ruiqi Li, Siqi Zheng, Xize Cheng, Ziang Zhang, Shengpeng Ji, Zhou Zhao:
MuVi: Video-to-Music Generation with Semantic Alignment and Rhythmic Synchronization. CoRR abs/2410.12957 (2024) - [i145]Wenyi Xiao, Zechuan Wang, Leilei Gan, Shuai Zhao, Wanggui He, Luu Anh Tuan, Long Chen, Hao Jiang, Zhou Zhao, Fei Wu:
A Comprehensive Survey of Datasets, Theories, Variants, and Applications in Direct Preference Optimization. CoRR abs/2410.15595 (2024) - [i144]Xize Cheng, Siqi Zheng, Zehan Wang, Minghui Fang, Ziang Zhang, Rongjie Huang, Ziyang Ma, Shengpeng Ji, Jialong Zuo, Tao Jin, Zhou Zhao:
OmniSep: Unified Omni-Modality Sound Separation with Query-Mixup. CoRR abs/2410.21269 (2024) - [i143]Zirun Guo, Tao Jin, Jingyuan Chen, Zhou Zhao:
Classifier-guided Gradient Modulation for Enhanced Multimodal Learning. CoRR abs/2411.01409 (2024) - [i142]Fuming You, Minghui Fang, Li Tang, Rongjie Huang, Yongqi Wang, Zhou Zhao:
MoMu-Diffusion: On Learning Long-Term Motion-Music Synchronization and Correspondence. CoRR abs/2411.01805 (2024) - [i141]Shengpeng Ji, Yifu Chen, Minghui Fang, Jialong Zuo, Jingyu Lu, Hanting Wang, Ziyue Jiang, Long Zhou, Shujie Liu, Xize Cheng, Xiaoda Yang, Zehan Wang, Qian Yang, Jian Li, Yidi Jiang, Jingzhen He, Yunfei Chu, Jin Xu, Zhou Zhao:
WavChat: A Survey of Spoken Dialogue Models. CoRR abs/2411.13577 (2024) - 2023
- [b1]Zhou Zhao:
Heart Segmentation and Evaluation of Fibrosis. (Segmentation cardiaque et évaluation de la fibrose). Sorbonne University, Paris, France, 2023 - [j68]Zhou Zhao, Qingkai Guo, Yu Sun, Ningli An, Pengzhe Hui, Laihao Yang, Xuefeng Chen:
Bioinspired Hierarchical Structure for an Ultrawide-Range Multifunctional Flexible Sensor Using Porous Expandable Polyethylene/Loofah-Like Polyurethane Sponge Material. Adv. Intell. Syst. 5(1) (2023) - [j67]Lei Li, Fuping Wu, Sihan Wang, Xinzhe Luo, Carlos Martín-Isla, Shuwei Zhai, Jianpeng Zhang, Yanfei Liu, Zhen Zhang, Markus J. Ankenbrand, Haochuan Jiang, Xiaoran Zhang, Linhong Wang, Tewodros Weldebirhan Arega, Elif Altunok, Zhou Zhao, Feiyan Li, Jun Ma, Xiaoping Yang, Élodie Puybareau, Ilkay Öksüz, Stéphanie Bricq, Weisheng Li, Kumaradevan Punithakumar, Sotirios A. Tsaftaris, Laura Maria Schreiber, Mingjing Yang, Guocai Liu, Yong Xia, Guotai Wang, Sergio Escalera, Xiahai Zhuang:
MyoPS: A benchmark of myocardial pathology segmentation combining three-sequence cardiac magnetic resonance images. Medical Image Anal. 87: 102808 (2023) - [j66]Shengyu Zhang, Fuli Feng, Kun Kuang, Wenqiao Zhang, Zhou Zhao, Hongxia Yang, Tat-Seng Chua, Fei Wu:
Personalized Latent Structure Learning for Recommendation. IEEE Trans. Pattern Anal. Mach. Intell. 45(8): 10285-10299 (2023) - [j65]Zhenyu Lu, Lu Chen, Hengtai Dai, Haoran Li, Zhou Zhao, Bofang Zheng, Nathan F. Lepora, Chenguang Yang:
Visual-Tactile Robot Grasping Based on Human Skill Learning From Demonstrations Using a Wearable Parallel Hand Exoskeleton. IEEE Robotics Autom. Lett. 8(9): 5384-5391 (2023) - [j64]Yuzhen Guo, Zengxing Zhang, Bin Yao, Jin Chai, Shiqiang Zhang, Jianwei Liu, Zhou Zhao, Chenyang Xue:
Fabrication and Performance of a Ta2O5 Thin Film pH Sensor Manufactured Using MEMS Processes.