


Остановите войну!
for scientists:


default search action
Zhou Zhao
This is just a disambiguation page, and is not intended to be the bibliography of an actual person. Any publication listed on this page has not been assigned to an actual author yet. If you know the true author of one of the publications listed below, you are welcome to contact us.
Person information

Refine list

refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
showing all ?? records
2020 – today
- 2023
- [b1]Zhou Zhao:
Heart Segmentation and Evaluation of Fibrosis. (Segmentation cardiaque et évaluation de la fibrose). Sorbonne University, Paris, France, 2023 - [j68]Zhou Zhao, Qingkai Guo, Yu Sun
, Ningli An, Pengzhe Hui, Laihao Yang, Xuefeng Chen:
Bioinspired Hierarchical Structure for an Ultrawide-Range Multifunctional Flexible Sensor Using Porous Expandable Polyethylene/Loofah-Like Polyurethane Sponge Material. Adv. Intell. Syst. 5(1) (2023) - [j67]Lei Li
, Fuping Wu, Sihan Wang
, Xinzhe Luo
, Carlos Martín-Isla, Shuwei Zhai, Jianpeng Zhang, Yanfei Liu, Zhen Zhang
, Markus J. Ankenbrand
, Haochuan Jiang
, Xiaoran Zhang
, Linhong Wang, Tewodros Weldebirhan Arega
, Elif Altunok
, Zhou Zhao
, Feiyan Li
, Jun Ma, Xiaoping Yang, Élodie Puybareau
, Ilkay Öksüz
, Stéphanie Bricq
, Weisheng Li
, Kumaradevan Punithakumar
, Sotirios A. Tsaftaris
, Laura Maria Schreiber
, Mingjing Yang
, Guocai Liu, Yong Xia, Guotai Wang
, Sergio Escalera
, Xiahai Zhuang
:
MyoPS: A benchmark of myocardial pathology segmentation combining three-sequence cardiac magnetic resonance images. Medical Image Anal. 87: 102808 (2023) - [j66]Shengyu Zhang
, Fuli Feng
, Kun Kuang
, Wenqiao Zhang, Zhou Zhao
, Hongxia Yang, Tat-Seng Chua
, Fei Wu
:
Personalized Latent Structure Learning for Recommendation. IEEE Trans. Pattern Anal. Mach. Intell. 45(8): 10285-10299 (2023) - [j65]Zhenyu Lu
, Lu Chen
, Hengtai Dai
, Haoran Li
, Zhou Zhao
, Bofang Zheng, Nathan F. Lepora
, Chenguang Yang
:
Visual-Tactile Robot Grasping Based on Human Skill Learning From Demonstrations Using a Wearable Parallel Hand Exoskeleton. IEEE Robotics Autom. Lett. 8(9): 5384-5391 (2023) - [j64]Yuzhen Guo
, Zengxing Zhang
, Bin Yao, Jin Chai, Shiqiang Zhang, Jianwei Liu, Zhou Zhao, Chenyang Xue:
Fabrication and Performance of a Ta2O5 Thin Film pH Sensor Manufactured Using MEMS Processes. Sensors 23(13): 6061 (2023) - [c210]Zijian Zhang
, Zhou Zhao, Jun Yu, Qi Tian:
ShiftDDPMs: Exploring Conditional Diffusion Models by Shifting Diffusion Trajectories. AAAI 2023: 3552-3560 - [c209]Shengyu Zhang, Xusheng Feng, Wenyan Fan, Wenjing Fang, Fuli Feng, Wei Ji, Shuo Li, Li Wang, Shanshan Zhao, Zhou Zhao, Tat-Seng Chua, Fei Wu:
Video-Audio Domain Generalization via Confounder Disentanglement. AAAI 2023: 15322-15330 - [c208]Zehan Wang, Yang Zhao, Haifeng Huang, Yan Xia, Zhou Zhao:
Scene-robust Natural Language Video Localization via Learning Domain-invariant Representations. ACL (Findings) 2023: 144-160 - [c207]Jinzheng He, Jinglin Liu, Zhenhui Ye, Rongjie Huang, Chenye Cui, Huadai Liu, Zhou Zhao:
RMSSinger: Realistic-Music-Score based Singing Voice Synthesis. ACL (Findings) 2023: 236-248 - [c206]Mengze Li, Tianbao Wang, Jiahe Xu, Kairong Han, Shengyu Zhang, Zhou Zhao, Jiaxu Miao, Wenqiao Zhang, Shiliang Pu, Fei Wu:
Multi-modal Action Chain Abductive Reasoning. ACL (1) 2023: 4617-4628 - [c205]Xize Cheng, Tao Jin, Linjun Li, Wang Lin, Xinyu Duan, Zhou Zhao:
OpenSR: Open-Modality Speech Recognition via Maintaining Multi-Modality Alignment. ACL (1) 2023: 6592-6607 - [c204]Rongjie Huang, Yi Ren, Ziyue Jiang, Chenye Cui, Jinglin Liu, Zhou Zhao:
FastDiff 2: Revisiting and Incorporating GANs and Diffusion Models in High-Fidelity Speech Synthesis. ACL (Findings) 2023: 6994-7009 - [c203]Ruiqi Li, Rongjie Huang, Lichao Zhang, Jinglin Liu, Zhou Zhao:
AlignSTS: Speech-to-Singing Conversion via Cross-Modal Alignment. ACL (Findings) 2023: 7074-7088 - [c202]Rongjie Huang, Chunlei Zhang, Yi Ren, Zhou Zhao, Dong Yu:
Prosody-TTS: Improving Prosody with Masked Autoencoder and Conditional Diffusion Model For Expressive Text-to-Speech. ACL (Findings) 2023: 8018-8034 - [c201]Rongjie Huang, Huadai Liu, Xize Cheng, Yi Ren, Linjun Li, Zhenhui Ye, Jinzheng He, Lichao Zhang, Jinglin Liu, Xiang Yin, Zhou Zhao:
AV-TranSpeech: Audio-Visual Robust Speech-to-Speech Translation. ACL (1) 2023: 8590-8604 - [c200]Zhenhui Ye, Rongjie Huang, Yi Ren, Ziyue Jiang, Jinglin Liu, Jinzheng He, Xiang Yin, Zhou Zhao:
CLAPSpeech: Learning Prosody from Text Context with Contrastive Language-Audio Pre-Training. ACL (1) 2023: 9317-9331 - [c199]Ye Wang, Tao Jin, Wang Lin, Xize Cheng, Linjun Li, Zhou Zhao:
Semantic-conditioned Dual Adaptation for Cross-domain Query-based Visual Segmentation. ACL (Findings) 2023: 9797-9815 - [c198]Ye Wang, Wang Lin, Shengyu Zhang, Tao Jin, Linjun Li, Xize Cheng, Zhou Zhao:
Weakly-Supervised Spoken Video Grounding via Semantic Interaction Learning. ACL (1) 2023: 10914-10932 - [c197]Linjun Li, Tao Jin, Xize Cheng, Ye Wang, Wang Lin, Rongjie Huang, Zhou Zhao:
Contrastive Token-Wise Meta-Learning for Unseen Performer Visual Temporal-Aligned Translation. ACL (Findings) 2023: 10993-11007 - [c196]Ziyue Jiang, Qian Yang, Jialong Zuo, Zhenhui Ye, Rongjie Huang, Yi Ren, Zhou Zhao:
FluentSpeech: Stutter-Oriented Automatic Speech Editing with Context-Aware Diffusion Models. ACL (Findings) 2023: 11655-11671 - [c195]Jinglin Liu, Zhenhui Ye, Qian Chen, Siqi Zheng, Wen Wang, Qinglin Zhang, Zhou Zhao:
DopplerBAS: Binaural Audio Synthesis Addressing Doppler Effect. ACL (Findings) 2023: 11905-11912 - [c194]Wang Lin, Tao Jin, Wenwen Pan, Linjun Li, Xize Cheng, Ye Wang, Zhou Zhao:
TAVT: Towards Transferable Audio-Visual Text Generation. ACL (1) 2023: 14983-14999 - [c193]Aoxiong Yin, Tianyun Zhong, Li Tang, Weike Jin, Tao Jin, Zhou Zhao:
Gloss Attention for Gloss-free Sign Language Translation. CVPR 2023: 2551-2562 - [c192]Haoyuan Li, Hao Jiang, Tao Jin, Mengyan Li, Yan Chen, Zhijie Lin, Yang Zhao, Zhou Zhao:
DATE: Domain Adaptive Product Seeker for E-Commerce. CVPR 2023: 19315-19324 - [c191]Mengze Li, Han Wang, Wenqiao Zhang, Jiaxu Miao, Zhou Zhao, Shengyu Zhang, Wei Ji, Fei Wu:
WINNER: Weakly-supervised hIerarchical decompositioN and aligNment for spatio-tEmporal video gRounding. CVPR 2023: 23090-23099 - [c190]Zhou Yu, Lixiang Zheng, Zhou Zhao, Fei Wu, Jianping Fan, Kui Ren, Jun Yu:
ANetQA: A Large-scale Benchmark for Fine-grained Compositional Reasoning over Untrimmed Videos. CVPR 2023: 23191-23200 - [c189]Chenye Cui, Zhou Zhao, Yi Ren, Jinglin Liu, Rongjie Huang, Feiyang Chen, Zhefeng Wang, Baoxing Huai, Fei Wu:
VarietySound: Timbre-Controllable Video to Sound Generation Via Unsupervised Information Disentanglement. ICASSP 2023: 1-5 - [c188]Qinglin Zhang, Chong Deng, Jiaqing Liu, Hai Yu, Qian Chen, Wen Wang, Zhijie Yan, Jinglin Liu, Yi Ren, Zhou Zhao:
MUG: A General Meeting Understanding and Generation Benchmark. ICASSP 2023: 1-5 - [c187]Rongjie Huang, Jinglin Liu, Huadai Liu, Yi Ren, Lichao Zhang, Jinzheng He, Zhou Zhao:
TranSpeech: Speech-to-Speech Translation With Bilateral Perturbation. ICLR 2023 - [c186]Zhenhui Ye, Ziyue Jiang, Yi Ren, Jinglin Liu, Jinzheng He, Zhou Zhao:
GeneFace: Generalized and High-Fidelity Audio-Driven 3D Talking Face Synthesis. ICLR 2023 - [c185]Rongjie Huang, Jiawei Huang, Dongchao Yang, Yi Ren, Luping Liu, Mingze Li, Zhenhui Ye, Jinglin Liu, Xiang Yin, Zhou Zhao:
Make-An-Audio: Text-To-Audio Generation with Prompt-Enhanced Diffusion Models. ICML 2023: 13916-13932 - [c184]Zhenyu Lu, Tianqi Yue, Zhou Zhao, Weiyong Si, Ning Wang, Chenguang Yang:
MechTac: A Multifunctional Tendon-Linked Optical Tactile Sensor for In/Out-the-Field-of-View Perception with Deep Learning. IECON 2023: 1-6 - [c183]Yazheng Yang
, Zhou Zhao
, Qi Liu
:
MSSRNet: Manipulating Sequential Style Representation for Unsupervised Text Style Transfer. KDD 2023: 3022-3032 - [c182]Shengyu Zhang, Yunze Tong, Kun Kuang, Fuli Feng, Jiezhong Qiu, Jin Yu, Zhou Zhao, Hongxia Yang, Zhongfei Zhang, Fei Wu:
Stable Prediction on Graphs with Agnostic Distribution Shifts. CDPD 2023: 49-74 - [c181]Mengze Li
, Haoyu Zhang
, Juncheng Li
, Zhou Zhao
, Wenqiao Zhang
, Shengyu Zhang
, Shiliang Pu
, Yueting Zhuang
, Fei Wu
:
Unsupervised Domain Adaptation for Video Object Grounding with Cascaded Debiasing Learning. ACM Multimedia 2023: 3807-3816 - [c180]Tao Jin
, Xize Cheng
, Linjun Li
, Wang Lin
, Ye Wang
, Zhou Zhao
:
Rethinking Missing Modality Learning from a Decoding Perspective. ACM Multimedia 2023: 4431-4439 - [c179]Haonan Shi
, Wenwen Pan
, Zhou Zhao
, Mingmin Zhang
, Fei Wu
:
Unsupervised Domain Adaptation for Referring Semantic Segmentation. ACM Multimedia 2023: 5807-5818 - [c178]Zhiqing Hong
, Chenye Cui
, Rongjie Huang
, Lichao Zhang
, Jinglin Liu
, Jinzheng He
, Zhou Zhao
:
UniSinger: Unified End-to-End Singing Voice Synthesis With Cross-Modality Information Matching. ACM Multimedia 2023: 7569-7579 - [c177]Jiahao Xun
, Shengyu Zhang
, Yanting Yang
, Jieming Zhu
, Liqun Deng
, Zhou Zhao
, Zhenhua Dong
, Ruiqi Li
, Lichao Zhang
, Fei Wu
:
DisCover: Disentangled Music Representation Learning for Cover Song Identification. SIGIR 2023: 453-463 - [c176]Liangcai Su
, Fan Yan
, Jieming Zhu
, Xi Xiao
, Haoyi Duan
, Zhou Zhao
, Zhenhua Dong
, Ruiming Tang
:
Beyond Two-Tower Matching: Learning Sparse Retrievable Cross-Interactions for Recommendation. SIGIR 2023: 548-557 - [i133]Rongjie Huang, Jiawei Huang, Dongchao Yang, Yi Ren, Luping Liu, Mingze Li, Zhenhui Ye, Jinglin Liu, Xiang Yin, Zhou Zhao:
Make-An-Audio: Text-To-Audio Generation with Prompt-Enhanced Diffusion Models. CoRR abs/2301.12661 (2023) - [i132]Zhenhui Ye, Ziyue Jiang, Yi Ren, Jinglin Liu, Jinzheng He, Zhou Zhao:
GeneFace: Generalized and High-Fidelity Audio-Driven 3D Talking Face Synthesis. CoRR abs/2301.13430 (2023) - [i131]Zijian Zhang, Zhou Zhao, Jun Yu, Qi Tian:
ShiftDDPMs: Exploring Conditional Diffusion Models by Shifting Diffusion Trajectories. CoRR abs/2302.02373 (2023) - [i130]Xize Cheng, Linjun Li, Tao Jin, Rongjie Huang, Wang Lin, Zehan Wang, Huangdai Liu, Ye Wang, Aoxiong Yin, Zhou Zhao:
MixSpeech: Cross-Modality Self-Learning with Audio-Visual Stream Mixup for Visual Speech Translation and Recognition. CoRR abs/2303.05309 (2023) - [i129]Qinglin Zhang, Chong Deng, Jiaqing Liu, Hai Yu, Qian Chen, Wen Wang, Zhijie Yan, Jinglin Liu, Yi Ren, Zhou Zhao:
Overview of the ICASSP 2023 General Meeting Understanding and Generation Challenge (MUG). CoRR abs/2303.13932 (2023) - [i128]Qinglin Zhang, Chong Deng, Jiaqing Liu, Hai Yu, Qian Chen, Wen Wang, Zhijie Yan, Jinglin Liu, Yi Ren, Zhou Zhao:
MUG: A General Meeting Understanding and Generation Benchmark. CoRR abs/2303.13939 (2023) - [i127]Haoyuan Li, Hao Jiang, Tao Jin, Mengyan Li, Yan Chen, Zhijie Lin, Yang Zhao, Zhou Zhao:
DATE: Domain Adaptive Product Seeker for E-commerce. CoRR abs/2304.03669 (2023) - [i126]Jiong Wang, Zhou Zhao, Fei Wu:
Set-Based Face Recognition Beyond Disentanglement: Burstiness Suppression With Variance Vocabulary. CoRR abs/2304.06249 (2023) - [i125]Rongjie Huang, Mingze Li, Dongchao Yang, Jiatong Shi, Xuankai Chang, Zhenhui Ye, Yuning Wu, Zhiqing Hong, Jiawei Huang, Jinglin Liu, Yi Ren, Zhou Zhao, Shinji Watanabe:
AudioGPT: Understanding and Generating Speech, Music, Sound, and Talking Head. CoRR abs/2304.12995 (2023) - [i124]Zhenhui Ye, Jinzheng He, Ziyue Jiang, Rongjie Huang, Jiawei Huang, Jinglin Liu, Yi Ren, Xiang Yin, Zejun Ma, Zhou Zhao:
GeneFace++: Generalized and Stable Real-Time Audio-Driven 3D Talking Face Generation. CoRR abs/2305.00787 (2023) - [i123]Dong Yao, Shengyu Zhang, Zhou Zhao, Jieming Zhu, Wenqiao Zhang, Rui Zhang, Xiaofei He, Fei Wu:
Denoising Multi-modal Sequential Recommenders with Contrastive Learning. CoRR abs/2305.01915 (2023) - [i122]Zhou Yu
, Lixiang Zheng, Zhou Zhao, Fei Wu, Jianping Fan, Kui Ren, Jun Yu:
ANetQA: A Large-scale Benchmark for Fine-grained Compositional Reasoning over Untrimmed Videos. CoRR abs/2305.02519 (2023) - [i121]Ruiqi Li, Rongjie Huang, Lichao Zhang, Jinglin Liu, Zhou Zhao:
AlignSTS: Speech-to-Singing Conversion via Cross-Modal Alignment. CoRR abs/2305.04476 (2023) - [i120]Jinzheng He, Jinglin Liu, Zhenhui Ye, Rongjie Huang, Chenye Cui, Huadai Liu, Zhou Zhao:
RMSSinger: Realistic-Music-Score based Singing Voice Synthesis. CoRR abs/2305.10686 (2023) - [i119]Zhenhui Ye, Rongjie Huang, Yi Ren, Ziyue Jiang, Jinglin Liu, Jinzheng He, Xiang Yin, Zhou Zhao:
CLAPSpeech: Learning Prosody from Text Context with Contrastive Language-Audio Pre-training. CoRR abs/2305.10763 (2023) - [i118]Huadai Liu, Rongjie Huang, Jinzheng He, Gang Sun, Ran Shen, Xize Cheng, Zhou Zhao:
Wav2SQL: Direct Generalizable Speech-To-SQL Parsing. CoRR abs/2305.12552 (2023) - [i117]Huadai Liu, Rongjie Huang, Xuan Lin, Wenqiang Xu, Maozong Zheng, Hong Chen, Jinzheng He, Zhou Zhao:
ViT-TTS: Visual Text-to-Speech with Scalable Diffusion Transformer. CoRR abs/2305.12708 (2023) - [i116]Ziyue Jiang, Qian Yang, Jialong Zuo, Zhenhui Ye, Rongjie Huang, Yi Ren, Zhou Zhao:
FluentSpeech: Stutter-Oriented Automatic Speech Editing with Context-Aware Diffusion Models. CoRR abs/2305.13612 (2023) - [i115]Zehan Wang, Yang Zhao, Xize Cheng, Haifeng Huang, Jiageng Liu, Li Tang, Linjun Li, Yongqi Wang, Aoxiong Yin, Ziang Zhang, Zhou Zhao:
Connecting Multi-modal Contrastive Representations. CoRR abs/2305.14381 (2023) - [i114]Rongjie Huang, Huadai Liu, Xize Cheng, Yi Ren, Linjun Li, Zhenhui Ye, Jinzheng He, Lichao Zhang, Jinglin Liu, Xiang Yin, Zhou Zhao:
AV-TranSpeech: Audio-Visual Robust Speech-to-Speech Translation. CoRR abs/2305.15403 (2023) - [i113]Jiawei Huang, Yi Ren, Rongjie Huang, Dongchao Yang, Zhenhui Ye, Chen Zhang, Jinglin Liu, Xiang Yin, Zejun Ma, Zhou Zhao:
Make-An-Audio 2: Temporal-Enhanced Text-to-Audio Generation. CoRR abs/2305.18474 (2023) - [i112]Rongjie Huang, Chunlei Zhang, Yongqi Wang, Dongchao Yang, Luping Liu, Zhenhui Ye, Ziyue Jiang, Chao Weng, Zhou Zhao, Dong Yu:
Make-A-Voice: Unified Voice Synthesis With Discrete Representation. CoRR abs/2305.19269 (2023) - [i111]Luping Liu, Zijian Zhang, Yi Ren, Rongjie Huang, Xiang Yin, Zhou Zhao:
Detector Guidance for Multi-Object Text-to-Image Generation. CoRR abs/2306.02236 (2023) - [i110]Zhenhui Ye, Ziyue Jiang, Yi Ren, Jinglin Liu, Chen Zhang, Xiang Yin, Zejun Ma, Zhou Zhao:
Ada-TTA: Towards Adaptive High-Quality Text-to-Talking Avatar Synthesis. CoRR abs/2306.03504 (2023) - [i109]Ziyue Jiang, Yi Ren, Zhenhui Ye, Jinglin Liu, Chen Zhang, Qian Yang, Shengpeng Ji, Rongjie Huang, Chunfeng Wang, Xiang Yin, Zejun Ma, Zhou Zhao:
Mega-TTS: Zero-Shot Text-to-Speech at Scale with Intrinsic Inductive Bias. CoRR abs/2306.03509 (2023) - [i108]Xize Cheng, Tao Jin, Linjun Li, Wang Lin, Xinyu Duan, Zhou Zhao:
OpenSR: Open-Modality Speech Recognition via Maintaining Multi-Modality Alignment. CoRR abs/2306.06410 (2023) - [i107]Yazheng Yang, Zhou Zhao, Qi Liu:
MSSRNet: Manipulating Sequential Style Representation for Unsupervised Text Style Transfer. CoRR abs/2306.07994 (2023) - [i106]Ziyue Jiang, Jinglin Liu, Yi Ren, Jinzheng He, Chen Zhang, Zhenhui Ye, Pengfei Wei, Chunfeng Wang, Xiang Yin, Zejun Ma, Zhou Zhao:
Mega-TTS 2: Zero-Shot Text-to-Speech with Arbitrary Length Speech Prompts. CoRR abs/2307.07218 (2023) - [i105]Aoxiong Yin, Tianyun Zhong, Li Tang, Weike Jin, Tao Jin, Zhou Zhao:
Gloss Attention for Gloss-free Sign Language Translation. CoRR abs/2307.07361 (2023) - [i104]Zehan Wang, Haifeng Huang, Yang Zhao, Linjun Li, Xize Cheng, Yichen Zhu, Aoxiong Yin, Zhou Zhao:
Distilling Coarse-to-Fine Semantic Matching Knowledge for Weakly Supervised 3D Visual Grounding. CoRR abs/2307.09267 (2023) - [i103]Jiahao Xun, Shengyu Zhang, Yanting Yang, Jieming Zhu, Liqun Deng, Zhou Zhao, Zhenhua Dong, Ruiqi Li, Lichao Zhang, Fei Wu:
DisCover: Disentangled Music Representation Learning for Cover Song Identification. CoRR abs/2307.09775 (2023) - [i102]Zehan Wang, Haifeng Huang, Yang Zhao, Linjun Li, Xize Cheng, Yichen Zhu, Aoxiong Yin, Zhou Zhao:
3DRP-Net: 3D Relative Position-aware Network for 3D Visual Grounding. CoRR abs/2307.13363 (2023) - [i101]Zehan Wang, Haifeng Huang, Yang Zhao, Ziang Zhang, Zhou Zhao:
Chat-3D: Data-efficiently Tuning Large Language Model for Universal Dialogue of 3D Scenes. CoRR abs/2308.08769 (2023) - [i100]Shengpeng Ji, Jialong Zuo, Minghui Fang, Ziyue Jiang, Feiyang Chen, Xinyu Duan, Baoxing Huai, Zhou Zhao:
TextrolSpeech: A Text Style Control Speech Corpus With Codec Language Text-to-Speech Models. CoRR abs/2308.14430 (2023) - [i99]Yongqi Wang, Jionghao Bai, Rongjie Huang, Ruiqi Li, Zhiqing Hong, Zhou Zhao:
Speech-to-Speech Translation with Discrete-Unit-Based Style Transfer. CoRR abs/2309.07566 (2023) - [i98]Dongchao Yang, Jinchuan Tian, Xu Tan, Rongjie Huang, Songxiang Liu, Xuankai Chang, Jiatong Shi, Sheng Zhao, Jiang Bian, Xixin Wu, Zhou Zhao, Shinji Watanabe, Helen Meng:
UniAudio: An Audio Foundation Model Toward Universal Audio Generation. CoRR abs/2310.00704 (2023) - [i97]Zehan Wang, Ziang Zhang, Luping Liu, Yang Zhao, Haifeng Huang, Tao Jin, Zhou Zhao:
Extending Multi-modal Contrastive Representations. CoRR abs/2310.08884 (2023) - [i96]Zijian Zhang, Luping Liu, Zhijie Lin, Yichen Zhu, Zhou Zhao:
Unsupervised Discovery of Interpretable Directions in h-space of Pre-trained Diffusion Models. CoRR abs/2310.09912 (2023) - [i95]Haoyi Duan, Yan Xia, Mingze Zhou, Li Tang, Jieming Zhu, Zhou Zhao:
Cross-modal Prompts: Adapting Large Pre-trained Models for Audio-Visual Downstream Tasks. CoRR abs/2311.05152 (2023) - [i94]Haoyuan Li, Zhou Zhao, Zhu Zhang, Zhijie Lin:
Weakly-Supervised Video Moment Retrieval via Regularized Two-Branch Proposal Networks with Erasing Mechanism. CoRR abs/2311.13946 (2023) - 2022
- [j63]Tao Jin, Zhou Zhao, Peng Wang, Jun Yu, Fei Wu:
Interaction augmented transformer with decoupled decoding for video captioning. Neurocomputing 492: 496-507 (2022) - [j62]Pengcheng Zhang
, Zhou Zhao
, Nannan Wang
, Jun Yu
, Fei Wu
:
Local-Global Graph Pooling via Mutual Information Maximization for Video-Paragraph Retrieval. IEEE Trans. Circuits Syst. Video Technol. 32(10): 7133-7146 (2022) - [j61]Jingkuan Song
, Jingqiu Zhang, Lianli Gao
, Zhou Zhao
, Heng Tao Shen
:
AgeGAN++: Face Aging and Rejuvenation With Dual Conditional GANs. IEEE Trans. Multim. 24: 791-804 (2022) - [j60]Zhaoyu Guo
, Zhou Zhao
, Weike Jin
, Dazhou Wang, Ruitao Liu, Jun Yu
:
TaoHighlight: Commodity-Aware Multi-Modal Video Highlight Detection in E-Commerce. IEEE Trans. Multim. 24: 2606-2616 (2022) - [j59]Wenhua Wang, Yuqun Zhang, Yulei Sui
, Yao Wan, Zhou Zhao
, Jian Wu, Philip S. Yu
, Guandong Xu
:
Reinforcement-Learning-Guided Source Code Summarization Using Hierarchical Attention. IEEE Trans. Software Eng. 48(2): 102-119 (2022) - [c175]Jinzheng He, Zhou Zhao, Yi Ren, Jinglin Liu, Baoxing Huai, Nicholas Jing Yuan:
Flow-Based Unconstrained Lip to Speech Generation. AAAI 2022: 843-851 - [c174]Jinglin Liu, Zhiying Zhu, Yi Ren, Wencan Huang, Baoxing Huai, Nicholas Jing Yuan, Zhou Zhao:
Parallel and High-Fidelity Text-to-Lip Generation. AAAI 2022: 1738-1746 - [c173]Jinglin Liu, Chengxi Li, Yi Ren, Feiyang Chen, Zhou Zhao:
DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism. AAAI 2022: 11020-11028 - [c172]Tao Jin, Zhou Zhao, Meng Zhang, Xingshan Zeng:
Prior Knowledge and Memory Enriched Transformer for Sign Language Translation. ACL (Findings) 2022: 3766-3775 - [c171]Jinglin Liu, Chengxi Li, Yi Ren, Zhiying Zhu, Zhou Zhao:
Learning the Beauty in Songs: Neural Singing Voice Beautifier. ACL (1) 2022: 7970-7983 - [c170]Yi Ren, Xu Tan, Tao Qin, Zhou Zhao, Tie-Yan Liu:
Revisiting Over-Smoothness in Text to Speech. ACL (1) 2022: 8197-8213 - [c169]Mengze Li, Tianbao Wang, Haoyu Zhang, Shengyu Zhang, Zhou Zhao, Jiaxu Miao, Wenqiao Zhang, Wenming Tan
, Jin Wang, Peng Wang, Shiliang Pu, Fei Wu:
End-to-End Modeling via Information Tree for One-Shot Natural Language Spatial Video Grounding. ACL (1) 2022: 8707-8717 - [c168]Wenwen Pan, Haonan Shi, Zhou Zhao, Jieming Zhu, Xiuqiang He, Zhigeng Pan, Lianli Gao, Jun Yu, Fei Wu, Qi Tian:
Wnet: Audio-Guided Video Object Segmentation via Wavelet-Based Cross- Modal Denoising Networks. CVPR 2022: 1310-1321 - [c167]Aoxiong Yin, Zhou Zhao, Weike Jin, Meng Zhang, Xingshan Zeng, Xiaofei He:
MLSLT: Towards Multilingual Sign Language Translation. CVPR 2022: 5099-5109 - [c166]Xinyu Lyu, Lianli Gao, Yuyu Guo, Zhou Zhao, Hao Huang, Heng Tao Shen, Jingkuan Song:
Fine-Grained Predicates Learning for Scene Graph Generation. CVPR 2022: 19445-19453 - [c165]Yan Xia, Zhou Zhao:
Cross-modal Background Suppression for Audio-Visual Event Localization. CVPR 2022: 19957-19966 - [c164]Lichao Zhang, Yi Ren, Liqun Deng, Zhou Zhao:
HiFiDenoise: High-Fidelity Denoising Text to Speech with Adversarial Networks. ICASSP 2022: 7232-7236 - [c163]Yi Ren, Ming Lei, Zhiying Huang, Shiliang Zhang, Qian Chen, Zhijie Yan, Zhou Zhao:
Prosospeech: Enhancing Prosody with Quantized Vector Pre-Training in Text-To-Speech. ICASSP 2022: 7577-7581 - [c162]Luping Liu, Yi Ren, Zhijie Lin, Zhou Zhao:
Pseudo Numerical Methods for Diffusion Models on Manifolds. ICLR 2022 - [c161]Rongjie Huang, Max W. Y. Lam, Jun Wang, Dan Su, Dong Yu, Yi Ren, Zhou Zhao:
FastDiff: A Fast Conditional Diffusion Model for High-Quality Speech Synthesis. IJCAI 2022: 4157-4163 - [c160]Zhenhui Ye, Zhou Zhao, Yi Ren, Fei Wu:
SyntaSpeech: Syntax-Aware Generative Adversarial Text-to-Speech. IJCAI 2022: 4468-4474 - [c159]Lichao Zhang, Zhou Zhao, Yi Ren, Liqun Deng:
EditSinger: Zero-Shot Text-Based Singing Voice Editing System with Diverse Prosody Modeling. IJCAI 2022: 4503-4509 - [c158]Zhou Zhao, Zhenyu Lu
:
Multi-purpose Tactile Perception Based on Deep Learning in a New Tendon-driven Optical Tactile Sensor. IROS 2022: 2099-2104 - [c157]Yuxiao Lin, Zhihao Du, Shiliang Zhang, Fan Yu, Zhou Zhao, Fei Wu:
Separate-to-Recognize: Joint Multi-target Speech Separation and Speech Recognition for Speaker-attributed ASR. ISCSLP 2022: 150-154 - [c156]Rongjie Huang, Chenye Cui, Feiyang Chen, Yi Ren, Jinglin Liu, Zhou Zhao, Baoxing Huai, Zhefeng Wang:
SingGAN: Generative Adversarial Network For High-Fidelity Singing Voice Generation. ACM Multimedia 2022: 2525-2535 - [c155]Rongjie Huang, Zhou Zhao, Huadai Liu, Jinglin Liu, Chenye Cui, Yi Ren:
ProDiff: Progressive Fast Diffusion Model for High-Quality Text-to-Speech. ACM Multimedia 2022: 2595-2605 - [c154]Mengze Li, Tianbao Wang, Haoyu Zhang, Shengyu Zhang, Zhou Zhao, Wenqiao Zhang, Jiaxu Miao, Shiliang Pu, Fei Wu:
HERO: HiErarchical spatio-tempoRal reasOning with Contrastive Action Correspondence for End-to-End Video Object Grounding. ACM Multimedia 2022: 3801-3810 - [c153]Tao Jin, Zhou Zhao, Meng Zhang, Xingshan Zeng:
MC-SLT: Towards Low-Resource Signer-Adaptive Sign Language Translation. ACM Multimedia 2022: 4939-4947 - [c152]