default search action

combined dblp search
author search
venue search
publication search

ask others

Jialong Zuo

> Home > Persons

Person information

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

2020 – today

see FAQ

What is the meaning of the colors in the publication lists?

2025
[j1]
- view
  authority control:
- export record
  dblp key:
  - journals/ivc/HongZHZTGS25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ivc/HongZHZTGS25
Jiahao Hong, Jialong Zuo, Chuchu Han, Ruochen Zheng, Ming Tian, Changxin Gao, Nong Sang:
Spatial cascaded clustering and weighted memory for unsupervised person re-identification. Image Vis. Comput. 156: 105478 (2025)
[c23]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/ZuoN0ZHSG025
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/ZuoN0ZHSG025
Jialong Zuo, Ying Nie, Tianyu Guo, Huaxin Zhang, Jiahao Hong, Nong Sang, Changxin Gao, Kai Han:
L-Man: A Large Multi-modal Model Unifying Human-centric Tasks. AAAI 2025: 11095-11103
[c22]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/JiJZ0C0025
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/JiJZ0C0025
Shengpeng Ji, Ziyue Jiang, Jialong Zuo, Minghui Fang, Yifu Chen, Tao Jin, Zhou Zhao:
Speech Watermarking with Discrete Intermediate Representations. AAAI 2025: 24239-24247
[c21]
- view
  - electronic edition @ aclanthology.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/acl/JiCWZ0000CZ025
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/acl/JiCWZ0000CZ025
Shengpeng Ji, Qian Chen, Wen Wang, Jialong Zuo, Minghui Fang, Ziyue Jiang, Hai Huang, Zehan Wang, Xize Cheng, Siqi Zheng, Zhou Zhao:
ControlSpeech: Towards Simultaneous and Independent Zero-shot Speaker Cloning and Zero-shot Language Style Control. ACL (1) 2025: 6966-6981
[c20]
- view
  - electronic edition @ aclanthology.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/acl/Ji0Z0WWH025
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/acl/Ji0Z0WWH025
Shengpeng Ji, Minghui Fang, Jialong Zuo, Ziyue Jiang, Dingdong Wang, Hanting Wang, Hai Huang, Zhou Zhao:
Language-Codec: Bridging Discrete Codec Representations and Speech Language Models. ACL (1) 2025: 13332-13345
[c19]
- view
  - electronic edition @ aclanthology.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/acl/0002JZ00ZCY0WD025
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/acl/0002JZ00ZCY0WD025
Minghui Fang, Shengpeng Ji, Jialong Zuo, Hai Huang, Yan Xia, Jieming Zhu, Xize Cheng, Xiaoda Yang, Wenrui Liu, Gang Wang, Zhenhua Dong, Zhou Zhao:
CART: A Generative Cross-Modal Retrieval Framework With Coarse-To-Fine Semantic Modeling. ACL (1) 2025: 15120-15133
[c18]
- view
  - electronic edition @ aclanthology.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/acl/ZuoJ0L0CYCD025
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/acl/ZuoJ0L0CYCD025
Jialong Zuo, Shengpeng Ji, Minghui Fang, Mingze Li, Ziyue Jiang, Xize Cheng, Xiaoda Yang, Feiyang Chen, Xinyu Duan, Zhou Zhao:
Rhythm Controllable and Efficient Zero-Shot Voice Conversion via Shortcut Flow Matching. ACL (1) 2025: 16203-16217
[c17]
- view
  - electronic edition @ aclanthology.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/coling/0003BCZJJ0YYZ25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/coling/0003BCZJJ0YYZ25
Wenrui Liu, Jionghao Bai, Xize Cheng, Jialong Zuo, Ziyue Jiang, Shengpeng Ji, Minghui Fang, Xiaoda Yang, Qian Yang, Zhou Zhao:
VoxpopuliTTS: a large-scale multilingual TTS corpus for zero-shot speech generation. COLING 2025: 10293-10297
[c16]
- view
  authority control:
- export record
  dblp key:
  - conf/cvpr/ZhangX0ZHGZ0S25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/cvpr/ZhangX0ZHGZ0S25
Huaxin Zhang, Xiaohao Xu, Xiang Wang, Jialong Zuo, Xiaonan Huang, Changxin Gao, Shanjun Zhang, Li Yu, Nong Sang:
Holmes-VAU: Towards Long-term Video Anomaly Understanding at Any Granularity. CVPR 2025: 13843-13853
[c15]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/ZuoJ00C00ZTG025
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/ZuoJ00C00ZTG025
Jialong Zuo, Shengpeng Ji, Minghui Fang, Ziyue Jiang, Xize Cheng, Qian Yang, Wenrui Liu, Guangyan Zhang, Zehai Tu, Yiwen Guo, Zhou Zhao:
Enhancing Expressive Voice Conversion with Discrete Pitch-Conditioned Flow Matching Model. ICASSP 2025: 1-5
[c14]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/ChengZW0Z0JZ0025
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/ChengZW0Z0JZ0025
Xize Cheng, Siqi Zheng, Zehan Wang, Minghui Fang, Ziang Zhang, Rongjie Huang, Shengpeng Ji, Jialong Zuo, Tao Jin, Zhou Zhao:
OmniSep: Unified Omni-Modality Sound Separation with Query-Mixup. ICLR 2025
[c13]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/Ji00C0Z0C0LZY0J25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/Ji00C0Z0C0LZY0J25
Shengpeng Ji, Ziyue Jiang, Wen Wang, Yifu Chen, Minghui Fang, Jialong Zuo, Qian Yang, Xize Cheng, Zehan Wang, Ruiqi Li, Ziang Zhang, Xiaoda Yang, Rongjie Huang, Yidi Jiang, Qian Chen, Siqi Zheng, Zhou Zhao:
WavTokenizer: an Efficient Acoustic Discrete Codec Tokenizer for Audio Language Modeling. ICLR 2025
[c12]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/0002JZC0YHZ025
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/0002JZC0YHZ025
Minghui Fang, Shengpeng Ji, Jialong Zuo, Xize Cheng, Wenrui Liu, Xiaoda Yang, Ruofan Hu, Jieming Zhu, Zhou Zhao:
GTA: Towards Generative Text-To-Audio Retrieval via Multi-Scale Tokenizer. INTERSPEECH 2025
[c11]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/LuanYCH0LZDML25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/LuanYCH0LZDML25
Kaixuan Luan, Xiaoda Yang, Shile Cai, Ruofan Hu, Minghui Fang, Wenrui Liu, Jialong Zuo, Jiaqi Duan, Yuhang Ma, Junyu Lu:
MelRe: Vision-Based Mel-Spectrogram Restoration. INTERSPEECH 2025
[c10]
- view
  authority control:
- export record
  dblp key:
  - conf/mm/000300YL0ZY00LC25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/000300YL0ZY00LC25
Wenrui Liu, Qian Chen, Wen Wang, Guanrou Yang, Weiqin Li, Minghui Fang, Jialong Zuo, Xiaoda Yang, Tao Jin, Jin Xu, Zemin Liu, Yafeng Chen, Jionghao Bai, Zhifang Guo:
Speech Token Prediction via Compressed-to-fine Language Modeling for Speech Generation. ACM Multimedia 2025: 10632-10641
[i28]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2502-05471
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2502-05471
Jialong Zuo, Shengpeng Ji, Minghui Fang, Ziyue Jiang, Xize Cheng, Qian Yang, Wenrui Liu, Guangyan Zhang, Zehai Tu, Yiwen Guo, Zhou Zhao:
Enhancing Expressive Voice Conversion with Discrete Pitch-Conditioned Flow Matching Model. CoRR abs/2502.05471 (2025)
[i27]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2502-18924
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2502-18924
Ziyue Jiang, Yi Ren, Ruiqi Li, Shengpeng Ji, Zhenhui Ye, Chen Zhang, Jionghao Bai, Xiaoda Yang, Jialong Zuo, Yu Zhang, Rui Liu, Xiang Yin, Zhou Zhao:
Sparse Alignment Enhanced Latent Diffusion Transformer for Zero-Shot Speech Synthesis. CoRR abs/2502.18924 (2025)
[i26]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2505-09558
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2505-09558
Shengpeng Ji, Tianle Liang, Yangzhuo Li, Jialong Zuo, Minghui Fang, Jinzheng He, Yifu Chen, Zhengqing Liu, Ziyue Jiang, Xize Cheng, Siqi Zheng, Jin Xu, Junyang Lin, Zhou Zhao:
WavReward: Spoken Dialogue Models With Generalist Reward Evaluators. CoRR abs/2505.09558 (2025)
[i25]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2506-01014
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2506-01014
Jialong Zuo, Shengpeng Ji, Minghui Fang, Mingze Li, Ziyue Jiang, Xize Cheng, Xiaoda Yang, Feiyang Chen, Xinyu Duan, Zhou Zhao:
Rhythm Controllable and Efficient Zero-Shot Voice Conversion via Shortcut Flow Matching. CoRR abs/2506.01014 (2025)
[i24]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2506-09385
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2506-09385
Jialong Zuo, Yongtai Deng, Mengdan Tan, Rui Jin, Dongyue Wu, Nong Sang, Liang Pan, Changxin Gao:
ReID5o: Achieving Omni Multi-modal Person Re-identification in a Single Model. CoRR abs/2506.09385 (2025)
[i23]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2506-23674
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2506-23674
Dongyue Wu, Zilin Guo, Jialong Zuo, Nong Sang, Changxin Gao:
Partial Forward Blocking: A Novel Data Pruning Paradigm for Lossless Training Acceleration. CoRR abs/2506.23674 (2025)
[i22]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2509-00503
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2509-00503
Jialong Zuo, Guangyan Zhang, Minghui Fang, Shengpeng Ji, Xiaoqi Jiao, Jingyu Li, Yiwen Guo, Zhou Zhao:
Entropy-based Coarse and Compressed Semantic Speech Representation Learning. CoRR abs/2509.00503 (2025)
[i21]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2510-12422
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2510-12422
Jialong Zuo, Yongtai Deng, Lingdong Kong, Jingkang Yang, Rui Jin, Yiwei Zhang, Nong Sang, Liang Pan, Ziwei Liu, Changxin Gao:
VideoLucy: Deep Memory Backtracking for Long Video Understanding. CoRR abs/2510.12422 (2025)
[i20]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2511-10334
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2511-10334
Wenti Yin, Huaxin Zhang, Xiang Wang, Yuqing Lu, Yicheng Zhang, Bingquan Gong, Jialong Zuo, Li Yu, Changxin Gao, Nong Sang:
Learning to Tell Apart: Weakly Supervised Video Anomaly Detection via Disentangled Semantic Alignment. CoRR abs/2511.10334 (2025)
[i19]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2512-10958
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2512-10958
Ao Liang, Lingdong Kong, Tianyi Yan, Hongsi Liu, Wesley Yang, Ziqi Huang, Wei Yin, Jialong Zuo, Yixuan Hu, Dekai Zhu, Dongyue Lu, Youquan Liu, Guangfeng Jiang, Linfeng Li, Xiangtai Li, Long Zhuo, Lai Xing Ng, Benoit R. Cottereau, Changxin Gao, Liang Pan, Wei Tsang Ooi, Ziwei Liu:
WorldLens: Full-Spectrum Evaluations of Driving World Models in Real World. CoRR abs/2512.10958 (2025)
[i18]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2512-15110
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2512-15110
Jialong Zuo, Haoyou Deng, Hanyu Zhou, Jiaxin Zhu, Yicheng Zhang, Yiwei Zhang, Yongxin Yan, Kaixing Huang, Weisen Chen, Yongtai Deng, Rui Jin, Nong Sang, Changxin Gao:
Is Nano Banana Pro a Low-Level Vision All-Rounder? A Comprehensive Evaluation on 14 Tasks and 40 Datasets. CoRR abs/2512.15110 (2025)
2024
[c9]
- view
  authority control:
- export record
  dblp key:
  - conf/acl/Ji0WZZ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/acl/Ji0WZZ24
Shengpeng Ji, Ziyue Jiang, Hanting Wang, Jialong Zuo, Zhou Zhao:
MobileSpeech: A Fast and High-Fidelity Framework for Mobile Zero-Shot Text-to-Speech. ACL (1) 2024: 13588-13600
[c8]
- view
  authority control:
- export record
  dblp key:
  - conf/cvpr/ZuoZNZ0S0G24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/cvpr/ZuoZNZ0S0G24
Jialong Zuo, Hanyu Zhou, Ying Nie, Feng Zhang, Tianyu Guo, Nong Sang, Yunhe Wang, Changxin Gao:
UFineBench: Towards Text-based Person Retrieval with Ultra-fine Granularity. CVPR 2024: 22010-22019
[c7]
- view
  authority control:
- export record
  dblp key:
  - conf/emnlp/YangCDQH0JZHZ024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/emnlp/YangCDQH0JZHZ024
Xiaoda Yang, Xize Cheng, Jiaqi Duan, Hongshun Qiu, Minjie Hong, Minghui Fang, Shengpeng Ji, Jialong Zuo, Zhiqing Hong, Zhimeng Zhang, Tao Jin:
AudioVSR: Enhancing Video Speech Recognition with Audio Data. EMNLP 2024: 15352-15361
[c6]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/JiZ00CDHZ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/JiZ00CDHZ24
Shengpeng Ji, Jialong Zuo, Minghui Fang, Ziyue Jiang, Feiyang Chen, Xinyu Duan, Baoxing Huai, Zhou Zhao:
TextrolSpeech: A Text Style Control Speech Corpus with Codec Language Text-to-Speech Models. ICASSP 2024: 10301-10305
[c5]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/interspeech/YangZS0L0C0H24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/interspeech/YangZS0L0C0H24
Qian Yang, Jialong Zuo, Zhe Su, Ziyue Jiang, Mingze Li, Zhou Zhao, Feiyang Chen, Zhefeng Wang, Baoxing Huai:
MSceneSpeech: A Multi-Scene Speech Dataset For Expressive Speech Synthesis. INTERSPEECH 2024
[c4]
- view
  authority control:
- export record
  dblp key:
  - conf/mm/YangCF0ZJZ024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/YangCF0ZJZ024
Xiaoda Yang, Xize Cheng, Dongjie Fu, Minghui Fang, Jialong Zuo, Shengpeng Ji, Zhou Zhao, Tao Jin:
SyncTalklip: Highly Synchronized Lip-Readable Speaker Generation with Multi-Task Learning. ACM Multimedia 2024: 8149-8158
[c3]
- view
  - electronic edition @ nips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/ZuoHZYZGS024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/ZuoHZYZGS024
Jialong Zuo, Jiahao Hong, Feng Zhang, Changqian Yu, Hanyu Zhou, Changxin Gao, Nong Sang, Jingdong Wang:
PLIP: Language-Image Pre-training for Person Representation Learning. NeurIPS 2024
[c2]
- view
  - electronic edition @ nips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/ZuoNZZW0SG24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/ZuoNZZW0SG24
Jialong Zuo, Ying Nie, Hanyu Zhou, Huaxin Zhang, Haoyu Wang, Tianyu Guo, Nong Sang, Changxin Gao:
Cross-video Identity Correlating for Person Re-identification Pre-training. NeurIPS 2024
[i17]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2402-09378
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2402-09378
Shengpeng Ji, Ziyue Jiang, Hanting Wang, Jialong Zuo, Zhou Zhao:
MobileSpeech: A Fast and High-Fidelity Framework for Mobile Zero-Shot Text-to-Speech. CoRR abs/2402.09378 (2024)
[i16]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2402-12208
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2402-12208
Shengpeng Ji, Minghui Fang, Ziyue Jiang, Rongjie Huang, Jialong Zuo, Shulei Wang, Zhou Zhao:
Language-Codec: Reducing the Gaps Between Discrete Codec Representation and Speech Language Models. CoRR abs/2402.12208 (2024)
[i15]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2403-00261
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2403-00261
Jiahao Hong, Jialong Zuo, Chuchu Han, Ruochen Zheng, Ming Tian, Changxin Gao, Nong Sang:
Spatial Cascaded Clustering and Weighted Memory for Unsupervised Person Re-identification. CoRR abs/2403.00261 (2024)
[i14]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2406-01205
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2406-01205
Shengpeng Ji, Jialong Zuo, Minghui Fang, Siqi Zheng, Qian Chen, Wen Wang, Ziyue Jiang, Hai Huang, Xize Cheng, Rongjie Huang, Zhou Zhao:
ControlSpeech: Towards Simultaneous Zero-shot Speaker Cloning and Zero-shot Language Style Control With Decoupled Codec. CoRR abs/2406.01205 (2024)
[i13]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2406-12235
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2406-12235
Huaxin Zhang, Xiaohao Xu, Xiang Wang, Jialong Zuo, Chuchu Han, Xiaonan Huang, Changxin Gao, Yuehuan Wang, Nong Sang:
Holmes-VAD: Towards Unbiased and Explainable Video Anomaly Detection via Multi-modal LLM. CoRR abs/2406.12235 (2024)
[i12]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2406-17507
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2406-17507
Minghui Fang, Shengpeng Ji, Jialong Zuo, Hai Huang, Yan Xia, Jieming Zhu, Xize Cheng, Xiaoda Yang, Wenrui Liu, Gang Wang, Zhenhua Dong, Zhou Zhao:
ACE: A Generative Cross-Modal Retrieval Framework with Coarse-To-Fine Semantic Modeling. CoRR abs/2406.17507 (2024)
[i11]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2407-14006
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2407-14006
Qian Yang, Jialong Zuo, Zhe Su, Ziyue Jiang, Mingze Li, Zhou Zhao, Feiyang Chen, Zhefeng Wang, Baoxing Huai:
MSceneSpeech: A Multi-Scene Speech Dataset For Expressive Speech Synthesis. CoRR abs/2407.14006 (2024)
[i10]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2408-16532
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2408-16532
Shengpeng Ji, Ziyue Jiang, Xize Cheng, Yifu Chen, Minghui Fang, Jialong Zuo, Qian Yang, Ruiqi Li, Ziang Zhang, Xiaoda Yang, Rongjie Huang, Yidi Jiang, Qian Chen, Siqi Zheng, Wen Wang, Zhou Zhao:
WavTokenizer: an Efficient Acoustic Discrete Codec Tokenizer for Audio Language Modeling. CoRR abs/2408.16532 (2024)
[i9]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2409-18569
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2409-18569
Jialong Zuo, Ying Nie, Hanyu Zhou, Huaxin Zhang, Haoyu Wang, Tianyu Guo, Nong Sang, Changxin Gao:
Cross-video Identity Correlating for Person Re-identification Pre-training. CoRR abs/2409.18569 (2024)
[i8]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2410-21269
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2410-21269
Xize Cheng, Siqi Zheng, Zehan Wang, Minghui Fang, Ziang Zhang, Rongjie Huang, Ziyang Ma, Shengpeng Ji, Jialong Zuo, Tao Jin, Zhou Zhao:
OmniSep: Unified Omni-Modality Sound Separation with Query-Mixup. CoRR abs/2410.21269 (2024)
[i7]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2411-13577
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2411-13577
Shengpeng Ji, Yifu Chen, Minghui Fang, Jialong Zuo, Jingyu Lu, Hanting Wang, Ziyue Jiang, Long Zhou, Shujie Liu, Xize Cheng, Xiaoda Yang, Zehan Wang, Qian Yang, Jian Li, Yidi Jiang, Jingzhen He, Yunfei Chu, Jin Xu, Zhou Zhao:
WavChat: A Survey of Spoken Dialogue Models. CoRR abs/2411.13577 (2024)
[i6]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2412-06171
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2412-06171
Huaxin Zhang, Xiaohao Xu, Xiang Wang, Jialong Zuo, Xiaonan Huang, Changxin Gao, Shanjun Zhang, Li Yu, Nong Sang:
Holmes-VAU: Towards Long-term Video Anomaly Understanding at Any Granularity. CoRR abs/2412.06171 (2024)
[i5]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2412-13917
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2412-13917
Shengpeng Ji, Ziyue Jiang, Jialong Zuo, Minghui Fang, Yifu Chen, Tao Jin, Zhou Zhao:
Speech Watermarking with Discrete Intermediate Representations. CoRR abs/2412.13917 (2024)
2023
[c1]
- view
  authority control:
- export record
  dblp key:
  - conf/acl/JiangYZYHRZ23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/acl/JiangYZYHRZ23
Ziyue Jiang, Qian Yang, Jialong Zuo, Zhenhui Ye, Rongjie Huang, Yi Ren, Zhou Zhao:
FluentSpeech: Stutter-Oriented Automatic Speech Editing with Context-Aware Diffusion Models. ACL (Findings) 2023: 11655-11671
[i4]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2305-08386
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2305-08386
Jialong Zuo, Changqian Yu, Nong Sang, Changxin Gao:
PLIP: Language-Image Pre-training for Person Representation Learning. CoRR abs/2305.08386 (2023)
[i3]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2305-13612
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2305-13612
Ziyue Jiang, Qian Yang, Jialong Zuo, Zhenhui Ye, Rongjie Huang, Yi Ren, Zhou Zhao:
FluentSpeech: Stutter-Oriented Automatic Speech Editing with Context-Aware Diffusion Models. CoRR abs/2305.13612 (2023)
[i2]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2308-14430
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2308-14430
Shengpeng Ji, Jialong Zuo, Minghui Fang, Ziyue Jiang, Feiyang Chen, Xinyu Duan, Baoxing Huai, Zhou Zhao:
TextrolSpeech: A Text Style Control Speech Corpus With Codec Language Text-to-Speech Models. CoRR abs/2308.14430 (2023)
[i1]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2312-03441
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2312-03441
Jialong Zuo, Hanyu Zhou, Ying Nie, Feng Zhang, Tianyu Guo, Nong Sang, Yunhe Wang, Changxin Gao:
UFineBench: Towards Text-based Person Retrieval with Ultra-fine Granularity. CoRR abs/2312.03441 (2023)

Coauthor Index

see FAQ

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.