


default search action
Li Yuan 0007
Person information
- affiliation: Peking University, School of Electronic and Computer Engineering, Beijing, China
- affiliation: National University of Singapore, Singapore
Other persons with the same name
- Li Yuan — disambiguation page
- Li Yuan 0001
— University of Science and Technology Beijing, School of Automation and Electrical Engineering, China
- Li Yuan 0002 — Harbin Institute of Technology, Shenzhen Graduate School, Key Laboratory of Network Oriented Intelligent Computation, China
- Li Yuan 0003
— Chinese Academy of Sciences, Academy of Mathematics and Systems Science, LSEC & NCMIS, Beijing, China (and 1 more)
- Li Yuan 0004
— Lamar University, Department of Electrical Engineering, Beaumont, TX, USA
- Li Yuan 0005
— Beijing University of Technology, College of Life Science and Bioengineering, China
- Li Yuan 0006 — RWTH Aachen University, Institute of Imaging and Computer Vision, Germany
Refine list

refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
showing all ?? records
2020 – today
- 2025
- [j14]Tao Wang, Li Yuan, Xinchao Wang, Jiashi Feng:
Learning Box Regression and Mask Segmentation Under Long-Tailed Distribution with Gradient Transfusing. Int. J. Comput. Vis. 133(2): 951-967 (2025) - [c48]Shuo Yang, Kun-Peng Ning, Yu-Yang Liu, Jia-Yu Yao, Yong-Hong Tian, Yi-Bing Song, Li Yuan:
Is Parameter Collision Hindering Continual Learning in LLMs? COLING 2025: 4243-4259 - 2024
- [j13]Ping Li
, Yu Zhang, Li Yuan, Xianghua Xu:
Fully Transformer-Equipped Architecture for end-to-end Referring Video Object Segmentation. Inf. Process. Manag. 61(1): 103566 (2024) - [j12]Haonan Qiu
, Munan Ning, Zeyin Song, Wei Fang
, Yanqi Chen, Tao Sun, Zhengyu Ma, Li Yuan, Yonghong Tian:
Self-architectural knowledge distillation for spiking neural networks. Neural Networks 178: 106475 (2024) - [j11]Ping Li
, Yu Zhang, Li Yuan, Huaxin Xiao, Binbin Lin
, Xianghua Xu:
Efficient Long-Short Temporal Attention network for unsupervised Video Object Segmentation. Pattern Recognit. 146: 110078 (2024) - [j10]Ping Li
, Yu Zhang, Li Yuan, Jian Zhao, Xianghua Xu
, Xiaoqin Zhang
:
Adversarial Attacks on Video Object Segmentation With Hard Region Discovery. IEEE Trans. Circuits Syst. Video Technol. 34(6): 5049-5062 (2024) - [j9]Shiyu Li
, Pengchong Qiao
, Lin Wang, Munan Ning
, Li Yuan, Yefeng Zheng
, Jie Chen
:
An Organ-Aware Diagnosis Framework for Radiology Report Generation. IEEE Trans. Medical Imaging 43(12): 4253-4265 (2024) - [j8]Li Yuan
, Tao Wang
, Xiaopeng Zhang
, Francis Eng Hock Tay, Zequn Jie
, Yonghong Tian
, Wei Liu
, Jiashi Feng
:
Learnable Central Similarity Quantization for Efficient Image and Video Retrieval. IEEE Trans. Neural Networks Learn. Syst. 35(12): 18717-18730 (2024) - [c47]Zesen Cheng, Kehan Li, Peng Jin, Siheng Li, Xiangyang Ji, Li Yuan, Chang Liu, Jie Chen:
Parallel Vertex Diffusion for Unified Visual Grounding. AAAI 2024: 1326-1334 - [c46]Meng Cao, Haoran Tang, Jinfa Huang, Peng Jin, Can Zhang, Ruyang Liu, Long Chen, Xiaodan Liang, Li Yuan, Ge Li:
RAP: Efficient Text-Video Retrieval with Sparse-and-Correlated Adapter. ACL (Findings) 2024: 7160-7174 - [c45]Kun-Peng Ning
, Ming Pang
, Zheng Fang
, Xue Jiang
, Xi-Wei Zhao
, Changping Peng
, Zhangang Lin
, Jinghe Hu
, Jingping Shao
, Li Yuan
:
Towards Better Seach Query Classification with Distribution-Diverse Multi-Expert Knowledge Distillation in JD Ads Search. CIKM 2024: 4786-4794 - [c44]Tao Wang, Lei Jin, Zheng Wang, Jianshu Li, Liang Li, Fang Zhao, Yu Cheng, Li Yuan, Li Zhou, Junliang Xing, Jian Zhao:
SynSP: Synergy of Smoothness and Precision in Pose Sequences Refinement. CVPR 2024: 1824-1833 - [c43]Yian Zhao, Kehan Li, Zesen Cheng, Pengchong Qiao, Xiawu Zheng, Rongrong Ji, Chang Liu, Li Yuan, Jie Chen:
GraCo: Granularity-Controllable Interactive Segmentation. CVPR 2024: 3501-3510 - [c42]Peng Jin, Ryuichi Takanobu, Wancai Zhang, Xiaochun Cao, Li Yuan:
Chat-UniVi: Unified Visual Representation Empowers Large Language Models with Image and Video Understanding. CVPR 2024: 13700-13710 - [c41]Mingyue Guo, Li Yuan, Zhaoyi Yan, Binghui Chen, Yaowei Wang, Qixiang Ye:
Regressor-Segmenter Mutual Prompt Learning for Crowd Counting. CVPR 2024: 28380-28389 - [c40]Kehan Li, Yanbo Fan, Yang Wu, Zhongqian Sun, Wei Yang, Xiangyang Ji, Li Yuan, Jie Chen:
Learning Pseudo 3D Guidance for View-Consistent Texturing with 2D Diffusion. ECCV (86) 2024: 18-34 - [c39]Hao Li
, Yanhao Jia, Peng Jin
, Zesen Cheng
, Kehan Li, Jialu Sui, Chang Liu
, Li Yuan
:
FreestyleRet: Retrieving Images from Style-Diversified Queries. ECCV (23) 2024: 258-274 - [c38]Wangbo Yu
, Li Yuan
, Yan-Pei Cao
, Xiangjun Gao, Xiaoyu Li
, Wenbo Hu
, Quan Long
, Ying Shan
, Yonghong Tian
:
HiFi-123: Towards High-Fidelity One Image to 3D Content Generation. ECCV (73) 2024: 258-274 - [c37]Junwu Zhang, Zhenyu Tang, Yatian Pang, Xinhua Cheng, Peng Jin, Yida Wei, Xing Zhou, Munan Ning, Li Yuan:
Repaint123: Fast and High-Quality One Image to 3D Generation with Progressive Controllable Repainting. ECCV (25) 2024: 303-320 - [c36]Peng Jin
, Hao Li
, Zesen Cheng, Kehan Li, Runyi Yu, Chang Liu
, Xiangyang Ji
, Li Yuan
, Jie Chen
:
Local Action-Guided Motion Diffusion Model for Text-to-Motion Generation. ECCV (25) 2024: 392-409 - [c35]Zhongwei Wan, Ziang Wu, Che Liu, Jinfa Huang, Zhihong Zhu, Peng Jin, Longyue Wang, Li Yuan:
LOOK-M: Look-Once Optimization in KV Cache for Efficient Multimodal Long-Context Inference. EMNLP (Findings) 2024: 4065-4078 - [c34]Jun Niu, Zhaokun Zhou, Kaiwei Che, Li Yuan:
A Multi-modal Spiking Meta-learner with Brain-Inspired Task-Aware Modulation Scheme. ICANN (10) 2024: 341-352 - [c33]Haonan Qiu, Zeyin Song, Yanqi Chen, Munan Ning, Wei Fang, Tao Sun, Zhengyu Ma, Li Yuan, Yonghong Tian:
Temporal Contrastive Learning for Spiking Neural Networks. ICANN (10) 2024: 422-436 - [c32]Liuzhenghao Lv, Wei Fang, Li Yuan, Yonghong Tian:
Optimal ANN-SNN Conversion with Group Neurons. ICASSP 2024: 6475-6479 - [c31]Xinhua Cheng, Tianyu Yang, Jianan Wang, Yu Li, Lei Zhang, Jian Zhang, Li Yuan:
Progressive3D: Progressively Local Editing for Text-to-3D Content Creation with Complex Semantic Prompts. ICLR 2024 - [c30]Bin Zhu, Bin Lin, Munan Ning, Yang Yan, Jiaxi Cui, Hongfa Wang, Yatian Pang, Wenhao Jiang, Junwu Zhang, Zongwei Li, Caiwan Zhang, Zhifeng Li, Wei Liu, Li Yuan:
LanguageBind: Extending Video-Language Pretraining to N-modality by Language-based Semantic Alignment. ICLR 2024 - [c29]Shaodong Wang
, Yunyang Ge
, Liuhan Chen
, Haiyang Zhou
, Qian Wang
, Xinhua Cheng
, Li Yuan
:
Prompt2Poster: Automatically Artistic Chinese Poster Creation from Prompt Only. ACM Multimedia 2024: 10716-10724 - [c28]Zhaokun Zhou, Yijie Lu, Yanhao Jia, Kaiwei Che, Jun Niu, Liwei Huang, Xinyu Shi, Yuesheng Zhu, Guoqi Li, Zhaofei Yu, Li Yuan:
Spiking Transformer with Experts Mixture. NeurIPS 2024 - [c27]Chenlin Zhou, Han Zhang, Zhaokun Zhou, Liutao Yu, Liwei Huang, Xiaopeng Fan, Li Yuan, Zhengyu Ma, Huihui Zhou, Yonghong Tian:
QKFormer: Hierarchical Spiking Transformer using Q-K Attention. NeurIPS 2024 - [i87]Zhaokun Zhou
, Kaiwei Che, Wei Fang, Keyu Tian, Yuesheng Zhu, Shuicheng Yan, Yonghong Tian, Li Yuan:
Spikformer V2: Join the High Accuracy Club on ImageNet with an SNN Ticket. CoRR abs/2401.02020 (2024) - [i86]Hao Li, Da Long, Li Yuan, Yonghong Tian, Xinchang Wang, Fanyang Mo:
Deep peak property learning for efficient chiral molecules ECD spectra prediction. CoRR abs/2401.03403 (2024) - [i85]Bin Lin, Zhenyu Tang, Yang Ye, Jiaxi Cui, Bin Zhu, Peng Jin, Junwu Zhang, Munan Ning, Li Yuan:
MoE-LLaVA: Mixture of Experts for Large Vision-Language Models. CoRR abs/2401.15947 (2024) - [i84]Bin Zhu, Peng Jin, Munan Ning, Bin Lin, Jinfa Huang, Qi Song, Jiaxi Cui, Junwu Zhang, Zhenyu Tang, Mingjun Pan, Xing Zhou, Li Yuan:
LLMBind: A Unified Modality-Task Integration Framework. CoRR abs/2402.14891 (2024) - [i83]Liuzhenghao Lv, Zongying Lin, Hao Li, Yuyang Liu, Jiaxi Cui, Calvin Yu-Chian Chen, Li Yuan, Yonghong Tian:
ProLLaMA: A Protein Large Language Model for Multi-Task Protein Language Processing. CoRR abs/2402.16445 (2024) - [i82]Zongying Lin, Hao Li, Liuzhenghao Lv, Bin Lin, Junwu Zhang, Calvin Yu-Chian Chwn, Li Yuan, Yonghong Tian:
TaxDiff: Taxonomic-Guided Diffusion Model for Protein Sequence Generation. CoRR abs/2402.17156 (2024) - [i81]Liuzhenghao Lv, Wei Fang, Li Yuan, Yonghong Tian:
Optimal ANN-SNN Conversion with Group Neurons. CoRR abs/2402.19061 (2024) - [i80]Yatian Pang, Tanghui Jia, Yujun Shi, Zhenyu Tang, Junwu Zhang, Xinhua Cheng, Xing Zhou, Francis E. H. Tay, Li Yuan:
Envision3D: One Image to 3D with Anchor Views Interpolation. CoRR abs/2403.08902 (2024) - [i79]Chenlin Zhou, Han Zhang, Zhaokun Zhou
, Liutao Yu, Liwei Huang, Xiaopeng Fan, Li Yuan, Zhengyu Ma, Huihui Zhou, Yonghong Tian:
QKFormer: Hierarchical Spiking Transformer using Q-K Attention. CoRR abs/2403.16552 (2024) - [i78]Shenghai Yuan, Jinfa Huang, Yujun Shi, Yongqi Xu, Ruijie Zhu, Bin Lin, Xinhua Cheng, Li Yuan, Jiebo Luo:
MagicTime: Time-lapse Video Generation Models as Metamorphic Simulators. CoRR abs/2404.05014 (2024) - [i77]Yian Zhao, Kehan Li, Zesen Cheng, Pengchong Qiao, Xiawu Zheng, Rongrong Ji, Chang Liu, Li Yuan, Jie Chen:
GraCo: Granularity-Controllable Interactive Segmentation. CoRR abs/2405.00587 (2024) - [i76]Meng Cao, Haoran Tang, Jinfa Huang, Peng Jin, Can Zhang, Ruyang Liu, Long Chen, Xiaodan Liang, Li Yuan, Ge Li:
RAP: Efficient Text-Video Retrieval with Sparse-and-Correlated Adapter. CoRR abs/2405.19465 (2024) - [i75]Wangbo Yu, Chaoran Feng, Jiye Tang, Xu Jia, Li Yuan, Yonghong Tian:
EvaGaussians: Event Stream Assisted Gaussian Splatting from Blurry Images. CoRR abs/2405.20224 (2024) - [i74]Lin Chen, Xilin Wei, Jinsong Li, Xiaoyi Dong, Pan Zhang, Yuhang Zang, Zehui Chen, Haodong Duan, Bin Lin, Zhenyu Tang, Li Yuan, Yu Qiao, Dahua Lin, Feng Zhao, Jiaqi Wang:
ShareGPT4Video: Improving Video Understanding and Generation with Better Captions. CoRR abs/2406.04325 (2024) - [i73]Zhongwei Wan, Ziang Wu, Che Liu, Jinfa Huang, Zhihong Zhu, Peng Jin, Longyue Wang, Li Yuan:
LOOK-M: Look-Once Optimization in KV Cache for Efficient Multimodal Long-Context Inference. CoRR abs/2406.18139 (2024) - [i72]Shenghai Yuan, Jinfa Huang, Yongqi Xu, Yaoyang Liu, Shaofeng Zhang, Yujun Shi, Ruijie Zhu, Xinhua Cheng, Jiebo Luo, Li Yuan:
ChronoMagic-Bench: A Benchmark for Metamorphic Evaluation of Text-to-Time-lapse Video Generation. CoRR abs/2406.18522 (2024) - [i71]Peng Jin, Hao Li, Zesen Cheng, Kehan Li, Runyi Yu, Chang Liu, Xiangyang Ji, Li Yuan, Jie Chen:
Local Action-Guided Motion Diffusion Model for Text-to-Motion Generation. CoRR abs/2407.10528 (2024) - [i70]Haiyang Zhou, Xinhua Cheng, Wangbo Yu, Yonghong Tian, Li Yuan:
HoloDreamer: Holistic 3D Panoramic World Generation from Text Descriptions. CoRR abs/2407.15187 (2024) - [i69]Zhenyu Tang, Junwu Zhang, Xinhua Cheng, Wangbo Yu, Chaoran Feng, Yatian Pang, Bin Lin, Li Yuan:
Cycle3D: High-quality and Consistent Image-to-3D Generation via Generation-Reconstruction Cycle. CoRR abs/2407.19548 (2024) - [i68]Wangbo Yu, Jinbo Xing, Li Yuan, Wenbo Hu, Xiaoyu Li, Zhipeng Huang, Xiangjun Gao, Tien-Tsin Wong, Ying Shan, Yonghong Tian:
ViewCrafter: Taming Video Diffusion Models for High-fidelity Novel View Synthesis. CoRR abs/2409.02048 (2024) - [i67]Zijun Chen, Yu Wang, Liuzhenghao Lv, Hao Li, Zongying Lin, Li Yuan, Yonghong Tian:
Multi-granularity Score-based Generative Framework Enables Efficient Inverse Design of Complex Organics. CoRR abs/2409.07912 (2024) - [i66]Peng Jin, Bo Zhu, Li Yuan, Shuicheng Yan:
MoE++: Accelerating Mixture-of-Experts Methods with Zero-Computation Experts. CoRR abs/2410.07348 (2024) - [i65]Shuo Yang, Kun-Peng Ning, Yu-Yang Liu, Jia-Yu Yao, Yonghong Tian, Yi-Bing Song, Li Yuan:
Is Parameter Collision Hindering Continual Learning in LLMs? CoRR abs/2410.10179 (2024) - [i64]Peng Jin, Bo Zhu, Li Yuan, Shuicheng Yan:
MoH: Multi-Head Attention as Mixture-of-Head Attention. CoRR abs/2410.11842 (2024) - [i63]Kaiwei Che, Zhaokun Zhou, Li Yuan, Jianguo Zhang, Yonghong Tian, Luziwei Leng:
Spatial-Temporal Search for Spiking Neural Networks. CoRR abs/2410.18580 (2024) - [i62]Kaiwei Che, Wei Fang, Zhengyu Ma, Li Yuan, Timothée Masquelier, Yonghong Tian:
ETTFS: An Efficient Training Framework for Time-to-First-Spike Neuron. CoRR abs/2410.23619 (2024) - [i61]Kun-Peng Ning, Hai-Jian Ke, Yu-Yang Liu, Jia-Yu Yao, Yonghong Tian, Li Yuan:
Sparse Orthogonal Parameters Tuning for Continual Learning. CoRR abs/2411.02813 (2024) - [i60]Shenghai Yuan, Jinfa Huang, Xianyi He, Yunyuan Ge, Yujun Shi, Liuhan Chen, Jiebo Luo, Li Yuan:
Identity-Preserving Text-to-Video Generation by Frequency Decomposition. CoRR abs/2411.17440 (2024) - [i59]Zongjian Li, Bin Lin, Yang Ye, Liuhan Chen, Xinhua Cheng, Shenghai Yuan, Li Yuan:
WF-VAE: Enhancing Video VAE by Wavelet-Driven Energy Flow for Latent Video Diffusion Model. CoRR abs/2411.17459 (2024) - [i58]Bin Lin, Yunyang Ge, Xinhua Cheng, Zongjian Li, Bin Zhu, Shaodong Wang, Xianyi He, Yang Ye, Shenghai Yuan, Liuhan Chen, Tanghui Jia, Junwu Zhang, Zhenyu Tang, Yatian Pang, Bin She, Cen Yan, Zhiheng Hu, Xiaoyi Dong, Lin Chen, Zhang Pan, Xing Zhou, Shaoling Dong, Yonghong Tian, Li Yuan:
Open-Sora Plan: Open-Source Large Video Generation Model. CoRR abs/2412.00131 (2024) - [i57]Yatian Pang, Bin Zhu, Bin Lin, Mingzhe Zheng, Francis E. H. Tay, Ser-Nam Lim, Harry Yang, Li Yuan:
DreamDance: Animating Human Images by Enriching 3D Geometry Cues from 2D Poses. CoRR abs/2412.00397 (2024) - [i56]Yatian Pang, Peng Jin, Shuo Yang, Bin Lin, Bin Zhu, Zhenyu Tang, Liuhan Chen, Francis E. H. Tay, Ser-Nam Lim, Harry Yang, Li Yuan:
Next Patch Prediction for Autoregressive Visual Generation. CoRR abs/2412.15321 (2024) - [i55]Zhipeng Huang, Wangbo Yu, Xinhua Cheng, ChengShu Zhao, Yunyang Ge, Mingyi Guo, Li Yuan, Yonghong Tian:
RoomPainter: View-Integrated Diffusion for Consistent Indoor Scene Texturing. CoRR abs/2412.16778 (2024) - [i54]Liuzhenghao Lv, Hao Li, Yu Wang, Zhiyuan Yan, Zijun Chen, Zongying Lin, Li Yuan, Yonghong Tian:
Navigating Chemical-Linguistic Sharing Space with Heterogeneous Molecular Encoding. CoRR abs/2412.20888 (2024) - [i53]Peng Jin, Hao Li, Li Yuan, Shuicheng Yan, Jie Chen:
Hierarchical Banzhaf Interaction for General Video-Language Representation Learning. CoRR abs/2412.20964 (2024) - 2023
- [j7]Qibin Hou
, Zihang Jiang
, Li Yuan
, Ming-Ming Cheng
, Shuicheng Yan, Jiashi Feng
:
Vision Permutator: A Permutable MLP-Like Architecture for Visual Recognition. IEEE Trans. Pattern Anal. Mach. Intell. 45(1): 1328-1334 (2023) - [j6]Li Yuan
, Qibin Hou
, Zihang Jiang
, Jiashi Feng
, Shuicheng Yan
:
VOLO: Vision Outlooker for Visual Recognition. IEEE Trans. Pattern Anal. Mach. Intell. 45(5): 6575-6586 (2023) - [j5]Munan Ning
, Donghuan Lu
, Yujia Xie
, Dongdong Chen
, Dong Wei
, Yefeng Zheng
, Yonghong Tian
, Shuicheng Yan
, Li Yuan
:
MADAv2: Advanced Multi-Anchor Based Active Domain Adaptation Segmentation. IEEE Trans. Pattern Anal. Mach. Intell. 45(11): 13553-13566 (2023) - [j4]Ping Li
, Jiachen Cao, Li Yuan, Qinghao Ye, Xianghua Xu:
Truncated attention-aware proposal networks with multi-scale dilation for temporal action detection. Pattern Recognit. 142: 109684 (2023) - [j3]Yatian Pang
, Francis Eng Hock Tay
, Li Yuan
, Zhenghua Chen
:
Masked Autoencoders for 3D Point Cloud Self-supervised Learning. World Sci. Annu. Rev. Artif. Intell. 1: 2440001:1-2440001:22 (2023) - [c26]Peng Jin, Jinfa Huang, Pengfei Xiong, Shangxuan Tian, Chang Liu, Xiangyang Ji, Li Yuan, Jie Chen:
Video-Text as Game Players: Hierarchical Banzhaf Interaction for Cross-Modal Representation Learning. CVPR 2023: 2472-2482 - [c25]Kehan Li, Zhennan Wang, Zesen Cheng, Runyi Yu, Yian Zhao, Guoli Song, Chang Liu, Li Yuan, Jie Chen:
ACSeg: Adaptive Conceptualization for Unsupervised Semantic Segmentation. CVPR 2023: 7162-7172 - [c24]Zesen Cheng, Pengchong Qiao, Kehan Li, Siheng Li, Pengxu Wei, Xiangyang Ji, Li Yuan, Chang Liu, Jie Chen:
Out-of-Candidate Rectification for Weakly Supervised Semantic Segmentation. CVPR 2023: 23673-23684 - [c23]Zeyin Song, Yifan Zhao, Yujun Shi, Peixi Peng, Li Yuan, Yonghong Tian:
Learning with Fantasy: Semantic-Aware Virtual Contrastive Constraint for Few-Shot Class-Incremental Learning. CVPR 2023: 24183-24192 - [c22]Kehan Li, Yian Zhao, Zhennan Wang, Zesen Cheng, Peng Jin, Xiangyang Ji, Li Yuan, Chang Liu, Jie Chen:
Multi-granularity Interaction Simulation for Unsupervised Interactive Segmentation. ICCV 2023: 666-676 - [c21]Peng Jin, Hao Li, Zesen Cheng, Kehan Li, Xiangyang Ji, Chang Liu, Li Yuan, Jie Chen:
DiffusionRet: Generative Text-Video Retrieval with Diffusion Model. ICCV 2023: 2470-2481 - [c20]Mingjian Ni, Guangyao Chen, Xiawu Zheng, Peixi Peng, Li Yuan, Yonghong Tian:
Learning Sparse Neural Networks with Identity Layers. ICIG (3) 2023: 91-102 - [c19]Zhaokun Zhou, Yuesheng Zhu, Chao He, Yaowei Wang, Shuicheng Yan, Yonghong Tian, Li Yuan:
Spikformer: When Spiking Neural Network Meets Transformer. ICLR 2023 - [c18]Peng Jin, Hao Li, Zesen Cheng, Jinfa Huang, Zhennan Wang, Li Yuan, Chang Liu, Jie Chen:
Text-Video Retrieval with Disentangled Conceptualization and Set-to-Set Alignment. IJCAI 2023: 938-946 - [c17]Peng Jin, Yang Wu, Yanbo Fan, Zhongqian Sun, Wei Yang, Li Yuan:
Act As You Wish: Fine-Grained Control of Motion Diffusion Model with Hierarchical Semantic Graphs. NeurIPS 2023 - [c16]Man Yao, Jiakui Hu, Zhaokun Zhou, Li Yuan, Yonghong Tian, Bo Xu, Guoqi Li:
Spike-driven Transformer. NeurIPS 2023 - [i52]Munan Ning, Donghuan Lu, Yujia Xie, Dongdong Chen, Dong Wei, Yefeng Zheng, Yonghong Tian, Shuicheng Yan, Li Yuan:
MADAv2: Advanced Multi-Anchor Based Active Domain Adaptation Segmentation. CoRR abs/2301.07354 (2023) - [i51]Zesen Cheng, Kehan Li, Peng Jin, Xiangyang Ji, Li Yuan, Chang Liu, Jie Chen:
Parallel Vertex Diffusion for Unified Visual Grounding. CoRR abs/2303.07216 (2023) - [i50]Peng Jin, Hao Li, Zesen Cheng, Kehan Li, Xiangyang Ji, Chang Liu, Li Yuan, Jie Chen:
DiffusionRet: Generative Text-Video Retrieval with Diffusion Model. CoRR abs/2303.09867 (2023) - [i49]Kehan Li, Yian Zhao, Zhennan Wang, Zesen Cheng, Peng Jin, Xiangyang Ji, Li Yuan, Chang Liu, Jie Chen:
Multi-granularity Interaction Simulation for Unsupervised Interactive Segmentation. CoRR abs/2303.13399 (2023) - [i48]Peng Jin, Jinfa Huang, Pengfei Xiong, Shangxuan Tian, Chang Liu, Xiangyang Ji, Li Yuan, Jie Chen:
Video-Text as Game Players: Hierarchical Banzhaf Interaction for Cross-Modal Representation Learning. CoRR abs/2303.14369 (2023) - [i47]Zeyin Song, Yifan Zhao, Yujun Shi, Peixi Peng, Li Yuan, Yonghong Tian:
Learning with Fantasy: Semantic-Aware Virtual Contrastive Constraint for Few-Shot Class-Incremental Learning. CoRR abs/2304.00426 (2023) - [i46]Peng Jin, Hao Li, Zesen Cheng, Jinfa Huang, Zhennan Wang, Li Yuan, Chang Liu, Jie Chen:
Text-Video Retrieval with Disentangled Conceptualization and Set-to-Set Alignment. CoRR abs/2305.12218 (2023) - [i45]Munan Ning, Yujia Xie, Dongdong Chen, Zeyin Song, Lu Yuan, Yonghong Tian, Qixiang Ye, Li Yuan:
Album Storytelling with Iterative Story-aware Captioning and Large Language Models. CoRR abs/2305.12943 (2023) - [i44]Haonan Qiu, Zeyin Song, Yanqi Chen, Munan Ning, Wei Fang, Tao Sun, Zhengyu Ma, Li Yuan, Yonghong Tian:
Temporal Contrastive Learning for Spiking Neural Networks. CoRR abs/2305.13909 (2023) - [i43]Dongxu Yue, Qin Guo, Munan Ning, Jiaxi Cui, Yuesheng Zhu, Li Yuan:
ChatFace: Chat-Guided Real Face Editing via Diffusion Latent Space Manipulation. CoRR abs/2305.14742 (2023) - [i42]Kaiwei Che, Zhaokun Zhou, Zhengyu Ma, Wei Fang, Yanqi Chen, Shuaijie Shen, Li Yuan, Yonghong Tian:
Auto-Spikformer: Spikformer Architecture Search. CoRR abs/2306.00807 (2023) - [i41]Jiaxi Cui, Zongjian Li, Yang Yan, Bohua Chen, Li Yuan:
ChatLaw: Open-Source Legal Large Language Model with Integrated External Knowledge Bases. CoRR abs/2306.16092 (2023) - [i40]Man Yao, Jiakui Hu, Zhaokun Zhou, Li Yuan, Yonghong Tian, Bo Xu, Guoqi Li:
Spike-driven Transformer. CoRR abs/2307.01694 (2023) - [i39]Mingjian Ni, Guangyao Chen, Xiawu Zheng, Peixi Peng, Li Yuan, Yonghong Tian:
Learning Sparse Neural Networks with Identity Layers. CoRR abs/2307.07389 (2023) - [i38]Ping Li, Yu Zhang, Li Yuan, Huaxin Xiao, Binbin Lin, Xianghua Xu:
Efficient Long-Short Temporal Attention Network for Unsupervised Video Object Segmentation. CoRR abs/2309.11707 (2023) - [i37]Ping Li, Yu Zhang, Li Yuan, Xianghua Xu:
Fully Transformer-Equipped Architecture for End-to-End Referring Video Object Segmentation. CoRR abs/2309.11933 (2023) - [i36]Ping Li, Junjie Chen, Li Yuan, Xianghua Xu, Mingli Song:
Triple-View Knowledge Distillation for Semi-Supervised Semantic Segmentation. CoRR abs/2309.12557 (2023) - [i35]Ping Li, Yu Zhang, Li Yuan, Jian Zhao, Xianghua Xu, Xiaoqin Zhang:
Adversarial Attacks on Video Object Segmentation with Hard Region Discovery. CoRR abs/2309.13857 (2023) - [i34]Bin Zhu, Bin Lin, Munan Ning, Yang Yan, Jiaxi Cui, Hongfa Wang, Yatian Pang, Wenhao Jiang, Junwu Zhang, Zongwei Li, Wancai Zhang, Zhifeng Li, Wei Liu, Li Yuan:
LanguageBind: Extending Video-Language Pretraining to N-modality by Language-based Semantic Alignment. CoRR abs/2310.01852 (2023) - [i33]Wangbo Yu, Li Yuan, Yan-Pei Cao, Xiangjun Gao, Xiaoyu Li, Long Quan, Ying Shan, Yonghong Tian:
HiFi-123: Towards High-fidelity One Image to 3D Content Generation. CoRR abs/2310.06744 (2023) - [i32]Xinhua Cheng, Tianyu Yang, Jianan Wang, Yu Li, Lei Zhang, Jian Zhang, Li Yuan:
Progressive3D: Progressively Local Editing for Text-to-3D Content Creation with Complex Semantic Prompts. CoRR abs/2310.11784 (2023) - [i31]Peng Jin, Yang Wu, Yanbo Fan, Zhongqian Sun, Yang Wei, Li Yuan:
Act As You Wish: Fine-Grained Control of Motion Diffusion Model with Hierarchical Semantic Graphs. CoRR abs/2311.01015 (2023) - [i30]Peng Jin, Ryuichi Takanobu, Caiwan Zhang, Xiaochun Cao, Li Yuan:
Chat-UniVi: Unified Visual Representation Empowers Large Language Models with Image and Video Understanding. CoRR abs/2311.08046 (2023) - [i29]Bin Lin, Yang Ye, Bin Zhu, Jiaxi Cui, Munan Ning, Peng Jin, Li Yuan:
Video-LLaVA: Learning United Visual Representation by Alignment Before Projection. CoRR abs/2311.10122 (2023) - [i28]Mingyue Guo, Li Yuan, Zhaoyi Yan, Binghui Chen, Yaowei Wang, Qixiang Ye:
Regressor-Segmenter Mutual Prompt Learning for Crowd Counting. CoRR abs/2312.01711 (2023) - [i27]Hao Li, Curise Jia, Peng Jin, Zesen Cheng, Kehan Li, Jialu Sui, Chang Liu, Li Yuan:
FreestyleRet: Retrieving Images from Style-Diversified Queries. CoRR abs/2312.02428 (2023) - [i26]Jiaxi Cui, Liuzhenghao Lv, Jing Wen, Rongsheng Wang, Jing Tang, Yonghong Tian, Li Yuan:
Machine Mindset: An MBTI Exploration of Large Language Models. CoRR abs/2312.12999 (2023) - [i25]Junwu Zhang, Zhenyu Tang, Yatian Pang, Xinhua Cheng, Peng Jin, Yida Wei, Munan Ning, Li Yuan:
Repaint123: Fast and High-quality One Image to 3D Generation with Progressive Controllable 2D Repainting. CoRR abs/2312.13271 (2023) - 2022
- [c15]Jiawang Bai, Li Yuan, Shu-Tao Xia, Shuicheng Yan, Zhifeng Li, Wei Liu:
Improving Vision Transformers by Revisiting High-Frequency Components. ECCV (24) 2022: 1-18 - [c14]Kehan Li
, Runyi Yu, Zhennan Wang, Li Yuan, Guoli Song, Jie Chen:
Locality Guidance for Improving Vision Transformers on Tiny Datasets. ECCV (24) 2022: 110-127 - [c13]