


default search action
Yan Lu 0001
Person information
- affiliation: Microsoft Research Asia, Beijing, China
- affiliation (PhD 2003): Harbin Institute of Technology, China
Other persons with the same name
- Yan Lu — disambiguation page
- Yan Lu 0002
— University of Macau, State Key Laboratory of Analog and Mixed-Signal VLSI, Macau (and 1 more)
- Yan Lu 0003
— Xidian University, China (and 1 more)
- Yan Lu 0005 — Zhejiang University, Hangzhou, Zhejiang Province, People's Republic of China
- Yan Lu 0006
— New York University, NY, USA
- Yan Lu 0007
— Old Dominion University, Norfolk, VA, USA
Refine list

refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
showing all ?? records
2020 – today
- 2025
- [j40]Yan Lu, Xinzhu Ma, Lei Yang, Tianzhu Zhang, Yating Liu, Qi Chu, Tong He, Yonghui Li, Wanli Ouyang:
GUPNet++: Geometry Uncertainty Propagation Network for Monocular 3D Object Detection. IEEE Trans. Pattern Anal. Mach. Intell. 47(2): 900-915 (2025) - 2024
- [j39]Jin Gao
, Yan Lu
, Xiaojuan Qi
, Yutong Kou, Bing Li
, Liang Li
, Shan Yu
, Weiming Hu
:
Recursive Least-Squares Estimator-Aided Online Learning for Visual Tracking. IEEE Trans. Pattern Anal. Mach. Intell. 46(3): 1881-1897 (2024) - [j38]Zhiwei Zhao
, Bin Liu
, Yan Lu
, Qi Chu
, Nenghai Yu
, Chang Wen Chen
:
Joint Identity-Aware Mixstyle and Graph-Enhanced Prototype for Clothes-Changing Person Re-Identification. IEEE Trans. Multim. 26: 3457-3468 (2024) - [j37]Wufei Ma
, Jiahao Li
, Bin Li
, Yan Lu
:
Uncertainty-Aware Deep Video Compression With Ensembles. IEEE Trans. Multim. 26: 7863-7872 (2024) - [j36]Jing Zhao
, Bin Li
, Jiahao Li
, Ruiqin Xiong
, Yan Lu
:
A Universal Optimization Framework for Learning-based Image Codec. ACM Trans. Multim. Comput. Commun. Appl. 20(1): 16:1-16:19 (2024) - [j35]Zhaoyang Jia
, Yan Lu
, Houqiang Li
:
Exploring Neighbor Correspondence Matching for Multiple-hypotheses Video Frame Synthesis. ACM Trans. Multim. Comput. Commun. Appl. 20(4): 111:1-111:20 (2024) - [c137]Yaqi Zhang, Di Huang, Bin Liu, Shixiang Tang, Yan Lu
, Lu Chen, Lei Bai, Qi Chu, Nenghai Yu, Wanli Ouyang:
MotionGPT: Finetuned LLMs Are General-Purpose Motion Generators. AAAI 2024: 7368-7376 - [c136]Zhiwei Zhao, Bin Liu, Yan Lu
, Qi Chu, Nenghai Yu:
Unifying Multi-Modal Uncertainty Modeling and Semantic Alignment for Text-to-Image Person Re-identification. AAAI 2024: 7534-7542 - [c135]Xiang Li, Jinglu Wang, Xiaohao Xu, Xiulian Peng, Rita Singh, Yan Lu, Bhiksha Raj:
QDFormer: Towards Robust Audiovisual Segmentation in Complex Environments with Quantization-based Semantic Decomposition. CVPR 2024: 3402-3413 - [c134]Zhaoyang Jia, Jiahao Li, Bin Li, Houqiang Li, Yan Lu:
Generative Latent Coding for Ultra-Low Bitrate Image Compression. CVPR 2024: 26088-26098 - [c133]Tianci Bi, Xiaoyi Zhang, Zhizheng Zhang, Wenxuan Xie, Cuiling Lan, Yan Lu, Nanning Zheng:
Text Grouping Adapter: Adapting Pre-Trained Text Detector for Layout Analysis. CVPR 2024: 28150-28159 - [c132]Xin Kang, Lei Chu, Jiahao Li, Xuejin Chen, Yan Lu:
Hierarchical Intra-Modal Correlation Learning for Label-Free 3D Semantic Segmentation. CVPR 2024: 28244-28253 - [c131]Linfeng Qi
, Zhaoyang Jia
, Jiahao Li
, Bin Li
, Houqiang Li
, Yan Lu
:
Long-Term Temporal Context Gathering for Neural Video Compression. ECCV (66) 2024: 305-322 - [c130]Huaying Xue, Xiulian Peng, Yan Lu:
Low-Latency Speech Enhancement via Speech Token Generation. ICASSP 2024: 661-665 - [c129]Ganlin Yang, Guoqiang Wei, Zhizheng Zhang, Yan Lu, Dong Liu:
Mask-Based Modeling for Neural Radiance Fields. ICLR 2024 - [c128]Jingwen Fu, Tao Yang, Yuwang Wang, Yan Lu, Nanning Zheng:
Breaking through the learning plateaus of in-context learning in Transformer. ICML 2024 - [c127]Zhijun Jia
, Huaying Xue
, Xiulian Peng
, Yan Lu
:
Convert and Speak: Zero-shot Accent Conversion with Minimum Supervision. ACM Multimedia 2024: 4446-4454 - [c126]Jiaqi Xu, Cuiling Lan, Wenxuan Xie, Xuejin Chen, Yan Lu:
Slot-VLM: Object-Event Slots for Video-Language Modeling. NeurIPS 2024 - [i85]Yifei Xin, Xiulian Peng, Yan Lu:
Masked Audio Modeling with CLAP and Multi-Objective Learning. CoRR abs/2401.15953 (2024) - [i84]Tao Yang, Cuiling Lan, Yan Lu, Nanning Zheng:
Diffusion Model with Cross Attention as an Inductive Bias for Disentanglement. CoRR abs/2402.09712 (2024) - [i83]Jiaqi Xu, Cuiling Lan, Wenxuan Xie, Xuejin Chen, Yan Lu:
Slot-VLM: SlowFast Slots for Video-Language Modeling. CoRR abs/2402.13088 (2024) - [i82]Zhipeng Huang, Zhizheng Zhang, Zheng-Jun Zha, Yan Lu, Baining Guo:
RelationVLM: Making Large Vision-Language Models Understand Visual Relations. CoRR abs/2403.12801 (2024) - [i81]Tianci Bi, Xiaoyi Zhang, Zhizheng Zhang, Wenxuan Xie, Cuiling Lan, Yan Lu, Nanning Zheng:
Text Grouping Adapter: Adapting Pre-trained Text Detector for Layout Analysis. CoRR abs/2405.07481 (2024) - [i80]Jingwen Fu, Zhizheng Zhang, Yan Lu, Nanning Zheng:
A General Theory for Compositional Generalization. CoRR abs/2405.11743 (2024) - [i79]Zhijun Jia, Huaying Xue, Xiulian Peng, Yan Lu:
Convert and Speak: Zero-shot Accent Conversion with Minimum Supervision. CoRR abs/2408.10096 (2024) - [i78]Xingrui Wang, Cuiling Lan, Hanxin Zhu, Zhibo Chen, Yan Lu:
GSemSplat: Generalizable Semantic 3D Gaussian Splatting from Uncalibrated Image Pairs. CoRR abs/2412.16932 (2024) - 2023
- [j34]Yan Lu, Siwei Kou, Xiaopeng Wang:
Micro-Doppler Effect and Sparse Representation Analysis of Underwater Targets. Sensors 23(19): 8066 (2023) - [j33]Xue Jiang
, Xiulian Peng
, Huaying Xue
, Yuan Zhang
, Yan Lu
:
Latent-Domain Predictive Neural Speech Coding. IEEE ACM Trans. Audio Speech Lang. Process. 31: 2111-2123 (2023) - [j32]Jinyang Huang
, Bin Liu
, Chenglin Miao, Yan Lu
, Qijia Zheng, Yu Wu, Jiancun Liu
, Lu Su
, Chang Wen Chen
:
PhaseAnti: An Anti-Interference WiFi-Based Activity Recognition System Using Interference-Independent Phase Component. IEEE Trans. Mob. Comput. 22(5): 2938-2954 (2023) - [j31]Xihua Sheng
, Jiahao Li
, Bin Li
, Li Li
, Dong Liu
, Yan Lu:
Temporal Context Mining for Learned Video Compression. IEEE Trans. Multim. 25: 7311-7322 (2023) - [j30]Xiang Li
, Jinglu Wang
, Xiao Li
, Yan Lu:
Video Instance Segmentation by Instance Flow Assembly. IEEE Trans. Multim. 25: 7469-7479 (2023) - [c125]Guoqiang Wei, Zhizheng Zhang, Cuiling Lan, Yan Lu, Zhibo Chen:
Active Token Mixer. AAAI 2023: 2759-2767 - [c124]Mude Hui, Zhizheng Zhang, Xiaoyi Zhang, Wenxuan Xie, Yuwang Wang, Yan Lu:
Unifying Layout Generation with a Decoupled Diffusion Model. CVPR 2023: 1942-1951 - [c123]Kun Yan, Xiao Li, Fangyun Wei, Jinglu Wang
, Chenbin Zhang, Ping Wang, Yan Lu:
Two-shot Video Object Segmentation. CVPR 2023: 2257-2267 - [c122]Yuchen Ren, Zhendong Mao, Shancheng Fang, Yan Lu, Tong He, Hao Du, Yongdong Zhang, Wanli Ouyang:
Crossing the Gap: Domain Generalization for Image Captioning. CVPR 2023: 2871-2880 - [c121]Yue Gao, Yuan Zhou, Jinglu Wang
, Xiao Li, Xiang Ming, Yan Lu:
High-Fidelity and Freely Controllable Talking Head Video Generation. CVPR 2023: 5609-5619 - [c120]Linfeng Qi, Jiahao Li, Bin Li, Houqiang Li, Yan Lu:
Motion Information Propagation for Neural Video Compression. CVPR 2023: 6111-6120 - [c119]Shiqi Lin, Zhizheng Zhang, Zhipeng Huang, Yan Lu, Cuiling Lan, Peng Chu, Quanzeng You, Jiang Wang, Zicheng Liu, Amey Parulkar, Viraj Navkal, Zhibo Chen:
Deep Frequency Filtering for Domain Generalization. CVPR 2023: 11797-11807 - [c118]Mingfang Zhang
, Jinglu Wang
, Xiao Li, Yifei Huang, Yoichi Sato, Yan Lu:
Structural Multiplane Image: Bridging Neural View Synthesis and 3D Reconstruction. CVPR 2023: 16707-16716 - [c117]Jiahao Li, Bin Li, Yan Lu:
Neural Video Compression with Diverse Contexts. CVPR 2023: 22616-22626 - [c116]Xue Jiang, Xiulian Peng, Yuan Zhang, Yan Lu:
Disentangled Feature Learning for Real-Time Neural Speech Coding. ICASSP 2023: 1-5 - [c115]Shuo Wang, Xiangyu Kong, Xiulian Peng, Hesam Movassagh, Vinod Prakash, Yan Lu:
Dasformer: Deep Alternating Spectrogram Transformer For Multi/Single-Channel Speech Separation. ICASSP 2023: 1-5 - [c114]Yifei Xin, Xiulian Peng, Yan Lu:
Improving Speech Enhancement via Event-Based Query. ICASSP 2023: 1-5 - [c113]Huaying Xue, Xiulian Peng, Yan Lu:
Contrast-PLC: Contrastive Learning for Packet Loss Concealment. ICASSP 2023: 1-5 - [c112]Yaqi Zhang, Yan Lu
, Bin Liu, Zhiwei Zhao, Qi Chu, Nenghai Yu:
Evopose: A Recursive Transformer for 3D Human Pose Estimation with Kinematic Structure Priors. ICASSP 2023: 1-5 - [c111]Chengyu Zheng, Yuan Zhou, Xiulian Peng, Yuan Zhang, Yan Lu:
Real-Time Speech Enhancement with Dynamic Attention Span. ICASSP 2023: 1-5 - [c110]Zhipeng Huang, Zhizheng Zhang, Cuiling Lan, Zheng-Jun Zha, Yan Lu, Baining Guo:
Adaptive Frequency Filters As Efficient Global Token Mixers. ICCV 2023: 6026-6036 - [c109]Yushuang Wu, Xiao Li, Jinglu Wang, Xiaoguang Han, Shuguang Cui
, Yan Lu:
Efficient View Synthesis with Neural Radiance Distribution Field. ICCV 2023: 18460-18469 - [c108]Xiang Li, Jinglu Wang, Xiaohao Xu, Xiao Li, Bhiksha Raj, Yan Lu:
Robust Referring Video Object Segmentation with Cyclic Structural Consensus. ICCV 2023: 22179-22188 - [c107]Zongyu Guo, Cuiling Lan, Zhizheng Zhang, Yan Lu, Zhibo Chen:
Versatile Neural Processes for Learning Implicit Neural Representations. ICLR 2023 - [c106]Guo-Hua Wang, Jiahao Li, Bin Li, Yan Lu:
EVC: Towards Real-Time Neural Image Compression with Mask Decay. ICLR 2023 - [c105]Yixin Wan, Yuan Zhou, Xiulian Peng, Kai-Wei Chang, Yan Lu:
ABC-KD: Attention-Based-Compression Knowledge Distillation for Deep Learning-Based Noise Suppression. INTERSPEECH 2023: 2528-2532 - [c104]Yifei Xin, Xiulian Peng, Yan Lu:
Masked Audio Modeling with CLAP and Multi-Objective Learning. INTERSPEECH 2023: 2763-2767 - [c103]Cong Huang
, Jiahao Li
, Lei Chu
, Dong Liu
, Yan Lu
:
Disentangle Propagation and Restoration for Efficient Video Recovery. ACM Multimedia 2023: 8336-8345 - [c102]Jingwen Fu, Zhizheng Zhang, Dacheng Yin, Yan Lu, Nanning Zheng:
Learning Trajectories are Generalization Indicators. NeurIPS 2023 - [c101]Tao Yang, Yuwang Wang, Yan Lu, Nanning Zheng:
DisDiff: Unsupervised Disentanglement of Diffusion Probabilistic Models. NeurIPS 2023 - [i77]Zongyu Guo, Cuiling Lan, Zhizheng Zhang, Yan Lu, Zhibo Chen:
Versatile Neural Processes for Learning Implicit Neural Representations. CoRR abs/2301.08883 (2023) - [i76]Guo-Hua Wang, Jiahao Li, Bin Li, Yan Lu:
EVC: Towards Real-Time Neural Image Compression with Mask Decay. CoRR abs/2302.05071 (2023) - [i75]Chengyu Zheng, Yuan Zhou, Xiulian Peng, Yuan Zhang, Yan Lu:
Real-time speech enhancement with dynamic attention span. CoRR abs/2302.10377 (2023) - [i74]Shuo Wang, Xiangyu Kong, Xiulian Peng, Hesam Movassagh, Vinod Prakash, Yan Lu:
DasFormer: Deep Alternating Spectrogram Transformer for Multi/Single-Channel Speech Separation. CoRR abs/2302.10657 (2023) - [i73]Yifei Xin, Xiulian Peng, Yan Lu:
Improving Speech Enhancement via Event-based Query. CoRR abs/2302.11558 (2023) - [i72]Chengyu Zheng, Yuan Zhou, Xiulian Peng, Yuan Zhang, Yan Lu:
Time-Variance Aware Real-Time Speech Enhancement. CoRR abs/2302.13063 (2023) - [i71]Huaying Xue, Xiulian Peng, Yan Lu:
Contrast-PLC: Contrastive Learning for Packet Loss Concealment. CoRR abs/2302.13284 (2023) - [i70]Jiahao Li, Bin Li, Yan Lu:
Neural Video Compression with Diverse Contexts. CoRR abs/2302.14402 (2023) - [i69]Mude Hui, Zhizheng Zhang, Xiaoyi Zhang, Wenxuan Xie, Yuwang Wang, Yan Lu:
Unifying Layout Generation with a Decoupled Diffusion Model. CoRR abs/2303.05049 (2023) - [i68]Mingfang Zhang, Jinglu Wang, Xiao Li, Yifei Huang, Yoichi Sato, Yan Lu:
Structural Multiplane Image: Bridging Neural View Synthesis and 3D Reconstruction. CoRR abs/2303.05937 (2023) - [i67]Kun Yan, Xiao Li, Fangyun Wei, Jinglu Wang, Chenbin Zhang, Ping Wang, Yan Lu:
Two-shot Video Object Segmentation. CoRR abs/2303.12078 (2023) - [i66]Ganlin Yang, Guoqiang Wei, Zhizheng Zhang, Yan Lu, Dong Liu:
MRVM-NeRF: Mask-Based Pretraining for Neural Radiance Fields. CoRR abs/2304.04962 (2023) - [i65]Yue Gao, Yuan Zhou, Jinglu Wang, Xiao Li, Xiang Ming, Yan Lu:
High-Fidelity and Freely Controllable Talking Head Video Generation. CoRR abs/2304.10168 (2023) - [i64]Jingwen Fu, Zhizheng Zhang, Dacheng Yin, Yan Lu, Nanning Zheng:
Learning Trajectories are Generalization Indicators. CoRR abs/2304.12579 (2023) - [i63]Xulin Li, Yan Lu, Bin Liu, Yuenan Hou, Yating Liu, Qi Chu, Wanli Ouyang, Nenghai Yu:
Clothes-Invariant Feature Learning by Causal Intervention for Clothes-Changing Person Re-identification. CoRR abs/2305.06145 (2023) - [i62]Yixin Wan, Yuan Zhou, Xiulian Peng, Kai-Wei Chang, Yan Lu:
ABC-KD: Attention-Based-Compression Knowledge Distillation for Deep Learning-Based Noise Suppression. CoRR abs/2305.16665 (2023) - [i61]Tao Yang, Yuwang Wang, Cuiling Lan, Yan Lu, Nanning Zheng:
Vector-based Representation is the Key: A Study on Disentanglement and Compositional Generalization. CoRR abs/2305.18063 (2023) - [i60]Zhizheng Zhang, Xiaoyi Zhang, Wenxuan Xie, Yan Lu:
Responsible Task Automation: Empowering Large Language Models as Responsible Task Automators. CoRR abs/2306.01242 (2023) - [i59]Yaqi Zhang, Yan Lu, Bin Liu, Zhiwei Zhao, Qi Chu, Nenghai Yu:
EVOPOSE: A Recursive Transformer For 3D Human Pose Estimation With Kinematic Structure Priors. CoRR abs/2306.09615 (2023) - [i58]Yaqi Zhang, Di Huang, Bin Liu, Shixiang Tang, Yan Lu, Lu Chen, Lei Bai, Qi Chu, Nenghai Yu, Wanli Ouyang:
MotionGPT: Finetuned LLMs are General-Purpose Motion Generators. CoRR abs/2306.10900 (2023) - [i57]Zhipeng Huang, Zhizheng Zhang, Cuiling Lan, Zheng-Jun Zha, Yan Lu, Baining Guo:
Adaptive Frequency Filters As Efficient Global Token Mixers. CoRR abs/2307.14008 (2023) - [i56]Yushuang Wu, Xiao Li, Jinglu Wang, Xiaoguang Han, Shuguang Cui
, Yan Lu:
Efficient View Synthesis with Neural Radiance Distribution Field. CoRR abs/2308.11130 (2023) - [i55]Jingwen Fu, Tao Yang, Yuwang Wang, Yan Lu, Nanning Zheng:
How does representation impact in-context learning: A exploration on a synthetic task. CoRR abs/2309.06054 (2023) - [i54]Xiang Li, Jinglu Wang, Xiaohao Xu, Xiulian Peng, Rita Singh, Yan Lu, Bhiksha Raj:
Rethinking Audiovisual Segmentation with Semantic Quantization and Decomposition. CoRR abs/2310.00132 (2023) - [i53]Zhizheng Zhang, Wenxuan Xie, Xiaoyi Zhang, Yan Lu:
Reinforced UI Instruction Grounding: Towards a Generic UI Task Automation API. CoRR abs/2310.04716 (2023) - [i52]Huaying Xue, Xiulian Peng, Yan Lu:
Low-latency Speech Enhancement via Speech Token Generation. CoRR abs/2310.08981 (2023) - [i51]Yan Lu
, Xinzhu Ma, Lei Yang, Tianzhu Zhang, Yating Liu, Qi Chu, Tong He, Yonghui Li, Wanli Ouyang:
GUPNet++: Geometry Uncertainty Propagation Network for Monocular 3D Object Detection. CoRR abs/2310.15624 (2023) - [i50]Jiaqi Xu, Cuiling Lan, Wenxuan Xie, Xuejin Chen, Yan Lu:
Retrieval-based Video Language Model for Efficient Long Video Question Answering. CoRR abs/2312.04931 (2023) - 2022
- [j29]Zengyi Qin
, Jinglu Wang
, Yan Lu:
MonoGRNet: A General Framework for Monocular 3D Object Detection. IEEE Trans. Pattern Anal. Mach. Intell. 44(9): 5170-5184 (2022) - [j28]Chengyu Zheng
, Yuan Zhou
, Xiulian Peng
, Yuan Zhang
, Yan Lu:
Time-Variance Aware Dynamic Kernel Generation for Real-Time Acoustic Echo Cancellation. IEEE Signal Process. Lett. 29: 967-971 (2022) - [c100]Xiang Li, Jinglu Wang, Xiao Li, Yan Lu:
Hybrid Instance-Aware Temporal Fusion for Online Video Instance Segmentation. AAAI 2022: 1429-1437 - [c99]Xiaohao Xu, Jinglu Wang, Xiao Li, Yan Lu:
Reliable Propagation-Correction Modulation for Video Object Segmentation. AAAI 2022: 2946-2954 - [c98]Cong Huang
, Jiahao Li, Bin Li, Dong Liu, Yan Lu:
Neural Compression-Based Feature Learning for Video Restoration. CVPR 2022: 5862-5871 - [c97]Yizhou Zhao, Xun Guo, Yan Lu:
Semantic-aligned Fusion Transformer for One-shot Object Detection. CVPR 2022: 7591-7601 - [c96]Haoqing Wang, Xun Guo, Zhi-Hong Deng, Yan Lu:
Rethinking Minimal Sufficient Representation in Contrastive Learning. CVPR 2022: 16020-16029 - [c95]Gusi Te, Xiu Li, Xiao Li, Jinglu Wang
, Wei Hu, Yan Lu:
Neural Capture of Animatable 3D Human from Monocular Video. ECCV (6) 2022: 275-291 - [c94]Xulin Li, Yan Lu
, Bin Liu, Yating Liu, Guojun Yin, Qi Chu, Jinyang Huang, Feng Zhu, Rui Zhao, Nenghai Yu:
Counterfactual Intervention Feature Transfer for Visible-Infrared Person Re-identification. ECCV (26) 2022: 381-398 - [c93]Xue Jiang, Xiulian Peng, Chengyu Zheng, Huaying Xue, Yuan Zhang, Yan Lu:
End-to-End Neural Speech Coding for Real-Time Communications. ICASSP 2022: 866-870 - [c92]Xiaoyu Wang, Xiangyu Kong, Xiulian Peng, Yan Lu:
Multi-Modal Multi-Correlation Learning for Audio-Visual Speech Separation. INTERSPEECH 2022: 886-890 - [c91]Huaying Xue, Xiulian Peng, Xue Jiang, Yan Lu:
Towards Error-Resilient Neural Speech Coding. INTERSPEECH 2022: 4217-4221 - [c90]Xue Jiang, Xiulian Peng, Huaying Xue, Yuan Zhang, Yan Lu:
Cross-Scale Vector Quantization for Scalable Neural Speech Coding. INTERSPEECH 2022: 4222-4226 - [c89]Jiahao Li, Bin Li, Yan Lu:
Hybrid Spatial-Temporal Entropy Modelling for Neural Video Compression. ACM Multimedia 2022: 1503-1511 - [c88]Xiaohao Xu, Jinglu Wang
, Xiang Ming, Yan Lu:
Towards Robust Video Object Segmentation with Adaptive Object Calibration. ACM Multimedia 2022: 2709-2718 - [c87]Zhaoyang Jia, Yan Lu, Houqiang Li:
Neighbor Correspondence Matching for Flow-based Video Frame Synthesis. ACM Multimedia 2022: 5389-5397 - [c86]Tao Yang, Yuwang Wang, Yan Lu, Nanning Zheng:
Visual Concepts Tokenization. NeurIPS 2022 - [c85]Tao Yu, Zhizheng Zhang, Cuiling Lan, Yan Lu, Zhibo Chen:
Mask-based Latent Reconstruction for Reinforcement Learning. NeurIPS 2022 - [c84]Yizhou Zhao, Zhenyang Li, Xun Guo, Yan Lu:
Alignment-guided Temporal Attention for Video Action Recognition. NeurIPS 2022 - [c83]Hanlei Yu, Bin Liu, Yan Lu
, Qi Chu, Nenghai Yu:
Multi-view Geometry Distillation for Cloth-Changing Person ReID. PRCV (1) 2022: 29-41 - [c82]Xulin Li, Bin Liu, Yan Lu
, Qi Chu, Nenghai Yu:
Cloth-Aware Center Cluster Loss for Cloth-Changing Person Re-identification. PRCV (1) 2022: 527-539 - [i49]Xue Jiang, Xiulian Peng, Chengyu Zheng, Huaying Xue, Yuan Zhang, Yan Lu:
End-to-End Neural Audio Coding for Real-Time Communications. CoRR abs/2201.09429 (2022) - [i48]Tao Yu, Zhizheng Zhang, Cuiling Lan, Zhibo Chen, Yan Lu:
Mask-based Latent Reconstruction for Reinforcement Learning. CoRR abs/2201.12096 (2022) - [i47]Guoqiang Wei, Zhizheng Zhang, Cuiling Lan, Yan Lu, Zhibo Chen:
ActiveMLP: An MLP-like Architecture with Active Token Mixer. CoRR abs/2203.06108 (2022) - [i46]Haoqing Wang, Xun Guo, Zhi-Hong Deng, Yan Lu:
Rethinking Minimal Sufficient Representation in Contrastive Learning. CoRR abs/2203.07004 (2022) - [i45]Yizhou Zhao, Xun Guo, Yan Lu:
Semantic-aligned Fusion Transformer for One-shot Object Detection. CoRR abs/2203.09093 (2022) - [i44]Cong Huang, Jiahao Li, Bin Li, Dong Liu, Yan Lu:
Neural Compression-Based Feature Learning for Video Restoration. CoRR abs/2203.09208 (2022) - [i43]Shiqi Lin, Zhizheng Zhang, Zhipeng Huang, Yan Lu, Cuiling Lan, Peng Chu, Quanzeng You, Jiang Wang, Zicheng Liu, Amey Parulkar, Viraj Navkal, Zhibo Chen:
Deep Frequency Filtering for Domain Generalization. CoRR abs/2203.12198 (2022) - [i42]Tao Yang, Yuwang Wang, Yan Lu, Nanning Zheng:
Visual Concepts Tokenization. CoRR abs/2205.10093 (2022) - [i41]Tao Yang, Shenglong Zhou, Yuwang Wang, Yan Lu, Nanning Zheng:
Test-time Batch Normalization. CoRR abs/2205.10210 (2022) - [i40]Xiaohao Xu, Jinglu Wang, Xiang Ming, Yan Lu:
Towards Robust Video Object Segmentation with Adaptive Object Calibration. CoRR abs/2207.00887 (2022) - [i39]Huaying Xue, Xiulian Peng, Xue Jiang, Yan Lu:
Towards Error-Resilient Neural Speech Coding. CoRR abs/2207.00993 (2022) - [i38]Xiaoyu Wang, Xiangyu Kong, Xiulian Peng, Yan Lu:
Multi-Modal Multi-Correlation Learning for Audio-Visual Speech Separation. CoRR abs/2207.01197 (2022) - [i37]Xiang Li, Jinglu Wang, Xiaohao Xu, Xiao Li, Yan Lu, Bhiksha Raj:
R^2VOS: Robust Referring Video Object Segmentation via Relational Multimodal Cycle Consistency. CoRR abs/2207.01203 (2022)