


default search action
Shanghang Zhang
Person information
Refine list

refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
showing all ?? records
2020 – today
- 2025
- [j17]Tianqi Liu
, Yanjun Qin
, Shanghang Zhang
, Xiaoming Tao
:
Empowering Corner Case Detection in Autonomous Vehicles With Multimodal Large Language Models. IEEE Signal Process. Lett. 32: 51-55 (2025) - 2024
- [j16]Zhenghong Wang, Sijie Ruan, Tianqiang Huang, Haoyi Zhou, Shanghang Zhang, Yi Wang, Leye Wang, Zhou Huang, Yu Liu:
A lightweight multi-layer perceptron for efficient multivariate time series forecasting. Knowl. Based Syst. 288: 111463 (2024) - [j15]Ji Ma
, Hongming Dai, Yao Mu
, Pengying Wu
, Hao Wang, Xiaowei Chi, Yang Fei
, Shanghang Zhang
, Chang Liu
:
DOZE: A Dataset for Open-Vocabulary Zero-Shot Object Navigation in Dynamic Environments. IEEE Robotics Autom. Lett. 9(9): 7389-7396 (2024) - [j14]Jianlin Xie
, Guanqun Wang
, Yin Zhuang
, Can Li
, Tong Zhang
, He Chen
, Liang Chen
, Shanghang Zhang
:
DECOR: Dynamic Decoupling and Multiobjective Optimization for Long-Tailed Remote Sensing Image Classification. IEEE Trans. Geosci. Remote. Sens. 62: 1-17 (2024) - [j13]Xingqun Qi
, Zhuojie Wu
, Wenxuan Zou
, Min Ren
, Yifan Gao
, Muyi Sun
, Shanghang Zhang
, Caifeng Shan
, Zhenan Sun
:
Exploring Generalizable Distillation for Efficient Medical Image Segmentation. IEEE J. Biomed. Health Informatics 28(7): 4170-4183 (2024) - [j12]Jianing Li
, Ming Lu, Jiaming Liu, Yandong Guo
, Yuan Du
, Li Du
, Shanghang Zhang
:
BEV-LGKD: A Unified LiDAR-Guided Knowledge Distillation Framework for Multi-View BEV 3D Object Detection. IEEE Trans. Intell. Veh. 9(1): 2489-2498 (2024) - [c95]Hongzhi Gao, Zheng Chen, Zehui Chen, Lin Chen, Jiaming Liu, Shanghang Zhang, Feng Zhao:
Leveraging Imagery Data with Spatial Point Prior for Weakly Semi-supervised 3D Object Detection. AAAI 2024: 1797-1805 - [c94]Senqiao Yang, Jiarui Wu, Jiaming Liu, Xiaoqi Li, Qizhe Zhang, Mingjie Pan, Yulu Gan, Zehui Chen, Shanghang Zhang:
Exploring Sparse Visual Prompt for Domain Adaptive Dense Prediction. AAAI 2024: 16334-16342 - [c93]Dongmei Zhang, Chang Li, Renrui Zhang, Shenghao Xie, Wei Xue, Xiaodong Xie, Shanghang Zhang:
FM-OV3D: Foundation Model-Based Cross-Modal Knowledge Blending for Open-Vocabulary 3D Detection. AAAI 2024: 16723-16731 - [c92]Rongyu Zhang, Yulin Luo, Jiaming Liu, Huanrui Yang, Zhen Dong, Denis A. Gudovskiy, Tomoyuki Okuno, Yohei Nakata, Kurt Keutzer, Yuan Du, Shanghang Zhang:
Efficient Deweahter Mixture-of-Experts with Uncertainty-Aware Feature-Wise Linear Modulation. AAAI 2024: 16812-16820 - [c91]Junyi Yao, Yijiang Liu, Zhen Dong, Mingfei Guo, Helan Hu, Kurt Keutzer, Li Du, Daquan Zhou, Shanghang Zhang:
PromptCoT: Align Prompt Distribution via Adapted Chain-of-Thought. CVPR 2024: 7027-7037 - [c90]Xingqun Qi, Jiahao Pan, Peng Li, Ruibin Yuan, Xiaowei Chi, Mengfei Li, Wenhan Luo, Wei Xue, Shanghang Zhang, Qifeng Liu, Yike Guo:
Weakly-Supervised Emotion Transition Learning for Diverse 3D Co-Speech Gesture Generation. CVPR 2024: 10424-10434 - [c89]Guanqun Wang, Jiaming Liu, Chenxuan Li, Yuan Zhang, Junpeng Ma, Xinyu Wei, Kevin Zhang, Maurice Chong, Renrui Zhang, Yijiang Liu, Shanghang Zhang:
Cloud-Device Collaborative Learning for Multimodal Large Language Models. CVPR 2024: 12646-12655 - [c88]Yuan Zhang, Tao Huang, Jiaming Liu, Tao Jiang, Kuan Cheng, Shanghang Zhang:
FreeKD: Knowledge Distillation via Semantic Frequency Prompt. CVPR 2024: 15931-15940 - [c87]Xiaobao Wei, Renrui Zhang, Jiarui Wu, Jiaming Liu, Ming Lu, Yandong Guo, Shanghang Zhang:
NTO3D: Neural Target Object 3D Reconstruction with Segment Anything. CVPR 2024: 20352-20362 - [c86]Zhi Zhang, Qizhe Zhang, Zijun Gao, Renrui Zhang, Ekaterina Shutova, Shiji Zhou, Shanghang Zhang:
Gradient-based Parameter Selection for Efficient Fine-Tuning. CVPR 2024: 28566-28577 - [c85]Jiaming Liu, Ran Xu, Senqiao Yang, Renrui Zhang, Qizhe Zhang, Zehui Chen, Yandong Guo, Shanghang Zhang:
Continual-MAE: Adaptive Distribution Masked Autoencoders for Continual Test-Time Adaptation. CVPR 2024: 28653-28663 - [c84]Xiaobao Wei
, Jiajun Cao, Yizhu Jin, Ming Lu, Guangyu Wang, Shanghang Zhang:
I-MedSAM: Implicit Medical Image Segmentation with Segment Anything. ECCV (10) 2024: 90-107 - [c83]Yulin Luo, Ruichuan An, Bocheng Zou, Yiming Tang, Jiaming Liu, Shanghang Zhang:
LLM as Dataset Analyst: Subpopulation Structure Discovery with Large Language Model. ECCV (33) 2024: 235-252 - [c82]Xinyan Chen, Jiaxin Ge, Tianjun Zhang, Jiaming Liu, Shanghang Zhang:
Learning from Mistakes: Iterative Prompt Relabeling for Text-to-Image Diffusion Model Training. EMNLP (Findings) 2024: 2937-2952 - [c81]Shitian Zhao, Renrui Zhang, Xu Luo, Yan Wang, Shanghang Zhang, Peng Gao:
Unleashing the Potentials of Likelihood Composition for Multi-modal Language Models. EMNLP (Findings) 2024: 10152-10163 - [c80]Chi-Min Chan, Weize Chen, Yusheng Su, Jianxuan Yu, Wei Xue, Shanghang Zhang, Jie Fu, Zhiyuan Liu:
ChatEval: Towards Better LLM-based Evaluators through Multi-Agent Debate. ICLR 2024 - [c79]Jiaming Liu, Senqiao Yang, Peidong Jia, Renrui Zhang, Ming Lu, Yandong Guo, Wei Xue, Shanghang Zhang:
ViDA: Homeostatic Visual Domain Adapter for Continual Test Time Adaptation. ICLR 2024 - [c78]Rui Ma, Mengxi Guo, Peidong Jia, Chenxuan Li, Yi Hou, Yuan Li, Xiaodong Xie, Shanghang Zhang:
Enhanced Blind Watermarking Against Black-Box Noise: Leveraging CIN Framework. ICME 2024: 1-6 - [c77]Dongmei Zhang, Ray Zhang, Fan Yang, Yuan Li, Huizhu Jia, Xiaodong Xie, Shanghang Zhang:
VLUReID: Exploiting Vision-Language Knowledge for Unsupervised Person Re-Identification. ICME 2024: 1-6 - [c76]Yijia Zhang, Lingran Zhao, Shijie Cao, Sicheng Zhang, Wenqiang Wang, Ting Cao, Fan Yang, Mao Yang, Shanghang Zhang, Ningyi Xu:
Integer or Floating Point? New Outlooks for Low-Bit Quantization on Large Language Models. ICME 2024: 1-6 - [c75]Anthony Chen, Huanrui Yang, Yulu Gan, Denis A. Gudovskiy, Zhen Dong, Haofan Wang, Tomoyuki Okuno, Yohei Nakata, Kurt Keutzer, Shanghang Zhang:
Split-Ensemble: Efficient OOD-aware Ensemble via Task and Model Splitting. ICML 2024 - [c74]Pengying Wu, Yao Mu, Bingxian Wu, Yi Hou, Ji Ma, Shanghang Zhang, Chang Liu:
VoroNav: Voronoi-based Zero-shot Object Navigation with Large Language Model. ICML 2024 - [c73]Yixiong Zou, Shanghang Zhang, Haichen Zhou, Yuhua Li, Ruixuan Li:
Compositional Few-Shot Class-Incremental Learning. ICML 2024 - [c72]Jiayi Ni, Senqiao Yang, Ran Xu, Jiaming Liu, Xiaoqi Li, Wenyu Jiao, Zehui Chen, Yi Liu, Shanghang Zhang:
Distribution-Aware Continual Test-Time Adaptation for Semantic Segmentation. ICRA 2024: 3044-3050 - [c71]Jiaming Liu, Qizhe Zhang, Xiaoqi Li, Jianing Li
, Guanqun Wang, Ming Lu, Tiejun Huang, Shanghang Zhang:
Unsupervised Spike Depth Estimation via Cross-modality Cross-domain Knowledge Transfer. ICRA 2024: 9109-9116 - [c70]Jiaming Liu, Rongyu Zhang, Xiaoqi Li, Xiaowei Chi, Zehui Chen, Ming Lu, Yandong Guo, Shanghang Zhang:
BEVUDA: Multi-geometric Space Alignments for Domain Adaptive BEV 3D Object Detection. ICRA 2024: 9487-9494 - [c69]Mingjie Pan, Jiaming Liu, Renrui Zhang, Peixiang Huang, Xiaoqi Li, Hongwei Xie, Bing Wang, Li Liu, Shanghang Zhang:
RenderOcc: Vision-Centric 3D Occupancy Prediction with 2D Rendering Supervision. ICRA 2024: 12404-12411 - [c68]Rongyu Zhang
, Zefan Cai
, Huanrui Yang
, Zidong Liu
, Denis A. Gudovskiy
, Tomoyuki Okuno
, Yohei Nakata
, Kurt Keutzer
, Baobao Chang
, Yuan Du
, Li Du
, Shanghang Zhang
:
VeCAF: Vision-language Collaborative Active Finetuning with Training Objective Awareness. ACM Multimedia 2024: 5451-5459 - [c67]Peng Li, Yuan Liu, Xiaoxiao Long, Feihu Zhang, Cheng Lin, Mengfei Li, Xingqun Qi, Shanghang Zhang, Wei Xue, Wenhan Luo, Ping Tan, Wenping Wang, Qifeng Liu, Yike Guo:
Era3D: High-Resolution Multiview Diffusion using Efficient Row-wise Attention. NeurIPS 2024 - [c66]Jiaming Liu, Mengzhen Liu, Zhenyu Wang, Pengju An, Xiaoqi Li, Kaichen Zhou, Senqiao Yang, Renrui Zhang, Yandong Guo, Shanghang Zhang:
RoboMamba: Efficient Vision-Language-Action Model for Robotic Reasoning and Manipulation. NeurIPS 2024 - [c65]Shiyao Li, Xuefei Ning, Shanghang Zhang, Lidong Guo, Tianchen Zhao, Huazhong Yang, Yu Wang:
TCP: Triplet Contrastive-relationship Preserving for Class-Incremental Learning. WACV 2024: 2020-2029 - [i131]Pengying Wu, Yao Mu, Bingxian Wu, Yi Hou, Ji Ma, Shanghang Zhang, Chang Liu:
VoroNav: Voronoi-based Zero-shot Object Navigation with Large Language Model. CoRR abs/2401.02695 (2024) - [i130]Mengfei Li, Ming Lu, Xiaofang Li, Shanghang Zhang:
RustNeRF: Robust Neural Radiance Field with Low-Quality Images. CoRR abs/2401.03257 (2024) - [i129]Rongyu Zhang, Zefan Cai, Huanrui Yang, Zidong Liu, Denis A. Gudovskiy, Tomoyuki Okuno, Yohei Nakata, Kurt Keutzer, Baobao Chang, Yuan Du, Li Du, Shanghang Zhang:
VeCAF: VLM-empowered Collaborative Active Finetuning with Training Objective Awareness. CoRR abs/2401.07853 (2024) - [i128]Jianing Li, Xi Nan, Ming Lu, Li Du, Shanghang Zhang:
Proximity QA: Unleashing the Power of Multi-Modal Large Language Models for Spatial Proximity Analysis. CoRR abs/2401.17862 (2024) - [i127]Tianyu Chen, Haoyi Zhou
, Ying Li, Hao Wang, Chonghan Gao, Shanghang Zhang, Jianxin Li:
Building Flexible Machine Learning Models for Scientific Computing at Scale. CoRR abs/2402.16014 (2024) - [i126]Zehui Chen, Qiuchen Wang, Zhenyu Li, Jiaming Liu, Shanghang Zhang, Feng Zhao:
A Vanilla Multi-Task Framework for Dense Visual Prediction Solution to 1st VCL Challenge - Multi-Task Robustness Track. CoRR abs/2402.17319 (2024) - [i125]Ji Ma, Hongming Dai, Yao Mu, Pengying Wu, Hao Wang, Xiaowei Chi, Yang Fei, Shanghang Zhang, Chang Liu:
DOZE: A Dataset for Open-Vocabulary Zero-Shot Object Navigation in Dynamic Environments. CoRR abs/2402.19007 (2024) - [i124]Yueru Jia, Yuhui Yuan, Aosong Cheng, Chuke Wang, Ji Li, Huizhu Jia, Shanghang Zhang:
DesignEdit: Multi-Layered Latent Decomposition and Fusion for Unified & Accurate Image Editing. CoRR abs/2403.14487 (2024) - [i123]Hongzhi Gao, Zheng Chen, Zehui Chen, Lin Chen, Jiaming Liu, Shanghang Zhang, Feng Zhao:
Point-DETR3D: Leveraging Imagery Data with Spatial Point Prior for Weakly Semi-supervised 3D Object Detection. CoRR abs/2403.15317 (2024) - [i122]Weifeng Lin, Xinyu Wei, Ruichuan An, Peng Gao, Bocheng Zou
, Yulin Luo, Siyuan Huang, Shanghang Zhang, Hongsheng Li:
Draw-and-Understand: Leveraging Visual Prompts to Enable MLLMs to Comprehend What You Want. CoRR abs/2403.20271 (2024) - [i121]Gaole Dai, Zhenyu Wang, Qinwen Xu, Ming Lu, Wen Chen, Boxin Shi, Shanghang Zhang, Tie-Jun Huang:
SpikeNVS: Enhancing Novel View Synthesis from Blurry Images via Spike Camera. CoRR abs/2404.06710 (2024) - [i120]Yiwen Tang, Jiaming Liu, Dong Wang, Zhigang Wang, Shanghang Zhang, Bin Zhao, Xuelong Li:
Any2Point: Empowering Any-modality Large Models for Efficient 3D Understanding. CoRR abs/2404.07989 (2024) - [i119]Yijiang Liu, Rongyu Zhang, Huanrui Yang, Kurt Keutzer, Yuan Du, Li Du, Shanghang Zhang:
Intuition-aware Mixture-of-Rank-1-Experts for Parameter Efficient Finetuning. CoRR abs/2404.08985 (2024) - [i118]Yulin Luo, Ruichuan An, Bocheng Zou, Yiming Tang, Jiaming Liu, Shanghang Zhang:
LLM as Dataset Analyst: Subpopulation Structure Discovery with Large Language Model. CoRR abs/2405.02363 (2024) - [i117]Peng Li, Yuan Liu, Xiaoxiao Long, Feihu Zhang, Cheng Lin, Mengfei Li, Xingqun Qi, Shanghang Zhang, Wenhan Luo
, Ping Tan, Wenping Wang, Qifeng Liu, Yike Guo:
Era3D: High-Resolution Multiview Diffusion using Efficient Row-wise Attention. CoRR abs/2405.11616 (2024) - [i116]Yuan Zhang, Fei Xiao, Tao Huang, Chun-Kai Fan, Hongyuan Dong, Jiawen Li, Jiacong Wang, Kuan Cheng, Shanghang Zhang, Haoyuan Guo:
Unveiling the Tapestry of Consistency in Large Vision-Language Models. CoRR abs/2405.14156 (2024) - [i115]Rongyu Zhang, Aosong Cheng, Yulin Luo, Gaole Dai, Huanrui Yang, Jiaming Liu, Ran Xu, Li Du, Yuan Du, Yanbing Jiang, Shanghang Zhang:
Decomposing the Neurons: Activation Sparsity via Mixture of Experts for Continual Test Time Adaptation. CoRR abs/2405.16486 (2024) - [i114]Xingqun Qi, Hengyuan Zhang, Yatian Wang, Jiahao Pan, Chen Liu, Peng Li, Xiaowei Chi, Mengfei Li, Qixun Zhang, Wei Xue, Shanghang Zhang, Wenhan Luo
, Qifeng Liu, Yike Guo:
CoCoGesture: Toward Coherent Co-speech 3D Gesture Generation in the Wild. CoRR abs/2405.16874 (2024) - [i113]Yixiong Zou, Shanghang Zhang, Haichen Zhou, Yuhua Li, Ruixuan Li:
Compositional Few-Shot Class-Incremental Learning. CoRR abs/2405.17022 (2024) - [i112]Jiaming Liu, Chenxuan Li, Guanqun Wang, Lily Lee, Kaichen Zhou, Sixiang Chen, Chuyan Xiong, Jiaxin Ge, Renrui Zhang, Shanghang Zhang:
Self-Corrected Multimodal Large Language Model for End-to-End Robot Manipulation. CoRR abs/2405.17418 (2024) - [i111]Gaole Dai, Cheng-Ching Tseng, Qingpo Wuwu, Rongyu Zhang, Shaokang Wang, Ming Lu, Tiejun Huang, Yu Zhou, Ali Ata Tuz, Matthias Gunzer, Jianxu Chen, Shanghang Zhang:
Implicit Neural Image Field for Biological Microscopy Image Compression. CoRR abs/2405.19012 (2024) - [i110]Nan Huang, Xiaobao Wei, Wenzhao Zheng, Pengju An, Ming Lu, Wei Zhan, Masayoshi Tomizuka, Kurt Keutzer, Shanghang Zhang:
S3Gaussian: Self-Supervised Street Gaussians for Autonomous Driving. CoRR abs/2405.20323 (2024) - [i109]Jiaming Liu, Mengzhen Liu, Zhenyu Wang, Lily Lee, Kaichen Zhou, Pengju An, Senqiao Yang, Renrui Zhang, Yandong Guo, Shanghang Zhang:
RoboMamba: Multimodal State Space Model for Efficient Robot Reasoning and Manipulation. CoRR abs/2406.04339 (2024) - [i108]Guanqun Wang, Xinyu Wei, Jiaming Liu, Ray Zhang, Yichi Zhang, Kevin Zhang, Maurice Chong, Shanghang Zhang:
MR-MLLM: Mutual Reinforcement of Multimodal Comprehension and Vision Perception. CoRR abs/2406.15768 (2024) - [i107]Huanrui Yang, Yafeng Huang, Zhen Dong, Denis A. Gudovskiy, Tomoyuki Okuno, Yohei Nakata, Yuan Du, Kurt Keutzer, Shanghang Zhang:
Fisher-aware Quantization for DETR Detectors with Critical-category Objectives. CoRR abs/2407.03442 (2024) - [i106]Renrui Zhang, Xinyu Wei, Dongzhi Jiang, Yichi Zhang, Ziyu Guo, Chengzhuo Tong, Jiaming Liu, Aojun Zhou, Bin Wei, Shanghang Zhang, Peng Gao, Hongsheng Li:
MAVIS: Mathematical Visual Instruction Tuning. CoRR abs/2407.08739 (2024) - [i105]Shanghang Zhang, Gaole Dai, Tie-Jun Huang, Jianxu Chen:
Multimodal Large Language Models for Bioimage Analysis. CoRR abs/2407.19778 (2024) - [i104]Xiaowei Chi, Yatian Wang, Aosong Cheng, Pengjun Fang, Zeyue Tian, Yingqing He, Zhaoyang Liu, Xingqun Qi, Jiahao Pan, Rongyu Zhang, Mengfei Li, Ruibin Yuan, Yanbing Jiang, Wei Xue, Wenhan Luo, Qifeng Chen, Shanghang Zhang, Qifeng Liu, Yike Guo:
MMTrail: A Multimodal Trailer Video Dataset with Language and Music Descriptions. CoRR abs/2407.20962 (2024) - [i103]Zhongyu Zhao, Menghang Dong, Rongyu Zhang, Wenzhao Zheng, Yunpeng Zhang, Huanrui Yang, Dalong Du, Kurt Keutzer, Shanghang Zhang:
FactorLLM: Factorizing Knowledge via Mixture of Experts for Large Language Models. CoRR abs/2408.11855 (2024) - [i102]Gaole Dai, Yiming Tang, Chunkai Fan, Qizhe Zhang, Zhi Zhang, Yulu Gan, Chengqing Zeng, Shanghang Zhang, Tiejun Huang:
Discovering Long-Term Effects on Parameter Efficient Fine-tuning. CoRR abs/2409.06706 (2024) - [i101]Xiaohong Liu, Guoxing Yang, Yulin Luo, Jiaji Mao, Xiang Zhang, Ming Gao, Shanghang Zhang, Jun Shen, Guangyu Wang:
Expert-level vision-language foundation model for real-world radiology and comprehensive evaluation. CoRR abs/2409.16183 (2024) - [i100]Shitian Zhao, Renrui Zhang, Xu Luo, Yan Wang, Shanghang Zhang, Peng Gao:
Unleashing the Potentials of Likelihood Composition for Multi-modal Language Models. CoRR abs/2410.00363 (2024) - [i99]Yuan Zhang, Chun-Kai Fan, Junpeng Ma, Wenzhao Zheng, Tao Huang, Kuan Cheng, Denis A. Gudovskiy, Tomoyuki Okuno, Yohei Nakata, Kurt Keutzer, Shanghang Zhang:
SparseVLM: Visual Token Sparsification for Efficient Vision-Language Model Inference. CoRR abs/2410.04417 (2024) - [i98]Xiaowei Chi, Hengyuan Zhang, Chun-Kai Fan, Xingqun Qi, Rongyu Zhang, Anthony Chen, Chi-Min Chan, Wei Xue, Wenhan Luo, Shanghang Zhang, Yike Guo:
EVA: An Embodied World Model for Future Video Anticipation. CoRR abs/2410.15461 (2024) - [i97]Shenghao Xie, Wenqiang Zu, Mingyang Zhao, Duo Su, Shilong Liu, Ruohua Shi, Guoqi Li, Shanghang Zhang, Lei Ma:
Towards Unifying Understanding and Generation in the Era of Vision Foundation Models: A Survey from the Autoregression Perspective. CoRR abs/2410.22217 (2024) - [i96]Bowen Liu, Haoyang Li, Shuning Wang, Shuo Nie, Shanghang Zhang:
Subgraph Aggregation for Out-of-Distribution Generalization on Graphs. CoRR abs/2410.22228 (2024) - [i95]Anthony Chen, Jianjin Xu, Wenzhao Zheng, Gaole Dai, Yida Wang, Renrui Zhang, Haofan Wang, Shanghang Zhang:
Training-free Regional Prompting for Diffusion Transformers. CoRR abs/2411.02395 (2024) - [i94]Xinyang Huang, Chuang Zhu, Bowen Zhang, Shanghang Zhang:
Learning from Different Samples: A Source-free Framework for Semi-supervised Domain Adaptation. CoRR abs/2411.06665 (2024) - [i93]Ruichuan An, Sihan Yang, Ming Lu, Kai Zeng, Yulin Luo, Ying Chen, Jiajun Cao, Hao Liang, Qi She, Shanghang Zhang, Wentao Zhang:
MC-LLaVA: Multi-Concept Personalized Vision-Language Model. CoRR abs/2411.11706 (2024) - [i92]Xiaobao Wei, Qingpo Wuwu, Zhongyu Zhao, Zhuangzhe Wu, Nan Huang, Ming Lu, Ningning MA, Shanghang Zhang:
EMD: Explicit Motion Modeling for High-Quality Street Gaussian Splatting. CoRR abs/2411.15582 (2024) - [i91]Zhi Zhang, Jiayi Shen, Congfeng Cao, Gaole Dai, Shiji Zhou, Qizhe Zhang, Shanghang Zhang, Ekaterina Shutova:
Proactive Gradient Conflict Mitigation in Multi-Task Learning: A Sparse Training Perspective. CoRR abs/2411.18615 (2024) - [i90]Yueru Jia, Jiaming Liu, Sixiang Chen, Chenyang Gu, Zhilue Wang, Longzan Luo, Lily Lee, Pengwei Wang, Zhongyuan Wang, Renrui Zhang, Shanghang Zhang:
Lift3D Foundation Policy: Lifting 2D Large-Scale Pretrained Models for Robust 3D Robotic Manipulation. CoRR abs/2411.18623 (2024) - [i89]Qizhe Zhang, Aosong Cheng, Ming Lu, Zhiyong Zhuo, Minqi Wang, Jiajun Cao, Shaobo Guo, Qi She, Shanghang Zhang:
[CLS] Attention is All You Need for Training-Free Visual Token Pruning: Make VLM Inference Faster. CoRR abs/2412.01818 (2024) - [i88]Lening Wang, Wenzhao Zheng, Dalong Du, Yunpeng Zhang, Yilong Ren, Han Jiang, Zhiyong Cui, Haiyang Yu, Jie Zhou, Jiwen Lu, Shanghang Zhang:
Stag-1: Towards Realistic 4D Driving Simulation with Video Generation Model. CoRR abs/2412.05280 (2024) - [i87]Yuming Li, Peidong Jia, Daiwei Hong, Yueru Jia, Qi She, Rui Zhao, Ming Lu, Shanghang Zhang:
ASGDiffusion: Parallel High-Resolution Generation with Asynchronous Structure Guidance. CoRR abs/2412.06163 (2024) - [i86]Zixun Xie, Sicheng Zuo, Wenzhao Zheng, Yunpeng Zhang, Dalong Du, Jie Zhou, Jiwen Lu, Shanghang Zhang:
GPD-1: Generative Pre-training for Driving. CoRR abs/2412.08643 (2024) - [i85]Wenzhao Zheng, Junjie Wu, Yao Zheng, Sicheng Zuo, Zixun Xie, Longchao Yang, Yong Pan, Zhihui Hao, Peng Jia, Xianpeng Lang, Shanghang Zhang:
GaussianAD: Gaussian-Centric End-to-End Autonomous Driving. CoRR abs/2412.10371 (2024) - [i84]Kun Wu, Chengkai Hou, Jiaming Liu, Zhengping Che, Xiaozhu Ju, Zhuqin Yang, Meng Li, Yinuo Zhao, Zhiyuan Xu, Guang Yang, Zhen Zhao, Guangyu Li, Zhao Jin, Lecheng Wang, Jilei Mao, Xinhua Wang, Shichao Fan, Ning Liu, Pei Ren, Qiang Zhang, Yaoxu Lyu, Mengzhen Liu, Jingyang He, Yulin Luo, Zeyu Gao, Chenxuan Li, Chenyang Gu, Yankai Fu, Di Wu, Xingyu Wang, Sixiang Chen, Zhenyu Wang, Pengju An, Siyuan Qian, Shanghang Zhang, Jian Tang:
RoboMIND: Benchmark on Multi-embodiment Intelligence Normative Data for Robot Manipulation. CoRR abs/2412.13877 (2024) - [i83]Kuangzhi Ge, Lingjun Chen, Kevin Zhang, Yulin Luo, Tianyu Shi, Liaoyuan Fan, Xiang Li, Guanqun Wang, Shanghang Zhang:
SCBench: A Sports Commentary Benchmark for Video LLMs. CoRR abs/2412.17637 (2024) - [i82]Jianxu Chen, Florian Jug, Susanne M. Rafelski, Shanghang Zhang:
The Emerging Issues in Bioimaging AI Publications and Research (Dagstuhl Seminar 24042). Dagstuhl Reports 14(1): 90-107 (2024) - 2023
- [j11]Haoyi Zhou
, Jianxin Li, Shanghang Zhang, Shuai Zhang
, Mengyi Yan
, Hui Xiong:
Expanding the prediction capacity in long sequence time-series forecasting. Artif. Intell. 318: 103886 (2023) - [j10]Guanqun Wang
, He Chen, Liang Chen, Yin Zhuang, Shanghang Zhang, Tong Zhang
, Hao Dong, Peng Gao:
P2FEViT: Plug-and-Play CNN Feature Embedded Hybrid Vision Transformer for Remote Sensing Image Classification. Remote. Sens. 15(7): 1773 (2023) - [j9]Atabak Dehban
, Shanghang Zhang, Nino Cauli
, Lorenzo Jamone
, José Santos-Victor
:
Learning Deep Features for Robotic Inference From Physical Interactions. IEEE Trans. Cogn. Dev. Syst. 15(3): 985-999 (2023) - [j8]Yi Hou
, Shanghang Zhang, Rui Ma, Huizhu Jia
, Xiaodong Xie
:
Frame-Recurrent Video Crowd Counting. IEEE Trans. Circuits Syst. Video Technol. 33(9): 5186-5199 (2023) - [j7]Shiji Zhou
, Zhi Wang
, Chenghao Hu, Yinan Mao, Haopeng Yan, Shanghang Zhang
, Chuan Wu
, Wenwu Zhu
:
Caching in Dynamic Environments: A Near-Optimal Online Learning Approach. IEEE Trans. Multim. 25: 792-804 (2023) - [c64]Yuheng Lu, Chenfeng Xu, Xiaobao Wei, Xiaodong Xie, Masayoshi Tomizuka, Kurt Keutzer, Shanghang Zhang:
Open-Vocabulary Point-Cloud Object Detection without 3D Annotation. CVPR 2023: 1190-1199 - [c63]Anthony Chen, Kevin Zhang, Renrui Zhang, Zihan Wang, Yuheng Lu, Yandong Guo, Shanghang Zhang:
PiMAE: Point Cloud and Image Interactive Masked Autoencoders for 3D Object Detection. CVPR 2023: 5291-5301 - [c62]Lianzhe Wang, Shiji Zhou, Shanghang Zhang, Xu Chu, Heng Chang, Wenwu Zhu:
Improving Generalization of Meta-Learning with Inverted Regularization at Inner-Level. CVPR 2023: 7826-7835 - [c61]Yuqing Ma, Hainan Li, Zhange Zhang, Jinyang Guo, Shanghang Zhang, Ruihao Gong
, Xianglong Liu:
Annealing-based Label-Transfer Learning for Open World Object Detection. CVPR 2023: 11454-11463 - [c60]Yulu Gan, Mingjie Pan, Rongyu Zhang, Zijian Ling, Lingran Zhao, Jiaming Liu, Shanghang Zhang:
Cloud-Device Collaborative Adaptation to Continual Changing Environments in the Real-World. CVPR 2023: 12157-12166 - [c59]Xiaowei Chi, Jiaming Liu, Ming Lu, Rongyu Zhang, Zhaoqing Wang, Yandong Guo, Shanghang Zhang:
BEV-SAN: Accurate BEV 3D Object Detection via Slice Attention Networks. CVPR 2023: 17461-17470 - [c58]Jianyang Gu, Kai Wang, Hao Luo, Chen Chen, Wei Jiang, Yuqiang Fang, Shanghang Zhang, Yang You, Jian Zhao:
MSINet: Twins Contrastive Search of Multi-Scale Interaction for Object ReID. CVPR 2023: 19243-19253 - [c57]Yijiang Liu
, Huanrui Yang, Zhen Dong, Kurt Keutzer, Li Du, Shanghang Zhang:
NoisyQuant: Noisy Bias-Enhanced Post-Training Activation Quantization for Vision Transformers. CVPR 2023: 20321-20330 - [c56]Lirui Xiao, Huanrui Yang, Zhen Dong, Kurt Keutzer, Li Du, Shanghang Zhang:
CSQ: Growing Mixed-Precision Quantization Scheme with Bi-level Continuous Sparsification. DAC 2023: 1-6 - [c55]Mingrui He, Tianyu Chen, Haoyi Zhou
, Shanghang Zhang, Jianxin Li:
BadRes: Reveal the Backdoors Through Residual Connection. ICASSP 2023: 1-5 - [c54]Xiangyang Zhu, Renrui Zhang, Bowei He
, Ziyu Guo, Ziyao Zeng, Zipeng Qin, Shanghang Zhang, Peng Gao:
PointCLIP V2: Prompting CLIP and GPT for Powerful 3D Open-world Learning. ICCV 2023: 2639-2650 - [c53]Yifan Zhang, Zhen Dong, Huanrui Yang, Ming Lu, Cheng-Ching Tseng, Yuan Du, Kurt Keutzer, Li Du, Shanghang Zhang:
QD-BEV : Quantization-aware View-guided Distillation for Multi-view 3D Object Detection. ICCV 2023: 3802-3812 - [c52]Xiuyu Li, Yijiang Liu
, Long Lian, Huanrui Yang, Zhen Dong, Daniel Kang, Shanghang Zhang, Kurt Keutzer:
Q-Diffusion: Quantizing Diffusion Models. ICCV 2023: 17489-17499 - [c51]Xu Chu, Yujie Jin, Xin Wang, Shanghang Zhang, Yasha Wang, Wenwu Zhu, Hong Mei:
Wasserstein Barycenter Matching for Graph Size Generalization of Message Passing Neural Networks. ICML 2023: 6158-6184 - [c50]Can Li, He Chen, Yin Zhuang, Shanghang Zhang:
Uncertainty-Aware Dynamic Learning for Cross-Domain Few-Shot Scene Classification from Remote Sensing Imagery. IGARSS 2023: 5778-5781 - [c49]Tianqi Liu, Shanghang Zhang, Yanjun Qin, Xiaoming Tao:
A Text Prompt-Based Approach for Zero-Shot Corner Case Object Detection in Autonomous Driving. ITSC 2023: 3241-3246 - [c48]