default search action

combined dblp search
author search
venue search
publication search

ask others

Shanghang Zhang

> Home > Persons

Person information

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

2020 – today

see FAQ

What is the meaning of the colors in the publication lists?

2026
[j27]
- view
  authority control:
- export record
  dblp key:
  - journals/bspc/ZhangQZT26
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/bspc/ZhangQZT26
Wenqi Zhang, Yanjun Qin, Shanghang Zhang, Xiaoming Tao:
How EEG-based cross-subject driving emotion is recognized: A multi-source transfer manifold learning model. Biomed. Signal Process. Control. 112: 108454 (2026)
[j26]
- view
  authority control:
- export record
  dblp key:
  - journals/inffus/QiZWPLSXZHLG26
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/inffus/QiZWPLSXZHLG26
Xingqun Qi, Hengyuan Zhang, Yatian Wang, Jiahao Pan, Chen Liu, Muyi Sun, Wei Xue, Shanghang Zhang, Sirui Han, Qifeng Liu, Yike Guo:
CoCoGesture: Towards coherent co-speech 3D gesture generation in the wild. Inf. Fusion 126: 103613 (2026)
[c130]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/ZhangDZHCDDWDZ26
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/ZhangDZHCDDWDZ26
Rongyu Zhang, Menghang Dong, Yuan Zhang, Liang Heng, Xiaowei Chi, Gaole Dai, Li Du, Dan Wang, Yuan Du, Shanghang Zhang:
MoLe-VLA: Dynamic Layer-skipping Vision Language Action Model via Mixture-of-Layers for Efficient Robot Manipulation. AAAI 2026: 18764-18772
[c129]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/MaZLWZSZ26
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/MaZLWZSZ26
Junpeng Ma, Qizhe Zhang, Ming Lu, Zhibin Wang, Qiang Zhou, Jun Song, Shanghang Zhang:
MMG-Vid: Maximizing Marginal Gains at Segment-level and Token-level for Efficient Video LLMs. AAAI 2026: 24253-24261
[i224]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2601-01618
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2601-01618
Huajie Tan, Peterson Co, Yijie Xu, Shanyu Rong, Yuheng Ji, Cheng Chi, Xiansheng Chen, Qiongyu Zhang, Zhongxia Zhao, Pengwei Wang, Zhongyuan Wang, Shanghang Zhang:
Action-Sketcher: From Reasoning to Action via Visual Sketches for Long-Horizon Robotic Manipulation. CoRR abs/2601.01618 (2026)
[i223]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2601-04137
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2601-04137
Chun-Kai Fan, Xiaowei Chi, Xiaozhu Ju, Hao Li, Yong Bao, Yu-Kai Wang, Lizhang Chen, Zhiyuan Jiang, Kuangzhi Ge, Ying Li, Weishi Mi, Qingpo Wuwu, Peidong Jia, Yulin Luo, Kevin Zhang, Zhiyuan Qin, Yong Dai, Sirui Han, Yike Guo, Shanghang Zhang, Jian Tang:
Wow, wo, val! A Comprehensive Embodied World Model Evaluation Turing Test. CoRR abs/2601.04137 (2026)
[i222]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2601-05248
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2601-05248
Zhuoyang Liu, Jiaming Liu, Hao Chen, Jiale Yu, Ziyu Guo, Chengkai Hou, Chenyang Gu, Xiangju Mi, Renrui Zhang, Kun Wu, Zhengping Che, Jian Tang, Pheng-Ann Heng, Shanghang Zhang:
LaST₀: Latent Spatio-Temporal Chain-of-Thought for Robotic Vision-Language-Action Model. CoRR abs/2601.05248 (2026)
[i221]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2601-14352
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2601-14352
Huajie Tan, Enshen Zhou, Zhiyu Li, Yijie Xu, Yuheng Ji, Xiansheng Chen, Cheng Chi, Pengwei Wang, Huizhu Jia, Yulong Ao, Mingyu Cao, Sixiang Chen, Zhe Li, Mengzhen Liu, Zixiao Wang, Shanyu Rong, Yaoxu Lyu, Zhongxia Zhao, Peterson Co, Yibo Li, Yi Han, Shaoxuan Xie, Guocai Yao, Songjing Wang, Leiduo Zhang, Xi Yang, Yance Jiao, Donghai Shi, Kunchang Xie, Shaokai Nie, Chunlei Men, Yonghua Lin, Zhongyuan Wang, Tiejun Huang, Shanghang Zhang:
RoboBrain 2.5: Depth in Sight, Time in Mind. CoRR abs/2601.14352 (2026)
[i220]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2601-16007
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2601-16007
Chak-Wing Mak, Guanyu Zhu, Boyi Zhang, Hongji Li, Xiaowei Chi, Kevin Zhang, Yichen Wu, Yangfan He, Chun-Kai Fan, Wentao Lu, Kuangzhi Ge, Xinyu Fang, Hongyang He, Kuan Lu, Tianxiang Xu, Li Zhang, Yongxin Ni, Youhua Li, Shanghang Zhang:
PhysicsMind: Sim and Real Mechanics Benchmarking for Physical Reasoning and Prediction in Foundational VLMs and World Models. CoRR abs/2601.16007 (2026)
[i219]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2601-18323
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2601-18323
Weishi Mi, Yong Bao, Xiaowei Chi, Xiaozhu Ju, Zhiyuan Qin, Kuangzhi Ge, Kai Tang, Peidong Jia, Shanghang Zhang, Jian Tang:
TC-IDM: Grounding Video Generation for Executable Zero-shot Robot Motion. CoRR abs/2601.18323 (2026)
[i218]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2601-21570
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2601-21570
Zixing Lei, Genjia Liu, Yuanshuo Zhang, Qipeng Liu, Chuan Wen, Shanghang Zhang, Wenzhao Lian, Siheng Chen:
EmboCoach-Bench: Benchmarking AI Agents on Developing Embodied Robots. CoRR abs/2601.21570 (2026)
[i217]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2602-01166
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2602-01166
Shuanghao Bai, Jing Lyu, Wanqi Zhou, Zhe Li, Dakai Wang, Lei Xing, Xiaoguang Zhao, Pengwei Wang, Zhongyuan Wang, Cheng Chi, Badong Chen, Shanghang Zhang:
Latent Reasoning VLA: Latent Thinking and Prediction for Vision-Language-Action Models. CoRR abs/2602.01166 (2026)
[i216]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2602-01594
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2602-01594
Wenzhuo Liu, Qiannan Guo, Zhen Wang, Wenshuo Wang, Lei Yang, Yicheng Qiao, Lening Wang, Zhiwei Liu, Chen Lv, Shanghang Zhang, Junqiang Xi, Huaping Liu:
UV-M3TL: A Unified and Versatile Multimodal Multi-Task Learning Framework for Assistive Driving Perception. CoRR abs/2602.01594 (2026)
[i215]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2602-04228
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2602-04228
Shuanghao Bai, Dakai Wang, Cheng Chi, Wanqi Zhou, Jing Lyu, Xiaoguang Zhao, Pengwei Wang, Zhongyuan Wang, Lei Xing, Shanghang Zhang, Badong Chen:
Reshaping Action Error Distributions for Reliable Vision-Language-Action Models. CoRR abs/2602.04228 (2026)
[i214]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2602-06825
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2602-06825
Yuming Li, Qingyu Li, Chengyu Bai, Xiangyang Luo, Zeyue Xue, Wenyu Qin, Meng Wang, Yikai Wang, Shanghang Zhang:
AEGPO: Adaptive Entropy-Guided Policy Optimization for Diffusion Models. CoRR abs/2602.06825 (2026)
[i213]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2602-09023
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2602-09023
Qinwen Xu, Jiaming Liu, Rui Zhou, Shaojun Shi, Nuowei Han, Zhuoyang Liu, Chenyang Gu, Shuo Gu, Yang Yue, Gao Huang, Wenzhao Zheng, Sirui Han, Peng Jia, Shanghang Zhang:
TwinRL-VLA: Digital Twin-Driven Reinforcement Learning for Real-World Robotic Manipulation. CoRR abs/2602.09023 (2026)
[i212]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2602-23893
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2602-23893
Bowen Yang, Zishuo Li, Yang Sun, Changtao Miao, Yifan Yang, Man Luo, Xiaotong Yan, Feng Jiang, Jinchuan Shi, Yankai Fu, Ning Chen, Junkai Zhao, Pengwei Wang, Guocai Yao, Shanghang Zhang, Hao Chen, Zhe Li, Kai Zhu:
AoE: Always-on Egocentric Human Video Collection for Embodied AI. CoRR abs/2602.23893 (2026)
2025
[j25]
- view
  authority control:
- export record
  dblp key:
  - journals/aei/LiuQZT25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/aei/LiuQZT25
Tianqi Liu, Yanjun Qin, Shanghang Zhang, Xiaoming Tao:
A diffusion-based feature enhancement approach for driving behavior classification with EEG data. Adv. Eng. Informatics 65: 103279 (2025)
[j24]
- view
  authority control:
- export record
  dblp key:
  - journals/ijcv/WangHHSZC25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ijcv/WangHHSZC25
Yupei Wang, Xiaoxing Hu, Yongkang Hu, Zhuoran Sun, Shanghang Zhang, Liang Chen:
Boosting Domain Generalization in Remote Sensing Image Segmentation via Style Mapping and General Prototypical Contrast. Int. J. Comput. Vis. 133(12): 8526-8545 (2025)
[j23]
- view
  authority control:
- export record
  dblp key:
  - journals/iotj/LiuQZDT25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/iotj/LiuQZDT25
Tianqi Liu, Yanjun Qin, Shanghang Zhang, Yiping Duan, Xiaoming Tao:
EEG-Driven Classification of Driver Mental Workload in Diverse Environments: A Dual-Branch Network for Efficient In-Vehicle Applications. IEEE Internet Things J. 12(17): 34846-34862 (2025)
[j22]
- view
  authority control:
- export record
  dblp key:
  - journals/ncs/DaiZWTZWQLTGHCZ25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ncs/DaiZWTZWQLTGHCZ25
Gaole Dai, Rongyu Zhang, Qingpo Wuwu, Cheng-Ching Tseng, Yu Zhou, Shaokang Wang, Siyuan Qian, Ming Lu, Ali Ata Tuz, Matthias Gunzer, Tiejun Huang, Jianxu Chen, Shanghang Zhang:
Implicit neural image field for biological microscopy image compression. Nat. Comput. Sci. 5(11): 1041-1050 (2025)
[j21]
- view
  authority control:
- export record
  dblp key:
  - journals/spl/LiuQZT25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/spl/LiuQZT25
Tianqi Liu, Yanjun Qin, Shanghang Zhang, Xiaoming Tao:
Empowering Corner Case Detection in Autonomous Vehicles With Multimodal Large Language Models. IEEE Signal Process. Lett. 32: 51-55 (2025)
[j20]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/symmetry/ZouMWLZQH25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/symmetry/ZouMWLZQH25
Chen Zou, Qingsen Ma, Jia Wang, Ming Lu, Shanghang Zhang, Zhaowei Qu, Zhaofeng He:
GaussianEnhancer++: A General GS-Agnostic Rendering Enhancer. Symmetry 17(3): 442 (2025)
[j19]
- view
  authority control:
- export record
  dblp key:
  - journals/tcsv/ZhangLLCWDDZ25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tcsv/ZhangLLCWDDZ25
Rongyu Zhang, Jiaming Liu, Xiaoqi Li, Xiaowei Chi, Dan Wang, Li Du, Yuan Du, Shanghang Zhang:
BEVUDA++: Geometric-Aware Unsupervised Domain Adaptation for Multi-View 3D Object Detection. IEEE Trans. Circuits Syst. Video Technol. 35(5): 5109-5122 (2025)
[j18]
- view
  authority control:
- export record
  dblp key:
  - journals/tmc/ZhangDLDDWZW25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tmc/ZhangDLDDWZW25
Rongyu Zhang, Xize Duan, Jiaming Liu, Li Du, Yuan Du, Dan Wang, Shanghang Zhang, Fangxin Wang:
RepCaM++: Exploring Transparent Visual Prompt With Inference-Time Re-Parameterization for Neural Video Delivery. IEEE Trans. Mob. Comput. 24(9): 8930-8944 (2025)
[j17]
- view
  authority control:
- export record
  dblp key:
  - journals/tnn/QiSWLLZZS25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tnn/QiSWLLZZS25
Xingqun Qi, Muyi Sun, Zijian Wang, Jiaming Liu, Qi Li, Fang Zhao, Shanghang Zhang, Caifeng Shan:
Biphasic Face Photo-Sketch Synthesis via Semantic-Driven Generative Adversarial Network With Graph Representation Learning. IEEE Trans. Neural Networks Learn. Syst. 36(2): 2182-2195 (2025)
[c128]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/JiaCYWLJZ25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/JiaCYWLJZ25
Yueru Jia, Aosong Cheng, Yuhui Yuan, Chuke Wang, Ji Li, Huizhu Jia, Shanghang Zhang:
DesignEdit: Unify Spatial-Aware Image Editing via Training-free Inpainting with a Multi-Layered Latent Diffusion Framework. AAAI 2025: 3958-3966
[c127]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/YangLZPG0CG0GZ25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/YangLZPG0CG0GZ25
Senqiao Yang, Jiaming Liu, Renrui Zhang, Mingjie Pan, Ziyu Guo, Xiaoqi Li, Zehui Chen, Peng Gao, Hongsheng Li, Yandong Guo, Shanghang Zhang:
LiDAR-LLM: Exploring the Potential of Large Language Models for 3D LiDAR Understanding. AAAI 2025: 9247-9255
[c126]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/LiuLWNZ25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/LiuLWNZ25
Bowen Liu, Haoyang Li, Shuning Wang, Shuo Nie, Shanghang Zhang:
Subgraph Aggregation for Out-of-Distribution Generalization on Graphs. AAAI 2025: 18763-18771
[c125]
- view
  - electronic edition @ aclanthology.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/acl/PingZMWZZ25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/acl/PingZMWZZ25
Bowen Ping, Jiali Zeng, Fandong Meng, Shuo Wang, Jie Zhou, Shanghang Zhang:
LongDPO: Unlock Better Long-form Generation Abilities for LLMs via Critique-augmented Stepwise Information. ACL (Findings) 2025: 7613-7632
[c124]
- view
  - electronic edition @ aclanthology.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/acl/ZhangHXZZWZWZX25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/acl/ZhangHXZZWZWZX25
Lingfeng Zhang, Xiaoshuai Hao, Qinwen Xu, Qiang Zhang, Xinyao Zhang, Pengwei Wang, Jing Zhang, Zhongyuan Wang, Shanghang Zhang, Renjing Xu:
MapNav: A Novel Memory Representation via Annotated Semantic Maps for VLM-based Vision-and-Language Navigation. ACL (1) 2025: 13032-13056
[c123]
- view
  authority control:
- export record
  dblp key:
  - conf/cvpr/JiTSHZZWZ0AXSL025
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/cvpr/JiTSHZZWZ0AXSL025
Yuheng Ji, Huajie Tan, Jiayu Shi, Xiaoshuai Hao, Yuan Zhang, Hengyuan Zhang, Pengwei Wang, Mengdi Zhao, Yao Mu, Pengju An, Xinda Xue, Qinghang Su, Huaihai Lyu, Xiaolong Zheng, Jiaming Liu, Zhongyuan Wang, Shanghang Zhang:
RoboBrain: A Unified Brain Model for Robotic Manipulation from Abstract to Concrete. CVPR 2025: 1724-1734
[c122]
- view
  authority control:
- export record
  dblp key:
  - conf/cvpr/HuangZXKZKW25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/cvpr/HuangZXKZKW25
Nan Huang, Wenzhao Zheng, Chenfeng Xu, Kurt Keutzer, Shanghang Zhang, Angjoo Kanazawa, Qianqian Wang:
Segment Any Motion in Videos. CVPR 2025: 3406-3416
[c121]
- view
  authority control:
- export record
  dblp key:
  - conf/cvpr/Jia0CGWL0WWZZ25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/cvpr/Jia0CGWL0WWZZ25
Yueru Jia, Jiaming Liu, Sixiang Chen, Chenyang Gu, Zhilue Wang, Longzan Luo, Xiaoqi Li, Pengwei Wang, Zhongyuan Wang, Renrui Zhang, Shanghang Zhang:
Lift3D Policy: Lifting 2D Foundation Models for Robust 3D Robotic Manipulation. CVPR 2025: 17347-17358
[c120]
- view
  authority control:
- export record
  dblp key:
  - conf/cvpr/XuWCLJZXHZX25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/cvpr/XuWCLJZXHZX25
Jinchang Xu, Shaokang Wang, Jintao Chen, Zhe Li, Peidong Jia, Fei Zhao, Guoqing Xiang, Zhijian Hao, Shanghang Zhang, Xiaodong Xie:
Decouple Distortion from Perception: Region Adaptive Diffusion for Extreme-low Bitrate Perception Image Compression. CVPR 2025: 18051-18061
[c119]
- view
  authority control:
- export record
  dblp key:
  - conf/cvpr/CaoZHLZAMZ25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/cvpr/CaoZHLZAMZ25
Jiajun Cao, Yuan Zhang, Tao Huang, Ming Lu, Qizhe Zhang, Ruichuan An, Ningning Ma, Shanghang Zhang:
MoVE-KD: Knowledge Distillation for VLMs with Mixture of Visual Encoders. CVPR 2025: 19846-19856
[c118]
- view
  authority control:
- export record
  dblp key:
  - conf/cvpr/0009XZ00PXH0Z025
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/cvpr/0009XZ00PXH0Z025
Xiaoqi Li, Jingyun Xu, Mingxu Zhang, Jiaming Liu, Yan Shen, Iaroslav Ponomarenko, Jiahui Xu, Liang Heng, Siyuan Huang, Shanghang Zhang, Hao Dong:
Object-Centric Prompt-Driven Vision-Language-Action Model for Robotic Manipulation. CVPR 2025: 27638-27648
[c117]
- view
  authority control:
- export record
  dblp key:
  - conf/dcc/WangXLYYZXJ25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/dcc/WangXLYYZXJ25
Shaokang Wang, Guoqing Xiang, Wenzhao Li, Mingyuan Yang, Fan Yang, Shanghang Zhang, Xiaodong Xie, Huizhu Jia:
Three-Stage Progressive Pre-Analysis Framework for VMAF Controllable Image Coding. DCC 2025: 404
[c116]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/0007MWLZH25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/0007MWLZH25
Chen Zou, Qingsen Ma, Jia Wang, Ming Lu, Shanghang Zhang, Zhaofeng He:
GaussianEnhancer: A General Rendering Enhancer for Gaussian Splatting. ICASSP 2025: 1-5
[c115]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/WangXXZX25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/WangXXZX25
Shaokang Wang, Guoqing Xiang, Jinchang Xu, Shanghang Zhang, Xiaodong Xie:
Efficient Quality Controllable Neural Image Compression based on QD-Model. ICASSP 2025: 1-5
[c114]
- view
  authority control:
- export record
  dblp key:
  - conf/iccv/HaoYZWWKYFYQMSCZZZSLN25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iccv/HaoYZWWKYFYQMSCZZZSLN25
Ruiyang Hao, Haibao Yu, Jiaru Zhong, Chuanye Wang, Jiahao Wang, Yiming Kan, Wenxian Yang, Siqi Fan, Huilin Yin, Jianing Qiu, Yao Mu, Jiankai Sun, Li Chen, Walter Zimmer, Dandan Zhang, Shanghang Zhang, Mac Schwager, Ping Luo, Zaiqing Nie:
Research Challenges and Progress in the End-to-End V2X Cooperative Autonomous Driving Competition. ICCVW 2025: 1849-1860
[c113]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/LinWA0ZL0Z025
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/LinWA0ZL0Z025
Weifeng Lin, Xinyu Wei, Ruichuan An, Peng Gao, Bocheng Zou, Yulin Luo, Siyuan Huang, Shanghang Zhang, Hongsheng Li:
Draw-and-Understand: Leveraging Visual Prompts to Enable MLLMs to Comprehend What You Want. ICLR 2025
[c112]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/QiWZPXZLLG25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/QiWZPXZLLG25
Xingqun Qi, Yatian Wang, Hengyuan Zhang, Jiahao Pan, Wei Xue, Shanghang Zhang, Wenhan Luo, Qifeng Liu, Yike Guo:
Co3Gesture: Towards Coherent Concurrent Co-speech 3D Gesture Generation with Interactive Diffusion. ICLR 2025
[c111]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/ZhangWJGZT0ZZG025
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/ZhangWJGZT0ZZG025
Renrui Zhang, Xinyu Wei, Dongzhi Jiang, Ziyu Guo, Yichi Zhang, Chengzhuo Tong, Jiaming Liu, Aojun Zhou, Shanghang Zhang, Peng Gao, Hongsheng Li:
MAVIS: Mathematical Visual Instruction Tuning with an Automatic Data Engine. ICLR 2025
[c110]
- view
  authority control:
- export record
  dblp key:
  - conf/icmcs/WangLXXZX25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icmcs/WangLXXZX25
Shaokang Wang, Dingquan Li, Guoqing Xiang, Jinchang Xu, Shanghang Zhang, Xiaodong Xie:
Adaptive Semantic Compression: Compatible Bitstream for Scalable Human-Machine Perception Sample Adaption. ICME 2025: 1-6
[c109]
- view
- export record
  dblp key:
  - conf/icml/0020FMZ0CGONKZ25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/0020FMZ0CGONKZ25
Yuan Zhang, Chun-Kai Fan, Junpeng Ma, Wenzhao Zheng, Tao Huang, Kuan Cheng, Denis A. Gudovskiy, Tomoyuki Okuno, Yohei Nakata, Kurt Keutzer, Shanghang Zhang:
SparseVLM: Visual Token Sparsification for Efficient Vision-Language Model Inference. ICML 2025
[c108]
- view
- export record
  dblp key:
  - conf/icml/ChenZ00GSZ025
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/ChenZ00GSZ025
Tianyu Chen, Haoyi Zhou, Ying Li, Hao Wang, Chonghan Gao, Rongye Shi, Shanghang Zhang, Jianxin Li:
OmniArch: Building Foundation Model for Scientific Computing. ICML 2025
[c107]
- view
- export record
  dblp key:
  - conf/icml/ChiFZQZCC0LZG25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/ChiFZQZCC0LZG25
Xiaowei Chi, Chun-Kai Fan, Hengyuan Zhang, Xingqun Qi, Rongyu Zhang, Anthony Chen, Chi-Min Chan, Wei Xue, Qifeng Liu, Shanghang Zhang, Yike Guo:
Empowering World Models with Reflection for Embodied Video Prediction. ICML 2025
[c106]
- view
- export record
  dblp key:
  - conf/icml/DaiFT00GZTZ025
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/DaiFT00GZTZ025
Gaole Dai, Chun-Kai Fan, Yiming Tang, Zhi Zhang, Yuan Zhang, Yulu Gan, Qizhe Zhang, Cheng-Ching Tseng, Shanghang Zhang, Tiejun Huang:
SAN: Hypothesizing Long-Term Synaptic Development and Neural Engram Mechanism in Scalable Model's Parameter-Efficient Fine-Tuning. ICML 2025
[c105]
- view
- export record
  dblp key:
  - conf/icml/WuwuGCHZWLZZ25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/WuwuGCHZWLZZ25
Qingpo Wuwu, Chonghan Gao, Tianyu Chen, Yihang Huang, Yuekai Zhang, Jianing Wang, Jianxin Li, Haoyi Zhou, Shanghang Zhang:
PINNsAgent: Automated PDE Surrogation with Large Language Models. ICML 2025
[c104]
- view
  authority control:
- export record
  dblp key:
  - conf/icra/HuangZYCZ25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icra/HuangZYCZ25
Nan Huang, Ting Zhang, Yuhui Yuan, Dong Chen, Shanghang Zhang:
High-Quality 3D Creation From a Single Image Using Subject-Specific Knowledge Prior. ICRA 2025: 199-206
[c103]
- view
  authority control:
- export record
  dblp key:
  - conf/icra/LiLLWGZDZ25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icra/LiLLWGZDZ25
Jianing Li, Ming Lu, Juntao Liu, Hao Wang, Chenyang Gu, Wenzhao Zheng, Li Du, Shanghang Zhang:
SliceOcc: Indoor 3D Semantic Occupancy Prediction with Vertical Slice Representation. ICRA 2025: 15762-15768
[c102]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/ijcai/ChenZLWZZZ025
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ijcai/ChenZLWZZZ025
Tianyu Chen, Haoyi Zhou, Ying Li, Hao Wang, Zhenzhe Zhang, Tianchen Zhu, Shanghang Zhang, Jianxin Li:
FreqMoE: Dynamic Frequency Enhancement for Neural PDE Solvers. IJCAI 2025: 7356-7364
[c101]
- view
  authority control:
- export record
  dblp key:
  - conf/iros/TangZHWWWZ25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iros/TangZHWWWZ25
Yingbo Tang, Shuaike Zhang, Xiaoshuai Hao, Pengwei Wang, Jianlong Wu, Zhongyuan Wang, Shanghang Zhang:
AffordGrasp: In-Context Affordance Reasoning for Open-Vocabulary Task-Oriented Grasping in Clutter. IROS 2025: 9433-9439
[c100]
- view
  authority control:
- export record
  dblp key:
  - conf/iros/HengLMLLWWJGZZD25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iros/HengLMLLWWJGZZD25
Liang Heng, Xiaoqi Li, Shangqing Mao, Jiaming Liu, Ruolin Liu, Jingli Wei, Yu-Kai Wang, Yueru Jia, Chenyang Gu, Rui Zhao, Shanghang Zhang, Hao Dong:
RwoR: Generating Robot Demonstrations from Human Hand Collection for Policy Learning without Robot. IROS 2025: 13544-13551
[c99]
- view
  authority control:
- export record
  dblp key:
  - conf/itsc/LiuQZT25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/itsc/LiuQZT25
Tianqi Liu, Yanjun Qin, Shanghang Zhang, Xiaoming Tao:
Driver Road Rage Detection and Event Perception Analysis Based on Eeg Functional Connectivity. ITSC 2025: 2356-2361
[c98]
- view
  authority control:
- export record
  dblp key:
  - conf/mm/WangWZLBLLZZ25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/WangWZLBLLZZ25
Hao Wang, Xiaobao Wei, Xiaoan Zhang, Jianing Li, Chengyu Bai, Ying Li, Ming Lu, Wenzhao Zheng, Shanghang Zhang:
EmbodiedOcc++: Boosting Embodied 3D Occupancy Prediction with Plane Regularization and Uncertainty Sampler. ACM Multimedia 2025: 925-934
[c97]
- view
  authority control:
- export record
  dblp key:
  - conf/mm/ZhangHTZ00MZ25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/ZhangHTZ00MZ25
Shuyi Zhang, Xiaoshuai Hao, Yingbo Tang, Lingfeng Zhang, Pengwei Wang, Zhongyuan Wang, Hongxuan Ma, Shanghang Zhang:
Video-CoT: A Comprehensive Dataset for Spatiotemporal Understanding of Videos Based on Chain-of-Thought. ACM Multimedia 2025: 12745-12752
[d1]
- view
  authority control:
- export record
  dblp key:
  - data/11/ZhangC25d
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/data/11/ZhangC25d
Shanghang Zhang, Jianxu Chen:
Implicit Neural Image Field for Biological Microscopy Image Compression. Zenodo, 2025
[i211]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2501-01709
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2501-01709
Jiajun Cao, Yuan Zhang, Tao Huang, Ming Lu, Qizhe Zhang, Ruichuan An, Ningning Ma, Shanghang Zhang:
MoVE-KD: Knowledge Distillation for VLMs with Mixture of Visual Encoders. CoRR abs/2501.01709 (2025)
[i210]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2501-12053
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2501-12053
Qingpo Wuwu, Chonghan Gao, Tianyu Chen, Yihang Huang, Yuekai Zhang, Jianing Wang, Jianxin Li, Haoyi Zhou, Shanghang Zhang:
PINNsAgent: Automated PDE Surrogation with Large Language Models. CoRR abs/2501.12053 (2025)
[i209]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2501-16684
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2501-16684
Jianing Li, Ming Lu, Hao Wang, Chenyang Gu, Wenzhao Zheng, Li Du, Shanghang Zhang:
SliceOcc: Indoor 3D Semantic Occupancy Prediction with Vertical Slice Representation. CoRR abs/2501.16684 (2025)
[i208]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2502-02095
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2502-02095
Bowen Ping, Jiali Zeng, Fandong Meng, Shuo Wang, Jie Zhou, Shanghang Zhang:
LongDPO: Unlock Better Long-form Generation Abilities for LLMs via Critique-augmented Stepwise Information. CoRR abs/2502.02095 (2025)
[i207]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2502-08449
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2502-08449
Yankai Fu, Qiuxuan Feng, Ning Chen, Zichen Zhou, Mengzhen Liu, Mingdong Wu, Tianxing Chen, Shanyu Rong, Jiaming Liu, Hao Dong, Shanghang Zhang:
CordViP: Correspondence-based Visuomotor Policy for Dexterous Manipulation in Real-World. CoRR abs/2502.08449 (2025)
[i206]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2502-13451
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2502-13451
Lingfeng Zhang, Xiaoshuai Hao, Qinwen Xu, Qiang Zhang, Xinyao Zhang, Pengwei Wang, Jing Zhang, Zhongyuan Wang, Shanghang Zhang, Renjing Xu:
MapNav: A Novel Memory Representation via Annotated Semantic Maps for VLM-based Vision-and-Language Navigation. CoRR abs/2502.13451 (2025)
[i205]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2502-21257
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2502-21257
Yuheng Ji, Huajie Tan, Jiayu Shi, Xiaoshuai Hao, Yuan Zhang, Hengyuan Zhang, Pengwei Wang, Mengdi Zhao, Yao Mu, Pengju An, Xinda Xue, Qinghang Su, Huaihai Lyu, Xiaolong Zheng, Jiaming Liu, Zhongyuan Wang, Shanghang Zhang:
RoboBrain: A Unified Brain Model for Robotic Manipulation from Abstract to Concrete. CoRR abs/2502.21257 (2025)
[i204]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2503-00778
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2503-00778
Yingbo Tang, Shuaike Zhang, Xiaoshuai Hao, Pengwei Wang, Jianlong Wu, Zhongyuan Wang, Shanghang Zhang:
AffordGrasp: In-Context Affordance Reasoning for Open-Vocabulary Task-Oriented Grasping in Clutter. CoRR abs/2503.00778 (2025)
[i203]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2503-10631
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2503-10631
Jiaming Liu, Hao Chen, Pengju An, Zhuoyang Liu, Renrui Zhang, Chenyang Gu, Xiaoqi Li, Ziyu Guo, Sixiang Chen, Mengzhen Liu, Chengkai Hou, Mengdi Zhao, KC alex Zhou, Pheng-Ann Heng, Shanghang Zhang:
HybridVLA: Collaborative Diffusion and Autoregression in a Unified Vision-Language-Action Model. CoRR abs/2503.10631 (2025)
[i202]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2503-16545
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2503-16545
Xinyan Chen, Jiaxin Ge, Hongming Dai, Qiang Zhou, Qiuxuan Feng, Jingtong Hu, Yizhou Wang, Jiaming Liu, Shanghang Zhang:
EmpathyAgent: Can Embodied Agents Conduct Empathetic Actions? CoRR abs/2503.16545 (2025)
[i201]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2503-20384
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2503-20384
Rongyu Zhang, Menghang Dong, Yuan Zhang, Liang Heng, Xiaowei Chi, Gaole Dai, Li Du, Dan Wang, Yuan Du, Shanghang Zhang:
MoLe-VLA: Dynamic Layer-skipping Vision Language Action Model via Mixture-of-Layers for Efficient Robot Manipulation. CoRR abs/2503.20384 (2025)
[i200]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2503-20752
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2503-20752
Huajie Tan, Yuheng Ji, Xiaoshuai Hao, Minglan Lin, Pengwei Wang, Zhongyuan Wang, Shanghang Zhang:
Reason-RFT: Reinforcement Fine-Tuning for Visual Reasoning. CoRR abs/2503.20752 (2025)
[i199]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2503-22268
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2503-22268
Nan Huang, Wenzhao Zheng, Chenfeng Xu, Kurt Keutzer, Shanghang Zhang, Angjoo Kanazawa, Qianqian Wang:
Segment Any Motion in Videos. CoRR abs/2503.22268 (2025)
[i198]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2504-09540
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2504-09540
Hao Wang, Xiaobao Wei, Xiaoan Zhang, Jianing Li, Chengyu Bai, Ying Li, Ming Lu, Wenzhao Zheng, Shanghang Zhang:
EmbodiedOcc++: Boosting Embodied 3D Occupancy Prediction with Plane Regularization and Uncertainty Sampler. CoRR abs/2504.09540 (2025)
[i197]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2504-16464
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2504-16464
Ying Li, Xiaobao Wei, Xiaowei Chi, Yuming Li, Zhongyu Zhao, Hao Wang, Ningning Ma, Ming Lu, Shanghang Zhang:
ManipDreamer: Boosting Robotic Manipulation World Model with Action Tree and Visual Guidance. CoRR abs/2504.16464 (2025)
[i196]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2505-01746
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2505-01746
Xingqun Qi, Yatian Wang, Hengyuan Zhang, Jiahao Pan, Wei Xue, Shanghang Zhang, Wenhan Luo, Qifeng Liu, Yike Guo:
Co³Gesture: Towards Coherent Concurrent Co-speech 3D Gesture Generation with Interactive Diffusion. CoRR abs/2505.01746 (2025)
[i195]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2505-02166
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2505-02166
Xiaoqi Li, Lingyun Xu, Mingxu Zhang, Jiaming Liu, Yan Shen, Iaroslav Ponomarenko, Jiahui Xu, Liang He, Siyuan Huang, Shanghang Zhang, Hao Dong:
CrayonRobo: Object-Centric Prompt-Driven Vision-Language-Action Model for Robotic Manipulation. CoRR abs/2505.02166 (2025)
[i194]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2505-03673
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2505-03673
Huajie Tan, Xiaoshuai Hao, Minglan Lin, Pengwei Wang, Yaoxu Lyu, Mingyu Cao, Zhongyuan Wang, Shanghang Zhang:
RoboOS: A Hierarchical Embodied Framework for Cross-Embodiment and Multi-Agent Collaboration. CoRR abs/2505.03673 (2025)
[i193]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2505-06858
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2505-06858
Tianyu Chen, Haoyi Zhou, Ying Li, Hao Wang, Zhenzhe Zhang, Tianchen Zhu, Shanghang Zhang, Jianxin Li:
FreqMoE: Dynamic Frequency Enhancement for Neural PDE Solvers. CoRR abs/2505.06858 (2025)
[i192]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2505-11920
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2505-11920
Guangrun Li, Yaoxu Lyu, Zhuoyang Liu, Chengkai Hou, Jieyu Zhang, Shanghang Zhang:
H2R: A Human-to-Robot Data Augmentation for Robot Pre-training from Videos. CoRR abs/2505.11920 (2025)
[i191]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2505-12239
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2505-12239
Jianheng Tang, Huiping Zhuang, Di Fang, Jiaxu Li, Feijiang Han, Yajiang Huang, Kejia Fan, Leye Wang, Zhanxing Zhu, Shanghang Zhang, Houbing Herbert Song, Yunhuai Liu:
ACU: Analytic Continual Unlearning for Efficient and Exact Forgetting with Privacy Preservation. CoRR abs/2505.12239 (2025)
[i190]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2505-12245
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2505-12245
Jianheng Tang, Huiping Zhuang, Jingyu He, Run He, Jingchao Wang, Kejia Fan, Anfeng Liu, Tian Wang, Leye Wang, Zhanxing Zhu, Shanghang Zhang, Houbing Herbert Song, Yunhuai Liu:
AFCL: Analytic Federated Continual Learning for Spatio-Temporal Invariance of Non-IID Data. CoRR abs/2505.12245 (2025)
[i189]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2505-18049
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2505-18049
Gaole Dai, Menghang Dong, Rongyu Zhang, Ruichuan An, Shanghang Zhang, Tie-Jun Huang:
SpikeGen: Generative Framework for Visual Spike Stream Processing. CoRR abs/2505.18049 (2025)
[i188]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2505-20610
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2505-20610
Xiaobao Wei, Xiaoan Zhang, Hao Wang, Qingpo Wuwu, Ming Lu, Wenzhao Zheng, Shanghang Zhang:
OmniIndoor3D: Comprehensive Indoor 3D Reconstruction. CoRR abs/2505.20610 (2025)
[i187]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2505-22421
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2505-22421
Anthony Chen, Wenzhao Zheng, Yida Wang, Xueyang Zhang, Kun Zhan, Peng Jia, Kurt Keutzer, Shanghang Zhang:
GeoDrive: 3D Geometry-Informed Driving World Model with Precise Action Control. CoRR abs/2505.22421 (2025)
[i186]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2506-01953
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2506-01953
Hao Chen, Jiaming Liu, Chenyang Gu, Zhuoyang Liu, Renrui Zhang, Xiaoqi Li, Xiao He, Yandong Guo, Chi-Wing Fu, Shanghang Zhang, Pheng-Ann Heng:
Fast-in-Slow: A Dual-System Foundation Model Unifying Fast Manipulation within Slow Reasoning. CoRR abs/2506.01953 (2025)
[i185]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2506-04308
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2506-04308
Enshen Zhou, Jingkun An, Cheng Chi, Yi Han, Shanyu Rong, Chi Zhang, Pengwei Wang, Zhongyuan Wang, Tie-Jun Huang, Lu Sheng, Shanghang Zhang:
RoboRefer: Towards Spatial Referring with Reasoning in Vision-Language Models for Robotics. CoRR abs/2506.04308 (2025)
[i184]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2506-06690
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2506-06690
Hao Wang, Chengkai Hou, Xianglong Li, Yankai Fu, Chenxuan Li, Ning Chen, Gaole Dai, Jiaming Liu, Tie-Jun Huang, Shanghang Zhang:
SpikePingpong: High-Frequency Spike Vision-based Robot Learning for Precise Striking in Table Tennis Game. CoRR abs/2506.06690 (2025)
[i183]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2506-08817
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2506-08817
Shuyi Zhang, Xiaoshuai Hao, Yingbo Tang, Lingfeng Zhang, Pengwei Wang, Zhongyuan Wang, Hongxuan Ma, Shanghang Zhang:
Video-CoT: A Comprehensive Dataset for Spatiotemporal Understanding of Videos Based on Chain-of-Thought. CoRR abs/2506.08817 (2025)
[i182]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2506-10967
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2506-10967
Qizhe Zhang, Mengzhen Liu, Lichen Li, Ming Lu, Yuan Zhang, Junwen Pan, Qi She, Shanghang Zhang:
Beyond Attention or Similarity: Maximizing Conditional Diversity for Token Pruning in MLLMs. CoRR abs/2506.10967 (2025)
[i181]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2506-16112
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2506-16112
Yuan Zhang, Chun-Kai Fan, Tao Huang, Ming Lu, Sicheng Yu, Junwen Pan, Kuan Cheng, Qi She, Shanghang Zhang:
AutoV: Learning to Retrieve Visual Prompt for Large Vision-Language Models. CoRR abs/2506.16112 (2025)
[i180]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2506-16119
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2506-16119
Chengyu Bai, Yuming Li, Zhongyu Zhao, Jintao Chen, Peidong Jia, Qi She, Ming Lu, Shanghang Zhang:
FastInit: Fast Noise Initialization for Temporally Consistent Video Generation. CoRR abs/2506.16119 (2025)
[i179]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2506-18897
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2506-18897
Xiaowei Chi, Kuangzhi Ge, Jiaming Liu, Siyuan Zhou, Peidong Jia, Zichen He, Yuzhen Liu, Tingguang Li, Lei Han, Sirui Han, Shanghang Zhang, Yike Guo:
MinD: Unified Visual Imagination and Control via Hierarchical World Models. CoRR abs/2506.18897 (2025)
[i178]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2506-21669
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2506-21669
Wanxin Tian, Shijie Zhang, Kevin Zhang, Xiaowei Chi, Yulin Luo, Junyu Lu, Chunkai Fan, Qiang Zhou, Yiming Zhao, Ning Liu Siyu Lin, Zhiyuan Qin, Xiaozhu Ju, Shanghang Zhang, Jian Tang:
SEEA-R1: Tree-Structured Reinforcement Fine-Tuning for Self-Evolving Embodied Agents. CoRR abs/2506.21669 (2025)
[i177]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2507-01961
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2507-01961
Sixiang Chen, Jiaming Liu, Siyuan Qian, Han Jiang, Lily Li, Renrui Zhang, Zhuoyang Liu, Chenyang Gu, Chengkai Hou, Pengwei Wang, Zhongyuan Wang, Shanghang Zhang:
AC-DiT: Adaptive Coordination Diffusion Transformer for Mobile Manipulation. CoRR abs/2507.01961 (2025)
[i176]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2507-02029
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2507-02029
Mingyu Cao, Huajie Tan, Yuheng Ji, Minglan Lin, Zhiyu Li, Zhou Cao, Pengwei Wang, Enshen Zhou, Yi Han, Yingbo Tang, Xiangqi Xu, Wei Guo, Yaoxu Lyu, Yijie Xu, Jiayu Shi, Mengfei Du, Cheng Chi, Mengdi Zhao, Xiaoshuai Hao, Junkai Zhao, Xiaojie Zhang, Shanyu Rong, Huaihai Lyu, Zhengliang Cai, Yankai Fu, Ning Chen, Bolun Zhang, Lingfeng Zhang, Shuyi Zhang, Dong Liu, Xi Feng, Songjing Wang, Xiaodan Liu, Yance Jiao, Mengsi Lyu, Zhuo Chen, Chenrui He, Yulong Ao, Xue Sun, Zheqi He, Jingshu Zheng, Xi Yang, Donghai Shi, Kunchang Xie, Bochao Zhang, Shaokai Nie, Chunlei Men, Yonghua Lin, Zhongyuan Wang, Tiejun Huang, Shanghang Zhang:
RoboBrain 2.0 Technical Report. CoRR abs/2507.02029 (2025)
[i175]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2507-03930
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2507-03930
Liang Heng, Xiaoqi Li, Shangqing Mao, Jiaming Liu, Ruolin Liu, Jingli Wei, Yu-Kai Wang, Yueru Jia, Chenyang Gu, Rui Zhao, Shanghang Zhang, Hao Dong:
RwoR: Generating Robot Demonstrations from Human Hand Collection for Policy Learning without Robot. CoRR abs/2507.03930 (2025)
[i174]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2507-21610
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2507-21610
Ruiyang Hao, Haibao Yu, Jiaru Zhong, Chuanye Wang, Jiahao Wang, Yiming Kan, Wenxian Yang, Siqi Fan, Huilin Yin, Jianing Qiu, Yao Mu, Jiankai Sun, Li Chen, Walter Zimmer, Dandan Zhang, Shanghang Zhang, Mac Schwager, Wei Huang, Xiaobo Zhang, Ping Luo, Zaiqing Nie:
Research Challenges and Progress in the End-to-End V2X Cooperative Autonomous Driving Competition. CoRR abs/2507.21610 (2025)
[i173]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2507-23318
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2507-23318
Jiajun Cao, Qizhe Zhang, Peidong Jia, Xuhui Zhao, Bo Lan, Xiaoan Zhang, Xiaobao Wei, Sixiang Chen, Zhuo Li, Yang Wang, Liyun Li, Xianming Liu, Ming Lu, Shanghang Zhang:
FastDriveVLA: Efficient End-to-End Driving via Plug-and-Play Reconstruction-based Token Pruning. CoRR abs/2507.23318 (2025)
[i172]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2508-03142
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2508-03142
Chengyu Bai, Jintao Chen, Xiang Bai, Yilong Chen, Qi She, Ming Lu, Shanghang Zhang:
UniEdit-I: Training-free Image Editing for Unified VLM via Iterative Understanding, Editing and Verifying. CoRR abs/2508.03142 (2025)
[i171]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2508-04598
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2508-04598
Lingfeng Zhang, Xiaoshuai Hao, Yingbo Tang, Haoxiang Fu, Xinyu Zheng, Pengwei Wang, Zhongyuan Wang, Wenbo Ding, Shanghang Zhang:
NavA³: Understanding Any Instruction, Navigating Anywhere, Finding Anything. CoRR abs/2508.04598 (2025)
[i170]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2508-16943
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2508-16943
Haozhuo Zhang, Jingkai Sun, Michele Caprio, Jian Tang, Shanghang Zhang, Qiang Zhang, Wei Pan:
HumanoidVerse: A Versatile Humanoid for Vision-Language Guided Multi-Object Rearrangement. CoRR abs/2508.16943 (2025)
[i169]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2508-17230
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2508-17230
Chengkai Hou, Yanjie Ze, Yankai Fu, Zeyu Gao, Songbo Hu, Yue Yu, Shanghang Zhang, Huazhe Xu:
4D Visual Pre-training for Robot Learning. CoRR abs/2508.17230 (2025)
[i168]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2508-21044
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2508-21044
Junpeng Ma, Qizhe Zhang, Ming Lu, Zhibin Wang, Qiang Zhou, Jun Song, Shanghang Zhang:
MMG-Vid: Maximizing Marginal Gains at Segment-level and Token-level for Efficient Video LLMs. CoRR abs/2508.21044 (2025)
[i167]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2509-05314
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2509-05314
Ying Li, Xiaobao Wei, Xiaowei Chi, Yuming Li, Zhongyu Zhao, Hao Wang, Ningning Ma, Ming Lu, Shanghang Zhang:
ManipDreamer3D : Synthesizing Plausible Robotic Manipulation Video with Occupancy-aware 3D Trajectory. CoRR abs/2509.05314 (2025)
[i166]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2509-06040
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2509-06040
Yuming Li, Yikai Wang, Yuying Zhu, Zhongyu Zhao, Ming Lu, Qi She, Shanghang Zhang:
BranchGRPO: Stable and Efficient GRPO with Structured Branching in Diffusion Models. CoRR abs/2509.06040 (2025)
[i165]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2509-09674
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2509-09674
Haozhan Li, Yuxin Zuo, Jiale Yu, Yuhao Zhang, Zhaohui Yang, Kaiyan Zhang, Xuekai Zhu, Yuchen Zhang, Tianxing Chen, Ganqu Cui, Dehui Wang, Dingxiang Luo, Yuchen Fan, Youbang Sun, Jia Zeng, Jiangmiao Pang, Shanghang Zhang, Yu Wang, Yao Mu, Bowen Zhou, Ning Ding:
SimpleVLA-RL: Scaling VLA Training via Reinforcement Learning. CoRR abs/2509.09674 (2025)
[i164]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2509-14002
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2509-14002
Rongyu Zhang, Xize Duan, Jiaming Liu, Li Du, Yuan Du, Dan Wang, Shanghang Zhang, Fangxin Wang:
RepCaM++: Exploring Transparent Visual Prompt With Inference-Time Re-Parameterization for Neural Video Delivery. CoRR abs/2509.14002 (2025)
[i163]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2509-14151
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2509-14151
Rongyu Zhang, Jiaming Liu, Xiaoqi Li, Xiaowei Chi, Dan Wang, Li Du, Yuan Du, Shanghang Zhang:
BEVUDA++: Geometric-aware Unsupervised Domain Adaptation for Multi-View 3D Object Detection. CoRR abs/2509.14151 (2025)
[i162]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2509-17759
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2509-17759
Chengbo Yuan, Rui Zhou, Mengzhen Liu, Yingdong Hu, Shengjie Wang, Li Yi, Chuan Wen, Shanghang Zhang, Yang Gao:
MotionTrans: Human VR Data Enable Motion-Level Learning for Robotic Manipulation Policies. CoRR abs/2509.17759 (2025)
[i161]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2509-22583
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2509-22583
Gaole Dai, Chenghao Zhou, Yu Zhou, Rongyu Zhang, Yuan Zhang, Chengkai Hou, Tie-Jun Huang, Jianxu Chen, Shanghang Zhang:
Orochi: Versatile Biomedical Image Processor. CoRR abs/2509.22583 (2025)
[i160]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2509-22642
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2509-22642
Xiaowei Chi, Peidong Jia, Chun-Kai Fan, Xiaozhu Ju, Weishi Mi, Kevin Zhang, Zhiyuan Qin, Wanxin Tian, Kuangzhi Ge, Hao Li, Zezhong Qian, Anthony Chen, Qiang Zhou, Yueru Jia, Jiaming Liu, Yong Dai, Qingpo Wuwu, Chengyu Bai, Yu-Kai Wang, Ying Li, Lizhang Chen, Yong Bao, Zhiyuan Jiang, Jiacheng Zhu, Kai Tang, Ruichuan An, Yulin Luo, Qiuxuan Feng, Siyuan Zhou, Chi-Min Chan, Chengkai Hou, Wei Xue, Sirui Han, Yike Guo, Shanghang Zhang, Jian Tang:
WoW: Towards a World omniscient World model Through Embodied Interaction. CoRR abs/2509.22642 (2025)
[i159]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2509-25681
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2509-25681
Junjie Wen, Minjie Zhu, Jiaming Liu, Zhiyuan Liu, Yicun Yang, Linfeng Zhang, Shanghang Zhang, Yichen Zhu, Yi Xu:
dVLA: Diffusion Vision-Language-Action Model with Multimodal Chain-of-Thought. CoRR abs/2509.25681 (2025)
[i158]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2509-26642
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2509-26642
Zhuoyang Liu, Jiaming Liu, Jiadong Xu, Nuowei Han, Chenyang Gu, Hao Chen, Kaichen Zhou, Renrui Zhang, Kai-Chin Hsieh, Kun Wu, Zhengping Che, Jian Tang, Shanghang Zhang:
MLA: A Multisensory Language-Action Model for Multimodal Understanding and Forecasting in Robotic Manipulation. CoRR abs/2509.26642 (2025)
[i157]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2510-00483
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2510-00483
Yuheng Ji, Huajie Tan, Cheng Chi, Yijie Xu, Yuting Zhao, Enshen Zhou, Huaihai Lyu, Pengwei Wang, Zhongyuan Wang, Shanghang Zhang, Xiaolong Zheng:
MathSticks: A Benchmark for Visual Symbolic Compositional Reasoning with Matchstick Puzzles. CoRR abs/2510.00483 (2025)
[i156]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2510-00855
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2510-00855
Kevin Zhang, Kuangzhi Ge, Xiaowei Chi, Renrui Zhang, Shaojun Shi, Zhen Dong, Sirui Han, Shanghang Zhang:
Can World Models Benefit VLMs for World Dynamics? CoRR abs/2510.00855 (2025)
[i155]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2510-07181
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2510-07181
Yi Han, Cheng Chi, Enshen Zhou, Shanyu Rong, Jingkun An, Pengwei Wang, Zhongyuan Wang, Lu Sheng, Shanghang Zhang:
TIGeR: Tool-Integrated Geometric Reasoning in Vision-Language Models for Robotics. CoRR abs/2510.07181 (2025)
[i154]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2510-07313
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2510-07313
Zezhong Qian, Xiaowei Chi, Yuming Li, Shizun Wang, Zhiyuan Qin, Xiaozhu Ju, Sirui Han, Shanghang Zhang:
WristWorld: Generating Wrist-Views via 4D World Models for Robotic Manipulation. CoRR abs/2510.07313 (2025)
[i153]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2510-09667
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2510-09667
Huaihai Lyu, Chaofan Chen, Senwei Xie, Pengwei Wang, Xiansheng Chen, Shanghang Zhang, Changsheng Xu:
OmniSAT: Compact Action Token, Faster Auto Regression. CoRR abs/2510.09667 (2025)
[i152]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2510-10903
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2510-10903
Shuanghao Bai, Wenxuan Song, Jiayi Chen, Yuheng Ji, Zhide Zhong, Jin Yang, Han Zhao, Wanqi Zhou, Wei Zhao, Zhe Li, Pengxiang Ding, Cheng Chi, Haoang Li, Chang Xu, Xiaolong Zheng, Donglin Wang, Shanghang Zhang, Badong Chen:
Towards a Unified Understanding of Robot Manipulation: A Comprehensive Survey. CoRR abs/2510.10903 (2025)
[i151]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2510-14952
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2510-14952
Zhe Li, Cheng Chi, Yangyang Wei, Boan Zhu, Yibo Peng, Tao Huang, Pengwei Wang, Zhongyuan Wang, Shanghang Zhang, Chang Xu:
From Language to Locomotion: Retargeting-free Humanoid Control via Motion Latent Guidance. CoRR abs/2510.14952 (2025)
[i150]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2510-17801
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2510-17801
Yulin Luo, Chun-Kai Fan, Menghang Dong, Jiayu Shi, Mengdi Zhao, Bo-Wen Zhang, Cheng Chi, Jiaming Liu, Gaole Dai, Rongyu Zhang, Ruichuan An, Kun Wu, Zhengping Che, Shaoxuan Xie, Guocai Yao, Zhongxia Zhao, Pengwei Wang, Guang Liu, Zhongyuan Wang, Tiejun Huang, Shanghang Zhang:
Robobench: A Comprehensive Evaluation Benchmark for Multimodal Large Language Models as Embodied Brain. CoRR abs/2510.17801 (2025)
[i149]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2510-26536
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2510-26536
Huajie Tan, Cheng Chi, Xiansheng Chen, Yuheng Ji, Zhongxia Zhao, Xiaoshuai Hao, Yaoxu Lyu, Mingyu Cao, Junkai Zhao, Huaihai Lyu, Enshen Zhou, Ning Chen, Yankai Fu, Cheng Peng, Wei Guo, Dong Liang, Zhuo Chen, Mengsi Lyu, Chenrui He, Yulong Ao, Yonghua Lin, Pengwei Wang, Zhongyuan Wang, Shanghang Zhang:
RoboOS-NeXT: A Unified Memory-based Framework for Lifelong, Scalable, and Robust Multi-Robot Collaboration. CoRR abs/2510.26536 (2025)
[i148]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2511-00940
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2511-00940
Zhe Li, Xiang Bai, Jieyu Zhang, Zhuangzhe Wu, Che Xu, Ying Li, Chengkai Hou, Shanghang Zhang:
URDF-Anything: Constructing Articulated Objects with 3D Multimodal Language Model. CoRR abs/2511.00940 (2025)
[i147]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2511-02776
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2511-02776
Shichao Fan, Kun Wu, Zhengping Che, Xinhua Wang, Di Wu, Fei Liao, Ning Liu, Yixue Zhang, Zhen Zhao, Zhiyuan Xu, Meng Li, Qingjie Liu, Shanghang Zhang, Min Wan, Jian Tang:
XR-1: Towards Versatile Vision-Language-Action Models via Learning Unified Vision-Motion Representations. CoRR abs/2511.02776 (2025)
[i146]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2511-07278
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2511-07278
Yilong Chen, Xiang Bai, Zhibin Wang, Chengyu Bai, Yuhan Dai, Ming Lu, Shanghang Zhang:
StreamKV: Streaming Video Question-Answering with Segment-based KV Cache Retrieval and Compression. CoRR abs/2511.07278 (2025)
[i145]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2511-13207
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2511-13207
Cheng Peng, Zhenzhe Zhang, Cheng Chi, Xiaobao Wei, Yanhao Zhang, Heng Wang, Pengwei Wang, Zhongyuan Wang, Jing Liu, Shanghang Zhang:
PIGEON: VLM-Driven Object Navigation via Points of Interest Selection. CoRR abs/2511.13207 (2025)
[i144]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2511-17106
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2511-17106
Yuan Zhang, Ming Lu, Junwen Pan, Tao Huang, Kuan Cheng, Qi She, Shanghang Zhang:
ChainV: Atomic Visual Hints Make Multimodal Reasoning Shorter and Better. CoRR abs/2511.17106 (2025)
[i143]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2511-17366
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2511-17366
Yankai Fu, Ning Chen, Junkai Zhao, Shaozhe Shan, Guocai Yao, Pengwei Wang, Zhongyuan Wang, Shanghang Zhang:
METIS: Multi-Source Egocentric Training for Integrated Dexterous Vision-Language-Action Model. CoRR abs/2511.17366 (2025)
[i142]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2511-17441
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2511-17441
Shihan Wu, Xuecheng Liu, Shaoxuan Xie, Pengwei Wang, Xinghang Li, Bowen Yang, Zhe Li, Kai Zhu, Hongyu Wu, Yiheng Liu, Zhaoye Long, Yue Wang, Chong Liu, Dihan Wang, Ziqiang Ni, Xiang Yang, You Liu, Ruoxuan Feng, Runtian Xu, Lei Zhang, Denghang Huang, Chenghao Jin, Anlan Yin, Xinlong Wang, Zhenguo Sun, Junkai Zhao, Mengfei Du, Mingyu Cao, Xiansheng Chen, Hongyang Cheng, Xiaojie Zhang, Yankai Fu, Ning Chen, Cheng Chi, Sixiang Chen, Huaihai Lyu, Xiaoshuai Hao, Yequan Wang, Bo Lei, Dong Liu, Xi Yang, Yance Jiao, Tengfei Pan, Yunyan Zhang, Songjing Wang, Ziqian Zhang, Xu Liu, Ji Zhang, Caowei Meng, Zhizheng Zhang, Jiyang Gao, Song Wang, Xiaokun Leng, Zhiqiang Xie, Zhenzhen Zhou, Peng Huang, Wu Yang, Yandong Guo, Yichao Zhu, Suibing Zheng, Hao Cheng, Xinmin Ding, Yang Yue, Huanqian Wang, Chi Chen, Jingrui Pang, YuXi Qian, Haoran Geng, Lianli Gao, Haiyuan Li, Bin Fang, Gao Huang, Yaodong Yang, Hao Dong, He Wang, Hang Zhao, Yadong Mu, Di Hu, Hao Zhao, Tiejun Huang, Shanghang Zhang, Yonghua Lin, Zhongyuan Wang, Guocai Yao:
RoboCOIN: An Open-Sourced Bimanual Robotic Data COllection for INtegrated Manipulation. CoRR abs/2511.17441 (2025)
[i141]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2511-17961
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2511-17961
Hao Wang, Xiaobao Wei, Ying Li, Qingpo Wuwu, Dongli Wu, Jiajun Cao, Ming Lu, Wenzhao Zheng, Shanghang Zhang:
RoboArmGS: High-Quality Robotic Arm Splatting via Bézier Curve Refinement. CoRR abs/2511.17961 (2025)
[i140]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2511-22134
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2511-22134
Zhen Fang, Zhuoyang Liu, Jiaming Liu, Hao Chen, Yu Zeng, Shiting Huang, Zehui Chen, Lin Chen, Shanghang Zhang, Feng Zhao:
DualVLA: Building a Generalizable Embodied Agent via Partial Decoupling of Reasoning and Action. CoRR abs/2511.22134 (2025)
[i139]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2512-02013
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2512-02013
Chenyang Gu, Jiaming Liu, Hao Chen, Runzhong Huang, Qingpo Wuwu, Zhuoyang Liu, Xiaoqi Li, Ying Li, Renrui Zhang, Peng Jia, Pheng-Ann Heng, Shanghang Zhang:
ManualVLA: A Unified VLA Model for Chain-of-Thought Manual Generation and Robotic Manipulation. CoRR abs/2512.02013 (2025)
[i138]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2512-03044
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2512-03044
Yueru Jia, Jiaming Liu, Shengbang Liu, Rui Zhou, Wanhe Yu, Yuyang Yan, Xiaowei Chi, Yandong Guo, Boxin Shi, Shanghang Zhang:
Video2Act: A Dual-System Video Diffusion Policy with Robotic Spatio-Motional Modeling. CoRR abs/2512.03044 (2025)
[i137]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2512-13660
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2512-13660
Enshen Zhou, Cheng Chi, Yibo Li, Jingkun An, Jiayuan Zhang, Shanyu Rong, Yi Han, Yuheng Ji, Mengzhen Liu, Pengwei Wang, Zhongyuan Wang, Lu Sheng, Shanghang Zhang:
RoboTracer: Mastering Spatial Trace with Reasoning in Vision-Language Models for Robotics. CoRR abs/2512.13660 (2025)
[i136]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2512-21714
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2512-21714
Junjun Hu, Jintao Chen, Haochen Bai, Minghua Luo, Shichao Xie, Ziyi Chen, Fei Liu, Zedong Chu, Xinda Xue, Botao Ren, Xiaolong Wu, Mu Xu, Shanghang Zhang:
AstraNav-World: World Model for Foresight Control and Consistency. CoRR abs/2512.21714 (2025)
[i135]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2512-22983
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2512-22983
Shuanghao Bai, Wenxuan Song, Jiayi Chen, Yuheng Ji, Zhide Zhong, Jin Yang, Han Zhao, Wanqi Zhou, Zhe Li, Pengxiang Ding, Cheng Chi, Chang Xu, Xiaolong Zheng, Donglin Wang, Haoang Li, Shanghang Zhang, Badong Chen:
Embodied Robot Manipulation in the Era of Foundation Models: Planning and Learning Perspectives. CoRR abs/2512.22983 (2025)
[i134]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2512-23649
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2512-23649
Zhe Li, Cheng Chi, Boan Zhu, Yangyang Wei, Shuanghao Bai, Yuheng Ji, Yibo Peng, Tao Huang, Pengwei Wang, Zhongyuan Wang, S.-H. Gary Chan, Chang Xu, Shanghang Zhang:
RoboMirror: Understand Before You Imitate for Video to Humanoid Locomotion. CoRR abs/2512.23649 (2025)
[i133]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2512-23650
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2512-23650
Zhe Li, Cheng Chi, Yangyang Wei, Boan Zhu, Tao Huang, Zhenguo Sun, Yibo Peng, Pengwei Wang, Zhongyuan Wang, Fangzhou Liu, Chang Xu, Shanghang Zhang:
Do You Have Freestyle? Expressive Humanoid Locomotion via Audio Control. CoRR abs/2512.23650 (2025)
[i132]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2512-23703
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2512-23703
Huajie Tan, Sixiang Chen, Yijie Xu, Zixiao Wang, Yuheng Ji, Cheng Chi, Yaoxu Lyu, Zhongxia Zhao, Xiansheng Chen, Peterson Co, Shaoxuan Xie, Guocai Yao, Pengwei Wang, Zhongyuan Wang, Shanghang Zhang:
Robo-Dopamine: General Process Reward Modeling for High-Precision Robotic Manipulation. CoRR abs/2512.23703 (2025)
[i131]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2512-24653
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2512-24653
Chengkai Hou, Kun Wu, Jiaming Liu, Zhengping Che, Di Wu, Fei Liao, Guangrun Li, Jingyang He, Qiuxuan Feng, Zhao Jin, Chenyang Gu, Zhuoyang Liu, Nuowei Han, Xiangju Mi, Yaoxu Lv, Yankai Fu, Gaole Dai, Langzhe Gu, Tao Li, Yuheng Zhang, Yixue Zhang, Xinhua Wang, Shichao Fan, Meng Li, Zhen Zhao, Ning Liu, Zhiyuan Xu, Pei Ren, Junjie Ji, Haonan Liu, Kuan Cheng, Shanghang Zhang, Jian Tang:
RoboMIND 2.0: A Multimodal, Bimanual Mobile Manipulation Dataset for Generalizable Embodied Intelligence. CoRR abs/2512.24653 (2025)
2024
[j16]
- view
  authority control:
- export record
  dblp key:
  - journals/kbs/WangRHZZWWHL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/kbs/WangRHZZWWHL24
Zhenghong Wang, Sijie Ruan, Tianqiang Huang, Haoyi Zhou, Shanghang Zhang, Yi Wang, Leye Wang, Zhou Huang, Yu Liu:
A lightweight multi-layer perceptron for efficient multivariate time series forecasting. Knowl. Based Syst. 288: 111463 (2024)
[j15]
- view
  authority control:
- export record
  dblp key:
  - journals/ral/MaDMWWCFZL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ral/MaDMWWCFZL24
Ji Ma, Hongming Dai, Yao Mu, Pengying Wu, Hao Wang, Xiaowei Chi, Yang Fei, Shanghang Zhang, Chang Liu:
DOZE: A Dataset for Open-Vocabulary Zero-Shot Object Navigation in Dynamic Environments. IEEE Robotics Autom. Lett. 9(9): 7389-7396 (2024)
[j14]
- view
  authority control:
- export record
  dblp key:
  - journals/tgrs/XieWZLZCCZ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tgrs/XieWZLZCCZ24
Jianlin Xie, Guanqun Wang, Yin Zhuang, Can Li, Tong Zhang, He Chen, Liang Chen, Shanghang Zhang:
DECOR: Dynamic Decoupling and Multiobjective Optimization for Long-Tailed Remote Sensing Image Classification. IEEE Trans. Geosci. Remote. Sens. 62: 1-17 (2024)
[j13]
- view
  authority control:
- export record
  dblp key:
  - journals/titb/QiWZRGSZSS24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/titb/QiWZRGSZSS24
Xingqun Qi, Zhuojie Wu, Wenxuan Zou, Min Ren, Yifan Gao, Muyi Sun, Shanghang Zhang, Caifeng Shan, Zhenan Sun:
Exploring Generalizable Distillation for Efficient Medical Image Segmentation. IEEE J. Biomed. Health Informatics 28(7): 4170-4183 (2024)
[j12]
- view
  authority control:
- export record
  dblp key:
  - journals/tiv/LiLLGDDZ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tiv/LiLLGDDZ24
Jianing Li, Ming Lu, Jiaming Liu, Yandong Guo, Yuan Du, Li Du, Shanghang Zhang:
BEV-LGKD: A Unified LiDAR-Guided Knowledge Distillation Framework for Multi-View BEV 3D Object Detection. IEEE Trans. Intell. Veh. 9(1): 2489-2498 (2024)
[c96]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/GaoCCCLZZ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/GaoCCCLZZ24
Hongzhi Gao, Zheng Chen, Zehui Chen, Lin Chen, Jiaming Liu, Shanghang Zhang, Feng Zhao:
Leveraging Imagery Data with Spatial Point Prior for Weakly Semi-supervised 3D Object Detection. AAAI 2024: 1797-1805
[c95]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/YangWL0ZPGCZ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/YangWL0ZPGCZ24
Senqiao Yang, Jiarui Wu, Jiaming Liu, Xiaoqi Li, Qizhe Zhang, Mingjie Pan, Yulu Gan, Zehui Chen, Shanghang Zhang:
Exploring Sparse Visual Prompt for Domain Adaptive Dense Prediction. AAAI 2024: 16334-16342
[c94]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/ZhangLZXXXZ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/ZhangLZXXXZ24
Dongmei Zhang, Chang Li, Renrui Zhang, Shenghao Xie, Wei Xue, Xiaodong Xie, Shanghang Zhang:
FM-OV3D: Foundation Model-Based Cross-Modal Knowledge Blending for Open-Vocabulary 3D Detection. AAAI 2024: 16723-16731
[c93]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/ZhangLLYDGONKDZ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/ZhangLLYDGONKDZ24
Rongyu Zhang, Yulin Luo, Jiaming Liu, Huanrui Yang, Zhen Dong, Denis A. Gudovskiy, Tomoyuki Okuno, Yohei Nakata, Kurt Keutzer, Yuan Du, Shanghang Zhang:
Efficient Deweahter Mixture-of-Experts with Uncertainty-Aware Feature-Wise Linear Modulation. AAAI 2024: 16812-16820
[c92]
- view
  authority control:
- export record
  dblp key:
  - conf/cvpr/YaoLDGHKDZZ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/cvpr/YaoLDGHKDZZ24
Junyi Yao, Yijiang Liu, Zhen Dong, Mingfei Guo, Helan Hu, Kurt Keutzer, Li Du, Daquan Zhou, Shanghang Zhang:
PromptCoT: Align Prompt Distribution via Adapted Chain-of-Thought. CVPR 2024: 7027-7037
[c91]
- view
  authority control:
- export record
  dblp key:
  - conf/cvpr/QiPLYCLLXZLG24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/cvpr/QiPLYCLLXZLG24
Xingqun Qi, Jiahao Pan, Peng Li, Ruibin Yuan, Xiaowei Chi, Mengfei Li, Wenhan Luo, Wei Xue, Shanghang Zhang, Qifeng Liu, Yike Guo:
Weakly-Supervised Emotion Transition Learning for Diverse 3D Co-Speech Gesture Generation. CVPR 2024: 10424-10434
[c90]
- view
  authority control:
- export record
  dblp key:
  - conf/cvpr/WangLLZMWZCZLZ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/cvpr/WangLLZMWZCZLZ24
Guanqun Wang, Jiaming Liu, Chenxuan Li, Yuan Zhang, Junpeng Ma, Xinyu Wei, Kevin Zhang, Maurice Chong, Renrui Zhang, Yijiang Liu, Shanghang Zhang:
Cloud-Device Collaborative Learning for Multimodal Large Language Models. CVPR 2024: 12646-12655
[c89]
- view
  authority control:
- export record
  dblp key:
  - conf/cvpr/ZhangHLJCZ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/cvpr/ZhangHLJCZ24
Yuan Zhang, Tao Huang, Jiaming Liu, Tao Jiang, Kuan Cheng, Shanghang Zhang:
FreeKD: Knowledge Distillation via Semantic Frequency Prompt. CVPR 2024: 15931-15940
[c88]
- view
  authority control:
- export record
  dblp key:
  - conf/cvpr/WeiZWLLGZ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/cvpr/WeiZWLLGZ24
Xiaobao Wei, Renrui Zhang, Jiarui Wu, Jiaming Liu, Ming Lu, Yandong Guo, Shanghang Zhang:
NTO3D: Neural Target Object 3D Reconstruction with Segment Anything. CVPR 2024: 20352-20362
[c87]
- view
  authority control:
- export record
  dblp key:
  - conf/cvpr/ZhangZGZSZZ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/cvpr/ZhangZGZSZZ24
Zhi Zhang, Qizhe Zhang, Zijun Gao, Renrui Zhang, Ekaterina Shutova, Shiji Zhou, Shanghang Zhang:
Gradient-based Parameter Selection for Efficient Fine-Tuning. CVPR 2024: 28566-28577
[c86]
- view
  authority control:
- export record
  dblp key:
  - conf/cvpr/LiuXYZZCGZ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/cvpr/LiuXYZZCGZ24
Jiaming Liu, Ran Xu, Senqiao Yang, Renrui Zhang, Qizhe Zhang, Zehui Chen, Yandong Guo, Shanghang Zhang:
Continual-MAE: Adaptive Distribution Masked Autoencoders for Continual Test-Time Adaptation. CVPR 2024: 28653-28663
[c85]
- view
  authority control:
- export record
  dblp key:
  - conf/eccv/WeiCJLWZ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/eccv/WeiCJLWZ24
Xiaobao Wei, Jiajun Cao, Yizhu Jin, Ming Lu, Guangyu Wang, Shanghang Zhang:
I-MedSAM: Implicit Medical Image Segmentation with Segment Anything. ECCV (10) 2024: 90-107
[c84]
- view
  authority control:
- export record
  dblp key:
  - conf/eccv/LuoAZTLZ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/eccv/LuoAZTLZ24
Yulin Luo, Ruichuan An, Bocheng Zou, Yiming Tang, Jiaming Liu, Shanghang Zhang:
LLM as Dataset Analyst: Subpopulation Structure Discovery with Large Language Model. ECCV (33) 2024: 235-252
[c83]
- view
  authority control:
- export record
  dblp key:
  - conf/emnlp/ChenGZ0Z24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/emnlp/ChenGZ0Z24
Xinyan Chen, Jiaxin Ge, Tianjun Zhang, Jiaming Liu, Shanghang Zhang:
Learning from Mistakes: Iterative Prompt Relabeling for Text-to-Image Diffusion Model Training. EMNLP (Findings) 2024: 2937-2952
[c82]
- view
  authority control:
- export record
  dblp key:
  - conf/emnlp/ZhaoZLWZ024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/emnlp/ZhaoZLWZ024
Shitian Zhao, Renrui Zhang, Xu Luo, Yan Wang, Shanghang Zhang, Peng Gao:
Unleashing the Potentials of Likelihood Composition for Multi-modal Language Models. EMNLP (Findings) 2024: 10152-10163
[c81]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/ChanCSYXZF024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/ChanCSYXZF024
Chi-Min Chan, Weize Chen, Yusheng Su, Jianxuan Yu, Wei Xue, Shanghang Zhang, Jie Fu, Zhiyuan Liu:
ChatEval: Towards Better LLM-based Evaluators through Multi-Agent Debate. ICLR 2024
[c80]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/LiuYJZLGXZ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/LiuYJZLGXZ24
Jiaming Liu, Senqiao Yang, Peidong Jia, Renrui Zhang, Ming Lu, Yandong Guo, Wei Xue, Shanghang Zhang:
ViDA: Homeostatic Visual Domain Adapter for Continual Test Time Adaptation. ICLR 2024
[c79]
- view
  authority control:
- export record
  dblp key:
  - conf/icmcs/MaGJLHLXZ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icmcs/MaGJLHLXZ24
Rui Ma, Mengxi Guo, Peidong Jia, Chenxuan Li, Yi Hou, Yuan Li, Xiaodong Xie, Shanghang Zhang:
Enhanced Blind Watermarking Against Black-Box Noise: Leveraging CIN Framework. ICME 2024: 1-6
[c78]
- view
  authority control:
- export record
  dblp key:
  - conf/icmcs/Zhang0YLJXZ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icmcs/Zhang0YLJXZ24
Dongmei Zhang, Ray Zhang, Fan Yang, Yuan Li, Huizhu Jia, Xiaodong Xie, Shanghang Zhang:
VLUReID: Exploiting Vision-Language Knowledge for Unsupervised Person Re-Identification. ICME 2024: 1-6
[c77]
- view
  authority control:
- export record
  dblp key:
  - conf/icmcs/ZhangZCZWCYYZX24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icmcs/ZhangZCZWCYYZX24
Yijia Zhang, Lingran Zhao, Shijie Cao, Sicheng Zhang, Wenqiang Wang, Ting Cao, Fan Yang, Mao Yang, Shanghang Zhang, Ningyi Xu:
Integer or Floating Point? New Outlooks for Low-Bit Quantization on Large Language Models. ICME 2024: 1-6
[c76]
- view
- export record
  dblp key:
  - conf/icml/ChenYGGDWONKZ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/ChenYGGDWONKZ24
Anthony Chen, Huanrui Yang, Yulu Gan, Denis A. Gudovskiy, Zhen Dong, Haofan Wang, Tomoyuki Okuno, Yohei Nakata, Kurt Keutzer, Shanghang Zhang:
Split-Ensemble: Efficient OOD-aware Ensemble via Task and Model Splitting. ICML 2024: 7568-7585
[c75]
- view
- export record
  dblp key:
  - conf/icml/WuMWHMZ024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/WuMWHMZ024
Pengying Wu, Yao Mu, Bingxian Wu, Yi Hou, Ji Ma, Shanghang Zhang, Chang Liu:
VoroNav: Voronoi-based Zero-shot Object Navigation with Large Language Model. ICML 2024: 53757-53775
[c74]
- view
- export record
  dblp key:
  - conf/icml/ZouZZ0024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/ZouZZ0024
Yixiong Zou, Shanghang Zhang, Haichen Zhou, Yuhua Li, Ruixuan Li:
Compositional Few-Shot Class-Incremental Learning. ICML 2024: 62964-62977
[c73]
- view
  authority control:
- export record
  dblp key:
  - conf/icra/NiYXLLJCLZ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icra/NiYXLLJCLZ24
Jiayi Ni, Senqiao Yang, Ran Xu, Jiaming Liu, Xiaoqi Li, Wenyu Jiao, Zehui Chen, Yi Liu, Shanghang Zhang:
Distribution-Aware Continual Test-Time Adaptation for Semantic Segmentation. ICRA 2024: 3044-3050
[c72]
- view
  authority control:
- export record
  dblp key:
  - conf/icra/LiuZLLWL0Z24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icra/LiuZLLWL0Z24
Jiaming Liu, Qizhe Zhang, Xiaoqi Li, Jianing Li, Guanqun Wang, Ming Lu, Tiejun Huang, Shanghang Zhang:
Unsupervised Spike Depth Estimation via Cross-modality Cross-domain Knowledge Transfer. ICRA 2024: 9109-9116
[c71]
- view
  authority control:
- export record
  dblp key:
  - conf/icra/LiuZLCCLGZ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icra/LiuZLCCLGZ24
Jiaming Liu, Rongyu Zhang, Xiaoqi Li, Xiaowei Chi, Zehui Chen, Ming Lu, Yandong Guo, Shanghang Zhang:
BEVUDA: Multi-geometric Space Alignments for Domain Adaptive BEV 3D Object Detection. ICRA 2024: 9487-9494
[c70]
- view
  authority control:
- export record
  dblp key:
  - conf/icra/PanLZHLXWLZ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icra/PanLZHLXWLZ24
Mingjie Pan, Jiaming Liu, Renrui Zhang, Peixiang Huang, Xiaoqi Li, Hongwei Xie, Bing Wang, Li Liu, Shanghang Zhang:
RenderOcc: Vision-Centric 3D Occupancy Prediction with 2D Rendering Supervision. ICRA 2024: 12404-12411
[c69]
- view
  authority control:
- export record
  dblp key:
  - conf/mm/ZhangCYLGONKCDD24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/ZhangCYLGONKCDD24
Rongyu Zhang, Zefan Cai, Huanrui Yang, Zidong Liu, Denis A. Gudovskiy, Tomoyuki Okuno, Yohei Nakata, Kurt Keutzer, Baobao Chang, Yuan Du, Li Du, Shanghang Zhang:
VeCAF: Vision-language Collaborative Active Finetuning with Training Objective Awareness. ACM Multimedia 2024: 5451-5459
[c68]
- view
  - electronic edition @ nips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/0020X0FDLWCZG24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/0020X0FDLWCZG24
Yuan Zhang, Fei Xiao, Tao Huang, Chun-Kai Fan, Hongyuan Dong, Jiawen Li, Jiacong Wang, Kuan Cheng, Shanghang Zhang, Haoyuan Guo:
Unveiling the Tapestry of Consistency in Large Vision-Language Models. NeurIPS 2024
[c67]
- view
  - electronic edition @ nips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/LiLLZLLQZXLTWLG24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/LiLLZLLQZXLTWLG24
Peng Li, Yuan Liu, Xiaoxiao Long, Feihu Zhang, Cheng Lin, Mengfei Li, Xingqun Qi, Shanghang Zhang, Wei Xue, Wenhan Luo, Ping Tan, Wenping Wang, Qifeng Liu, Yike Guo:
Era3D: High-Resolution Multiview Diffusion using Efficient Row-wise Attention. NeurIPS 2024
[c66]
- view
  - electronic edition @ nips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/LiuLWALZYZGZ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/LiuLWALZYZGZ24
Jiaming Liu, Mengzhen Liu, Zhenyu Wang, Pengju An, Xiaoqi Li, Kaichen Zhou, Senqiao Yang, Renrui Zhang, Yandong Guo, Shanghang Zhang:
RoboMamba: Efficient Vision-Language-Action Model for Robotic Reasoning and Manipulation. NeurIPS 2024
[c65]
- view
  authority control:
- export record
  dblp key:
  - conf/wacv/LiNZGZY024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/wacv/LiNZGZY024
Shiyao Li, Xuefei Ning, Shanghang Zhang, Lidong Guo, Tianchen Zhao, Huazhong Yang, Yu Wang:
TCP: Triplet Contrastive-relationship Preserving for Class-Incremental Learning. WACV 2024: 2020-2029
[i130]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2401-02695
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2401-02695
Pengying Wu, Yao Mu, Bingxian Wu, Yi Hou, Ji Ma, Shanghang Zhang, Chang Liu:
VoroNav: Voronoi-based Zero-shot Object Navigation with Large Language Model. CoRR abs/2401.02695 (2024)
[i129]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2401-03257
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2401-03257
Mengfei Li, Ming Lu, Xiaofang Li, Shanghang Zhang:
RustNeRF: Robust Neural Radiance Field with Low-Quality Images. CoRR abs/2401.03257 (2024)
[i128]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2401-07853
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2401-07853
Rongyu Zhang, Zefan Cai, Huanrui Yang, Zidong Liu, Denis A. Gudovskiy, Tomoyuki Okuno, Yohei Nakata, Kurt Keutzer, Baobao Chang, Yuan Du, Li Du, Shanghang Zhang:
VeCAF: VLM-empowered Collaborative Active Finetuning with Training Objective Awareness. CoRR abs/2401.07853 (2024)
[i127]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2401-17862
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2401-17862
Jianing Li, Xi Nan, Ming Lu, Li Du, Shanghang Zhang:
Proximity QA: Unleashing the Power of Multi-Modal Large Language Models for Spatial Proximity Analysis. CoRR abs/2401.17862 (2024)
[i126]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2402-16014
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2402-16014
Tianyu Chen, Haoyi Zhou, Ying Li, Hao Wang, Chonghan Gao, Shanghang Zhang, Jianxin Li:
Building Flexible Machine Learning Models for Scientific Computing at Scale. CoRR abs/2402.16014 (2024)
[i125]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2402-17319
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2402-17319
Zehui Chen, Qiuchen Wang, Zhenyu Li, Jiaming Liu, Shanghang Zhang, Feng Zhao:
A Vanilla Multi-Task Framework for Dense Visual Prediction Solution to 1st VCL Challenge - Multi-Task Robustness Track. CoRR abs/2402.17319 (2024)
[i124]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2402-19007
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2402-19007
Ji Ma, Hongming Dai, Yao Mu, Pengying Wu, Hao Wang, Xiaowei Chi, Yang Fei, Shanghang Zhang, Chang Liu:
DOZE: A Dataset for Open-Vocabulary Zero-Shot Object Navigation in Dynamic Environments. CoRR abs/2402.19007 (2024)
[i123]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2403-14487
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2403-14487
Yueru Jia, Yuhui Yuan, Aosong Cheng, Chuke Wang, Ji Li, Huizhu Jia, Shanghang Zhang:
DesignEdit: Multi-Layered Latent Decomposition and Fusion for Unified & Accurate Image Editing. CoRR abs/2403.14487 (2024)
[i122]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2403-15317
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2403-15317
Hongzhi Gao, Zheng Chen, Zehui Chen, Lin Chen, Jiaming Liu, Shanghang Zhang, Feng Zhao:
Point-DETR3D: Leveraging Imagery Data with Spatial Point Prior for Weakly Semi-supervised 3D Object Detection. CoRR abs/2403.15317 (2024)
[i121]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2403-20271
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2403-20271
Weifeng Lin, Xinyu Wei, Ruichuan An, Peng Gao, Bocheng Zou, Yulin Luo, Siyuan Huang, Shanghang Zhang, Hongsheng Li:
Draw-and-Understand: Leveraging Visual Prompts to Enable MLLMs to Comprehend What You Want. CoRR abs/2403.20271 (2024)
[i120]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2404-06710
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2404-06710
Gaole Dai, Zhenyu Wang, Qinwen Xu, Ming Lu, Wen Chen, Boxin Shi, Shanghang Zhang, Tie-Jun Huang:
SpikeNVS: Enhancing Novel View Synthesis from Blurry Images via Spike Camera. CoRR abs/2404.06710 (2024)
[i119]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2404-07989
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2404-07989
Yiwen Tang, Jiaming Liu, Dong Wang, Zhigang Wang, Shanghang Zhang, Bin Zhao, Xuelong Li:
Any2Point: Empowering Any-modality Large Models for Efficient 3D Understanding. CoRR abs/2404.07989 (2024)
[i118]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2404-08985
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2404-08985
Yijiang Liu, Rongyu Zhang, Huanrui Yang, Kurt Keutzer, Yuan Du, Li Du, Shanghang Zhang:
Intuition-aware Mixture-of-Rank-1-Experts for Parameter Efficient Finetuning. CoRR abs/2404.08985 (2024)
[i117]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2405-02363
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2405-02363
Yulin Luo, Ruichuan An, Bocheng Zou, Yiming Tang, Jiaming Liu, Shanghang Zhang:
LLM as Dataset Analyst: Subpopulation Structure Discovery with Large Language Model. CoRR abs/2405.02363 (2024)
[i116]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2405-11616
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2405-11616
Peng Li, Yuan Liu, Xiaoxiao Long, Feihu Zhang, Cheng Lin, Mengfei Li, Xingqun Qi, Shanghang Zhang, Wenhan Luo, Ping Tan, Wenping Wang, Qifeng Liu, Yike Guo:
Era3D: High-Resolution Multiview Diffusion using Efficient Row-wise Attention. CoRR abs/2405.11616 (2024)
[i115]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2405-14156
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2405-14156
Yuan Zhang, Fei Xiao, Tao Huang, Chun-Kai Fan, Hongyuan Dong, Jiawen Li, Jiacong Wang, Kuan Cheng, Shanghang Zhang, Haoyuan Guo:
Unveiling the Tapestry of Consistency in Large Vision-Language Models. CoRR abs/2405.14156 (2024)
[i114]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2405-16486
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2405-16486
Rongyu Zhang, Aosong Cheng, Yulin Luo, Gaole Dai, Huanrui Yang, Jiaming Liu, Ran Xu, Li Du, Yuan Du, Yanbing Jiang, Shanghang Zhang:
Decomposing the Neurons: Activation Sparsity via Mixture of Experts for Continual Test Time Adaptation. CoRR abs/2405.16486 (2024)
[i113]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2405-17022
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2405-17022
Yixiong Zou, Shanghang Zhang, Haichen Zhou, Yuhua Li, Ruixuan Li:
Compositional Few-Shot Class-Incremental Learning. CoRR abs/2405.17022 (2024)
[i112]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2405-17418
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2405-17418
Jiaming Liu, Chenxuan Li, Guanqun Wang, Lily Lee, Kaichen Zhou, Sixiang Chen, Chuyan Xiong, Jiaxin Ge, Renrui Zhang, Shanghang Zhang:
Self-Corrected Multimodal Large Language Model for End-to-End Robot Manipulation. CoRR abs/2405.17418 (2024)
[i111]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2405-19012
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2405-19012
Gaole Dai, Cheng-Ching Tseng, Qingpo Wuwu, Rongyu Zhang, Shaokang Wang, Ming Lu, Tiejun Huang, Yu Zhou, Ali Ata Tuz, Matthias Gunzer, Jianxu Chen, Shanghang Zhang:
Implicit Neural Image Field for Biological Microscopy Image Compression. CoRR abs/2405.19012 (2024)
[i110]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2405-20323
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2405-20323
Nan Huang, Xiaobao Wei, Wenzhao Zheng, Pengju An, Ming Lu, Wei Zhan, Masayoshi Tomizuka, Kurt Keutzer, Shanghang Zhang:
S³Gaussian: Self-Supervised Street Gaussians for Autonomous Driving. CoRR abs/2405.20323 (2024)
[i109]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2406-04339
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2406-04339
Jiaming Liu, Mengzhen Liu, Zhenyu Wang, Lily Lee, Kaichen Zhou, Pengju An, Senqiao Yang, Renrui Zhang, Yandong Guo, Shanghang Zhang:
RoboMamba: Multimodal State Space Model for Efficient Robot Reasoning and Manipulation. CoRR abs/2406.04339 (2024)
[i108]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2406-15768
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2406-15768
Guanqun Wang, Xinyu Wei, Jiaming Liu, Ray Zhang, Yichi Zhang, Kevin Zhang, Maurice Chong, Shanghang Zhang:
MR-MLLM: Mutual Reinforcement of Multimodal Comprehension and Vision Perception. CoRR abs/2406.15768 (2024)
[i107]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2407-03442
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2407-03442
Huanrui Yang, Yafeng Huang, Zhen Dong, Denis A. Gudovskiy, Tomoyuki Okuno, Yohei Nakata, Yuan Du, Kurt Keutzer, Shanghang Zhang:
Fisher-aware Quantization for DETR Detectors with Critical-category Objectives. CoRR abs/2407.03442 (2024)
[i106]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2407-08739
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2407-08739
Renrui Zhang, Xinyu Wei, Dongzhi Jiang, Yichi Zhang, Ziyu Guo, Chengzhuo Tong, Jiaming Liu, Aojun Zhou, Bin Wei, Shanghang Zhang, Peng Gao, Hongsheng Li:
MAVIS: Mathematical Visual Instruction Tuning. CoRR abs/2407.08739 (2024)
[i105]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2407-19778
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2407-19778
Shanghang Zhang, Gaole Dai, Tie-Jun Huang, Jianxu Chen:
Multimodal Large Language Models for Bioimage Analysis. CoRR abs/2407.19778 (2024)
[i104]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2407-20962
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2407-20962
Xiaowei Chi, Yatian Wang, Aosong Cheng, Pengjun Fang, Zeyue Tian, Yingqing He, Zhaoyang Liu, Xingqun Qi, Jiahao Pan, Rongyu Zhang, Mengfei Li, Ruibin Yuan, Yanbing Jiang, Wei Xue, Wenhan Luo, Qifeng Chen, Shanghang Zhang, Qifeng Liu, Yike Guo:
MMTrail: A Multimodal Trailer Video Dataset with Language and Music Descriptions. CoRR abs/2407.20962 (2024)
[i103]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2408-11855
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2408-11855
Zhongyu Zhao, Menghang Dong, Rongyu Zhang, Wenzhao Zheng, Yunpeng Zhang, Huanrui Yang, Dalong Du, Kurt Keutzer, Shanghang Zhang:
FactorLLM: Factorizing Knowledge via Mixture of Experts for Large Language Models. CoRR abs/2408.11855 (2024)
[i102]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2409-06706
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2409-06706
Gaole Dai, Yiming Tang, Chunkai Fan, Qizhe Zhang, Zhi Zhang, Yulu Gan, Chengqing Zeng, Shanghang Zhang, Tiejun Huang:
Discovering Long-Term Effects on Parameter Efficient Fine-tuning. CoRR abs/2409.06706 (2024)
[i101]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2409-16183
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2409-16183
Xiaohong Liu, Guoxing Yang, Yulin Luo, Jiaji Mao, Xiang Zhang, Ming Gao, Shanghang Zhang, Jun Shen, Guangyu Wang:
Expert-level vision-language foundation model for real-world radiology and comprehensive evaluation. CoRR abs/2409.16183 (2024)
[i100]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2410-00363
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2410-00363
Shitian Zhao, Renrui Zhang, Xu Luo, Yan Wang, Shanghang Zhang, Peng Gao:
Unleashing the Potentials of Likelihood Composition for Multi-modal Language Models. CoRR abs/2410.00363 (2024)
[i99]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2410-04417
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2410-04417
Yuan Zhang, Chun-Kai Fan, Junpeng Ma, Wenzhao Zheng, Tao Huang, Kuan Cheng, Denis A. Gudovskiy, Tomoyuki Okuno, Yohei Nakata, Kurt Keutzer, Shanghang Zhang:
SparseVLM: Visual Token Sparsification for Efficient Vision-Language Model Inference. CoRR abs/2410.04417 (2024)
[i98]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2410-15461
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2410-15461
Xiaowei Chi, Hengyuan Zhang, Chun-Kai Fan, Xingqun Qi, Rongyu Zhang, Anthony Chen, Chi-Min Chan, Wei Xue, Wenhan Luo, Shanghang Zhang, Yike Guo:
EVA: An Embodied World Model for Future Video Anticipation. CoRR abs/2410.15461 (2024)
[i97]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2410-22217
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2410-22217
Shenghao Xie, Wenqiang Zu, Mingyang Zhao, Duo Su, Shilong Liu, Ruohua Shi, Guoqi Li, Shanghang Zhang, Lei Ma:
Towards Unifying Understanding and Generation in the Era of Vision Foundation Models: A Survey from the Autoregression Perspective. CoRR abs/2410.22217 (2024)
[i96]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2410-22228
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2410-22228
Bowen Liu, Haoyang Li, Shuning Wang, Shuo Nie, Shanghang Zhang:
Subgraph Aggregation for Out-of-Distribution Generalization on Graphs. CoRR abs/2410.22228 (2024)
[i95]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2411-02395
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2411-02395
Anthony Chen, Jianjin Xu, Wenzhao Zheng, Gaole Dai, Yida Wang, Renrui Zhang, Haofan Wang, Shanghang Zhang:
Training-free Regional Prompting for Diffusion Transformers. CoRR abs/2411.02395 (2024)
[i94]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2411-06665
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2411-06665
Xinyang Huang, Chuang Zhu, Bowen Zhang, Shanghang Zhang:
Learning from Different Samples: A Source-free Framework for Semi-supervised Domain Adaptation. CoRR abs/2411.06665 (2024)
[i93]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2411-11706
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2411-11706
Ruichuan An, Sihan Yang, Ming Lu, Kai Zeng, Yulin Luo, Ying Chen, Jiajun Cao, Hao Liang, Qi She, Shanghang Zhang, Wentao Zhang:
MC-LLaVA: Multi-Concept Personalized Vision-Language Model. CoRR abs/2411.11706 (2024)
[i92]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2411-15582
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2411-15582
Xiaobao Wei, Qingpo Wuwu, Zhongyu Zhao, Zhuangzhe Wu, Nan Huang, Ming Lu, Ningning Ma, Shanghang Zhang:
EMD: Explicit Motion Modeling for High-Quality Street Gaussian Splatting. CoRR abs/2411.15582 (2024)
[i91]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2411-18615
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2411-18615
Zhi Zhang, Jiayi Shen, Congfeng Cao, Gaole Dai, Shiji Zhou, Qizhe Zhang, Shanghang Zhang, Ekaterina Shutova:
Proactive Gradient Conflict Mitigation in Multi-Task Learning: A Sparse Training Perspective. CoRR abs/2411.18615 (2024)
[i90]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2411-18623
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2411-18623
Yueru Jia, Jiaming Liu, Sixiang Chen, Chenyang Gu, Zhilue Wang, Longzan Luo, Lily Lee, Pengwei Wang, Zhongyuan Wang, Renrui Zhang, Shanghang Zhang:
Lift3D Foundation Policy: Lifting 2D Large-Scale Pretrained Models for Robust 3D Robotic Manipulation. CoRR abs/2411.18623 (2024)
[i89]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2412-01818
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2412-01818
Qizhe Zhang, Aosong Cheng, Ming Lu, Zhiyong Zhuo, Minqi Wang, Jiajun Cao, Shaobo Guo, Qi She, Shanghang Zhang:
[CLS] Attention is All You Need for Training-Free Visual Token Pruning: Make VLM Inference Faster. CoRR abs/2412.01818 (2024)
[i88]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2412-05280
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2412-05280
Lening Wang, Wenzhao Zheng, Dalong Du, Yunpeng Zhang, Yilong Ren, Han Jiang, Zhiyong Cui, Haiyang Yu, Jie Zhou, Jiwen Lu, Shanghang Zhang:
Stag-1: Towards Realistic 4D Driving Simulation with Video Generation Model. CoRR abs/2412.05280 (2024)
[i87]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2412-06163
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2412-06163
Yuming Li, Peidong Jia, Daiwei Hong, Yueru Jia, Qi She, Rui Zhao, Ming Lu, Shanghang Zhang:
ASGDiffusion: Parallel High-Resolution Generation with Asynchronous Structure Guidance. CoRR abs/2412.06163 (2024)
[i86]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2412-08643
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2412-08643
Zixun Xie, Sicheng Zuo, Wenzhao Zheng, Yunpeng Zhang, Dalong Du, Jie Zhou, Jiwen Lu, Shanghang Zhang:
GPD-1: Generative Pre-training for Driving. CoRR abs/2412.08643 (2024)
[i85]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2412-10371
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2412-10371
Wenzhao Zheng, Junjie Wu, Yao Zheng, Sicheng Zuo, Zixun Xie, Longchao Yang, Yong Pan, Zhihui Hao, Peng Jia, Xianpeng Lang, Shanghang Zhang:
GaussianAD: Gaussian-Centric End-to-End Autonomous Driving. CoRR abs/2412.10371 (2024)
[i84]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2412-13877
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2412-13877
Kun Wu, Chengkai Hou, Jiaming Liu, Zhengping Che, Xiaozhu Ju, Zhuqin Yang, Meng Li, Yinuo Zhao, Zhiyuan Xu, Guang Yang, Zhen Zhao, Guangyu Li, Zhao Jin, Lecheng Wang, Jilei Mao, Xinhua Wang, Shichao Fan, Ning Liu, Pei Ren, Qiang Zhang, Yaoxu Lyu, Mengzhen Liu, Jingyang He, Yulin Luo, Zeyu Gao, Chenxuan Li, Chenyang Gu, Yankai Fu, Di Wu, Xingyu Wang, Sixiang Chen, Zhenyu Wang, Pengju An, Siyuan Qian, Shanghang Zhang, Jian Tang:
RoboMIND: Benchmark on Multi-embodiment Intelligence Normative Data for Robot Manipulation. CoRR abs/2412.13877 (2024)
[i83]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2412-17637
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2412-17637
Kuangzhi Ge, Lingjun Chen, Kevin Zhang, Yulin Luo, Tianyu Shi, Liaoyuan Fan, Xiang Li, Guanqun Wang, Shanghang Zhang:
SCBench: A Sports Commentary Benchmark for Video LLMs. CoRR abs/2412.17637 (2024)
[i82]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/dagstuhl-reports/0001JRZ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/dagstuhl-reports/0001JRZ24
Jianxu Chen, Florian Jug, Susanne M. Rafelski, Shanghang Zhang:
The Emerging Issues in Bioimaging AI Publications and Research (Dagstuhl Seminar 24042). Dagstuhl Reports 14(1): 90-107 (2024)
2023
[j11]
- view
  authority control:
- export record
  dblp key:
  - journals/ai/ZhouLZZYX23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ai/ZhouLZZYX23
Haoyi Zhou, Jianxin Li, Shanghang Zhang, Shuai Zhang, Mengyi Yan, Hui Xiong:
Expanding the prediction capacity in long sequence time-series forecasting. Artif. Intell. 318: 103886 (2023)
[j10]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/remotesensing/WangCCZZZDG23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/remotesensing/WangCCZZZDG23
Guanqun Wang, He Chen, Liang Chen, Yin Zhuang, Shanghang Zhang, Tong Zhang, Hao Dong, Peng Gao:
P2FEViT: Plug-and-Play CNN Feature Embedded Hybrid Vision Transformer for Remote Sensing Image Classification. Remote. Sens. 15(7): 1773 (2023)
[j9]
- view
  authority control:
- export record
  dblp key:
  - journals/tamd/DehbanZCJS23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tamd/DehbanZCJS23
Atabak Dehban, Shanghang Zhang, Nino Cauli, Lorenzo Jamone, José Santos-Victor:
Learning Deep Features for Robotic Inference From Physical Interactions. IEEE Trans. Cogn. Dev. Syst. 15(3): 985-999 (2023)
[j8]
- view
  authority control:
- export record
  dblp key:
  - journals/tcsv/HouZMJX23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tcsv/HouZMJX23
Yi Hou, Shanghang Zhang, Rui Ma, Huizhu Jia, Xiaodong Xie:
Frame-Recurrent Video Crowd Counting. IEEE Trans. Circuits Syst. Video Technol. 33(9): 5186-5199 (2023)
[j7]
- view
  authority control:
- export record
  dblp key:
  - journals/tmm/ZhouWHMYZWZ23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tmm/ZhouWHMYZWZ23
Shiji Zhou, Zhi Wang, Chenghao Hu, Yinan Mao, Haopeng Yan, Shanghang Zhang, Chuan Wu, Wenwu Zhu:
Caching in Dynamic Environments: A Near-Optimal Online Learning Approach. IEEE Trans. Multim. 25: 792-804 (2023)
[c64]
- view
  authority control:
- export record
  dblp key:
  - conf/cvpr/LuXWXTKZ23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/cvpr/LuXWXTKZ23
Yuheng Lu, Chenfeng Xu, Xiaobao Wei, Xiaodong Xie, Masayoshi Tomizuka, Kurt Keutzer, Shanghang Zhang:
Open-Vocabulary Point-Cloud Object Detection without 3D Annotation. CVPR 2023: 1190-1199
[c63]
- view
  authority control:
- export record
  dblp key:
  - conf/cvpr/ChenZZWLGZ23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/cvpr/ChenZZWLGZ23
Anthony Chen, Kevin Zhang, Renrui Zhang, Zihan Wang, Yuheng Lu, Yandong Guo, Shanghang Zhang:
PiMAE: Point Cloud and Image Interactive Masked Autoencoders for 3D Object Detection. CVPR 2023: 5291-5301
[c62]
- view
  authority control:
- export record
  dblp key:
  - conf/cvpr/WangZZCC023
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/cvpr/WangZZCC023
Lianzhe Wang, Shiji Zhou, Shanghang Zhang, Xu Chu, Heng Chang, Wenwu Zhu:
Improving Generalization of Meta-Learning with Inverted Regularization at Inner-Level. CVPR 2023: 7826-7835
[c61]
- view
  authority control:
- export record
  dblp key:
  - conf/cvpr/MaLZGZG023
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/cvpr/MaLZGZG023
Yuqing Ma, Hainan Li, Zhange Zhang, Jinyang Guo, Shanghang Zhang, Ruihao Gong, Xianglong Liu:
Annealing-based Label-Transfer Learning for Open World Object Detection. CVPR 2023: 11454-11463
[c60]
- view
  authority control:
- export record
  dblp key:
  - conf/cvpr/GanPZLZLZ23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/cvpr/GanPZLZLZ23
Yulu Gan, Mingjie Pan, Rongyu Zhang, Zijian Ling, Lingran Zhao, Jiaming Liu, Shanghang Zhang:
Cloud-Device Collaborative Adaptation to Continual Changing Environments in the Real-World. CVPR 2023: 12157-12166
[c59]
- view
  authority control:
- export record
  dblp key:
  - conf/cvpr/ChiLLZWGZ23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/cvpr/ChiLLZWGZ23
Xiaowei Chi, Jiaming Liu, Ming Lu, Rongyu Zhang, Zhaoqing Wang, Yandong Guo, Shanghang Zhang:
BEV-SAN: Accurate BEV 3D Object Detection via Slice Attention Networks. CVPR 2023: 17461-17470
[c58]
- view
  authority control:
- export record
  dblp key:
  - conf/cvpr/Gu00C0FZ0023
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/cvpr/Gu00C0FZ0023
Jianyang Gu, Kai Wang, Hao Luo, Chen Chen, Wei Jiang, Yuqiang Fang, Shanghang Zhang, Yang You, Jian Zhao:
MSINet: Twins Contrastive Search of Multi-Scale Interaction for Object ReID. CVPR 2023: 19243-19253
[c57]
- view
  authority control:
- export record
  dblp key:
  - conf/cvpr/LiuYDKDZ23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/cvpr/LiuYDKDZ23
Yijiang Liu, Huanrui Yang, Zhen Dong, Kurt Keutzer, Li Du, Shanghang Zhang:
NoisyQuant: Noisy Bias-Enhanced Post-Training Activation Quantization for Vision Transformers. CVPR 2023: 20321-20330
[c56]
- view
  authority control:
- export record
  dblp key:
  - conf/dac/XiaoYDKDZ23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/dac/XiaoYDKDZ23
Lirui Xiao, Huanrui Yang, Zhen Dong, Kurt Keutzer, Li Du, Shanghang Zhang:
CSQ: Growing Mixed-Precision Quantization Scheme with Bi-level Continuous Sparsification. DAC 2023: 1-6
[c55]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/HeCZZL23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/HeCZZL23
Mingrui He, Tianyu Chen, Haoyi Zhou, Shanghang Zhang, Jianxin Li:
BadRes: Reveal the Backdoors Through Residual Connection. ICASSP 2023: 1-5
[c54]
- view
  authority control:
- export record
  dblp key:
  - conf/iccv/ZhuZHGZQZG23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iccv/ZhuZHGZQZG23
Xiangyang Zhu, Renrui Zhang, Bowei He, Ziyu Guo, Ziyao Zeng, Zipeng Qin, Shanghang Zhang, Peng Gao:
PointCLIP V2: Prompting CLIP and GPT for Powerful 3D Open-world Learning. ICCV 2023: 2639-2650
[c53]
- view
  authority control:
- export record
  dblp key:
  - conf/iccv/ZhangDYLTDKDZ23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iccv/ZhangDYLTDKDZ23
Yifan Zhang, Zhen Dong, Huanrui Yang, Ming Lu, Cheng-Ching Tseng, Yuan Du, Kurt Keutzer, Li Du, Shanghang Zhang:
QD-BEV : Quantization-aware View-guided Distillation for Multi-view 3D Object Detection. ICCV 2023: 3802-3812
[c52]
- view
  authority control:
- export record
  dblp key:
  - conf/iccv/LiLLYDKZK23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iccv/LiLLYDKZK23
Xiuyu Li, Yijiang Liu, Long Lian, Huanrui Yang, Zhen Dong, Daniel Kang, Shanghang Zhang, Kurt Keutzer:
Q-Diffusion: Quantizing Diffusion Models. ICCV 2023: 17489-17499
[c51]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/ChuJWZW0023
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/ChuJWZW0023
Xu Chu, Yujie Jin, Xin Wang, Shanghang Zhang, Yasha Wang, Wenwu Zhu, Hong Mei:
Wasserstein Barycenter Matching for Graph Size Generalization of Message Passing Neural Networks. ICML 2023: 6158-6184
[c50]
- view
  authority control:
- export record
  dblp key:
  - conf/igarss/LiCZZ23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/igarss/LiCZZ23
Can Li, He Chen, Yin Zhuang, Shanghang Zhang:
Uncertainty-Aware Dynamic Learning for Cross-Domain Few-Shot Scene Classification from Remote Sensing Imagery. IGARSS 2023: 5778-5781
[c49]
- view
  authority control:
- export record
  dblp key:
  - conf/itsc/LiuZQT23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/itsc/LiuZQT23
Tianqi Liu, Shanghang Zhang, Yanjun Qin, Xiaoming Tao:
A Text Prompt-Based Approach for Zero-Shot Corner Case Object Detection in Autonomous Driving. ITSC 2023: 3241-3246
[c48]
- view
  authority control:
- export record
  dblp key:
  - conf/itsc/ZhangQZT23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/itsc/ZhangQZT23
Wenqi Zhang, Yanjun Qin, Shanghang Zhang, Xiaoming Tao:
Electroencephalogram-Based Driver Emotional State Detection with Manifold Learning. ITSC 2023: 3329-3334
[c47]
- view
  authority control:
- export record
  dblp key:
  - conf/miccai/PanGZLZWZL23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/miccai/PanGZLZWZL23
Mingjie Pan, Yulu Gan, Fangxu Zhou, Jiaming Liu, Ying Zhang, Aimin Wang, Shanghang Zhang, Dawei Li:
DiffuseIR: Diffusion Models for Isotropic Reconstruction of 3D Microscopic Images. MICCAI (10) 2023: 323-332
[c46]
- view
  - electronic edition @ nips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/ZhouLJWZZ023
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/ZhouLJWZZ023
Qiang Zhou, Weize Li, Lihan Jiang, Guoliang Wang, Guyue Zhou, Shanghang Zhang, Hao Zhao:
PAD: A Dataset and Benchmark for Pose-agnostic Anomaly Detection. NeurIPS 2023
[c45]
- view
  authority control:
- export record
  dblp key:
  - conf/nossdav/ZhangDLS00LGZ23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nossdav/ZhangDLS00LGZ23
Rongyu Zhang, Lixuan Du, Jiaming Liu, Congcong Song, Fangxin Wang, Xiaoqi Li, Ming Lu, Yandong Guo, Shanghang Zhang:
RepCaM: Re-parameterization Content-aware Modulation for Neural Video Delivery. NOSSDAV 2023: 1-7
[c44]
- view
  authority control:
- export record
  dblp key:
  - conf/wmcsa/XuZZZSL0X23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/wmcsa/XuZZZSL0X23
Kenuo Xu, Kexing Zhou, Chengxuan Zhu, Shanghang Zhang, Boxin Shi, Xiaoqiang Li, Tiejun Huang, Chenren Xu:
When Visible Light (Backscatter) Communication Meets Neuromorphic Cameras in V2X. HotMobile 2023: 42-48
[i81]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2302-04304
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2302-04304
Xiuyu Li, Long Lian, Yijiang Liu, Huanrui Yang, Zhen Dong, Daniel Kang, Shanghang Zhang, Kurt Keutzer:
Q-Diffusion: Quantizing Diffusion Models. CoRR abs/2302.04304 (2023)
[i80]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2303-07065
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2303-07065
Jianyang Gu, Kai Wang, Hao Luo, Chen Chen, Wei Jiang, Yuqiang Fang, Shanghang Zhang, Yang You, Jian Zhao:
MSINet: Twins Contrastive Search of Multi-Scale Interaction for Object ReID. CoRR abs/2303.07065 (2023)
[i79]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2303-08129
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2303-08129
Anthony Chen, Kevin Zhang, Renrui Zhang, Zihan Wang, Yuheng Lu, Yandong Guo, Shanghang Zhang:
PiMAE: Point Cloud and Image Interactive Masked Autoencoders for 3D Object Detection. CoRR abs/2303.08129 (2023)
[i78]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2303-09792
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2303-09792
Senqiao Yang, Jiarui Wu, Jiaming Liu, Xiaoqi Li, Qizhe Zhang, Mingjie Pan, Yulu Gan, Shanghang Zhang:
Exploring Sparse Visual Prompt for Cross-domain Semantic Segmentation. CoRR abs/2303.09792 (2023)
[i77]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2303-13739
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2303-13739
Yulin Luo, Rui Zhao, Xiaobao Wei, Jinwei Chen, Yijie Lu, Shenghao Xie, Tianyu Wang, Ruiqin Xiong, Ming Lu, Shanghang Zhang:
MoWE: Mixture of Weather Experts for Multiple Adverse Weather Removal. CoRR abs/2303.13739 (2023)
[i76]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2304-07919
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2304-07919
Jiaxin Ge, Hongyin Luo, Siyuan Qian, Yulu Gan, Jie Fu, Shanghang Zhang:
Chain of Thought Prompt Tuning in Vision Language Models. CoRR abs/2304.07919 (2023)
[i75]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2305-12356
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2305-12356
Yijia Zhang, Lingran Zhao, Shijie Cao, Wenqiang Wang, Ting Cao, Fan Yang, Mao Yang, Shanghang Zhang, Ningyi Xu:
Integer or Floating Point? New Outlooks for Low-Bit Quantization on Large Language Models. CoRR abs/2305.12356 (2023)
[i74]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2306-04344
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2306-04344
Jiaming Liu, Senqiao Yang, Peidong Jia, Ming Lu, Yandong Guo, Wei Xue, Shanghang Zhang:
ViDA: Homeostatic Visual Domain Adapter for Continual Test Time Adaptation. CoRR abs/2306.04344 (2023)
[i73]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2306-09117
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2306-09117
Mingjie Pan, Li Liu, Jiaming Liu, Peixiang Huang, Longlong Wang, Shanghang Zhang, Shaoqing Xu, Zhiyi Lai, Kuiyuan Yang:
UniOcc: Unifying Vision-Centric 3D Occupancy Prediction with Geometric and Semantic Rendering. CoRR abs/2306.09117 (2023)
[i72]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2306-12109
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2306-12109
Mingjie Pan, Yulu Gan, Fangxu Zhou, Jiaming Liu, Aimin Wang, Shanghang Zhang, Dawei Li:
DiffuseIR: Diffusion Models For Isotropic Reconstruction of 3D Microscopic Images. CoRR abs/2306.12109 (2023)
[i71]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2307-00313
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2307-00313
Peidong Jia, Jiaming Liu, Senqiao Yang, Jiarui Wu, Xiaodong Xie, Shanghang Zhang:
PM-DETR: Domain Adaptive Prompt Memory for Object Detection with Transformers. CoRR abs/2307.00313 (2023)
[i70]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2308-07201
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2308-07201
Chi-Min Chan, Weize Chen, Yusheng Su, Jianxuan Yu, Wei Xue, Shanghang Zhang, Jie Fu, Zhiyuan Liu:
ChatEval: Towards Better LLM-based Evaluators through Multi-Agent Debate. CoRR abs/2308.07201 (2023)
[i69]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2308-10515
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2308-10515
Yifan Zhang, Zhen Dong, Huanrui Yang, Ming Lu, Cheng-Ching Tseng, Yuan Du, Kurt Keutzer, Li Du, Shanghang Zhang:
QD-BEV : Quantization-aware View-guided Distillation for Multi-view 3D Object Detection. CoRR abs/2308.10515 (2023)
[i68]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2309-09502
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2309-09502
Mingjie Pan, Jiaming Liu, Renrui Zhang, Peixiang Huang, Xiaoqi Li, Li Liu, Shanghang Zhang:
RenderOcc: Vision-Centric 3D Occupancy Prediction with 2D Rendering Supervision. CoRR abs/2309.09502 (2023)
[i67]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2309-12790
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2309-12790
Xiaobao Wei, Renrui Zhang, Jiarui Wu, Jiaming Liu, Ming Lu, Yandong Guo, Shanghang Zhang:
NOC: High-Quality Neural Object Cloning with 3D Lifting of Segment Anything. CoRR abs/2309.12790 (2023)
[i66]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2309-13604
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2309-13604
Jiayi Ni, Senqiao Yang, Jiaming Liu, Xiaoqi Li, Wenyu Jiao, Ran Xu, Zehui Chen, Yi Liu, Shanghang Zhang:
Distribution-Aware Continual Test Time Adaptation for Semantic Segmentation. CoRR abs/2309.13604 (2023)
[i65]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2310-07716
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2310-07716
Qiang Zhou, Weize Li, Lihan Jiang, Guoliang Wang, Guyue Zhou, Shanghang Zhang, Hao Zhao:
PAD: A Dataset and Benchmark for Pose-agnostic Anomaly Detection. CoRR abs/2310.07716 (2023)
[i64]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2310-10909
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2310-10909
Zihan Qiu, Zhen Liu, Shuicheng Yan, Shanghang Zhang, Jie Fu:
Heterogenous Memory Augmented Neural Networks. CoRR abs/2310.10909 (2023)
[i63]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2311-12079
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2311-12079
Yuan Zhang, Tao Huang, Jiaming Liu, Tao Jiang, Kuan Cheng, Shanghang Zhang:
FreeKD: Knowledge Distillation via Semantic Frequency Prompt. CoRR abs/2311.12079 (2023)
[i62]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2311-16974
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2311-16974
Peidong Jia, Chenxuan Li, Zeyu Liu, Yichao Shen, Xingru Chen, Yuhui Yuan, Yinglin Zheng, Dong Chen, Ji Li, Xiaodong Xie, Shanghang Zhang, Baining Guo:
COLE: A Hierarchical Generation Framework for Graphic Design. CoRR abs/2311.16974 (2023)
[i61]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2311-17081
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2311-17081
Xiaobao Wei, Jiajun Cao, Yizhu Jin, Ming Lu, Guangyu Wang, Shanghang Zhang:
I-MedSAM: Implicit Medical Image Segmentation with Segment Anything. CoRR abs/2311.17081 (2023)
[i60]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2311-17532
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2311-17532
Xingqun Qi, Jiahao Pan, Peng Li, Ruibin Yuan, Xiaowei Chi, Mengfei Li, Wenhan Luo, Wei Xue, Shanghang Zhang, Qifeng Liu, Yike Guo:
Weakly-Supervised Emotion Transition Learning for Diverse 3D Co-speech Gesture Generation. CoRR abs/2311.17532 (2023)
[i59]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2311-17963
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2311-17963
Xiaowei Chi, Yijiang Liu, Zhengkai Jiang, Rongyu Zhang, Ziyi Lin, Renrui Zhang, Peng Gao, Chaoyou Fu, Shanghang Zhang, Qifeng Liu, Yike Guo:
ChatIllusion: Efficient-Aligning Interleaved Generation ability with Visual Instruction Model. CoRR abs/2311.17963 (2023)
[i58]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2312-01361
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2312-01361
Jianchen Zhao, Cheng-Ching Tseng, Ming Lu, Ruichuan An, Xiaobao Wei, He Sun, Shanghang Zhang:
MoEC: Mixture of Experts Implicit Neural Compression. CoRR abs/2312.01361 (2023)
[i57]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2312-02923
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2312-02923
Qizhe Zhang, Bocheng Zou, Ruichuan An, Jiaming Liu, Shanghang Zhang:
Split & Merge: Unlocking the Potential of Visual Adapters via Sparse Training. CoRR abs/2312.02923 (2023)
[i56]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2312-09148
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2312-09148
Anthony Chen, Huanrui Yang, Yulu Gan, Denis A. Gudovskiy, Zhen Dong, Haofan Wang, Tomoyuki Okuno, Yohei Nakata, Shanghang Zhang, Kurt Keutzer:
Split-Ensemble: Efficient OOD-aware Ensemble via Task and Model Splitting. CoRR abs/2312.09148 (2023)
[i55]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2312-10136
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2312-10136
Zhi Zhang, Qizhe Zhang, Zijun Gao, Renrui Zhang, Ekaterina Shutova, Shiji Zhou, Shanghang Zhang:
Gradient-based Parameter Selection for Efficient Fine-Tuning. CoRR abs/2312.10136 (2023)
[i54]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2312-11535
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2312-11535
Nan Huang, Ting Zhang, Yuhui Yuan, Dong Chen, Shanghang Zhang:
Customize-It-3D: High-Quality 3D Creation from A Single Image Using Subject-Specific Knowledge Prior. CoRR abs/2312.11535 (2023)
[i53]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2312-12480
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2312-12480
Jiaming Liu, Ran Xu, Senqiao Yang, Renrui Zhang, Qizhe Zhang, Zehui Chen, Yandong Guo, Shanghang Zhang:
Adaptive Distribution Masked Autoencoders for Continual Test-Time Adaptation. CoRR abs/2312.12480 (2023)
[i52]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2312-14074
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2312-14074
Senqiao Yang, Jiaming Liu, Ray Zhang, Mingjie Pan, Zoey Guo, Xiaoqi Li, Zehui Chen, Peng Gao, Yandong Guo, Shanghang Zhang:
LiDAR-LLM: Exploring the Potential of Large Language Models for 3D LiDAR Understanding. CoRR abs/2312.14074 (2023)
[i51]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2312-14465
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2312-14465
Dongmei Zhang, Chang Li, Ray Zhang, Shenghao Xie, Wei Xue, Xiaodong Xie, Shanghang Zhang:
FM-OV3D: Foundation Model-based Cross-modal Knowledge Blending for Open-Vocabulary 3D Detection. CoRR abs/2312.14465 (2023)
[i50]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2312-16204
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2312-16204
Jiaxin Ge, Xinyan Chen, Tianjun Zhang, Shanghang Zhang:
Iterative Prompt Relabeling for diffusion model with RLDF. CoRR abs/2312.16204 (2023)
[i49]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2312-16279
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2312-16279
Guanqun Wang, Jiaming Liu, Chenxuan Li, Junpeng Ma, Yuan Zhang, Xinyu Wei, Kevin Zhang, Maurice Chong, Ray Zhang, Yijiang Liu, Shanghang Zhang:
Cloud-Device Collaborative Learning for Multimodal Large Language Models. CoRR abs/2312.16279 (2023)
[i48]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2312-16610
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2312-16610
Rongyu Zhang, Yulin Luo, Jiaming Liu, Huanrui Yang, Zhen Dong, Denis A. Gudovskiy, Tomoyuki Okuno, Yohei Nakata, Kurt Keutzer, Yuan Du, Shanghang Zhang:
Efficient Deweather Mixture-of-Experts with Uncertainty-aware Feature-wise Linear Modulation. CoRR abs/2312.16610 (2023)
2022
[j6]
- view
  authority control:
- export record
  dblp key:
  - journals/tmm/ZhouWZWZ22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tmm/ZhouWZWZ22
Shiji Zhou, Lianzhe Wang, Shanghang Zhang, Zhi Wang, Wenwu Zhu:
Active Gradual Domain Adaptation: Dataset and Approach. IEEE Trans. Multim. 24: 1210-1220 (2022)
[j5]
- view
  authority control:
- export record
  dblp key:
  - journals/tnn/ZhaoYZLZWKGSSK22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tnn/ZhaoYZLZWKGSSK22
Sicheng Zhao, Xiangyu Yue, Shanghang Zhang, Bo Li, Han Zhao, Bichen Wu, Ravi Krishna, Joseph E. Gonzalez, Alberto L. Sangiovanni-Vincentelli, Sanjit A. Seshia, Kurt Keutzer:
A Review of Single-Source Deep Unsupervised Visual Domain Adaptation. IEEE Trans. Neural Networks Learn. Syst. 33(2): 473-493 (2022)
[c43]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/aistats/Zhou0ZWCW022
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aistats/Zhou0ZWCW022
Shiji Zhou, Han Zhao, Shanghang Zhang, Lianzhe Wang, Heng Chang, Zhi Wang, Wenwu Zhu:
Online Continual Adaptation with Active Self-Training. AISTATS 2022: 8852-8883
[c42]
- view
  authority control:
- export record
  dblp key:
  - conf/cvpr/ZhangZZJZCZL022
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/cvpr/ZhangZZJZCZL022
Chongzhi Zhang, Mingyuan Zhang, Shanghang Zhang, Daisheng Jin, Qiang Zhou, Zhongang Cai, Haiyu Zhao, Xianglong Liu, Ziwei Liu:
Delving Deep into the Generalization of Vision Transformers under Distribution Shifts. CVPR 2022: 7267-7276
[c41]
- view
  authority control:
- export record
  dblp key:
  - conf/eccv/LiLWLLCYGZ22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/eccv/LiLWLLCYGZ22
Xiaoqi Li, Jiaming Liu, Shizun Wang, Cheng Lyu, Ming Lu, Yurong Chen, Anbang Yao, Yandong Guo, Shanghang Zhang:
Efficient Meta-Tuning for Content-Aware Neural Video Delivery. ECCV (18) 2022: 308-324
[c40]
- view
  authority control:
- export record
  dblp key:
  - conf/eccv/YuLWZNGOLKZ22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/eccv/YuLWZNGOLKZ22
Jinze Yu, Jiaming Liu, Xiaobao Wei, Haoyi Zhou, Yohei Nakata, Denis A. Gudovskiy, Tomoyuki Okuno, Jianxin Li, Kurt Keutzer, Shanghang Zhang:
MTTrans: Cross-domain Object Detection with Mean Teacher Transformer. ECCV (9) 2022: 629-645
[c39]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/DengLZG22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/DengLZG22
Shikuang Deng, Yuhang Li, Shanghang Zhang, Shi Gu:
Temporal Efficient Training of Spiking Neural Network via Gradient Re-weighting. ICLR 2022
[c38]
- view
  - electronic edition @ mlr.press (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/ChuJZWWZM22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/ChuJZWWZM22
Xu Chu, Yujie Jin, Wenwu Zhu, Yasha Wang, Xin Wang, Shanghang Zhang, Hong Mei:
DNA: Domain Generalization with Diversified Neural Averaging. ICML 2022: 4010-4034
[c37]
- view
  authority control:
- export record
  dblp key:
  - conf/icra/LiuZZLDKDZ22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icra/LiuZZLDKDZ22
Minzhe Liu, Qiang Zhou, Hengshuang Zhao, Jianing Li, Yuan Du, Kurt Keutzer, Li Du, Shanghang Zhang:
Prototype-Voxel Contrastive Learning for LiDAR Point Cloud Panoptic Segmentation. ICRA 2022: 9243-9250
[c36]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/ijcai/LiCDKZ22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ijcai/LiCDKZ22
Tian Li, Xiang Chen, Zhen Dong, Kurt Keutzer, Shanghang Zhang:
Domain-Adaptive Text Classification with Structured Knowledge from Unlabeled Data. IJCAI 2022: 4216-4222
[c35]
- view
  - electronic edition @ nips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/WeiZZGZZYL22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/WeiZZGZZYL22
Xiuying Wei, Yunchen Zhang, Xiangguo Zhang, Ruihao Gong, Shanghang Zhang, Qi Zhang, Fengwei Yu, Xianglong Liu:
Outlier Suppression: Pushing the Limit of Low-bit Transformer Language Models. NeurIPS 2022
[c34]
- view
  - electronic edition @ nips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/ZhouXZP0022
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/ZhouXZP0022
Haoyi Zhou, Siyang Xiao, Shanghang Zhang, Jieqi Peng, Shuai Zhang, Jianxin Li:
Jump Self-attention: Capturing High-order Statistics in Transformers. NeurIPS 2022
[c33]
- view
  - electronic edition @ nips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/ZouZLL22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/ZouZLL22
Yixiong Zou, Shanghang Zhang, Yuhua Li, Ruixuan Li:
Margin-Based Few-Shot Class-Incremental Learning with Class-Level Overfitting Mitigation. NeurIPS 2022
[c32]
- view
  authority control:
- export record
  dblp key:
  - conf/wacv/ReedYNEVM0ZGMKD22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/wacv/ReedYNEVM0ZGMKD22
Colorado J. Reed, Xiangyu Yue, Ani Nrusimha, Sayna Ebrahimi, Vivek Vijaykumar, Richard Mao, Bo Li, Shanghang Zhang, Devin Guillory, Sean Metzger, Kurt Keutzer, Trevor Darrell:
Self-Supervised Pretraining Improves Self-Supervised Pretraining. WACV 2022: 1050-1060
[i47]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2202-11946
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2202-11946
Shikuang Deng, Yuhang Li, Shanghang Zhang, Shi Gu:
Temporal Efficient Training of Spiking Neural Network via Gradient Re-weighting. CoRR abs/2202.11946 (2022)
[i46]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2205-01643
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2205-01643
Jinze Yu, Jiaming Liu, Xiaobao Wei, Haoyi Zhou, Yohei Nakata, Denis A. Gudovskiy, Tomoyuki Okuno, Jianxin Li, Kurt Keutzer, Shanghang Zhang:
Cross-Domain Object Detection with Mean-Teacher Transformer. CoRR abs/2205.01643 (2022)
[i45]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2205-02162
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2205-02162
Zhen Dong, Kaicheng Zhou, Guohao Li, Qiang Zhou, Mingfei Guo, Bernard Ghanem, Kurt Keutzer, Shanghang Zhang:
UnrealNAS: Can We Search Neural Architectures with Unreal Data? CoRR abs/2205.02162 (2022)
[i44]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2206-09591
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2206-09591
Tian Li, Xiang Chen, Zhen Dong, Weijiang Yu, Yijun Yan, Kurt Keutzer, Shanghang Zhang:
Domain-Adaptive Text Classification with Structured Knowledge from Unlabeled Data. CoRR abs/2206.09591 (2022)
[i43]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2207-01987
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2207-01987
Yuheng Lu, Chenfeng Xu, Xiaobao Wei, Xiaodong Xie, Masayoshi Tomizuka, Kurt Keutzer, Shanghang Zhang:
Open-Vocabulary 3D Detection via Image-level Class and Debiased Cross-modal Contrastive Learning. CoRR abs/2207.01987 (2022)
[i42]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2207-09691
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2207-09691
Xiaoqi Li, Jiaming Liu, Shizun Wang, Cheng Lyu, Ming Lu, Yurong Chen, Anbang Yao, Yandong Guo, Shanghang Zhang:
Efficient Meta-Tuning for Content-aware Neural Video Delivery. CoRR abs/2207.09691 (2022)
[i41]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2208-12527
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2208-12527
Jiaming Liu, Qizhe Zhang, Jianing Li, Ming Lu, Tiejun Huang, Shanghang Zhang:
Unsupervised Spike Depth Estimation via Cross-modality Cross-domain Knowledge Transfer. CoRR abs/2208.12527 (2022)
[i40]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2208-12653
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2208-12653
Jianing Li, Jiaming Liu, Xiaobao Wei, Jiyuan Zhang, Ming Lu, Lei Ma, Li Du, Tiejun Huang, Shanghang Zhang:
Uncertainty Guided Depth Fusion for Spike Camera. CoRR abs/2208.12653 (2022)
[i39]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2209-07125
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2209-07125
Mingrui He, Tianyu Chen, Haoyi Zhou, Shanghang Zhang, Jianxin Li:
BadRes: Reveal the Backdoors through Residual Connection. CoRR abs/2209.07125 (2022)
[i38]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2209-13325
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2209-13325
Xiuying Wei, Yunchen Zhang, Xiangguo Zhang, Ruihao Gong, Shanghang Zhang, Qi Zhang, Fengwei Yu, Xianglong Liu:
Outlier Suppression: Pushing the Limit of Low-bit Transformer Language Models. CoRR abs/2209.13325 (2022)
[i37]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2210-04524
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2210-04524
Yixiong Zou, Shanghang Zhang, Yuhua Li, Ruixuan Li:
Margin-Based Few-Shot Class-Incremental Learning with Class-Level Overfitting Mitigation. CoRR abs/2210.04524 (2022)
[i36]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2211-11682
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2211-11682
Xiangyang Zhu, Renrui Zhang, Bowei He, Ziyao Zeng, Shanghang Zhang, Peng Gao:
PointCLIP V2: Adapting CLIP for Powerful 3D Open-world Learning. CoRR abs/2211.11682 (2022)
[i35]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2211-16056
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2211-16056
Yijiang Liu, Huanrui Yang, Zhen Dong, Kurt Keutzer, Li Du, Shanghang Zhang:
NoisyQuant: Noisy Bias-Enhanced Post-Training Activation Quantization for Vision Transformers. CoRR abs/2211.16056 (2022)
[i34]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2211-17126
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2211-17126
Jiaming Liu, Rongyu Zhang, Xiaowei Chi, Xiaoqi Li, Ming Lu, Yandong Guo, Shanghang Zhang:
Multi-latent Space Alignments for Unsupervised Domain Adaptation in Multi-view 3D Object Detection. CoRR abs/2211.17126 (2022)
[i33]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2212-00623
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2212-00623
Jianing Li, Ming Lu, Jiaming Liu, Yandong Guo, Li Du, Shanghang Zhang:
BEV-LGKD: A Unified LiDAR-Guided Knowledge Distillation Framework for BEV 3D Object Detection. CoRR abs/2212.00623 (2022)
[i32]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2212-00972
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2212-00972
Yulu Gan, Mingjie Pan, Rongyu Zhang, Zijian Ling, Lingran Zhao, Jiaming Liu, Shanghang Zhang:
Cloud-Device Collaborative Adaptation to Continual Changing Environments in the Real-world. CoRR abs/2212.00972 (2022)
[i31]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2212-01231
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2212-01231
Xiaowei Chi, Jiaming Liu, Ming Lu, Rongyu Zhang, Zhaoqing Wang, Yandong Guo, Shanghang Zhang:
BEV-SAN: Accurate BEV 3D Object Detection via Slice Attention Networks. CoRR abs/2212.01231 (2022)
[i30]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2212-02770
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2212-02770
Lirui Xiao, Huanrui Yang, Zhen Dong, Kurt Keutzer, Li Du, Shanghang Zhang:
CSQ: Growing Mixed-Precision Quantization Scheme with Bi-level Continuous Sparsification. CoRR abs/2212.02770 (2022)
2021
[j4]
- view
  authority control:
- export record
  dblp key:
  - journals/ijon/LiPNZPZL21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ijon/LiPNZPZL21
Chen Li, Xutan Peng, Yuhang Niu, Shanghang Zhang, Hao Peng, Chuan Zhou, Jianxin Li:
Learning graph attention-aware knowledge graph embedding. Neurocomputing 461: 516-529 (2021)
[c31]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/ZhouZPZLXZ21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/ZhouZPZLXZ21
Haoyi Zhou, Shanghang Zhang, Jieqi Peng, Shuai Zhang, Jianxin Li, Hui Xiong, Wancai Zhang:
Informer: Beyond Efficient Transformer for Long Sequence Time-Series Forecasting. AAAI 2021: 11106-11115
[c30]
- view
  authority control:
- export record
  dblp key:
  - conf/cvpr/0080WZLKD021
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/cvpr/0080WZLKD021
Bo Li, Yezhen Wang, Shanghang Zhang, Dongsheng Li, Kurt Keutzer, Trevor Darrell, Han Zhao:
Learning Invariant Representations and Risks for Semi-Supervised Domain Adaptation. CVPR 2021: 1104-1113
[c29]
- view
  authority control:
- export record
  dblp key:
  - conf/cvpr/YueZZ0DKS21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/cvpr/YueZZ0DKS21
Xiangyu Yue, Zangwei Zheng, Shanghang Zhang, Yang Gao, Trevor Darrell, Kurt Keutzer, Alberto L. Sangiovanni-Vincentelli:
Prototypical Cross-Domain Self-Supervised Learning for Few-Shot Unsupervised Domain Adaptation. CVPR 2021: 13834-13844
[c28]
- view
  authority control:
- export record
  dblp key:
  - conf/icassp/LiCZDK21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icassp/LiCZDK21
Tian Li, Xiang Chen, Shanghang Zhang, Zhen Dong, Kurt Keutzer:
Cross-Domain Sentiment Classification with Contrastive Learning and Mutual Information Maximization. ICASSP 2021: 8203-8207
[c27]
- view
  authority control:
- export record
  dblp key:
  - conf/iccv/LiuFZ0FY21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iccv/LiuFZ0FY21
Yunze Liu, Qingnan Fan, Shanghang Zhang, Hao Dong, Thomas A. Funkhouser, Li Yi:
Contrastive Multimodal Fusion with TupleInfoNCE. ICCV 2021: 734-743
[c26]
- view
  authority control:
- export record
  dblp key:
  - conf/iccv/LuoCZZZYL0Z021
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iccv/LuoCZZZYL0Z021
Zhipeng Luo, Zhongang Cai, Changqing Zhou, Gongjie Zhang, Haiyu Zhao, Shuai Yi, Shijian Lu, Hongsheng Li, Shanghang Zhang, Ziwei Liu:
Unsupervised Domain Adaptive 3D Detection with Multi-Level Consistency. ICCV 2021: 8846-8855
[c25]
- view
  authority control:
- export record
  dblp key:
  - conf/icdm/ZhangLZZZW21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icdm/ZhangLZZZW21
Shuai Zhang, Jianxin Li, Haoyi Zhou, Qishan Zhu, Shanghang Zhang, Danding Wang:
MERITS: Medication Recommendation for Chronic Disease with Irregular Time-Series. ICDM 2021: 1481-1486
[c24]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/MaKZH21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/MaKZH21
Xuezhe Ma, Xiang Kong, Shanghang Zhang, Eduard H. Hovy:
Decoupling Global and Local Representations via Invertible Generative Flows. ICLR 2021
[c23]
- view
  authority control:
- export record
  dblp key:
  - conf/kdd/Zhou0PZZ21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/kdd/Zhou0PZZ21
Haoyi Zhou, Jianxin Li, Jieqi Peng, Shuai Zhang, Shanghang Zhang:
Triplet Attention: Rethinking the Similarity in Transformers. KDD 2021: 2378-2388
[c22]
- view
  authority control:
- export record
  dblp key:
  - conf/mm/ZouZC0KM21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/ZouZC0KM21
Yixiong Zou, Shanghang Zhang, Guangyao Chen, Yonghong Tian, Kurt Keutzer, José M. F. Moura:
Annotation-Efficient Untrimmed Video Action Recognition. ACM Multimedia 2021: 487-495
[c21]
- view
  authority control:
- export record
  dblp key:
  - conf/mm/ZouZY0M21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/ZouZY0M21
Yixiong Zou, Shanghang Zhang, Jianpeng Yu, Yonghong Tian, José M. F. Moura:
Revisiting Mid-Level Patterns for Cross-Domain Few-Shot Recognition. ACM Multimedia 2021: 741-749
[c20]
- view
  - electronic edition @ neurips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/LiGZDHG21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/LiGZDHG21
Yuhang Li, Yufei Guo, Shanghang Zhang, Shikuang Deng, Yongqing Hai, Shi Gu:
Differentiable Spike: Rethinking Gradient-Descent for Training Spiking Neural Networks. NeurIPS 2021: 23426-23439
[i29]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2103-12718
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2103-12718
Colorado J. Reed, Xiangyu Yue, Ani Nrusimha, Sayna Ebrahimi, Vivek Vijaykumar, Richard Mao, Bo Li, Shanghang Zhang, Devin Guillory, Sean Metzger, Kurt Keutzer, Trevor Darrell:
Self-Supervised Pretraining Improves Self-Supervised Pretraining. CoRR abs/2103.12718 (2021)
[i28]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2103-16765
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2103-16765
Xiangyu Yue, Zangwei Zheng, Shanghang Zhang, Yang Gao, Trevor Darrell, Kurt Keutzer, Alberto L. Sangiovanni-Vincentelli:
Prototypical Cross-domain Self-supervised Learning for Few-shot Unsupervised Domain Adaptation. CoRR abs/2103.16765 (2021)
[i27]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2106-06526
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2106-06526
Shiji Zhou, Han Zhao, Shanghang Zhang, Lianzhe Wang, Heng Chang, Zhi Wang, Wenwu Zhu:
Online Continual Adaptation with Active Self-Training. CoRR abs/2106.06526 (2021)
[i26]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2106-07617
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2106-07617
Chongzhi Zhang, Mingyuan Zhang, Shanghang Zhang, Daisheng Jin, Qiang Zhou, Zhongang Cai, Haiyu Zhao, Shuai Yi, Xianglong Liu, Ziwei Liu:
Delving Deep into the Generalization of Vision Transformers under Distribution Shifts. CoRR abs/2106.07617 (2021)
[i25]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2107-02575
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2107-02575
Yunze Liu, Qingnan Fan, Shanghang Zhang, Hao Dong, Thomas A. Funkhouser, Li Yi:
Contrastive Multimodal Fusion with TupleInfoNCE. CoRR abs/2107.02575 (2021)
[i24]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2107-11355
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2107-11355
Zhipeng Luo, Zhongang Cai, Changqing Zhou, Gongjie Zhang, Haiyu Zhao, Shuai Yi, Shijian Lu, Hongsheng Li, Shanghang Zhang, Ziwei Liu:
Unsupervised Domain Adaptive 3D Detection with Multi-Level Consistency. CoRR abs/2107.11355 (2021)
[i23]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2110-14240
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2110-14240
Haojin Liao, Xiaolin Song, Sicheng Zhao, Shanghang Zhang, Xiangyu Yue, Xingxu Yao, Yueming Zhang, Tengfei Xing, Pengfei Xu, Qiang Wang:
2nd Place Solution for VisDA 2021 Challenge - Universally Domain Adaptive Image Recognition. CoRR abs/2110.14240 (2021)
2020
[j3]
- view
  authority control:
- export record
  dblp key:
  - journals/kbs/LiPZPYHDW20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/kbs/LiPZPYHDW20
Chen Li, Xutan Peng, Shanghang Zhang, Hao Peng, Philip S. Yu, Min He, Linfeng Du, Lihong Wang:
Modeling relation paths for knowledge base completion via joint adversarial training. Knowl. Based Syst. 201-202: 105865 (2020)
[c19]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/ZhaoWZGLS0HCK20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/ZhaoWZGLS0HCK20
Sicheng Zhao, Guangzhi Wang, Shanghang Zhang, Yang Gu, Yaxian Li, Zhichao Song, Pengfei Xu, Runbo Hu, Hua Chai, Kurt Keutzer:
Multi-Source Distilling Domain Adaptation. AAAI 2020: 12975-12983
[c18]
- view
  authority control:
- export record
  dblp key:
  - conf/eccv/0001XCKHZW20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/eccv/0001XCKHZW20
Xinwei Sun, Yilun Xu, Peng Cao, Yuqing Kong, Lingjing Hu, Shanghang Zhang, Yizhou Wang:
TCGM: An Information-Theoretic Framework for Semi-supervised Multi-modality Learning. ECCV (3) 2020: 171-188
[c17]
- view
  authority control:
- export record
  dblp key:
  - conf/eccv/MeiZZZ20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/eccv/MeiZZZ20
Ke Mei, Chuang Zhu, Jiaqi Zou, Shanghang Zhang:
Instance Adaptive Self-training for Unsupervised Domain Adaptation. ECCV (26) 2020: 415-430
[c16]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/ijcai/SongZSXX20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ijcai/SongZSXX20
Congzheng Song, Shanghang Zhang, Najmeh Sadoughi, Pengtao Xie, Eric P. Xing:
Generalized Zero-Shot Text Classification for ICD Coding. IJCAI 2020: 4018-4024
[c15]
- view
  authority control:
- export record
  dblp key:
  - conf/mm/ZouZC0WM20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/ZouZC0WM20
Yixiong Zou, Shanghang Zhang, Ke Chen, Yonghong Tian, Yaowei Wang, José M. F. Moura:
Compositional Few-Shot Recognition with Primitive Discovery and Enhancing. ACM Multimedia 2020: 156-164
[i22]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2004-11820
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2004-11820
Xuezhe Ma, Xiang Kong, Shanghang Zhang, Eduard H. Hovy:
Decoupling Global and Local Representations from/for Image Generation. CoRR abs/2004.11820 (2020)
[i21]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2005-06047
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2005-06047
Yixiong Zou, Shanghang Zhang, Ke Chen, José M. F. Moura, Yaowei Wang, Yonghong Tian:
Compositional Few-Shot Recognition with Primitive Discovery and Enhancing. CoRR abs/2005.06047 (2020)
[i20]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2006-13352
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2006-13352
Bo Li, Yezhen Wang, Tong Che, Shanghang Zhang, Sicheng Zhao, Pengfei Xu, Wei Zhou, Yoshua Bengio, Kurt Keutzer:
Rethinking Distributional Matching Based Domain Adaptation. CoRR abs/2006.13352 (2020)
[i19]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2007-04234
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2007-04234
Xingyi Yang, Xuehai He, Yuxiao Liang, Yue Yang, Shanghang Zhang, Pengtao Xie:
Transfer Learning or Self-supervised Learning? A Tale of Two Pretraining Paradigms. CoRR abs/2007.04234 (2020)
[i18]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2007-06793
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2007-06793
Xinwei Sun, Yilun Xu, Peng Cao, Yuqing Kong, Lingjing Hu, Shanghang Zhang, Yizhou Wang:
TCGM: An Information-Theoretic Framework for Semi-Supervised Multi-Modality Learning. CoRR abs/2007.06793 (2020)
[i17]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2008-03128
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2008-03128
Yixiong Zou, Shanghang Zhang, José M. F. Moura, Jianpeng Yu, Yonghong Tian:
Revisiting Mid-Level Patterns for Distant-Domain Few-Shot Recognition. CoRR abs/2008.03128 (2020)
[i16]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2008-12197
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2008-12197
Ke Mei, Chuang Zhu, Jiaqi Zou, Shanghang Zhang:
Instance Adaptive Self-Training for Unsupervised Domain Adaptation. CoRR abs/2008.12197 (2020)
[i15]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2009-00155
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2009-00155
Sicheng Zhao, Xiangyu Yue, Shanghang Zhang, Bo Li, Han Zhao, Bichen Wu, Ravi Krishna, Joseph E. Gonzalez, Alberto L. Sangiovanni-Vincentelli, Sanjit A. Seshia, Kurt Keutzer:
A Review of Single-Source Deep Unsupervised Visual Domain Adaptation. CoRR abs/2009.00155 (2020)
[i14]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2010-04647
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2010-04647
Bo Li, Yezhen Wang, Shanghang Zhang, Dongsheng Li, Trevor Darrell, Kurt Keutzer, Han Zhao:
Learning Invariant Representations and Risks for Semi-supervised Domain Adaptation. CoRR abs/2010.04647 (2020)
[i13]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2010-16088
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2010-16088
Tian Li, Xiang Chen, Shanghang Zhang, Zhen Dong, Kurt Keutzer:
Cross-Domain Sentiment Classification With Contrastive Learning and Mutual Information Maximization. CoRR abs/2010.16088 (2020)
[i12]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2011-14478
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2011-14478
Yixiong Zou, Shanghang Zhang, Guangyao Chen, Yonghong Tian, Kurt Keutzer, José M. F. Moura:
Annotation-Efficient Untrimmed Video Action Recognition. CoRR abs/2011.14478 (2020)
[i11]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2012-02943
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2012-02943
Tian Li, Xiang Chen, Shanghang Zhang, Zhen Dong, Kurt Keutzer:
Cross-Domain Sentiment Classification with In-Domain Contrastive Learning. CoRR abs/2012.02943 (2020)
[i10]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2012-07436
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2012-07436
Haoyi Zhou, Shanghang Zhang, Jieqi Peng, Shuai Zhang, Jianxin Li, Hui Xiong, Wancai Zhang:
Informer: Beyond Efficient Transformer for Long Sequence Time-Series Forecasting. CoRR abs/2012.07436 (2020)
[i9]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2012-13089
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2012-13089
Yunze Liu, Li Yi, Shanghang Zhang, Qingnan Fan, Thomas A. Funkhouser, Hao Dong:
P4Contrast: Contrastive Learning with Pairs of Point-Pixel Pairs for RGB-D Scene Understanding. CoRR abs/2012.13089 (2020)

2010 – 2019

see FAQ

What is the meaning of the colors in the publication lists?

2019
[j2]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/access/ZhuDZ19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/access/ZhuDZ19
Chuang Zhu, Huihui Dong, Shanghang Zhang:
Feature Fusion for Image Retrieval With Adaptive Bitrate Allocation and Hard Negative Mining. IEEE Access 7: 161858-161870 (2019)
[c14]
- view
- export record
  dblp key:
  - conf/nips/MaKZH19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/MaKZH19
Xuezhe Ma, Xiang Kong, Shanghang Zhang, Eduard H. Hovy:
MaCow: Masked Convolutional Generative Flow. NeurIPS 2019: 5891-5900
[c13]
- view
- export record
  dblp key:
  - conf/nips/NiZ019
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/NiZ019
Jian Ni, Shanghang Zhang, Haiyong Xie:
Dual Adversarial Semantics-Consistent Network for Generalized Zero-Shot Learning. NeurIPS 2019: 6143-6154
[i8]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1907-05570
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1907-05570
Jian Ni, Shanghang Zhang, Haiyong Xie:
Dual Adversarial Semantics-Consistent Network for Generalized Zero-Shot Learning. CoRR abs/1907.05570 (2019)
[i7]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1909-13154
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1909-13154
Congzheng Song, Shanghang Zhang, Najmeh Sadoughi, Pengtao Xie, Eric P. Xing:
Generalized Zero-shot ICD Coding. CoRR abs/1909.13154 (2019)
[i6]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1911-11554
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1911-11554
Sicheng Zhao, Guangzhi Wang, Shanghang Zhang, Yang Gu, Yaxian Li, Zhichao Song, Pengfei Xu, Runbo Hu, Hua Chai, Kurt Keutzer:
Multi-source Distilling Domain Adaptation. CoRR abs/1911.11554 (2019)
2018
[b1]
- view
  authority control:
- export record
  dblp key:
  - phd/us/Zhang18i
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/phd/us/Zhang18i
Shanghang Zhang:
Deep Understanding of Urban Mobility from CityscapeWebcams. Carnegie Mellon University, USA, 2018
[c12]
- view
  authority control:
- export record
  dblp key:
  - conf/cvpr/ZhangS0MCM18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/cvpr/ZhangS0MCM18
Shanghang Zhang, Xiaohui Shen, Zhe Lin, Radomír Mech, João Paulo Costeira, José M. F. Moura:
Learning to Understand Image Blur. CVPR 2018: 6586-6595
[c11]
- view
  authority control:
- export record
  dblp key:
  - conf/icc/DasGZKM18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icc/DasGZKM18
Rajshekhar Das, Akshay Gadre, Shanghang Zhang, Swarun Kumar, José M. F. Moura:
A Deep Learning Approach to IoT Authentication. ICC 2018: 1-6
[c10]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/0002ZWCMG18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/0002ZWCMG18
Han Zhao, Shanghang Zhang, Guanhang Wu, João Paulo Costeira, José M. F. Moura, Geoffrey J. Gordon:
Multiple Source Domain Adaptation with Adversarial Learning. ICLR (Workshop) 2018
[c9]
- view
- export record
  dblp key:
  - conf/nips/ZhaoZWMCG18
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/ZhaoZWMCG18
Han Zhao, Shanghang Zhang, Guanhang Wu, José M. F. Moura, João Paulo Costeira, Geoffrey J. Gordon:
Adversarial Multiple Source Domain Adaptation. NeurIPS 2018: 8568-8579
[i5]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1810-06033
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1810-06033
Chen Li, Xutan Peng, Shanghang Zhang, Jianxin Li, Lihong Wang:
Hierarchical Attention Networks for Knowledge Base Completion via Joint Adversarial Training. CoRR abs/1810.06033 (2018)
2017
[c8]
- view
  authority control:
- export record
  dblp key:
  - conf/cvpr/ZhangWCM17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/cvpr/ZhangWCM17
Shanghang Zhang, Guanhang Wu, João Paulo Costeira, José M. F. Moura:
Understanding Traffic Density from Large-Scale Web Camera Data. CVPR 2017: 4264-4273
[c7]
- view
  authority control:
- export record
  dblp key:
  - conf/iccv/ZhangWCM17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iccv/ZhangWCM17
Shanghang Zhang, Guanhang Wu, João Paulo Costeira, José M. F. Moura:
FCN-rLSTM: Deep Spatio-Temporal Neural Networks for Vehicle Counting in City Cameras. ICCV 2017: 3687-3696
[i4]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/ZhangWCM17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/ZhangWCM17
Shanghang Zhang, Guanhang Wu, João Paulo Costeira, José M. F. Moura:
Understanding Traffic Density from Large-Scale Web Camera Data. CoRR abs/1703.05868 (2017)
[i3]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/ZhaoZWCMG17
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/ZhaoZWCMG17
Han Zhao, Shanghang Zhang, Guanhang Wu, João Paulo Costeira, José M. F. Moura, Geoffrey J. Gordon:
Multiple Source Domain Adaptation with Adversarial Training of Neural Networks. CoRR abs/1705.09684 (2017)
[i2]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/ZhangWCM17aa
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/ZhangWCM17aa
Shanghang Zhang, Guanhang Wu, João Paulo Costeira, José M. F. Moura:
FCN-rLSTM: Deep Spatio-Temporal Neural Networks for Vehicle Counting in City Cameras. CoRR abs/1707.09476 (2017)
[i1]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1710-10370
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1710-10370
Jian Du, Shanghang Zhang, Guanhang Wu, José M. F. Moura, Soummya Kar:
Topology adaptive graph convolutional networks. CoRR abs/1710.10370 (2017)
2015
[c6]
- view
  authority control:
- export record
  dblp key:
  - conf/icip/ToropovGZKM15
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icip/ToropovGZKM15
Evgeny Toropov, Liangyan Gui, Shanghang Zhang, Satwik Kottur, José M. F. Moura:
Traffic flow from a low frame rate city camera. ICIP 2015: 3802-3806
2014
[c5]
- view
  authority control:
- export record
  dblp key:
  - conf/itc/ZhangLBSCB14
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/itc/ZhangLBSCB14
Shanghang Zhang, Xin Li, Ronald D. Blanton, José Machado da Silva, John M. Carulli Jr., Kenneth M. Butler:
Bayesian model fusion: Enabling test cost reduction of analog/RF circuits via wafer-level spatial variation modeling. ITC 2014: 1-10
2013
[j1]
- view
  authority control:
- export record
  dblp key:
  - journals/tmm/ZhuJZHXG13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tmm/ZhuJZHXG13
Chuang Zhu, Huizhu Jia, Shanghang Zhang, Xiaofeng Huang, Xiaodong Xie, Wen Gao:
On a Highly Efficient RDO-Based Mode Decision Pipeline Design for AVS. IEEE Trans. Multim. 15(8): 1815-1829 (2013)
[c4]
- view
  authority control:
- export record
  dblp key:
  - conf/iscas/LiZJXG13
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iscas/LiZJXG13
Yuan Li, Shanghang Zhang, Huizhu Jia, Xiaodong Xie, Wen Gao:
A high-throughput low-latency arithmetic encoder design for HDTV. ISCAS 2013: 998-1001
2012
[c3]
- view
  authority control:
- export record
  dblp key:
  - conf/icmcs/WeiZZJXG12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icmcs/WeiZZJXG12
Kaijin Wei, Rongwei Zhou, Shanghang Zhang, Huizhu Jia, Don Xie, Wen Gao:
An Optimized Hardware Video Encoder for AVS with Level C+ Data Reuse Scheme for Motion Estimation. ICME 2012: 1055-1060
[c2]
- view
  authority control:
- export record
  dblp key:
  - conf/pcs/WeiZJXG12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/pcs/WeiZJXG12
Kaijin Wei, Shanghang Zhang, Huizhu Jia, Don Xie, Wen Gao:
A flexible and high-performance hardware video encoder architecture. PCS 2012: 373-376
[c1]
- view
  authority control:
- export record
  dblp key:
  - conf/vcip/ZhangWJXG12
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/vcip/ZhangWJXG12
Shanghang Zhang, Kaijin Wei, Huizhu Jia, Xiaodong Xie, Wen Gao:
An efficient foreground-based surveillance video coding scheme in low bit-rate compression. VCIP 2012: 1-6

Coauthor Index

see FAQ

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.