


default search action
Siteng Huang
Person information
Refine list

refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2025
- [c18]Han Zhao, Min Zhang, Wei Zhao, Pengxiang Ding, Siteng Huang, Donglin Wang:
Cobra: Extending Mamba to Multi-Modal Large Language Model for Efficient Inference. AAAI 2025: 10421-10429 - [i26]Xuyang Liu, Ziming Wang, Yuhang Han, Yingyao Wang, Jiale Yuan, Jun Song, Bo Zheng, Linfeng Zhang
, Siteng Huang, Honggang Chen:
Compression with Global Guidance: Towards Training-free High-Resolution MLLMs Acceleration. CoRR abs/2501.05179 (2025) - [i25]Pengxiang Ding, Jianfei Ma, Xinyang Tong, Binghong Zou, Xinxin Luo, Yiguo Fan, Ting Wang, Hongchao Lu, Panzhong Mo, Jinxin Liu, Yuefan Wang, Huaicheng Zhou, Wenshuo Feng, Jiacheng Liu, Siteng Huang, Donglin Wang:
Humanoid-VLA: Towards Universal Humanoid Control with Visual Integration. CoRR abs/2502.14795 (2025) - [i24]Minghui Lin, Xiang Wang, Yishan Wang, Shu Wang, Fengqi Dai, Pengxiang Ding, Cunxiang Wang, Zhengrong Zuo, Nong Sang, Siteng Huang, Donglin Wang:
Exploring the Evolution of Physics Cognition in Video Generation: A Survey. CoRR abs/2503.21765 (2025) - [i23]Xiaomin Yu, Pengxiang Ding, Wenjie Zhang, Siteng Huang, Songyang Gao, Chengwei Qin, Kejian Wu, Zhaoxin Fan, Ziyue Qiao, Donglin Wang:
Unicorn: Text-Only Data Synthesis for Vision Language Model Training. CoRR abs/2503.22655 (2025) - 2024
- [c17]Shuanghao Bai, Min Zhang, Wanqi Zhou
, Siteng Huang
, Zhirong Luan, Donglin Wang, Badong Chen:
Prompt-Based Distribution Alignment for Unsupervised Domain Adaptation. AAAI 2024: 729-737 - [c16]Biao Gong, Siteng Huang, Yutong Feng, Shiwei Zhang, Yuyuan Li, Yu Liu:
Check, Locate, Rectify: A Training-Free Layout Calibration System for Text- to- Image Generation. CVPR 2024: 6624-6634 - [c15]Siteng Huang, Biao Gong, Yutong Feng, Xi Chen, Yuqian Fu, Yu Liu, Donglin Wang:
Learning Disentangled Identifiers for Action-Customized Text-to-Image Generation. CVPR 2024: 7797-7806 - [c14]Siteng Huang, Biao Gong, Yutong Feng, Min Zhang, Yiliang Lv, Donglin Wang:
Troika: Multi-Path Cross-Modal Traction for Compositional Zero-Shot Learning. CVPR 2024: 24005-24014 - [c13]Yang Liu, Pengxiang Ding, Siteng Huang, Min Zhang, Han Zhao, Donglin Wang:
PiTe: Pixel-Temporal Alignment for Large Video-Language Model. ECCV (5) 2024: 160-176 - [c12]Pengxiang Ding, Han Zhao, Wenjie Zhang, Wenxuan Song, Min Zhang, Siteng Huang, Ningxi Yang, Donglin Wang:
QUAR-VLA: Vision-Language-Action Model for Quadruped Robots. ECCV (5) 2024: 352-367 - [c11]Xuyang Liu
, Siteng Huang
, Yachen Kang, Honggang Chen, Donglin Wang:
VGDIFFZERO: Text-To-Image Diffusion Models Can Be Zero-Shot Visual Grounders. ICASSP 2024: 2765-2769 - [c10]Ting Liu, Xuyang Liu
, Siteng Huang, Honggang Chen, Quanjun Yin, Long Qin, Donglin Wang, Yue Hu:
DARA: Domain- and Relation-Aware Adapters Make Parameter-Efficient Tuning for Visual Grounding. ICME 2024: 1-6 - [c9]Can Cui
, Siteng Huang
, Wenxuan Song
, Pengxiang Ding
, Min Zhang
, Donglin Wang
:
ProFD: Prompt-Guided Feature Disentangling for Occluded Person Re-Identification. ACM Multimedia 2024: 1583-1592 - [i22]Han Zhao, Min Zhang, Wei Zhao, Pengxiang Ding, Siteng Huang, Donglin Wang:
Cobra: Extending Mamba to Multi-Modal Large Language Model for Efficient Inference. CoRR abs/2403.14520 (2024) - [i21]Ting Liu, Xuyang Liu, Siteng Huang, Honggang Chen, Quanjun Yin, Long Qin, Donglin Wang, Yue Hu:
DARA: Domain- and Relation-aware Adapters Make Parameter-efficient Tuning for Visual Grounding. CoRR abs/2405.06217 (2024) - [i20]Ting Liu, Xuyang Liu, Liangtao Shi, Zunnan Xu, Siteng Huang, Yi Xin, Quanjun Yin:
Sparse-Tuning: Adapting Vision Transformers with Efficient Fine-tuning and Inference. CoRR abs/2405.14700 (2024) - [i19]Xuyang Liu, Ting Liu, Siteng Huang, Yue Hu, Quanjun Yin, Donglin Wang, Honggang Chen:
M2IST: Multi-Modal Interactive Side-Tuning for Memory-efficient Referring Expression Comprehension. CoRR abs/2407.01131 (2024) - [i18]Fengyuan Dai, Siteng Huang, Min Zhang, Biao Gong, Donglin Wang:
Focus-Consistent Multi-Level Aggregation for Compositional Zero-Shot Learning. CoRR abs/2408.17083 (2024) - [i17]Yang Liu, Pengxiang Ding, Siteng Huang, Min Zhang, Han Zhao, Donglin Wang:
PiTe: Pixel-Temporal Alignment for Large Video-Language Model. CoRR abs/2409.07239 (2024) - [i16]Can Cui, Siteng Huang, Wenxuan Song, Pengxiang Ding, Min Zhang, Donglin Wang:
ProFD: Prompt-Guided Feature Disentangling for Occluded Person Re-Identification. CoRR abs/2409.20081 (2024) - [i15]Chang Zou, Xuyang Liu, Ting Liu, Siteng Huang, Linfeng Zhang
:
Accelerating Diffusion Transformers with Token-wise Feature Caching. CoRR abs/2410.05317 (2024) - [i14]Yuhang Han, Xuyang Liu, Pengxiang Ding, Donglin Wang, Honggang Chen, Qingsen Yan, Siteng Huang:
Rethinking Token Reduction in MLLMs: Towards a Unified Paradigm for Training-Free Acceleration. CoRR abs/2411.17686 (2024) - [i13]Zhefei Gong, Pengxiang Ding, Shangke Lyu, Siteng Huang, Mingyang Sun, Wei Zhao, Zhaoxin Fan, Donglin Wang:
CARP: Visuomotor Policy Learning via Coarse-to-Fine Autoregressive Prediction. CoRR abs/2412.06782 (2024) - [i12]Bofang Jia, Pengxiang Ding, Can Cui, Mingyang Sun, Pengfang Qian, Siteng Huang, Zhaoxin Fan, Donglin Wang:
Score and Distribution Matching Policy: Advanced Accelerated Visuomotor Policies via Matched Distillation. CoRR abs/2412.09265 (2024) - [i11]Xinyang Tong, Pengxiang Ding, Donglin Wang, Wenjie Zhang, Can Cui, Mingyang Sun, Yiguo Fan, Han Zhao, Hongyin Zhang, Yonghao Dang, Siteng Huang, Shangke Lyu:
QUART-Online: Latency-Free Large Multimodal Language Model for Quadruped Robot Learning. CoRR abs/2412.15576 (2024) - 2023
- [c8]Siteng Huang
, Biao Gong, Yulin Pan, Jianwen Jiang, Yiliang Lv, Yuyuan Li, Donglin Wang:
VoP: Text-Video Co-Operative Prompt Tuning for Cross-Modal Retrieval. CVPR 2023: 6565-6574 - [c7]Siteng Huang
, Qiyao Wei
, Donglin Wang
:
Reference-Limited Compositional Zero-Shot Learning. ICMR 2023: 443-451 - [i10]Siteng Huang, Biao Gong, Yutong Feng, Yiliang Lv, Donglin Wang:
Troika: Multi-Path Cross-Modal Traction for Compositional Zero-Shot Learning. CoRR abs/2303.15230 (2023) - [i9]Xuyang Liu
, Siteng Huang, Yachen Kang, Honggang Chen, Donglin Wang:
VGDiffZero: Text-to-image Diffusion Models Can Be Zero-shot Visual Grounders. CoRR abs/2309.01141 (2023) - [i8]Biao Gong, Siteng Huang, Yutong Feng, Shiwei Zhang, Yuyuan Li, Yu Liu:
Check, Locate, Rectify: A Training-Free Layout Calibration System for Text-to-Image Generation. CoRR abs/2311.15773 (2023) - [i7]Siteng Huang, Biao Gong, Yutong Feng, Xi Chen, Yuqian Fu, Yu Liu, Donglin Wang:
Learning Disentangled Identifiers for Action-Customized Text-to-Image Generation. CoRR abs/2311.15841 (2023) - [i6]Shuanghao Bai, Min Zhang, Wanqi Zhou, Siteng Huang, Zhirong Luan, Donglin Wang, Badong Chen:
Prompt-based Distribution Alignment for Unsupervised Domain Adaptation. CoRR abs/2312.09553 (2023) - 2022
- [c6]Min Zhang, Siteng Huang
, Wenbin Li, Donglin Wang:
Tree Structure-Aware Few-Shot Image Classification via Hierarchical Aggregation. ECCV (20) 2022: 453-470 - [c5]Min Zhang, Siteng Huang
, Donglin Wang:
Domain Generalized Few-Shot Image Classification via Meta Regularization Network. ICASSP 2022: 3748-3752 - [i5]Min Zhang, Siteng Huang, Wenbin Li, Donglin Wang:
Tree Structure-Aware Few-Shot Image Classification via Hierarchical Aggregation. CoRR abs/2207.06989 (2022) - [i4]Siteng Huang, Qiyao Wei, Donglin Wang:
Reference-Limited Compositional Zero-Shot Learning. CoRR abs/2208.10046 (2022) - [i3]Siteng Huang, Biao Gong, Yulin Pan, Jianwen Jiang, Yiliang Lv, Yuyuan Li, Donglin Wang:
VoP: Text-Video Co-operative Prompt Tuning for Cross-Modal Retrieval. CoRR abs/2211.12764 (2022) - 2021
- [c4]Siteng Huang
, Min Zhang, Yachen Kang, Donglin Wang:
Attributes-Guided and Pure-Visual Attention Alignment for Few-Shot Recognition. AAAI 2021: 7840-7847 - [c3]Zhengyu Chen
, Jixie Ge, Heshen Zhan, Siteng Huang
, Donglin Wang:
Pareto Self-Supervised Training for Few-Shot Learning. CVPR 2021: 13663-13672 - [c2]Zifeng Zhuang, Xintao Xiang, Siteng Huang
, Donglin Wang:
HINFShot: A Challenge Dataset for Few-Shot Node Classification in Heterogeneous Information Network. ICMR 2021: 429-436 - [i2]Zhengyu Chen, Jixie Ge, Heshen Zhan, Siteng Huang, Donglin Wang:
Pareto Self-Supervised Training for Few-Shot Learning. CoRR abs/2104.07841 (2021) - 2020
- [i1]Siteng Huang, Min Zhang, Yachen Kang, Donglin Wang:
Attributes-Guided and Pure-Visual Attention Alignment for Few-Shot Recognition. CoRR abs/2009.04724 (2020)
2010 – 2019
- 2019
- [c1]Siteng Huang
, Donglin Wang, Xuehan Wu, Ao Tang:
DSANet: Dual Self-Attention Network for Multivariate Time Series Forecasting. CIKM 2019: 2129-2132
Coauthor Index

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from ,
, and
to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and
to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2025-05-03 00:03 CEST by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint