default search action

combined dblp search
author search
venue search
publication search

ask others

Kaihang Pan

> Home > Persons

Person information

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

2020 – today

see FAQ

What is the meaning of the colors in the publication lists?

2025
[j2]
- view
  authority control:
- export record
  dblp key:
  - journals/tmm/ChenPDWZTX25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tmm/ChenPDWZTX25
Dong Chen, Kaihang Pan, Guangyu Dai, Guoming Wang, Yueting Zhuang, Siliang Tang, Mingliang Xu:
Improving Vision Anomaly Detection With the Guidance of Language Modality. IEEE Trans. Multim. 27: 1410-1419 (2025)
[c11]
- view
  authority control:
- export record
  dblp key:
  - conf/cvpr/QiuGQPYL0TZC25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/cvpr/QiuGQPYL0TZC25
Haiyi Qiu, Minghe Gao, Long Qian, Kaihang Pan, Qifan Yu, Juncheng Li, Wenjie Wang, Siliang Tang, Yueting Zhuang, Tat-Seng Chua:
STEP: Enhancing Video-LLMs' Compositional Reasoning by Spatio-Temporal Graph-guided Self-Training. CVPR 2025: 3284-3294
[c10]
- view
  authority control:
- export record
  dblp key:
  - conf/cvpr/YuCYPWW0TZZ25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/cvpr/YuCYPWW0TZZ25
Qifan Yu, Wei Chow, Zhongqi Yue, Kaihang Pan, Yang Wu, Xiaoyang Wan, Juncheng Li, Siliang Tang, Hanwang Zhang, Yueting Zhuang:
AnyEdit: Mastering Unified High-Quality Image Editing for Any Idea. CVPR 2025: 26125-26135
[c9]
- view
  authority control:
- export record
  dblp key:
  - conf/cvpr/PanLYAJZLTZ25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/cvpr/PanLYAJZLTZ25
Kaihang Pan, Wang Lin, Zhongqi Yue, Tenglong Ao, Liyu Jia, Wei Zhao, Juncheng Li, Siliang Tang, Hanwang Zhang:
Generative Multimodal Pretraining with Discrete Diffusion Timestep Tokens. CVPR 2025: 26136-26146
[c8]
- view
- export record
  dblp key:
  - conf/icml/000100L0LWWZMSZ25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/000100L0LWWZMSZ25
Hao Fei, Yuan Zhou, Juncheng Li, Xiangtai Li, Qingshan Xu, Bobo Li, Shengqiong Wu, Yaoting Wang, Junbao Zhou, Jiahao Meng, Qingyu Shi, Zhiyuan Zhou, Liangtao Shi, Minghe Gao, Daoan Zhang, Zhiqi Ge, Siliang Tang, Kaihang Pan, Yaobo Ye, Haobo Yuan, Tao Zhang, Weiming Wu, Tianjie Ju, Zixiang Meng, Shilin Xu, Liyu Jia, Wentao Hu, Meng Luo, Jiebo Luo, Tat-Seng Chua, Shuicheng Yan, Hanwang Zhang:
On Path to Multimodal Generalist: General-Level and General-Bench. ICML 2025
[c7]
- view
- export record
  dblp key:
  - conf/icml/BuWYGMZPL000TZ25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/BuWYGMZPL000TZ25
Wendong Bu, Yang Wu, Qifan Yu, Minghe Gao, Bingchen Miao, Zhenkui Zhang, Kaihang Pan, Liyunfei, Mengze Li, Wei Ji, Juncheng Li, Siliang Tang, Yueting Zhuang:
What Limits Virtual Agent Application? OmniBench: A Scalable Multi-Dimensional Benchmark for Essential Virtual Agent Capabilities. ICML 2025
[i20]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2504-14666
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2504-14666
Kaihang Pan, Wang Lin, Zhongqi Yue, Tenglong Ao, Liyu Jia, Wei Zhao, Juncheng Li, Siliang Tang, Hanwang Zhang:
Generative Multimodal Pretraining with Discrete Diffusion Timestep Tokens. CoRR abs/2504.14666 (2025)
[i19]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2504-15932
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2504-15932
Wang Lin, Liyu Jia, Wentao Hu, Kaihang Pan, Zhongqi Yue, Wei Zhao, Jingyuan Chen, Fei Wu, Hanwang Zhang:
Reasoning Physical Video Generation with Diffusion Timestep Tokens via Reinforcement Learning. CoRR abs/2504.15932 (2025)
[i18]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2505-04620
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2505-04620
Hao Fei, Yuan Zhou, Juncheng Li, Xiangtai Li, Qingshan Xu, Bobo Li, Shengqiong Wu, Yaoting Wang, Junbao Zhou, Jiahao Meng, Qingyu Shi, Zhiyuan Zhou, Liangtao Shi, Minghe Gao, Daoan Zhang, Zhiqi Ge, Weiming Wu, Siliang Tang, Kaihang Pan, Yaobo Ye, Haobo Yuan, Tao Zhang, Tianjie Ju, Zixiang Meng, Shilin Xu, Liyu Jia, Wentao Hu, Meng Luo, Jiebo Luo, Tat-Seng Chua, Shuicheng Yan, Hanwang Zhang:
On Path to Multimodal Generalist: General-Level and General-Bench. CoRR abs/2505.04620 (2025)
[i17]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2505-07538
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2505-07538
Bohan Wang, Zhongqi Yue, Fengda Zhang, Shuo Chen, Li'an Bi, Junzhe Zhang, Xue Song, Kennard Yanting Chan, Jiachun Pan, Weijia Wu, Mingze Zhou, Wang Lin, Kaihang Pan, Saining Zhang, Liyu Jia, Wentao Hu, Wei Zhao, Hanwang Zhang:
Selftok: Discrete Visual Tokens of Autoregression, by Diffusion, and for Reasoning. CoRR abs/2505.07538 (2025)
[i16]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2506-01480
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2506-01480
Kaihang Pan, Yang Wu, Wendong Bu, Kai Shen, Juncheng Li, Yingting Wang, Yunfei Li, Siliang Tang, Jun Xiao, Fei Wu, Hang Zhao, Yueting Zhuang:
Unlocking Aha Moments via Reinforcement Learning: Advancing Collaborative Visual Comprehension and Generation. CoRR abs/2506.01480 (2025)
[i15]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2506-05501
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2506-05501
Kaihang Pan, Wendong Bu, Yuruo Wu, Yang Wu, Kai Shen, Yunfei Li, Hang Zhao, Juncheng Li, Siliang Tang, Yueting Zhuang:
FocusDiff: Advancing Fine-Grained Text-Image Alignment for Autoregressive Visual Generation through RL. CoRR abs/2506.05501 (2025)
[i14]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2506-08933
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2506-08933
Wendong Bu, Yang Wu, Qifan Yu, Minghe Gao, Bingchen Miao, Zhenkui Zhang, Kaihang Pan, Yunfei Li, Mengze Li, Wei Ji, Juncheng Li, Siliang Tang, Yueting Zhuang:
What Limits Virtual Agent Application? OmniBench: A Scalable Multi-Dimensional Benchmark for Essential Virtual Agent Capabilities. CoRR abs/2506.08933 (2025)
[i13]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2509-05714
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2509-05714
Zhaoyu Fan, Kaihang Pan, Mingze Zhou, Bosheng Qin, Juncheng Li, Shengyu Zhang, Wenqiao Zhang, Siliang Tang, Fei Wu, Yueting Zhuang:
Towards Meta-Cognitive Knowledge Editing for Multimodal LLMs. CoRR abs/2509.05714 (2025)
[i12]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2512-00387
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2512-00387
Kaihang Pan, Weile Chen, Haiyi Qiu, Qifan Yu, Wendong Bu, Zehan Wang, Yun Zhu, Juncheng Li, Siliang Tang:
WiseEdit: Benchmarking Cognition- and Creativity-Informed Image Editing. CoRR abs/2512.00387 (2025)
[i11]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2512-19159
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2512-19159
Wendong Bu, Kaihang Pan, Yuze Lin, Jiacheng Li, Kai Shen, Wenqiao Zhang, Juncheng Li, Jun Xiao, Siliang Tang:
OmniMoGen: Unifying Human Motion Generation via Learning from Interleaved Text-Motion Instructions. CoRR abs/2512.19159 (2025)
2024
[j1]
- view
  authority control:
- export record
  dblp key:
  - journals/tkde/GuoTLPW24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tkde/GuoTLPW24
Jianhao Guo, Siliang Tang, Juncheng Li, Kaihang Pan, Lingfei Wu:
RustGraph: Robust Anomaly Detection in Dynamic Graphs by Jointly Learning Structural-Temporal Dependency. IEEE Trans. Knowl. Data Eng. 36(7): 3472-3485 (2024)
[c6]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/0006PGG0ZCTZZ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/0006PGG0ZCTZZ24
Juncheng Li, Kaihang Pan, Zhiqi Ge, Minghe Gao, Wei Ji, Wenqiao Zhang, Tat-Seng Chua, Siliang Tang, Hanwang Zhang, Yueting Zhuang:
Fine-tuning Multimodal LLMs to Follow Zero-shot Demonstrative Instructions. ICLR 2024
[c5]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/PanT0FCYCZZ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/PanT0FCYCZZ24
Kaihang Pan, Siliang Tang, Juncheng Li, Zhaoyu Fan, Wei Chow, Shuicheng Yan, Tat-Seng Chua, Yueting Zhuang, Hanwang Zhang:
Auto-Encoding Morph-Tokens for Multimodal LLM. ICML 2024
[c4]
- view
  - electronic edition @ nips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/ChowLYP0GSTZS24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/ChowLYP0GSTZS24
Wei Chow, Juncheng Li, Qifan Yu, Kaihang Pan, Hao Fei, Zhiqi Ge, Shuai Yang, Siliang Tang, Hanwang Zhang, Qianru Sun:
Unified Generative and Discriminative Training for Multi-modal Large Language Models. NeurIPS 2024
[c3]
- view
  - electronic edition @ nips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/PanF0Y0THZS24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/PanF0Y0THZS24
Kaihang Pan, Zhaoyu Fan, Juncheng Li, Qifan Yu, Hao Fei, Siliang Tang, Richang Hong, Hanwang Zhang, Qianru Sun:
Towards Unified Multimodal Editing with Enhanced Knowledge Collaboration. NeurIPS 2024
[c2]
- view
  authority control:
- export record
  dblp key:
  - conf/sigir/Pan0W0S0LLCT24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/sigir/Pan0W0S0LLCT24
Kaihang Pan, Juncheng Li, Wenjie Wang, Hao Fei, Hongye Song, Wei Ji, Jun Lin, Xiaozhong Liu, Tat-Seng Chua, Siliang Tang:
I3: Intent-Introspective Retrieval Conditioned on Instructions. SIGIR 2024: 1839-1849
[i10]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2405-01926
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2405-01926
Kaihang Pan, Siliang Tang, Juncheng Li, Zhaoyu Fan, Wei Chow, Shuicheng Yan, Tat-Seng Chua, Yueting Zhuang, Hanwang Zhang:
Auto-Encoding Morph-Tokens for Multimodal LLM. CoRR abs/2405.01926 (2024)
[i9]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2409-19872
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2409-19872
Kaihang Pan, Zhaoyu Fan, Juncheng Li, Qifan Yu, Hao Fei, Siliang Tang, Richang Hong, Hanwang Zhang, Qianru Sun:
Towards Unified Multimodal Editing with Enhanced Knowledge Collaboration. CoRR abs/2409.19872 (2024)
[i8]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2411-00304
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2411-00304
Wei Chow, Juncheng Li, Qifan Yu, Kaihang Pan, Hao Fei, Zhiqi Ge, Shuai Yang, Siliang Tang, Hanwang Zhang, Qianru Sun:
Unified Generative and Discriminative Training for Multi-modal Large Language Models. CoRR abs/2411.00304 (2024)
[i7]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2411-15738
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2411-15738
Qifan Yu, Wei Chow, Zhongqi Yue, Kaihang Pan, Yang Wu, Xiaoyang Wan, Juncheng Li, Siliang Tang, Hanwang Zhang, Yueting Zhuang:
AnyEdit: Mastering Unified High-Quality Image Editing for Any Idea. CoRR abs/2411.15738 (2024)
[i6]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2412-00161
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2412-00161
Haiyi Qiu, Minghe Gao, Long Qian, Kaihang Pan, Qifan Yu, Juncheng Li, Wenjie Wang, Siliang Tang, Yueting Zhuang, Tat-Seng Chua:
STEP: Enhancing Video-LLMs' Compositional Reasoning by Spatio-Temporal Graph-guided Self-Training. CoRR abs/2412.00161 (2024)
[i5]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2412-10342
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2412-10342
Zhiqi Ge, Juncheng Li, Xinglei Pang, Minghe Gao, Kaihang Pan, Wang Lin, Hao Fei, Wenqiao Zhang, Siliang Tang, Yueting Zhuang:
Iris: Breaking GUI Complexity with Adaptive Focus and Self-Refining. CoRR abs/2412.10342 (2024)
2023
[c1]
- view
  authority control:
- export record
  dblp key:
  - conf/emnlp/Pan0SLLT23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/emnlp/Pan0SLLT23
Kaihang Pan, Juncheng Li, Hongye Song, Jun Lin, Xiaozhong Liu, Siliang Tang:
Self-supervised Meta-Prompt Learning with Meta-Gradient Regularization for Few-shot Generalization. EMNLP (Findings) 2023: 1059-1077
[i4]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2303-12314
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2303-12314
Kaihang Pan, Juncheng Li, Hongye Song, Jun Lin, Xiaozhong Liu, Siliang Tang:
Meta-augmented Prompt Tuning for Better Few-shot Learning. CoRR abs/2303.12314 (2023)
[i3]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2308-04152
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2308-04152
Juncheng Li, Kaihang Pan, Zhiqi Ge, Minghe Gao, Hanwang Zhang, Wei Ji, Wenqiao Zhang, Tat-Seng Chua, Siliang Tang, Yueting Zhuang:
Empowering Vision-Language Models to Follow Interleaved Vision-Language Instructions. CoRR abs/2308.04152 (2023)
[i2]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2308-10025
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2308-10025
Kaihang Pan, Juncheng Li, Hongye Song, Hao Fei, Wei Ji, Shuo Zhang, Jun Lin, Xiaozhong Liu, Siliang Tang:
ControlRetriever: Harnessing the Power of Instructions for Controllable Retrieval. CoRR abs/2308.10025 (2023)
[i1]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2310-02821
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2310-02821
Dong Chen, Kaihang Pan, Guoming Wang, Yueting Zhuang, Siliang Tang:
Improving Vision Anomaly Detection with the Guidance of Language Modality. CoRR abs/2310.02821 (2023)

Coauthor Index

see FAQ

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.