default search action

combined dblp search
author search
venue search
publication search

ask others

Yuxuan Wang 0004

> Home > Persons

Person information

affiliation: Alibaba Inc., Qwen team, Beijing, China
affiliation: Peking University, Institute of Computer Technology, Beijing, China
affiliation: Peking University, Center for Data Science, Beijing, China
affiliation: Beijing Institute for General Artificial Intelligence (BIGAI), China

Other persons with the same name

see FAQ

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

2020 – today

see FAQ

What is the meaning of the colors in the publication lists?

2025
[j1]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - journals/tmlr/JiaLKWWWWZSLQLHZZ25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tmlr/JiaLKWWWWZSLQLHZZ25
Zixia Jia, Jiaqi Li, Yipeng Kang, Yuxuan Wang, Tong Wu, Quansen Wang, Xiaobo Wang, Shuyi Zhang, Junzhe Shen, Qing Li, Siyuan Qi, Yitao Liang, Di He, Zilong Zheng, Song-Chun Zhu:
The AI Hippocampus: How Far are We From Human Memory? Trans. Mach. Learn. Res. 2025 (2025)
[c11]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/WangMWLL025
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/WangMWLL025
Yueqian Wang, Xiaojun Meng, Yuxuan Wang, Jianxin Liang, Qun Liu, Dongyan Zhao:
Friends-MMC: A Dataset for Multi-modal Multi-party Conversation Understanding. AAAI 2025: 25425-25433
[c10]
- view
  - electronic edition @ escholarship.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/cogsci/PengMW0000Z25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/cogsci/PengMW0000Z25
Yongqian Peng, Yuxi Ma, Mengmeng Wang, Yuxuan Wang, Yizhou Wang, Chi Zhang, Yixin Zhu, Zilong Zheng:
Probing and Inducing Combinational Creativity in Vision-Language Models. CogSci 2025
[c9]
- view
  authority control:
- export record
  dblp key:
  - conf/cvpr/WangWCW0Z25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/cvpr/WangWCW0Z25
Yuxuan Wang, Yueqian Wang, Bo Chen, Tong Wu, Dongyan Zhao, Zilong Zheng:
OmniMMI: A Comprehensive Multi-modal Interaction Benchmark in Streaming Video Contexts. CVPR 2025: 18925-18935
[c8]
- view
- export record
  dblp key:
  - conf/icml/WuSJ0Z25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/WuSJ0Z25
Tong Wu, Junzhe Shen, Zixia Jia, Yuxuan Wang, Zilong Zheng:
TokenSwift: Lossless Acceleration of Ultra Long Sequence Generation. ICML 2025
[i25]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2501-05037
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2501-05037
Rujie Wu, Xiaojian Ma, Hai Ci, Yue Fan, Yuxuan Wang, Haozhe Zhao, Qing Li, Yizhou Wang:
LongViTU: Instruction Tuning for Long-Form Video Understanding. CoRR abs/2501.05037 (2025)
[i24]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2502-18890
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2502-18890
Tong Wu, Junzhe Shen, Zixia Jia, Yuxuan Wang, Zilong Zheng:
From Hours to Minutes: Lossless Acceleration of Ultra Long Sequence Generation up to 100K Tokens. CoRR abs/2502.18890 (2025)
[i23]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2503-22952
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2503-22952
Yuxuan Wang, Yueqian Wang, Bo Chen, Tong Wu, Dongyan Zhao, Zilong Zheng:
OmniMMI: A Comprehensive Multi-modal Interaction Benchmark in Streaming Video Contexts. CoRR abs/2503.22952 (2025)
[i22]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2504-13120
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2504-13120
Yongqian Peng, Yuxi Ma, Mengmeng Wang, Yuxuan Wang, Yizhou Wang, Chi Zhang, Yixin Zhu, Zilong Zheng:
Probing and Inducing Combinational Creativity in Vision-Language Models. CoRR abs/2504.13120 (2025)
[i21]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2505-13308
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2505-13308
Hengli Li, Chenxi Li, Tong Wu, Xuekai Zhu, Yuxuan Wang, Zhaoxin Yu, Eric Hanchen Jiang, Song-Chun Zhu, Zixia Jia, Ying Nian Wu, Zilong Zheng:
Seek in the Dark: Reasoning via Test-Time Instance-Level Policy Gradient in Latent Space. CoRR abs/2505.13308 (2025)
[i20]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2505-19752
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2505-19752
Hengli Li, Yuxuan Wang, Song-Chun Zhu, Ying Nian Wu, Zilong Zheng:
Discrete Markov Bridge. CoRR abs/2505.19752 (2025)
[i19]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2506-02555
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2506-02555
Zhitao Zeng, Zhu Zhuo, Xiaojun Jia, Erli Zhang, Junde Wu, Jiaan Zhang, Yuxuan Wang, Chang Han Low, Jian Jiang, Zilong Zheng, Xiaochun Cao, Yutong Ban, Qi Dou, Yang Liu, Yueming Jin:
SurgVLM: A Large Vision-Language Model and Systematic Evaluation Benchmark for Surgical Intelligence. CoRR abs/2506.02555 (2025)
[i18]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2509-17429
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2509-17429
Zhitao Zeng, Guojian Yuan, Junyuan Mao, Yuxuan Wang, Xiaoshuang Jia, Yueming Jin:
Multi-scale Temporal Prediction via Incremental Generation and Multi-agent Collaboration. CoRR abs/2509.17429 (2025)
[i17]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2509-24445
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2509-24445
Jianxin Liang, Tan Yue, Yuxuan Wang, Yueqian Wang, Zhihan Yin, Huishuai Zhang, Dongyan Zhao:
Beyond Isolated Facts: Synthesizing Narrative and Grounded Supervision for VideoQA. CoRR abs/2509.24445 (2025)
[i16]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2509-25773
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2509-25773
Zhengpeng Shi, Hengli Li, Yanpeng Zhao, Jianqun Zhou, Yuxuan Wang, Qinrong Cui, Wei Bi, Song-Chun Zhu, Bo Zhao, Zilong Zheng:
V-HUB: A Visual-Centric Humor Understanding Benchmark for Video LLMs. CoRR abs/2509.25773 (2025)
[i15]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2512-24885
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2512-24885
Hengli Li, Zhaoxin Yu, Qi Shen, Chenxi Li, Mengmeng Wang, Tinglang Wu, Yipeng Kang, Yuxuan Wang, Song-Chun Zhu, Zixia Jia, Zilong Zheng:
BEDA: Belief Estimation as Probabilistic Constraints for Performing Strategic Dialogue Acts. CoRR abs/2512.24885 (2025)
2024
[c7]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/WangWC024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/WangWC024
Yueqian Wang, Yuxuan Wang, Kai Chen, Dongyan Zhao:
STAIR: Spatial-Temporal Reasoning with Auditable Intermediate Results for Video Question Answering. AAAI 2024: 19215-19223
[c6]
- view
  authority control:
- export record
  dblp key:
  - conf/emnlp/WangW0L0LZ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/emnlp/WangW0L0LZ24
Yuxuan Wang, Yueqian Wang, Pengfei Wu, Jianxin Liang, Dongyan Zhao, Yang Liu, Zilong Zheng:
Efficient Temporal Extrapolation of Multimodal Large Language Models with Temporal Grounding Bridge. EMNLP 2024: 9972-9987
[i14]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2401-03901
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2401-03901
Yueqian Wang, Yuxuan Wang, Kai Chen, Dongyan Zhao:
STAIR: Spatial-Temporal Reasoning with Auditable Intermediate Results for Video Question Answering. CoRR abs/2401.03901 (2024)
[i13]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2402-16050
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2402-16050
Yuxuan Wang, Yueqian Wang, Pengfei Wu, Jianxin Liang, Dongyan Zhao, Zilong Zheng:
LSTP: Language-guided Spatial-Temporal Prompt Learning for Long-form Video-Text Understanding. CoRR abs/2402.16050 (2024)
[i12]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2403-10228
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2403-10228
Yueqian Wang, Xiaojun Meng, Jianxin Liang, Yuxuan Wang, Qun Liu, Dongyan Zhao:
HawkEye: Training Video-Text LLMs for Grounding Text in Videos. CoRR abs/2403.10228 (2024)
[i11]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2406-16338
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2406-16338
Yuxuan Wang, Yueqian Wang, Dongyan Zhao, Cihang Xie, Zilong Zheng:
VideoHallucer: Evaluating Intrinsic and Extrinsic Hallucinations in Large Video-Language Models. CoRR abs/2406.16338 (2024)
[i10]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2408-02210
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2408-02210
Yuxuan Wang, Alan L. Yuille, Zhuowan Li, Zilong Zheng:
ExoViP: Step-by-step Verification and Exploration with Exoskeleton Modules for Compositional Visual Reasoning. CoRR abs/2408.02210 (2024)
[i9]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2409-01071
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2409-01071
Yuxuan Wang, Cihang Xie, Yang Liu, Zilong Zheng:
VideoLLaMB: Long-context Video Understanding with Recurrent Memory Bridges. CoRR abs/2409.01071 (2024)
[i8]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2409-01151
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2409-01151
Yueqian Wang, Jianxin Liang, Yuxuan Wang, Huishuai Zhang, Dongyan Zhao:
Understanding Multimodal Hallucination with Parameter-Free Representation Alignment. CoRR abs/2409.01151 (2024)
[i7]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2411-17991
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2411-17991
Yueqian Wang, Xiaojun Meng, Yuxuan Wang, Jianxin Liang, Jiansheng Wei, Huishuai Zhang, Dongyan Zhao:
VideoLLM Knows When to Speak: Enhancing Time-Sensitive Video Comprehension with Video-Text Duet Interaction Format. CoRR abs/2411.17991 (2024)
[i6]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2412-17295
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2412-17295
Yueqian Wang, Xiaojun Meng, Yuxuan Wang, Jianxin Liang, Qun Liu, Dongyan Zhao:
Friends-MMC: A Dataset for Multi-modal Multi-party Conversation Understanding. CoRR abs/2412.17295 (2024)
2023
[c5]
- view
  authority control:
- export record
  dblp key:
  - conf/acl/WangW0Z23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/acl/WangW0Z23
Yuxuan Wang, Jianghui Wang, Dongyan Zhao, Zilong Zheng:
Rethinking Dictionaries and Glyphs for Chinese Language Pre-training. ACL (Findings) 2023: 1089-1101
[c4]
- view
  authority control:
- export record
  dblp key:
  - conf/acl/WangZZ0WZ23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/acl/WangZZ0WZ23
Yuxuan Wang, Zilong Zheng, Xueliang Zhao, Jinpeng Li, Yueqian Wang, Dongyan Zhao:
VSTAR: A Video-grounded Dialogue Dataset for Situated Semantic Understanding with Scene and Topic Transitions. ACL (1) 2023: 5036-5048
[c3]
- view
  authority control:
- export record
  dblp key:
  - conf/nlpcc/WangWZ23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nlpcc/WangWZ23
Yueqian Wang, Yuxuan Wang, Dongyan Zhao:
Overview of the NLPCC 2023 Shared Task 10: Learn to Watch TV: Multimodal Dialogue Understanding and Response Generation. NLPCC (3) 2023: 412-419
[i5]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2305-18756
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2305-18756
Yuxuan Wang, Zilong Zheng, Xueliang Zhao, Jinpeng Li, Yueqian Wang, Dongyan Zhao:
VSTAR: A Video-grounded Dialogue Dataset for Situated Semantic Understanding with Scene and Topic Transitions. CoRR abs/2305.18756 (2023)
[i4]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2305-18760
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2305-18760
Yuxuan Wang, Jianghui Wang, Dongyan Zhao, Zilong Zheng:
Shuo Wen Jie Zi: Rethinking Dictionaries and Glyphs for Chinese Language Pre-training. CoRR abs/2305.18760 (2023)
[i3]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2306-02252
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2306-02252
Jianghui Wang, Yuxuan Wang, Dongyan Zhao, Zilong Zheng:
MoviePuzzle: Visual Narrative Reasoning through Multimodal Order Learning. CoRR abs/2306.02252 (2023)
[i2]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2309-15516
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2309-15516
Xiaowen Sun, Jiazhan Feng, Yuxuan Wang, Yuxuan Lai, Xingyu Shen, Dongyan Zhao:
Teaching Text-to-Image Models to Communicate. CoRR abs/2309.15516 (2023)
2022
[c2]
- view
  authority control:
- export record
  dblp key:
  - conf/emnlp/ZhaoWTW022
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/emnlp/ZhaoWTW022
Xueliang Zhao, Yuxuan Wang, Chongyang Tao, Chenshuo Wang, Dongyan Zhao:
Collaborative Reasoning on Multi-Modal Semantic Graphs for Video-Grounded Dialogue Generation. EMNLP (Findings) 2022: 5988-5998
[c1]
- view
  authority control:
- export record
  dblp key:
  - conf/nlpcc/WangZZ22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nlpcc/WangZZ22
Yuxuan Wang, Xueliang Zhao, Dongyan Zhao:
Overview of the NLPCC 2022 Shared Task: Multi-modal Dialogue Understanding and Generation. NLPCC (2) 2022: 328-335
[i1]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2210-12460
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2210-12460
Xueliang Zhao, Yuxuan Wang, Chongyang Tao, Chenshuo Wang, Dongyan Zhao:
Collaborative Reasoning on Multi-Modal Semantic Graphs for Video-Grounded Dialogue Generation. CoRR abs/2210.12460 (2022)

Coauthor Index

see FAQ

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.