


default search action
Yuxuan Wang 0004
Person information
- affiliation: Alibaba Inc., Qwen team, Beijing, China
- affiliation: Peking University, Institute of Computer Technology, Beijing, China
- affiliation: Peking University, Center for Data Science, Beijing, China
- affiliation: Beijing Institute for General Artificial Intelligence (BIGAI), China
Other persons with the same name
- Yuxuan Wang (aka: YuXuan Wang, Yu-xuan Wang, Yu-Xuan Wang) — disambiguation page
- Yuxuan Wang 0001 — Harbin Institute of Technology, School of Computer Science and Technology, China
- Yuxuan Wang 0002 — ByteDance AI Lab, Mountain View, CA, USA (and 2 more)
- Yuxuan Wang 0003
— École Polytechnique Fédérale de Lausanne (EPFL), Embedded System Laboratory (ESL), Switzerland - Yuxuan Wang 0005 — Peking University, School of Computer Science, Academy for Advanced Interdisciplinary Studies, Beijing, China
- Yuxuan Wang 0006 — Southern University of Science and Technology (SUSTech), Key Laboratory of Biomimetic Robotics and Intelligent Systems, Shenzhen, China
- Yuxuan Wang 0007 — Tsinghua University, Beijing Information Science and Technology National Research Center (BNRist), Beijing, China
- Yuxuan Wang 0008
— Boston University, School of Public Health, Department of Biostatistics, Boston, MA, USA - Yuxuan Wang 0009 — Hebei Medical University, School of Public Health, Shijiazhuang, China
- Yuxuan Wang 0010 — ByteDance Seed
- Yuxuan Wang 0011 — Johns Hopkins University, Sol Goldman Pancreatic Cancer Research Center, Baltimore, MD, USA
- Yuxuan Wang 0012
— Beijing Institute of Technology, School of Computer Science and Technology, Beijing, China (and 1 more) - Yuxuan Wang 0013
— University of Macau, tate Key Laboratory of Internet of Things for Smart City (SKL-IOTSC), Macau, China (and 1 more)
Refine list

refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2025
[j1]Zixia Jia, Jiaqi Li, Yipeng Kang, Yuxuan Wang, Tong Wu, Quansen Wang, Xiaobo Wang, Shuyi Zhang, Junzhe Shen, Qing Li, Siyuan Qi, Yitao Liang, Di He, Zilong Zheng, Song-Chun Zhu:
The AI Hippocampus: How Far are We From Human Memory? Trans. Mach. Learn. Res. 2025 (2025)
[c11]Yueqian Wang, Xiaojun Meng, Yuxuan Wang, Jianxin Liang, Qun Liu, Dongyan Zhao:
Friends-MMC: A Dataset for Multi-modal Multi-party Conversation Understanding. AAAI 2025: 25425-25433
[c10]Yongqian Peng, Yuxi Ma, Mengmeng Wang, Yuxuan Wang, Yizhou Wang, Chi Zhang, Yixin Zhu, Zilong Zheng:
Probing and Inducing Combinational Creativity in Vision-Language Models. CogSci 2025
[c9]Yuxuan Wang, Yueqian Wang, Bo Chen, Tong Wu, Dongyan Zhao, Zilong Zheng:
OmniMMI: A Comprehensive Multi-modal Interaction Benchmark in Streaming Video Contexts. CVPR 2025: 18925-18935
[c8]Tong Wu, Junzhe Shen, Zixia Jia, Yuxuan Wang, Zilong Zheng:
TokenSwift: Lossless Acceleration of Ultra Long Sequence Generation. ICML 2025
[i25]Rujie Wu, Xiaojian Ma, Hai Ci, Yue Fan, Yuxuan Wang, Haozhe Zhao, Qing Li, Yizhou Wang:
LongViTU: Instruction Tuning for Long-Form Video Understanding. CoRR abs/2501.05037 (2025)
[i24]Tong Wu, Junzhe Shen, Zixia Jia, Yuxuan Wang, Zilong Zheng:
From Hours to Minutes: Lossless Acceleration of Ultra Long Sequence Generation up to 100K Tokens. CoRR abs/2502.18890 (2025)
[i23]Yuxuan Wang, Yueqian Wang, Bo Chen, Tong Wu, Dongyan Zhao, Zilong Zheng:
OmniMMI: A Comprehensive Multi-modal Interaction Benchmark in Streaming Video Contexts. CoRR abs/2503.22952 (2025)
[i22]Yongqian Peng, Yuxi Ma, Mengmeng Wang, Yuxuan Wang, Yizhou Wang, Chi Zhang, Yixin Zhu, Zilong Zheng:
Probing and Inducing Combinational Creativity in Vision-Language Models. CoRR abs/2504.13120 (2025)
[i21]Hengli Li, Chenxi Li, Tong Wu, Xuekai Zhu, Yuxuan Wang, Zhaoxin Yu, Eric Hanchen Jiang, Song-Chun Zhu, Zixia Jia, Ying Nian Wu, Zilong Zheng:
Seek in the Dark: Reasoning via Test-Time Instance-Level Policy Gradient in Latent Space. CoRR abs/2505.13308 (2025)
[i20]Hengli Li, Yuxuan Wang, Song-Chun Zhu, Ying Nian Wu, Zilong Zheng:
Discrete Markov Bridge. CoRR abs/2505.19752 (2025)
[i19]Zhitao Zeng, Zhu Zhuo, Xiaojun Jia, Erli Zhang, Junde Wu, Jiaan Zhang, Yuxuan Wang, Chang Han Low, Jian Jiang, Zilong Zheng, Xiaochun Cao, Yutong Ban, Qi Dou, Yang Liu, Yueming Jin:
SurgVLM: A Large Vision-Language Model and Systematic Evaluation Benchmark for Surgical Intelligence. CoRR abs/2506.02555 (2025)
[i18]Zhitao Zeng, Guojian Yuan, Junyuan Mao, Yuxuan Wang, Xiaoshuang Jia, Yueming Jin:
Multi-scale Temporal Prediction via Incremental Generation and Multi-agent Collaboration. CoRR abs/2509.17429 (2025)
[i17]Jianxin Liang, Tan Yue, Yuxuan Wang, Yueqian Wang, Zhihan Yin, Huishuai Zhang, Dongyan Zhao:
Beyond Isolated Facts: Synthesizing Narrative and Grounded Supervision for VideoQA. CoRR abs/2509.24445 (2025)
[i16]Zhengpeng Shi, Hengli Li, Yanpeng Zhao, Jianqun Zhou, Yuxuan Wang, Qinrong Cui, Wei Bi, Song-Chun Zhu, Bo Zhao, Zilong Zheng:
V-HUB: A Visual-Centric Humor Understanding Benchmark for Video LLMs. CoRR abs/2509.25773 (2025)
[i15]Hengli Li, Zhaoxin Yu, Qi Shen, Chenxi Li, Mengmeng Wang, Tinglang Wu, Yipeng Kang, Yuxuan Wang, Song-Chun Zhu, Zixia Jia, Zilong Zheng:
BEDA: Belief Estimation as Probabilistic Constraints for Performing Strategic Dialogue Acts. CoRR abs/2512.24885 (2025)- 2024
[c7]Yueqian Wang, Yuxuan Wang, Kai Chen, Dongyan Zhao:
STAIR: Spatial-Temporal Reasoning with Auditable Intermediate Results for Video Question Answering. AAAI 2024: 19215-19223
[c6]Yuxuan Wang, Yueqian Wang, Pengfei Wu, Jianxin Liang, Dongyan Zhao, Yang Liu, Zilong Zheng:
Efficient Temporal Extrapolation of Multimodal Large Language Models with Temporal Grounding Bridge. EMNLP 2024: 9972-9987
[i14]Yueqian Wang, Yuxuan Wang, Kai Chen, Dongyan Zhao:
STAIR: Spatial-Temporal Reasoning with Auditable Intermediate Results for Video Question Answering. CoRR abs/2401.03901 (2024)
[i13]Yuxuan Wang, Yueqian Wang, Pengfei Wu, Jianxin Liang, Dongyan Zhao, Zilong Zheng:
LSTP: Language-guided Spatial-Temporal Prompt Learning for Long-form Video-Text Understanding. CoRR abs/2402.16050 (2024)
[i12]Yueqian Wang, Xiaojun Meng, Jianxin Liang, Yuxuan Wang, Qun Liu, Dongyan Zhao:
HawkEye: Training Video-Text LLMs for Grounding Text in Videos. CoRR abs/2403.10228 (2024)
[i11]Yuxuan Wang, Yueqian Wang, Dongyan Zhao, Cihang Xie, Zilong Zheng:
VideoHallucer: Evaluating Intrinsic and Extrinsic Hallucinations in Large Video-Language Models. CoRR abs/2406.16338 (2024)
[i10]Yuxuan Wang, Alan L. Yuille, Zhuowan Li, Zilong Zheng:
ExoViP: Step-by-step Verification and Exploration with Exoskeleton Modules for Compositional Visual Reasoning. CoRR abs/2408.02210 (2024)
[i9]Yuxuan Wang, Cihang Xie, Yang Liu, Zilong Zheng:
VideoLLaMB: Long-context Video Understanding with Recurrent Memory Bridges. CoRR abs/2409.01071 (2024)
[i8]Yueqian Wang, Jianxin Liang, Yuxuan Wang, Huishuai Zhang, Dongyan Zhao:
Understanding Multimodal Hallucination with Parameter-Free Representation Alignment. CoRR abs/2409.01151 (2024)
[i7]Yueqian Wang, Xiaojun Meng, Yuxuan Wang, Jianxin Liang, Jiansheng Wei, Huishuai Zhang, Dongyan Zhao:
VideoLLM Knows When to Speak: Enhancing Time-Sensitive Video Comprehension with Video-Text Duet Interaction Format. CoRR abs/2411.17991 (2024)
[i6]Yueqian Wang, Xiaojun Meng, Yuxuan Wang, Jianxin Liang, Qun Liu, Dongyan Zhao:
Friends-MMC: A Dataset for Multi-modal Multi-party Conversation Understanding. CoRR abs/2412.17295 (2024)- 2023
[c5]Yuxuan Wang, Jianghui Wang, Dongyan Zhao, Zilong Zheng:
Rethinking Dictionaries and Glyphs for Chinese Language Pre-training. ACL (Findings) 2023: 1089-1101
[c4]Yuxuan Wang, Zilong Zheng, Xueliang Zhao, Jinpeng Li, Yueqian Wang, Dongyan Zhao:
VSTAR: A Video-grounded Dialogue Dataset for Situated Semantic Understanding with Scene and Topic Transitions. ACL (1) 2023: 5036-5048
[c3]Yueqian Wang, Yuxuan Wang, Dongyan Zhao:
Overview of the NLPCC 2023 Shared Task 10: Learn to Watch TV: Multimodal Dialogue Understanding and Response Generation. NLPCC (3) 2023: 412-419
[i5]Yuxuan Wang, Zilong Zheng, Xueliang Zhao, Jinpeng Li, Yueqian Wang, Dongyan Zhao:
VSTAR: A Video-grounded Dialogue Dataset for Situated Semantic Understanding with Scene and Topic Transitions. CoRR abs/2305.18756 (2023)
[i4]Yuxuan Wang, Jianghui Wang, Dongyan Zhao, Zilong Zheng:
Shuo Wen Jie Zi: Rethinking Dictionaries and Glyphs for Chinese Language Pre-training. CoRR abs/2305.18760 (2023)
[i3]Jianghui Wang, Yuxuan Wang, Dongyan Zhao, Zilong Zheng:
MoviePuzzle: Visual Narrative Reasoning through Multimodal Order Learning. CoRR abs/2306.02252 (2023)
[i2]Xiaowen Sun, Jiazhan Feng, Yuxuan Wang, Yuxuan Lai, Xingyu Shen, Dongyan Zhao:
Teaching Text-to-Image Models to Communicate. CoRR abs/2309.15516 (2023)- 2022
[c2]Xueliang Zhao, Yuxuan Wang
, Chongyang Tao, Chenshuo Wang, Dongyan Zhao:
Collaborative Reasoning on Multi-Modal Semantic Graphs for Video-Grounded Dialogue Generation. EMNLP (Findings) 2022: 5988-5998
[c1]Yuxuan Wang, Xueliang Zhao, Dongyan Zhao:
Overview of the NLPCC 2022 Shared Task: Multi-modal Dialogue Understanding and Generation. NLPCC (2) 2022: 328-335
[i1]Xueliang Zhao, Yuxuan Wang, Chongyang Tao, Chenshuo Wang, Dongyan Zhao:
Collaborative Reasoning on Multi-Modal Semantic Graphs for Video-Grounded Dialogue Generation. CoRR abs/2210.12460 (2022)
Coauthor Index

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from
to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the
of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from
,
, and
to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from
and
to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from
.
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2026-02-05 00:03 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint


Google
Google Scholar
Semantic Scholar
Internet Archive Scholar
CiteSeerX
ORCID







