


default search action
Yiheng Xu
Person information
Refine list

refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2026
[i23]Lei Zhang, Mouxiang Chen, Ruisheng Cao, Jiawei Chen, Fan Zhou, Yiheng Xu, Jiaxi Yang, Zeyao Ma, Liang Chen, Changwei Luo, Kai Zhang, Fan Yan, KaShun Shum, Jiajun Zhang, Zeyu Cui, Feng Hu, Junyang Lin, Binyuan Hui, Min Yang:
MegaFlow: Large-Scale Distributed Orchestration System for the Agentic Era. CoRR abs/2601.07526 (2026)- 2025
[c18]Yuchao Chen, Xiangyuan Jiang, Yiheng Xu, Wei Zhou
:
Modality Perception Network for Multi-modal Rumor Detection. ICIC (1) 2025: 354-365
[c17]Yiheng Xu, Dunjie Lu, Zhennan Shen, Junli Wang, Zekun Wang, Yuchen Mao, Caiming Xiong, Tao Yu:
AgentTrek: Agent Trajectory Synthesis via Guiding Replay with Web Tutorials. ICLR 2025
[c16]Yiheng Xu, Zekun Wang, Junli Wang, Dunjie Lu, Tianbao Xie, Amrita Saha, Doyen Sahoo, Tao Yu, Caiming Xiong:
Aguvis: Unified Pure Vision Agents for Autonomous GUI Interaction. ICML 2025
[i22]Shuai Bai, Keqin Chen, Xuejing Liu, Jialin Wang, Wenbin Ge, Sibo Song, Kai Dang, Peng Wang, Shijie Wang
, Jun Tang, Humen Zhong, Yuanzhi Zhu, Ming-Hsuan Yang, Zhaohai Li, Jianqiang Wan, Pengfei Wang, Wei Ding, Zheren Fu, Yiheng Xu, Jiabo Ye, Xi Zhang, Tianbao Xie, Zesen Cheng, Hang Zhang, Zhibo Yang, Haiyang Xu, Junyang Lin:
Qwen2.5-VL Technical Report. CoRR abs/2502.13923 (2025)
[i21]Tianbao Xie, Jiaqi Deng, Xiaochuan Li, Junlin Yang, Haoyuan Wu, Jixuan Chen, Wenjing Hu, Xinyuan Wang, Yuhui Xu, Zekun Wang, Yiheng Xu, Junli Wang, Doyen Sahoo, Tao Yu, Caiming Xiong:
Scaling Computer-Use Grounding via User Interface Decomposition and Synthesis. CoRR abs/2505.13227 (2025)
[i20]Xinyuan Wang, Bowen Wang, Dunjie Lu, Junlin Yang, Tianbao Xie, Junli Wang, Jiaqi Deng, Xiaole Guo, Yiheng Xu, Chen Henry Wu, Zhennan Shen, Zhuokai Li, Ryan Li, Xiaochuan Li, Junda Chen, Boyuan Zheng, Peihang Li, Fangyu Lei, Ruisheng Cao, Yeqiao Fu, Dongchan Shin, Martin Shin, Jiarui Hu, Yuyan Wang, Jixuan Chen, Yuxiao Ye, Danyang Zhang, Dikang Du, Hao Hu, Huarong Chen, Zaida Zhou, Haotian Yao, Ziwei Chen, Qizheng Gu, Yipu Wang, Heng Wang, Diyi Yang, Victor Zhong, Flood Sung, Y. Charles, Zhilin Yang, Tao Yu:
OpenCUA: Open Foundations for Computer-Use Agents. CoRR abs/2508.09123 (2025)
[i19]Xianzhen Luo, Jinyang Huang, Wenzhen Zheng, Qingfu Zhu, Mingzheng Xu, Yiheng Xu, YuanTao Fan, Libo Qin, Wanxiang Che:
How Many Code and Test Cases Are Enough? Evaluating Test Cases Generation from a Binary-Matrix Perspective. CoRR abs/2510.08720 (2025)
[i18]Dunjie Lu, Yiheng Xu, Junli Wang, Haoyuan Wu, Xinyuan Wang, Zekun Wang, Junlin Yang, Hongjin Su, Jixuan Chen, Junda Chen, Yuchen Mao, Jingren Zhou, Junyang Lin, Binyuan Hui, Tao Yu:
VideoAgentTrek: Computer Use Pretraining from Unlabeled Videos. CoRR abs/2510.19488 (2025)
[i17]Yueqi Song, Ketan Ramaneti, Zaid Sheikh, Ziru Chen, Boyu Gou, Tianbao Xie, Yiheng Xu, Danyang Zhang, Apurva Gandhi, Fan Yang, Joseph Liu, Tianyue Ou, Zhihao Yuan, Frank Xu, Shuyan Zhou, Xingyao Wang, Xiang Yue, Tao Yu, Huan Sun, Yu Su, Graham Neubig:
Agent Data Protocol: Unifying Datasets for Diverse, Effective Fine-tuning of LLM Agents. CoRR abs/2510.24702 (2025)- 2024
[j1]Quyuan Tao
, Yiheng Xu, Youzhe He
, Ting Luo, Xiaoming Li, Lei Han:
Benchmarking mapping algorithms for cell-type annotating in mouse brain by integrating single-nucleus RNA-seq and Stereo-seq data. Briefings Bioinform. 25(4) (2024)
[c15]Yanpeng Ge
, Bo Liu, Shaoshuai Su, Yiheng Xu, Qiang Sun, Weikai Zhang, Wensheng Gao:
Study of the Vibration Characteristics of 550 kV GIS Circuit Breaker Based on Rigid-Flexible Coupling Model*. AIM 2024: 536-541
[c14]Yiheng Xu, Pranav Sivaraman, Hariharan Devarajan, Kathryn M. Mohror, Abhinav Bhatele:
ML-based Modeling to Predict I/O Performance on Different Storage Sub-systems. HiPC 2024: 221-231
[c13]Yiheng Xu, Hongjin Su, Chen Xing, Boyu Mi, Qian Liu, Weijia Shi, Binyuan Hui, Fan Zhou, Yitao Liu, Tianbao Xie, Zhoujun Cheng, Siheng Zhao, Lingpeng Kong, Bailin Wang, Caiming Xiong, Tao Yu:
Lemur: Harmonizing Natural Language and Code for Language Agents. ICLR 2024
[c12]Tianbao Xie, Danyang Zhang, Jixuan Chen, Xiaochuan Li, Siheng Zhao, Ruisheng Cao, Toh Jing Hua, Zhoujun Cheng, Dongchan Shin, Fangyu Lei, Yitao Liu, Yiheng Xu, Shuyan Zhou, Silvio Savarese, Caiming Xiong, Victor Zhong, Tao Yu:
OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer Environments. NeurIPS 2024
[i16]Tianbao Xie, Danyang Zhang
, Jixuan Chen, Xiaochuan Li, Siheng Zhao, Ruisheng Cao, Toh Jing Hua, Zhoujun Cheng, Dongchan Shin, Fangyu Lei, Yitao Liu, Yiheng Xu, Shuyan Zhou, Silvio Savarese, Caiming Xiong, Victor Zhong, Tao Yu:
OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer Environments. CoRR abs/2404.07972 (2024)
[i15]Yiheng Xu, Zekun Wang, Junli Wang, Dunjie Lu, Tianbao Xie, Amrita Saha, Doyen Sahoo, Tao Yu, Caiming Xiong:
Aguvis: Unified Pure Vision Agents for Autonomous GUI Interaction. CoRR abs/2412.04454 (2024)
[i14]Yiheng Xu, Dunjie Lu, Zhennan Shen, Junli Wang, Zekun Wang, Yuchen Mao, Caiming Xiong, Tao Yu:
AgentTrek: Agent Trajectory Synthesis via Guiding Replay with Web Tutorials. CoRR abs/2412.09605 (2024)- 2023
[c11]Yiheng Xu, Mingkun Zhang, Sibo Huang, Dongyu Zhang:
Tooth Segmentation from Cone-Beam CT Images Through Boundary Refinement. ICANN (4) 2023: 190-202
[i13]Mukai Li, Shansan Gong, Jiangtao Feng, Yiheng Xu, Jun Zhang, Zhiyong Wu, Lingpeng Kong:
In-Context Learning with Many Demonstration Examples. CoRR abs/2302.04931 (2023)
[i12]Yiheng Xu, Hongjin Su, Chen Xing, Boyu Mi, Qian Liu, Weijia Shi, Binyuan Hui, Fan Zhou, Yitao Liu, Tianbao Xie, Zhoujun Cheng, Siheng Zhao, Lingpeng Kong, Bailin Wang, Caiming Xiong, Tao Yu:
Lemur: Harmonizing Natural Language and Code for Language Agents. CoRR abs/2310.06830 (2023)
[i11]Tianbao Xie, Fan Zhou, Zhoujun Cheng, Peng Shi, Luoxuan Weng, Yitao Liu, Toh Jing Hua, Junning Zhao
, Qian Liu, Che Liu, Leo Z. Liu, Yiheng Xu, Hongjin Su, Dongchan Shin, Caiming Xiong, Tao Yu:
OpenAgents: An Open Platform for Language Agents in the Wild. CoRR abs/2310.10634 (2023)
[i10]Yiheng Xu, Pranav Sivaraman, Hariharan Devarajan, Kathryn M. Mohror, Abhinav Bhatele:
ML-based Modeling to Predict I/O Performance on Different Storage Sub-systems. CoRR abs/2312.06131 (2023)- 2022
[c10]Yiheng Xu
, Tengchao Lv, Lei Cui, Guoxin Wang, Yijuan Lu, Dinei A. F. Florêncio, Cha Zhang, Furu Wei:
XFUND: A Benchmark Dataset for Multilingual Visually Rich Form Understanding. ACL (Findings) 2022: 3214-3224
[c9]Junlong Li, Yiheng Xu
, Lei Cui, Furu Wei:
MarkupLM: Pre-training of Text and Markup Language for Visually Rich Document Understanding. ACL (1) 2022: 6078-6087
[c8]Junlong Li, Yiheng Xu, Tengchao Lv, Lei Cui, Cha Zhang, Furu Wei:
DiT: Self-supervised Pre-training for Document Image Transformer. ACM Multimedia 2022: 3530-3539
[i9]Junlong Li, Yiheng Xu, Tengchao Lv, Lei Cui, Cha Zhang, Furu Wei:
DiT: Self-supervised Pre-training for Document Image Transformer. CoRR abs/2203.02378 (2022)- 2021
[c7]Yang Xu, Yiheng Xu
, Tengchao Lv, Lei Cui, Furu Wei, Guoxin Wang, Yijuan Lu, Dinei A. F. Florêncio, Cha Zhang, Wanxiang Che, Min Zhang, Lidong Zhou:
LayoutLMv2: Multi-modal Pre-training for Visually-rich Document Understanding. ACL/IJCNLP (1) 2021: 2579-2591
[c6]Zilong Wang, Yiheng Xu
, Lei Cui, Jingbo Shang, Furu Wei:
LayoutReader: Pre-training of Text and Layout for Reading Order Detection. EMNLP (1) 2021: 4735-4744
[i8]Jason Mohoney, Roger Waleffe, Yiheng Xu, Theodoros Rekatsinas, Shivaram Venkataraman:
Learning Massive Graph Embeddings on a Single Machine. CoRR abs/2101.08358 (2021)
[i7]Yiheng Xu
, Tengchao Lv, Lei Cui, Guoxin Wang, Yijuan Lu, Dinei Florêncio, Cha Zhang, Furu Wei:
LayoutXLM: Multimodal Pre-training for Multilingual Visually-rich Document Understanding. CoRR abs/2104.08836 (2021)
[i6]Zilong Wang, Yiheng Xu, Lei Cui, Jingbo Shang, Furu Wei:
LayoutReader: Pre-training of Text and Layout for Reading Order Detection. CoRR abs/2108.11591 (2021)
[i5]Junlong Li, Yiheng Xu, Lei Cui, Furu Wei:
MarkupLM: Pre-training of Text and Markup Language for Visually-rich Document Understanding. CoRR abs/2110.08518 (2021)
[i4]Lei Cui, Yiheng Xu, Tengchao Lv, Furu Wei:
Document AI: Benchmarks, Models and Applications. CoRR abs/2111.08609 (2021)- 2020
[c5]Yongji Wu, Defu Lian, Yiheng Xu
, Le Wu, Enhong Chen:
Graph Convolutional Networks with Markov Random Field Reasoning for Social Spammer Detection. AAAI 2020: 1054-1061
[c4]Minghao Li
, Yiheng Xu
, Lei Cui, Shaohan Huang, Furu Wei, Zhoujun Li
, Ming Zhou:
DocBank: A Benchmark Dataset for Document Layout Analysis. COLING 2020: 949-960
[c3]Yiheng Xu
, Minghao Li
, Lei Cui, Shaohan Huang, Furu Wei, Ming Zhou:
LayoutLM: Pre-training of Text and Layout for Document Image Understanding. KDD 2020: 1192-1200
[i3]Minghao Li, Yiheng Xu, Lei Cui, Shaohan Huang, Furu Wei, Zhoujun Li, Ming Zhou:
DocBank: A Benchmark Dataset for Document Layout Analysis. CoRR abs/2006.01038 (2020)
[i2]Yang Xu, Yiheng Xu, Tengchao Lv, Lei Cui, Furu Wei, Guoxin Wang, Yijuan Lu, Dinei A. F. Florêncio, Cha Zhang, Wanxiang Che, Min Zhang, Lidong Zhou:
LayoutLMv2: Multi-modal Pre-training for Visually-Rich Document Understanding. CoRR abs/2012.14740 (2020)
2010 – 2019
- 2019
[i1]Yiheng Xu, Minghao Li, Lei Cui, Shaohan Huang, Furu Wei, Ming Zhou:
LayoutLM: Pre-training of Text and Layout for Document Image Understanding. CoRR abs/1912.13318 (2019)
2000 – 2009
- 2009
[c2]Heping Xiong, Yiheng Xu, Yi Xiao:
Comparative Analysis of Multi-period Portfolio Strategies. BIFE 2009: 266-269- 2008
[c1]Yiheng Xu, Qiangwei Wang, Jinglu Hu:
An Improved Discrete Particle Swarm Optimization Based on Cooperative Swarms. IAT 2008: 79-82
Coauthor Index

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from
to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the
of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from
,
, and
to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from
and
to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from
.
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2026-03-07 23:05 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint


Google
Google Scholar
Semantic Scholar
Internet Archive Scholar
CiteSeerX
ORCID







