


default search action
Yi Jiang 0009
Person information
- affiliation: Bytedance Inc., Beijing, China
Other persons with the same name
- Yi Jiang — disambiguation page
- Yi Jiang 0001 — Victoria University of Technology, Department of Computer and Mathematical Sciences, Australia
- Yi Jiang 0002
— Fudan University, Department of Communication Science and Engineering, Shanghai, China (and 8 more) - Yi Jiang 0003
— Chinese Academy of Sciences, Institute of Psychology, Beijing, China (and 1 more) - Yi Jiang 0004
— Yangzhou University, Institute of Information Engineering, China - Yi Jiang 0005
— Northwestern Polytechnical University, Xi'an, China - Yi Jiang 0006
— Dalian Maritime University, Information Science and Technology College, China (and 1 more) - Yi Jiang 0007
— Northeastern University, State Key Laboratory of Synthetical Automation for Process Industries, Shenyang, China (and 2 more) - Yi Jiang 0008
— Wuhan University of Science and Technology, School of Computer Science and Technology, Wuhan, China
Refine list

refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2026
[j2]Junfeng Wu
, Yi Jiang
, Chuofan Ma, Yuliang Liu
, Hengshuang Zhao
, Zehuan Yuan, Song Bai, Xiang Bai
:
Liquid: Language Models are Scalable and Unified Multi-Modal Generators. Int. J. Comput. Vis. 134(1): 39 (2026)
[c26]Shilong Zhang, Wenbo Li, Shoufa Chen, Chongjian Ge, Peize Sun, Yifu Zhang, Yi Jiang, Zehuan Yuan, Bingyue Peng, Ping Luo:
FlashVideo: Flowing Fidelity to Detail for Efficient High-Resolution Video Generation. AAAI 2026: 12735-12743- 2025
[c25]Jian Han, Jinlai Liu, Yi Jiang, Bin Yan, Yuqi Zhang, Zehuan Yuan, Bingyue Peng, Xiaobing Liu:
Infinity: Scaling Bitwise AutoRegressive Modeling for High-Resolution Image Synthesis. CVPR 2025: 15733-15744
[c24]Shoufa Chen, Chongjian Ge, Yuqi Zhang, Yida Zhang, Fengda Zhu, Hao Yang, Hongxiang Hao, Hui Wu, Zhichao Lai, Yifei Hu, Ting-Che Lin, Shilong Zhang, Fu Li, Chuan Li, Xing Wang, Yanghua Peng, Peize Sun, Ping Luo, Yi Jiang, Zehuan Yuan, Bingyue Peng, Xiaobing Liu:
Goku: Flow Based Video Generative Foundation Models. CVPR 2025: 23516-23527
[i32]Shoufa Chen, Chongjian Ge, Yuqi Zhang, Yida Zhang, Fengda Zhu, Hao Yang, Hongxiang Hao, Hui Wu, Zhichao Lai, Yifei Hu, Ting-Che Lin, Shilong Zhang, Fu Li, Chuan Li, Xing Wang, Yanghua Peng, Peize Sun, Ping Luo, Yi Jiang, Zehuan Yuan, Bingyue Peng, Xiaobing Liu:
Goku: Flow Based Video Generative Foundation Models. CoRR abs/2502.04896 (2025)
[i31]Shilong Zhang, Wenbo Li, Shoufa Chen, Chongjian Ge, Peize Sun, Yida Zhang, Yi Jiang, Zehuan Yuan, Binyue Peng, Ping Luo:
FlashVideo: Flowing Fidelity to Detail for Efficient High-Resolution Video Generation. CoRR abs/2502.05179 (2025)
[i30]Chuofan Ma, Yi Jiang, Junfeng Wu, Jihan Yang, Xin Yu, Zehuan Yuan, Bingyue Peng, Xiaojuan Qi:
UniTok: A Unified Tokenizer for Visual Generation and Understanding. CoRR abs/2502.20321 (2025)
[i29]Yifu Zhang, Hao Yang, Yuqi Zhang, Yifei Hu, Fengda Zhu, Chuang Lin, Xiaofeng Mei, Yi Jiang, Bingyue Peng, Zehuan Yuan:
Waver: Wave Your Way to Lifelike Video Generation. CoRR abs/2508.15761 (2025)
[i28]Jinlai Liu, Jian Han, Bin Yan, Hui Wu, Fengda Zhu, Xing Wang, Yi Jiang, Bingyue Peng, Zehuan Yuan:
InfinityStar: Unified Spacetime AutoRegressive Modeling for Visual Generation. CoRR abs/2511.04675 (2025)- 2024
[c23]Junfeng Wu
, Yi Jiang, Qihao Liu, Zehuan Yuan, Xiang Bai, Song Bai:
General Object Foundation Model for Images and Videos at Scale. CVPR 2024: 3783-3795
[c22]Chuang Lin, Yi Jiang, Lizhen Qu, Zehuan Yuan, Jianfei Cai:
Generative Region-Language Pretraining for Open-Ended Object Detection. CVPR 2024: 13958-13968
[c21]Chuofan Ma, Yi Jiang, Jiannan Wu, Zehuan Yuan, Xiaojuan Qi:
Groma: Localized Visual Tokenization for Grounding Multimodal Large Language Models. ECCV (6) 2024: 417-435
[i27]Chuang Lin, Yi Jiang, Lizhen Qu, Zehuan Yuan, Jianfei Cai:
Generative Region-Language Pretraining for Open-Ended Object Detection. CoRR abs/2403.10191 (2024)
[i26]Chuofan Ma, Yi Jiang, Jiannan Wu, Zehuan Yuan, Xiaojuan Qi:
Groma: Localized Visual Tokenization for Grounding Multimodal Large Language Models. CoRR abs/2404.13013 (2024)
[i25]Peize Sun, Yi Jiang, Shoufa Chen, Shilong Zhang, Bingyue Peng, Ping Luo, Zehuan Yuan:
Autoregressive Model Beats Diffusion: Llama for Scalable Image Generation. CoRR abs/2406.06525 (2024)
[i24]Junfeng Wu, Yi Jiang, Chuofan Ma, Yuliang Liu
, Hengshuang Zhao, Zehuan Yuan, Song Bai, Xiang Bai:
Liquid: Language Models are Scalable Multi-modal Generators. CoRR abs/2412.04332 (2024)
[i23]Jian Han, Jinlai Liu, Yi Jiang, Bin Yan, Yuqi Zhang, Zehuan Yuan, Bingyue Peng, Xiaobing Liu:
Infinity: Scaling Bitwise AutoRegressive Modeling for High-Resolution Image Synthesis. CoRR abs/2412.04431 (2024)- 2023
[j1]Peize Sun
, Rufeng Zhang
, Yi Jiang, Tao Kong
, Chenfeng Xu
, Wei Zhan
, Masayoshi Tomizuka
, Zehuan Yuan, Ping Luo
:
Sparse R-CNN: An End-to-End Framework for Object Detection. IEEE Trans. Pattern Anal. Mach. Intell. 45(12): 15650-15664 (2023)
[c20]Qihao Liu, Junfeng Wu
, Yi Jiang, Xiang Bai, Alan L. Yuille, Song Bai:
InstMove: Instance Motion for Object-centric Video Segmentation. CVPR 2023: 6344-6354
[c19]Bin Yan, Yi Jiang, Jiannan Wu, Dong Wang, Ping Luo, Zehuan Yuan, Huchuan Lu:
Universal Instance Perception as Object Discovery and Retrieval. CVPR 2023: 15325-15336
[c18]Jiannan Wu, Yi Jiang, Bin Yan, Huchuan Lu, Zehuan Yuan, Ping Luo:
Segment Every Reference Object in Spatial and Temporal Spaces. ICCV 2023: 2538-2550
[c17]Jiannan Wu, Yi Jiang, Bin Yan, Huchuan Lu, Zehuan Yuan, Ping Luo:
Exploring Transformers for Open-world Instance Segmentation. ICCV 2023: 6588-6598
[c16]Qiushan Guo, Chuofan Ma, Yi Jiang, Zehuan Yuan, Yizhou Yu, Ping Luo:
EGC: Image Generation and Classification via a Diffusion Energy-Based Model. ICCV 2023: 22895-22905
[c15]Matej Kristan, Jirí Matas, Martin Danelljan, Michael Felsberg, Hyung Jin Chang, Luka Cehovin Zajc, Alan Lukezic, Ondrej Drbohlav, Zhongqun Zhang, Khanh-Tung Tran, Xuan-Son Vu, Johanna Björklund, Christoph Mayer, Yushan Zhang, Lei Ke, Jie Zhao, Gustavo Fernández, Noor Al-Shakarji, Dong An, Michael Arens, Stefan Becker, Goutam Bhat, Sebastian Bullinger, Antoni B. Chan, Shijie Chang, Hanyuan Chen, Xin Chen, Yan Chen, Zhenyu Chen, Yangming Cheng, Yutao Cui, Chunyuan Deng, Jiahua Dong, Matteo Dunnhofer, Wei Feng, Jianlong Fu, Jie Gao, Ruize Han, Zeqi Hao, Jun-Yan He, Keji He, Zhenyu He, Xiantao Hu, Kaer Huang, Yuqing Huang, Yi Jiang, Ben Kang, Jin-Peng Lan, Hyungjun Lee, Chenyang Li, Jiahao Li, Ning Li, Wangkai Li, Xiaodi Li, Xin Li, Pengyu Liu, Yue Liu, Huchuan Lu, Bin Luo, Ping Luo, Yinchao Ma, Deshui Miao, Christian Micheloni, Kannappan Palaniappan, Hancheol Park, Matthieu Paul, Houwen Peng, Zekun Qian, Gani Rahmon, Norbert Scherer-Negenborn, Pengcheng Shao, Wooksu Shin, Elham Soltani Kazemi, Tianhui Song, Rainer Stiefelhagen, Rui Sun, Chuanming Tang, Zhangyong Tang, Imad Eddine Toubal, Jack Valmadre, Joost van de Weijer, Luc Van Gool, Jash Vira, Stéphane Vujasinovic, Cheng Wan, Jia Wan, Dong Wang, Fei Wang, Feifan Wang, He Wang, Limin Wang, Song Wang, Yaowei Wang, Zhepeng Wang, Gangshan Wu, Jiannan Wu, Qiangqiang Wu
, Xiaojun Wu, Anqi Xiao, Jinxia Xie, Chenlong Xu, Min Xu, Tianyang Xu, Yuanyou Xu, Bin Yan, Dawei Yang, Ming-Hsuan Yang, Tianyu Yang, Yi Yang, Zongxin Yang, Xuanwu Yin, Fisher Yu, Hongyuan Yu, Qianjin Yu, Weichen Yu, Yongsheng Yuan, Zehuan Yuan, Jianlin Zhang, Lu Zhang, Tianzhu Zhang, Guodongfang Zhao, Shaochuan Zhao, Yaozong Zheng, Bineng Zhong, Jiawen Zhu, Xuefeng Zhu, Yueting Zhuang, ChengAo Zong, Kunlong Zuo:
The First Visual Object Tracking Segmentation VOTS2023 Challenge Results. ICCV (Workshops) 2023: 1788-1810
[c14]Chuang Lin, Peize Sun, Yi Jiang, Ping Luo, Lizhen Qu, Gholamreza Haffari, Zehuan Yuan, Jianfei Cai:
Learning Object-Language Alignments for Open-Vocabulary Object Detection. ICLR 2023
[c13]Bingyang Wang
, Tanlin Li
, Jiannan Wu
, Yi Jiang
, Huchuan Lu
, You He
:
A Simple Baseline for Open-World Tracking via Self-training. ACM Multimedia 2023: 2765-2774
[c12]Chuofan Ma, Yi Jiang, Xin Wen, Zehuan Yuan, Xiaojuan Qi:
CoDet: Co-occurrence Guided Region-Word Alignment for Open-Vocabulary Object Detection. NeurIPS 2023
[i22]Bin Yan, Yi Jiang, Jiannan Wu, Dong Wang, Ping Luo, Zehuan Yuan, Huchuan Lu:
Universal Instance Perception as Object Discovery and Retrieval. CoRR abs/2303.06674 (2023)
[i21]Qihao Liu, Junfeng Wu, Yi Jiang, Xiang Bai, Alan L. Yuille, Song Bai:
InstMove: Instance Motion for Object-centric Video Segmentation. CoRR abs/2303.08132 (2023)
[i20]Qiushan Guo, Yizhou Yu, Yi Jiang, Jiannan Wu, Zehuan Yuan, Ping Luo:
Multi-Level Contrastive Learning for Dense Prediction Task. CoRR abs/2304.02010 (2023)
[i19]Qiushan Guo, Chuofan Ma, Yi Jiang, Zehuan Yuan, Yizhou Yu, Ping Luo:
EGC: Image Generation and Classification via a Diffusion Energy-Based Model. CoRR abs/2304.02012 (2023)
[i18]Jiannan Wu, Yi Jiang, Bin Yan, Huchuan Lu, Zehuan Yuan, Ping Luo:
Exploring Transformers for Open-world Instance Segmentation. CoRR abs/2308.04206 (2023)
[i17]Chuofan Ma, Yi Jiang, Xin Wen, Zehuan Yuan, Xiaojuan Qi:
CoDet: Co-Occurrence Guided Region-Word Alignment for Open-Vocabulary Object Detection. CoRR abs/2310.16667 (2023)
[i16]Junfeng Wu, Yi Jiang, Qihao Liu, Zehuan Yuan, Xiang Bai, Song Bai:
General Object Foundation Model for Images and Videos at Scale. CoRR abs/2312.09158 (2023)
[i15]Jiannan Wu, Yi Jiang, Bin Yan, Huchuan Lu, Zehuan Yuan, Ping Luo:
UniRef++: Segment Every Reference Object in Spatial and Temporal Spaces. CoRR abs/2312.15715 (2023)- 2022
[c11]Jiannan Wu, Yi Jiang, Peize Sun, Zehuan Yuan, Ping Luo:
Language as Queries for Referring Video Object Segmentation. CVPR 2022: 4964-4974
[c10]Peize Sun, Jinkun Cao, Yi Jiang, Zehuan Yuan, Song Bai, Kris Kitani, Ping Luo:
DanceTrack: Multi-Object Tracking in Uniform Appearance and Diverse Motion. CVPR 2022: 20961-20970
[c9]Yifu Zhang, Peize Sun, Yi Jiang, Dongdong Yu, Fucheng Weng, Zehuan Yuan, Ping Luo, Wenyu Liu, Xinggang Wang
:
ByteTrack: Multi-object Tracking by Associating Every Detection Box. ECCV (22) 2022: 1-21
[c8]Chuang Lin
, Yi Jiang
, Jianfei Cai
, Lizhen Qu
, Gholamreza Haffari
, Zehuan Yuan
:
Multimodal Transformer with Variable-Length Memory for Vision-and-Language Navigation. ECCV (36) 2022: 380-397
[c7]Junfeng Wu
, Yi Jiang, Song Bai, Wenqing Zhang, Xiang Bai:
SeqFormer: Sequential Transformer for Video Instance Segmentation. ECCV (28) 2022: 553-569
[c6]Junfeng Wu
, Qihao Liu, Yi Jiang, Song Bai, Alan L. Yuille, Xiang Bai:
In Defense of Online Models for Video Instance Segmentation. ECCV (28) 2022: 588-605
[c5]Bin Yan, Yi Jiang, Peize Sun, Dong Wang, Zehuan Yuan, Ping Luo, Huchuan Lu:
Towards Grand Unification of Object Tracking. ECCV (21) 2022: 733-751
[c4]Shuo Yang, Peize Sun, Yi Jiang, Xiaobo Xia, Ruiheng Zhang, Zehuan Yuan, Changhu Wang, Ping Luo, Min Xu:
Objects in Semantic Topology. ICLR 2022
[c3]Chuofan Ma, Qiushan Guo, Yi Jiang, Ping Luo, Zehuan Yuan, Xiaojuan Qi:
Rethinking Resolution in the Context of Efficient Video Recognition. NeurIPS 2022
[i14]Jiannan Wu, Yi Jiang, Peize Sun, Zehuan Yuan, Ping Luo:
Language as Queries for Referring Video Object Segmentation. CoRR abs/2201.00487 (2022)
[i13]Bin Yan, Yi Jiang, Peize Sun, Dong Wang, Zehuan Yuan, Ping Luo, Huchuan Lu:
Towards Grand Unification of Object Tracking. CoRR abs/2207.07078 (2022)
[i12]Junfeng Wu, Qihao Liu, Yi Jiang, Song Bai, Alan L. Yuille, Xiang Bai:
In Defense of Online Models for Video Instance Segmentation. CoRR abs/2207.10661 (2022)
[i11]Chuofan Ma, Qiushan Guo, Yi Jiang, Zehuan Yuan, Ping Luo, Xiaojuan Qi:
Rethinking Resolution in the Context of Efficient Video Recognition. CoRR abs/2209.12797 (2022)
[i10]Junfeng Wu, Yi Jiang, Qihao Liu, Xiang Bai, Song Bai:
The Runner-up Solution for YouTube-VIS Long Video Challenge 2022. CoRR abs/2211.09973 (2022)
[i9]Chuang Lin, Peize Sun, Yi Jiang, Ping Luo, Lizhen Qu, Gholamreza Haffari, Zehuan Yuan, Jianfei Cai:
Learning Object-Language Alignments for Open-Vocabulary Object Detection. CoRR abs/2211.14843 (2022)- 2021
[c2]Peize Sun, Rufeng Zhang, Yi Jiang, Tao Kong, Chenfeng Xu, Wei Zhan, Masayoshi Tomizuka, Lei Li
, Zehuan Yuan, Changhu Wang, Ping Luo:
Sparse R-CNN: End-to-End Object Detection With Learnable Proposals. CVPR 2021: 14454-14463
[c1]Peize Sun, Yi Jiang, Enze Xie, Wenqi Shao, Zehuan Yuan, Changhu Wang, Ping Luo:
What Makes for End-to-End Object Detection? ICML 2021: 9934-9944
[i8]Shuo Yang, Peize Sun, Yi Jiang, Xiaobo Xia, Ruiheng Zhang, Zehuan Yuan, Changhu Wang, Ping Luo, Min Xu:
Objects in Semantic Topology. CoRR abs/2110.02687 (2021)
[i7]Yifu Zhang, Peize Sun, Yi Jiang, Dongdong Yu, Zehuan Yuan, Ping Luo, Wenyu Liu, Xinggang Wang:
ByteTrack: Multi-Object Tracking by Associating Every Detection Box. CoRR abs/2110.06864 (2021)
[i6]Chuang Lin, Yi Jiang, Jianfei Cai, Lizhen Qu, Gholamreza Haffari, Zehuan Yuan:
Multimodal Transformer with Variable-length Memory for Vision-and-Language Navigation. CoRR abs/2111.05759 (2021)
[i5]Peize Sun, Jinkun Cao, Yi Jiang, Zehuan Yuan, Song Bai, Kris Kitani, Ping Luo:
DanceTrack: Multi-Object Tracking in Uniform Appearance and Diverse Motion. CoRR abs/2111.14690 (2021)
[i4]Junfeng Wu, Yi Jiang, Wenqing Zhang, Xiang Bai, Song Bai:
SeqFormer: a Frustratingly Simple Model for Video Instance Segmentation. CoRR abs/2112.08275 (2021)- 2020
[i3]Peize Sun, Rufeng Zhang, Yi Jiang, Tao Kong, Chenfeng Xu, Wei Zhan, Masayoshi Tomizuka, Lei Li, Zehuan Yuan, Changhu Wang, Ping Luo:
Sparse R-CNN: End-to-End Object Detection with Learnable Proposals. CoRR abs/2011.12450 (2020)
[i2]Peize Sun, Yi Jiang, Enze Xie, Zehuan Yuan, Changhu Wang, Ping Luo:
OneNet: Towards End-to-End One-Stage Object Detection. CoRR abs/2012.05780 (2020)
[i1]Peize Sun, Yi Jiang, Rufeng Zhang, Enze Xie, Jinkun Cao, Xinting Hu, Tao Kong, Zehuan Yuan, Changhu Wang, Ping Luo:
TransTrack: Multiple-Object Tracking with Transformer. CoRR abs/2012.15460 (2020)
Coauthor Index

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from
to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the
of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from
,
, and
to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from
and
to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from
.
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2026-04-03 00:40 CEST by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint


Google
Google Scholar
Semantic Scholar
Internet Archive Scholar
CiteSeerX
ORCID







