


default search action
Size Wu
Person information
Refine list

refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2026
[i14]Hongyang Wei, Hongbo Liu, Zidong Wang, Yi Peng, Baixin Xu, Size Wu, Xuying Zhang, Xianglong He, Zexiang Liu, Peiyu Wang, Xuchen Song, Yangguang Li, Yang Liu, Yahui Zhou:
Skywork UniPic 3.0: Unified Multi-Image Composition via Sequence Modeling. CoRR abs/2601.15664 (2026)- 2025
[j1]Shilin Xu
, Xiangtai Li, Size Wu, Wenwei Zhang
, Yunhai Tong
, Chen Change Loy
:
DST-Det: Open-Vocabulary Object Detection via Dynamic Self-Training. IEEE Trans. Circuits Syst. Video Technol. 35(5): 5037-5050 (2025)
[c6]Size Wu, Sheng Jin, Wenwei Zhang, Lumin Xu, Wentao Liu, Wei Li, Chen Change Loy:
F-LMM: Grounding Frozen Large Multimodal Models. CVPR 2025: 24710-24721
[i13]Size Wu, Wenwei Zhang, Lumin Xu, Sheng Jin, Zhonghua Wu, Qingyi Tao, Wentao Liu, Wei Li, Chen Change Loy:
Harmonizing Visual Representations for Unified Multimodal Understanding and Generation. CoRR abs/2503.21979 (2025)
[i12]Size Wu, Zhonghua Wu, Zerui Gong, Qingyi Tao, Sheng Jin, Qinyue Li, Wei Li, Chen Change Loy:
OpenUni: A Simple Baseline for Unified Multimodal Understanding and Generation. CoRR abs/2505.23661 (2025)
[i11]Zujin Guo, Size Wu, Zhongang Cai, Wei Li, Chen Change Loy:
Controllable Human-centric Keyframe Interpolation with Generative Prior. CoRR abs/2506.03119 (2025)
[i10]Kang Liao, Size Wu, Zhonghua Wu, Linyi Jin, Chao Wang, Yikai Wang, Fei Wang, Wei Li, Chen Change Loy:
Thinking with Camera: A Unified Multimodal Model for Camera-Centric Understanding and Generation. CoRR abs/2510.08673 (2025)
[i9]Huiqiang Sun, Liao Shen, Zhan Peng, Kun Wang, Size Wu, Yuhang Zang, Tianqi Liu, Zihao Huang, Xingyu Zeng, Zhiguo Cao, Wei Li, Chen Change Loy:
Generative Photographic Control for Scene-Consistent Video Cinematic Editing. CoRR abs/2511.12921 (2025)
[i8]Qingyu Shi, Size Wu, Jinbin Bai, Kaidong Yu, Yujing Wang, Yunhai Tong, Xiangtai Li, Xuelong Li:
RecTok: Reconstruction Distillation along Rectified Flow. CoRR abs/2512.13421 (2025)- 2024
[c5]Size Wu, Wenwei Zhang, Lumin Xu, Sheng Jin, Wentao Liu, Chen Change Loy:
CLIM: Contrastive Language-Image Mosaic for Region Representation. AAAI 2024: 6117-6125
[c4]Xiangtai Li, Haobo Yuan, Wei Li, Henghui Ding, Size Wu, Wenwei Zhang, Yining Li, Kai Chen, Chen Change Loy:
OMG-Seg: Is One Model Good Enough for all Segmentation? CVPR 2024: 27948-27959
[c3]Size Wu, Wenwei Zhang, Lumin Xu, Sheng Jin, Xiangtai Li, Wentao Liu, Chen Change Loy:
CLIPSelf: Vision Transformer Distills Itself for Open-Vocabulary Dense Prediction. ICLR 2024
[i7]Xiangtai Li, Haobo Yuan, Wei Li, Henghui Ding, Size Wu, Wenwei Zhang, Yining Li, Kai Chen, Chen Change Loy:
OMG-Seg: Is One Model Good Enough For All Segmentation? CoRR abs/2401.10229 (2024)
[i6]Size Wu, Sheng Jin, Wenwei Zhang, Lumin Xu, Wentao Liu, Wei Li, Chen Change Loy:
F-LMM: Grounding Frozen Large Multimodal Models. CoRR abs/2406.05821 (2024)- 2023
[c2]Size Wu, Wenwei Zhang, Sheng Jin, Wentao Liu, Chen Change Loy:
Aligning Bag of Regions for Open-Vocabulary Object Detection. CVPR 2023: 15254-15264
[i5]Size Wu, Wenwei Zhang, Sheng Jin, Wentao Liu, Chen Change Loy:
Aligning Bag of Regions for Open-Vocabulary Object Detection. CoRR abs/2302.13996 (2023)
[i4]Shilin Xu, Xiangtai Li, Size Wu, Wenwei Zhang, Yining Li, Guangliang Cheng, Yunhai Tong, Kai Chen, Chen Change Loy:
DST-Det: Simple Dynamic Self-Training for Open-Vocabulary Object Detection. CoRR abs/2310.01393 (2023)
[i3]Size Wu, Wenwei Zhang, Lumin Xu, Sheng Jin, Xiangtai Li, Wentao Liu, Chen Change Loy:
CLIPSelf: Vision Transformer Distills Itself for Open-Vocabulary Dense Prediction. CoRR abs/2310.01403 (2023)
[i2]Size Wu, Wenwei Zhang, Lumin Xu, Sheng Jin, Wentao Liu, Chen Change Loy:
CLIM: Contrastive Language-Image Mosaic for Region Representation. CoRR abs/2312.11376 (2023)- 2021
[c1]Size Wu, Sheng Jin, Wentao Liu, Lei Bai, Chen Qian, Dong Liu, Wanli Ouyang:
Graph-Based 3D Multi-Person Pose Estimation Using Multi-View Images. ICCV 2021: 11128-11137
[i1]Size Wu, Sheng Jin, Wentao Liu, Lei Bai, Chen Qian, Dong Liu, Wanli Ouyang:
Graph-Based 3D Multi-Person Pose Estimation Using Multi-View Images. CoRR abs/2109.05885 (2021)
Coauthor Index

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from
to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the
of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from
,
, and
to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from
and
to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from
.
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2026-02-24 22:40 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint


Google
Google Scholar
Semantic Scholar
Internet Archive Scholar
CiteSeerX
ORCID







