


default search action
Pengfei Wan 0001
Person information
- affiliation: Kuaishou Technology, Beijing, China
- affiliation (former): Meitu Inc., Beijing, China
- affiliation (PhD 2015): Hong Kong University of Science and Technology, Hong Kong
Other persons with the same name
- Pengfei Wan (aka: Peng-fei Wan, Peng-Fei Wan) — disambiguation page
- Pengfei Wan 0002
— Shaanxi Normal University (SNNU), School of Computer Science, Xi'an, China
Refine list

refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2025
[j16]Tao Zhang
, Xingye Tian, Yikang Zhou
, Shunping Ji
, Xuebo Wang, Xin Tao, Yuan Zhang, Pengfei Wan
, Zhongyuan Wang, Yu Wu
:
DVIS++: Improved Decoupled Framework for Universal Video Segmentation. IEEE Trans. Pattern Anal. Mach. Intell. 47(7): 5918-5929 (2025)
[j15]Jinchao Zhu
, Yuxuan Wang, Siyuan Pan, Pengfei Wan
, Di Zhang, Gao Huang
:
A-SDM: Accelerating Stable Diffusion Through Model Assembly and Feature Inheritance Strategies. IEEE Trans. Neural Networks Learn. Syst. 36(10): 18478-18491 (2025)
[j14]Kaiwen Jiang
, Feng-Lin Liu
, Shu-Yu Chen
, Pengfei Wan
, Yuan Zhang, Yu-Kun Lai
, Hongbo Fu
, Lin Gao
:
NeRFFaceShop: Learning a Photo-Realistic 3D-Aware Generative Model of Animatable and Relightable Heads From Large-Scale in-the-Wild Videos. IEEE Trans. Vis. Comput. Graph. 31(10): 7938-7950 (2025)
[j13]Yuxin Zhang
, Weiming Dong
, Fan Tang
, Nisha Huang
, Haibin Huang
, Chongyang Ma
, Pengfei Wan
, Tong-Yee Lee
, Changsheng Xu
:
MotionCrafter: Plug-and-Play Motion Guidance for Diffusion Models. IEEE Trans. Vis. Comput. Graph. 31(10): 8372-8384 (2025)
[j12]Shaohua Pan
, Xinyu Yi, Yan Zhou, Weihua Jian, Yuan Zhang, Pengfei Wan
, Feng Xu
:
DiffCap: Diffusion-Based Real-Time Human Motion Capture Using Sparse IMUs and a Monocular Camera. IEEE Trans. Vis. Comput. Graph. 31(12): 10272-10283 (2025)
[c61]Wei-Qi Feng, Dong Han, Ze-Kang Zhou, Shunkai Li, Xiaoqiang Liu, Pengfei Wan, Di Zhang, Miao Wang:
GPAvatar: High-fidelity Head Avatars by Learning Efficient Gaussian Projections. CVPR 2025: 250-259
[c60]Zixuan Ye, Huijuan Huang, Xintao Wang, Pengfei Wan, Di Zhang, Wenhan Luo:
StyleMaster: Stylize Your Video with Artistic Generation and Translation. CVPR 2025: 2630-2640
[c59]Qiuheng Wang, Yukai Shi, Jiarong Ou, Rui Chen, Ke Lin, Jiahao Wang, Boyuan Jiang, Haotian Yang, Mingwu Zheng, Xin Tao, Fei Yang, Pengfei Wan, Di Zhang:
Koala-36M: A Large-scale Video Dataset Improving Consistency between Fine-grained Conditions and Video Content. CVPR 2025: 8428-8437
[c58]Zhuoman Liu, Weicai Ye, Yan Luximon, Pengfei Wan, Di Zhang:
Unleashing the Potential of Multi-modal Foundation Models and Video Diffusion for 4D Dynamic Physical Scene Simulation. CVPR 2025: 11016-11025
[c57]Shian Du, Menghan Xia, Chang Liu, Xintao Wang, Jing Wang, Pengfei Wan, Di Zhang, Xiangyang Ji:
PatchVSR: Breaking Video Diffusion Resolution Limits with Patch-wise Video Super-Resolution. CVPR 2025: 17799-17809
[c56]Yuanyang Yin, Yaqi Zhao, Mingwu Zheng, Ke Lin, Jiarong Ou, Rui Chen, Victor Shea-Jay Huang, Jiahao Wang, Xin Tao, Pengfei Wan, Di Zhang, Baoqun Yin, Wentao Zhang, Kun Gai:
Towards Precise Scaling Laws for Video Diffusion Transformers. CVPR 2025: 18155-18165
[c55]Feng-Lin Liu, Hongbo Fu, Xintao Wang, Weicai Ye, Pengfei Wan, Di Zhang, Lin Gao:
SketchVideo: Sketch-based Video Generation and Editing. CVPR 2025: 23379-23390
[c54]Yuanyang Yin, Yaqi Zhao, Yajie Zhang, Yuanxing Zhang, Ke Lin, Jiahao Wang, Xin Tao, Pengfei Wan, Wentao Zhang, Feng Zhao:
SEA: Supervised Embedding Alignment for Token-Level Visual-Textual Integration in MLLMs. EMNLP 2025: 1058-1070
[c53]Yuchi Wang, Yishuo Cai, Shuhuai Ren, Sihan Yang, Linli Yao, Yuanxin Liu, Yuanxing Zhang, Pengfei Wan, Xu Sun:
RICO: Improving Accuracy and Completeness in Image Recaptioning via Visual Reconstruction. EMNLP 2025: 21785-21804
[c52]Jianhong Bai, Menghan Xia, Xintao Wang, Ziyang Yuan, Zuozhu Liu, Haoji Hu, Pengfei Wan, Di Zhang:
SynCamMaster: Synchronizing Multi-Camera Video Generation from Diverse Viewpoints. ICLR 2025
[c51]Hejia Chen, Haoxian Zhang, Shoulong Zhang, Xiaoqiang Liu, Sisi Zhuang, Yuan Zhang, Pengfei Wan, Di Zhang, Shuai Li:
Cafe-Talk: Generating 3D Talking Face Animation with Multimodal Coarse- and Fine-grained Control. ICLR 2025
[c50]Qi Fan, Xin Tao, Lei Ke, Mingqiao Ye, Di Zhang, Pengfei Wan, Yu-Wing Tai, Chi-Keung Tang:
Stable Segment Anything Model. ICLR 2025
[c49]Xiao Fu, Xian Liu, Xintao Wang, Sida Peng, Menghan Xia, Xiaoyu Shi, Ziyang Yuan, Pengfei Wan, Di Zhang, Dahua Lin:
3DTrajMaster: Mastering 3D Trajectory for Multi-Entity Motion in Video Generation. ICLR 2025
[c48]Bohan Zeng
, Ling Yang
, Jiaming Liu
, Minghao Xu
, Yuanxing Zhang
, Pengfei Wan
, Wentao Zhang
, Shuicheng Yan
:
EditWorld: Simulating World Dynamics for Instruction-Following Image Editing. ACM Multimedia 2025: 12674-12681
[c47]Luozhou Wang
, Ziyang Mai
, Guibao Shen
, Yixun Liang
, Xin Tao
, Pengfei Wan
, Di Zhang
, Yijun Li
, Ying-Cong Chen
:
Motion Inversion for Video Customization. SIGGRAPH (Conference Paper Track) 2025: 4:1-4:12
[c46]Qinghe Wang
, Yawen Luo
, Xiaoyu Shi
, Xu Jia
, Huchuan Lu
, Tianfan Xue
, Xintao Wang
, Pengfei Wan
, Di Zhang
, Kun Gai
:
CineMaster: A 3D-Aware and Controllable Framework for Cinematic Text-to-Video Generation. SIGGRAPH (Conference Paper Track) 2025: 60:1-60:10
[c45]Jiwen Yu
, Jianhong Bai
, Yiran Qin
, Quande Liu
, Xintao Wang
, Pengfei Wan
, Di Zhang
, Xihui Liu
:
Context as Memory: Scene-Consistent Interactive Long Video Generation with Memory Retrieval. SIGGRAPH Asia 2025: 19:1-19:11
[c44]Yawen Luo
, Xiaoyu Shi
, Jianhong Bai
, Menghan Xia
, Tianfan Xue
, Xintao Wang
, Pengfei Wan
, Di Zhang
, Kun Gai
:
CamCloneMaster: Enabling Reference-based Camera Control for Video Generation. SIGGRAPH Asia 2025: 20:1-20:10
[i126]Yuzhou Huang, Ziyang Yuan, Quande Liu, Qiulin Wang, Xintao Wang, Ruimao Zhang, Pengfei Wan, Di Zhang, Kun Gai:
ConceptMaster: Multi-Concept Video Customization on Diffusion Transformer Models Without Test-Time Tuning. CoRR abs/2501.04698 (2025)
[i125]Jiwen Yu, Yiran Qin, Xintao Wang, Pengfei Wan, Di Zhang, Xihui Liu
:
GameFactory: Creating New Games with Generative Interactive Videos. CoRR abs/2501.08325 (2025)
[i124]Jie Liu, Gongye Liu, Jiajun Liang, Ziyang Yuan, Xiaokun Liu, Mingwu Zheng, Xiele Wu, Qiulin Wang, Wenyu Qin, Menghan Xia, Xintao Wang, Xiaohong Liu, Fei Yang, Pengfei Wan, Di Zhang, Kun Gai, Yujiu Yang, Wanli Ouyang:
Improving Video Generation with Human Feedback. CoRR abs/2501.13918 (2025)
[i123]Qinghe Wang, Yawen Luo, Xiaoyu Shi, Xu Jia, Huchuan Lu, Tianfan Xue, Xintao Wang, Pengfei Wan, Di Zhang, Kun Gai:
CineMaster: A 3D-Aware and Controllable Framework for Cinematic Text-to-Video Generation. CoRR abs/2502.08639 (2025)
[i122]Borui Liao, Yulong Xu, Jiao Ou, Kaiyuan Yang, Weihua Jian, Pengfei Wan, Di Zhang:
FlexDuo: A Pluggable System for Enabling Full-Duplex Capabilities in Speech Dialogue Systems. CoRR abs/2502.13472 (2025)
[i121]Zhen Yang, Guibao Shen, Liang Hou, Mushui Liu, Luozhou Wang, Xin Tao, Pengfei Wan, Di Zhang, Ying-Cong Chen:
RectifiedHR: Enable Efficient High-Resolution Image Generation via Energy Rectification. CoRR abs/2503.02537 (2025)
[i120]Xukun Zhou, Fengxin Li, Ming Chen, Yan Zhou, Pengfei Wan, Di Zhang, Yeying Jin, Zhaoxin Fan, Hongyan Liu, Jun He:
ExGes: Expressive Human Motion Retrieval and Modulation for Audio-Driven Gesture Synthesis. CoRR abs/2503.06499 (2025)
[i119]Shiyuan Yang, Zheng Gu, Liang Hou, Xin Tao, Pengfei Wan, Xiaodong Chen, Jing Liao:
MTV-Inpaint: Multi-Task Long Video Inpainting. CoRR abs/2503.11412 (2025)
[i118]Jianhong Bai, Menghan Xia, Xiao Fu, Xintao Wang, Lianrui Mu, Jinwen Cao, Zuozhu Liu, Haoji Hu, Xiang Bai, Pengfei Wan, Di Zhang:
ReCamMaster: Camera-Controlled Generative Rendering from A Single Video. CoRR abs/2503.11647 (2025)
[i117]Minglei Shi, Ziyang Yuan, Haotian Yang, Xintao Wang, Mingwu Zheng, Xin Tao, Wenliang Zhao, Wenzhao Zheng, Jie Zhou, Jiwen Lu, Pengfei Wan, Di Zhang, Kun Gai:
DiffMoE: Dynamic Token Selection for Scalable Diffusion Transformers. CoRR abs/2503.14487 (2025)
[i116]Hejia Chen, Haoxian Zhang, Shoulong Zhang, Xiaoqiang Liu, Sisi Zhuang, Yuan Zhang, Pengfei Wan, Di Zhang, Shuai Li:
Cafe-Talk: Generating 3D Talking Face Animation with Multimodal Coarse- and Fine-grained Control. CoRR abs/2503.14517 (2025)
[i115]Jiwen Yu, Yiran Qin, Haoxuan Che, Quande Liu, Xintao Wang, Pengfei Wan, Di Zhang, Xihui Liu
:
Position: Interactive Generative Video as Next-Generation Game Engine. CoRR abs/2503.17359 (2025)
[i114]Cong Liu, Liang Hou, Mingwu Zheng, Xin Tao, Pengfei Wan, Di Zhang, Kun Gai:
Boosting Resolution Generalization of Diffusion Transformers with Randomized Positional Encodings. CoRR abs/2503.18719 (2025)
[i113]Xuan Ju, Weicai Ye, Quande Liu, Qiulin Wang, Xintao Wang, Pengfei Wan, Di Zhang, Kun Gai, Qiang Xu:
FullDiT: Multi-Task Video Generative Foundation Model with Full Attention. CoRR abs/2503.19907 (2025)
[i112]Nan Gao, Yihua Bao, Dongdong Weng, Jiayi Zhao, Jia Li, Yan Zhou, Pengfei Wan, Di Zhang:
SARGes: Semantically Aligned Reliable Gesture Generation via Intent Chain. CoRR abs/2503.20202 (2025)
[i111]Feng-Lin Liu, Hongbo Fu, Xintao Wang, Weicai Ye, Pengfei Wan, Di Zhang, Lin Gao:
SketchVideo: Sketch-based Video Generation and Editing. CoRR abs/2503.23284 (2025)
[i110]Zhichao Liao, Xiaokun Liu, Wenyu Qin, Qingyu Li, Qiulin Wang, Pengfei Wan, Di Zhang, Long Zeng, Pingfa Feng:
HumanAesExpert: Advancing a Multi-Modality Foundation Model for Human Image Aesthetic Assessment. CoRR abs/2503.23907 (2025)
[i109]Shengqiong Wu, Weicai Ye, Jiahao Wang, Quande Liu, Xintao Wang, Pengfei Wan, Di Zhang, Kun Gai, Shuicheng Yan, Hao Fei, Tat-Seng Chua:
Any2Caption:Interpreting Any Condition to Caption for Controllable Video Generation. CoRR abs/2503.24379 (2025)
[i108]Xiaole Xian, Zhichao Liao, Qingyu Li, Wenyu Qin, Pengfei Wan, Weicheng Xie, Long Zeng, Linlin Shen, Pingfa Feng:
SPF-Portrait: Towards Pure Portrait Customization with Semantic Pollution-Free Fine-tuning. CoRR abs/2504.00396 (2025)
[i107]Ruotong Wang, Mingli Zhu, Jiarong Ou, Rui Chen, Xin Tao, Pengfei Wan, Baoyuan Wu:
BadVideo: Stealthy Backdoor Attack against Text-to-Video Generation. CoRR abs/2504.16907 (2025)
[i106]Jiwen Yu, Yiran Qin, Haoxuan Che, Quande Liu, Xintao Wang, Pengfei Wan, Di Zhang, Kun Gai, Hao Chen, Xihui Liu
:
A Survey of Interactive Generative Video. CoRR abs/2504.21853 (2025)
[i105]Jie Liu, Gongye Liu, Jiajun Liang, Yangguang Li, Jiaheng Liu, Xintao Wang, Pengfei Wan, Di Zhang, Wanli Ouyang:
Flow-GRPO: Training Flow Matching Models via Online RL. CoRR abs/2505.05470 (2025)
[i104]Tianxiong Zhong, Xingye Tian, Boyuan Jiang, Xuebo Wang, Xin Tao, Pengfei Wan, Zhiwei Zhang:
VFRTok: Variable Frame Rates Video Tokenizer with Duration-Proportional Information Assumption. CoRR abs/2505.12053 (2025)
[i103]Yuechen Zhang, Jinbo Xing, Bin Xia, Shaoteng Liu, Bohao Peng, Xin Tao, Pengfei Wan, Eric Lo, Jiaya Jia:
Training-Free Efficient Video Generation via Dynamic Token Carving. CoRR abs/2505.16864 (2025)
[i102]Haoran He, Jiajun Liang, Xintao Wang, Pengfei Wan, Di Zhang, Kun Gai, Ling Pan:
Scaling Image and Video Generation via Test-Time Evolutionary Search. CoRR abs/2505.17618 (2025)
[i101]Yang Shi, Huanqian Wang, Wulin Xie, Huanyao Zhang, Lijie Zhao, Yifan Zhang, Xinfeng Li, Chaoyou Fu, Zhuoer Wen, Wenting Liu, Zhuoran Zhang, Xinlong Chen, Bohan Zeng, Sihan Yang, Yuanxing Zhang, Pengfei Wan, Haotian Wang, Wenjing Yang:
MME-VideoOCR: Evaluating OCR-Based Capabilities of Multimodal LLMs in Video Scenarios. CoRR abs/2505.21333 (2025)
[i100]Ziqiao Peng, Jiwen Liu, Haoxian Zhang, Xiaoqiang Liu, Songlin Tang, Pengfei Wan, Di Zhang, Hongyan Liu, Jun He:
OmniSync: Towards Universal Lip Synchronization via Diffusion Transformers. CoRR abs/2505.21448 (2025)
[i99]Yuchi Wang, Yishuo Cai, Shuhuai Ren, Sihan Yang, Linli Yao, Yuanxin Liu, Yuanxing Zhang, Pengfei Wan, Xu Sun:
RICO: Improving Accuracy and Completeness in Image Recaptioning via Visual Reconstruction. CoRR abs/2505.22613 (2025)
[i98]Xiao Fu, Xintao Wang, Xian Liu, Jianhong Bai, Runsen Xu, Pengfei Wan, Di Zhang, Dahua Lin:
Learning Video Generation for Robotic Manipulation with Collaborative Trajectory Control. CoRR abs/2506.01943 (2025)
[i97]Yawen Luo, Jianhong Bai, Xiaoyu Shi, Menghan Xia, Xintao Wang, Pengfei Wan, Di Zhang, Kun Gai, Tianfan Xue:
CamCloneMaster: Enabling Reference-based Camera Control for Video Generation. CoRR abs/2506.03140 (2025)
[i96]Jiwen Yu, Jianhong Bai, Yiran Qin, Quande Liu, Xintao Wang, Pengfei Wan, Di Zhang, Xihui Liu
:
Context as Memory: Scene-Consistent Interactive Long Video Generation with Memory Retrieval. CoRR abs/2506.03141 (2025)
[i95]Xuanhua He, Quande Liu, Zixuan Ye, Weicai Ye, Qiulin Wang, Xintao Wang, Qifeng Chen, Pengfei Wan, Di Zhang, Kun Gai:
FullDiT2: Efficient In-Context Conditioning for Video Diffusion Transformers. CoRR abs/2506.04213 (2025)
[i94]Zixuan Ye, Xuanhua He, Quande Liu, Qiulin Wang, Xintao Wang, Pengfei Wan, Di Zhang, Kun Gai, Qifeng Chen, Wenhan Luo:
UNIC: Unified In-Context Video Editing. CoRR abs/2506.04216 (2025)
[i93]Xinlong Chen, Yuanxing Zhang, Yushuo Guan, Bohan Zeng, Yang Shi, Sihan Yang, Pengfei Wan, Qiang Liu, Liang Wang, Tieniu Tan:
VersaVid-R1: A Versatile Video Understanding and Reasoning Model from Question Answering to Captioning Tasks. CoRR abs/2506.09079 (2025)
[i92]Kaiyi Huang, Yukun Huang, Xintao Wang, Zinan Lin, Xuefei Ning, Pengfei Wan, Di Zhang, Yu Wang, Xihui Liu
:
FilMaster: Bridging Cinematic Principles and Generative AI for Automated Film Generation. CoRR abs/2506.18899 (2025)
[i91]Liangbin Xie, Yu Li, Shian Du, Menghan Xia, Xintao Wang, Fanghua Yu, Ziyan Chen, Pengfei Wan, Jiantao Zhou, Chao Dong:
SimpleGVR: A Simple Baseline for Latent-Cascaded Video Super-Resolution. CoRR abs/2506.19838 (2025)
[i90]Jianzong Wu, Liang Hou, Haotian Yang, Xin Tao, Ye Tian, Pengfei Wan, Di Zhang, Yunhai Tong:
VMoBA: Mixture-of-Block Attention for Video Diffusion Models. CoRR abs/2506.23858 (2025)
[i89]Yukai Shi, Jiarong Ou, Rui Chen, Haotian Yang, Jiahao Wang, Xin Tao, Pengfei Wan, Di Zhang, Kun Gai:
Imbalance in Balance: Online Concept Balancing in Generation Models. CoRR abs/2507.13345 (2025)
[i88]Liang Hou, Yuan Gao, Boyuan Jiang, Xin Tao, Qi Yan, Renjie Liao, Pengfei Wan, Di Zhang, Kun Gai:
Score Augmentation for Diffusion Models. CoRR abs/2508.07926 (2025)
[i87]Ming Chen, Liyuan Cui, Wenyuan Zhang, Haoxian Zhang, Yan Zhou, Xiaohan Li, Songlin Tang, Jiwen Liu, Borui Liao, Hejia Chen, Xiaoqiang Liu, Pengfei Wan:
MIDAS: Multimodal Interactive Digital-humAn Synthesis via Real-time Autoregressive Video Generation. CoRR abs/2508.19320 (2025)
[i86]Ouxiang Li, Yuan Wang, Xinting Hu, Huijuan Huang, Rui Chen, Jiarong Ou, Xin Tao, Pengfei Wan, Xiaojuan Qi, Fuli Feng:
Easier Painting Than Thinking: Can Text-to-Image Models Set the Stage, but Not Direct the Play? CoRR abs/2509.03516 (2025)
[i85]Yikang Ding, Jiwen Liu, Wenyuan Zhang, Zekun Wang, Wentao Hu, Liyuan Cui, Mingming Lao, Yingchao Shao, Hui Liu, Xiaohan Li, Ming Chen, Xiaoqiang Liu, Yu-Shen Liu, Pengfei Wan:
Kling-Avatar: Grounding Multimodal Instructions for Cascaded Long-Duration Avatar Animation Synthesis. CoRR abs/2509.09595 (2025)
[i84]Yidan Zhang, Mutian Xu, Yiming Hao, Kun Zhou, Jiahao Chang, Xiaoqiang Liu, Pengfei Wan, Hongbo Fu, Xiaoguang Han:
VC-Agent: An Interactive Agent for Customized Video Dataset Collection. CoRR abs/2509.21291 (2025)
[i83]Siyu Cao, Hangting Chen, Peng Chen, Yiji Cheng, Yutao Cui, Xinchi Deng, Ying Dong, Kipper Gong, Tianpeng Gu, Xiusen Gu, Tiankai Hang, Duojun Huang, Jie Jiang, Zhengkai Jiang, Weijie Kong, Changlin Li, Donghao Li, Junzhe Li, Xin Li, Yang Li, Zhenxi Li, Zhimin Li, Jiaxin Lin, Linus, Lucaz Liu, Shu Liu, Songtao Liu, Yu Liu, Yuhong Liu, Yanxin Long, Fanbin Lu, Qinglin Lu, Yuyang Peng, Yuanbo Peng, Xiangwei Shen, Yixuan Shi, Jiale Tao, Yangyu Tao, Qi Tian, Pengfei Wan, Chunyu Wang, Kai Wang, Lei Wang, Linqing Wang, Lucas Wang, Qixun Wang, Weiyan Wang, Hao Wen, Bing Wu, Jianbing Wu, Yue Wu, Senhao Xie, Fang Yang, Miles Yang, Xiaofeng Yang, Xuan Yang, Zhantao Yang, Jingmiao Yu, Zheng Yuan, Chao Zhang, Jian-Wei Zhang, Peizhen Zhang, Shi-Xue Zhang, Tao Zhang, Weigang Zhang, Yepeng Zhang, Yingfang Zhang, Zihao Zhang, Zijian Zhang, Penghao Zhao, Zhiyuan Zhao, Xuefei Zhe, Jianchen Zhu, Zhao Zhong:
HunyuanImage 3.0 Technical Report. CoRR abs/2509.23951 (2025)
[i82]Yang Shi, Yuhao Dong, Yue Ding, Yuran Wang, Xuanyu Zhu, Sheng Zhou, Wenting Liu, Haochen Tian, Rundong Wang, Huanqian Wang, Zuyan Liu, Bohan Zeng, Ruizhe Chen, Qixun Wang, Zhuoran Zhang, Xinlong Chen, Chengzhuo Tong, Bozhou Li, Chaoyou Fu, Qiang Liu, Haotian Wang, Wenjing Yang, Yuanxing Zhang, Pengfei Wan, Yifan Zhang, Ziwei Liu:
RealUnify: Do Unified Models Truly Benefit from Unification? A Comprehensive Benchmark. CoRR abs/2509.24897 (2025)
[i81]Zhihong Chen, Xuehai Bai, Yang Shi, Chaoyou Fu, Huanyu Zhang, Haotian Wang, Xiaoyan Sun, Zhang Zhang, Liang Wang, Yuanxing Zhang, Pengfei Wan, Yifan Zhang:
OpenGPT-4o-Image: A Comprehensive Dataset for Advanced Image Generation and Editing. CoRR abs/2509.24900 (2025)
[i80]Jia Jun Cheng Xian, Muchen Li, Haotian Yang, Xin Tao, Pengfei Wan, Leonid Sigal, Renjie Liao:
Free Lunch Alignment of Text-to-Image Diffusion Models without Preference Image Pairs. CoRR abs/2509.25771 (2025)
[i79]Shian Du, Menghan Xia, Chang Liu, Xintao Wang, Jing Wang, Pengfei Wan, Di Zhang, Xiangyang Ji:
PatchVSR: Breaking Video Diffusion Resolution Limits with Patch-wise Video Super-Resolution. CoRR abs/2509.26025 (2025)
[i78]Shian Du, Menghan Xia, Chang Liu, Quande Liu, Xintao Wang, Pengfei Wan, Xiangyang Ji:
UniMMVSR: A Unified Multi-Modal Framework for Cascaded Video Super-Resolution. CoRR abs/2510.08143 (2025)
[i77]Cong Wei, Quande Liu, Zixuan Ye, Qiulin Wang, Xintao Wang, Pengfei Wan, Kun Gai, Wenhu Chen:
UniVideo: Unified Understanding, Generation, and Editing for Videos. CoRR abs/2510.08377 (2025)
[i76]Minghong Cai, Qiulin Wang, Zongli Ye, Wenze Liu, Quande Liu, Weicai Ye, Xintao Wang, Pengfei Wan, Kun Gai, Xiangyu Yue:
VideoCanvas: Unified Video Completion from Arbitrary Spatiotemporal Patches via In-Context Conditioning. CoRR abs/2510.08555 (2025)
[i75]Xinlong Chen, Yue Ding, Weihong Lin, Jingyun Hua, Linli Yao, Yang Shi, Bozhou Li, Yuanxing Zhang, Qiang Liu, Pengfei Wan, Liang Wang, Tieniu Tan:
AVoCaDO: An Audiovisual Video Captioner Driven by Temporal Orchestration. CoRR abs/2510.10395 (2025)
[i74]Qunzhong Wang, Jie Liu, Jiajun Liang, Yilei Jiang, Yuanxing Zhang, Jinyuan Chen, Yaozhi Zheng, Xintao Wang, Pengfei Wan, Xiangyu Yue, Jiaheng Liu:
VR-Thinker: Boosting Video Reward Models through Thinking-with-Image Reasoning. CoRR abs/2510.10518 (2025)
[i73]Yu Li, Menghan Xia, Gongye Liu, Jianhong Bai, Xintao Wang, Conglang Zhang, Yuxuan Lin, Ruihang Chu, Pengfei Wan, Yujiu Yang:
AdaViewPlanner: Adapting Video Diffusion Models for Viewpoint Planning in 4D Scenes. CoRR abs/2510.10670 (2025)
[i72]Jincheng Zhong, Boyuan Jiang, Xin Tao, Pengfei Wan, Kun Gai, Mingsheng Long:
Mitigating the Noise Shift for Denoising Generative Models via Noise Awareness Guidance. CoRR abs/2510.12497 (2025)
[i71]Sihui Ji, Xi Chen, Xin Tao, Pengfei Wan, Hengshuang Zhao:
PhysMaster: Mastering Physical Representation for Video Generation via Reinforcement Learning. CoRR abs/2510.13809 (2025)
[i70]Zhen Yang, Mingyang Zhang, Feng Chen, Ganggui Ding, Liang Hou, Xin Tao, Pengfei Wan, Ying-Cong Chen:
Less is More: Improving LLM Reasoning with Minimal Test-Time Intervention. CoRR abs/2510.13940 (2025)
[i69]Yuanhui Huang, Weiliang Chen, Wenzhao Zheng, Xin Tao, Pengfei Wan, Jie Zhou, Jiwen Lu:
Terra: Explorable Native 3D World Model with Point Latents. CoRR abs/2510.14977 (2025)
[i68]Minglei Shi, Haolin Wang, Wenzhao Zheng, Ziyang Yuan, Xiaoshi Wu, Xintao Wang, Pengfei Wan, Jie Zhou, Jiwen Lu:
Latent Diffusion Model without Variational Autoencoder. CoRR abs/2510.15301 (2025)
[i67]Jing Wang, Jiajun Liang, Jie Liu, Henglin Liu, Gongye Liu, Jun Zheng, Wanyuan Pang, Ao Ma, Zhenyu Xie, Xintao Wang, Meng Wang, Pengfei Wan, Xiaodan Liang:
GRPO-Guard: Mitigating Implicit Over-Optimization in Flow Matching via Regulated Clipping. CoRR abs/2510.22319 (2025)
[i66]Baolu Li, Yiming Zhang, Qinghe Wang, Liqian Ma, Xiaoyu Shi, Xintao Wang, Pengfei Wan, Zhenfei Yin, Yunzhi Zhuge, Huchuan Lu, Xu Jia:
VFXMaster: Unlocking Dynamic Visual Effect Generation via In-Context Learning. CoRR abs/2510.25772 (2025)
[i65]Yukun Huang, Jiwen Yu, Yanning Zhou, Jianan Wang, Xintao Wang, Pengfei Wan, Xihui Liu:
OmniX: From Unified Panoramic Generation and Perception to Graphics-Ready 3D Scenes. CoRR abs/2510.26800 (2025)
[i64]Weikang Bian, Xiaoyu Shi, Zhaoyang Huang, Jianhong Bai, Qinghe Wang, Xintao Wang, Pengfei Wan, Kun Gai, Hongsheng Li:
RelightMaster: Precise Video Relighting with Multi-plane Light Images. CoRR abs/2511.06271 (2025)
[i63]Tianhao Peng, Haochen Wang, Yuanxing Zhang, Zekun Wang, Zili Wang, Ge Zhang, Jian Yang, Shihao Li, Yanghai Wang, Xintao Wang, Houyi Li, Wei Ji, Pengfei Wan, Wenhao Huang, Zhaoxiang Zhang, Jiaheng Liu:
MVU-Eval: Towards Multi-Video Understanding Evaluation for Multimodal LLMs. CoRR abs/2511.07250 (2025)
[i62]Jingtong Yue, Ziqi Huang, Zhaoxi Chen, Xintao Wang, Pengfei Wan, Ziwei Liu:
Simulating the Visual World with Artificial Intelligence: A Roadmap. CoRR abs/2511.08585 (2025)
[i61]Tianxiong Zhong, Xingye Tian, Xuebo Wang, Boyuan Jiang, Xin Tao, Pengfei Wan:
Decoupling Complexity from Scale in Latent Diffusion Model. CoRR abs/2511.16117 (2025)
[i60]Shengqiong Wu, Weicai Ye, Yuanxing Zhang, Jiahao Wang, Quande Liu, Xintao Wang, Pengfei Wan, Kun Gai, Hao Fei, Tat-Seng Chua:
A Reason-then-Describe Instruction Interpreter for Controllable Video Generation. CoRR abs/2511.20563 (2025)
[i59]Qixun Wang, Yang Shi, Yifei Wang, Yuanxing Zhang, Pengfei Wan, Kun Gai, Xianghua Ying, Yisen Wang:
Monet: Reasoning in Latent Visual Space Beyond Images and Language. CoRR abs/2511.21395 (2025)
[i58]Qinghe Wang, Xiaoyu Shi, Baolu Li, Weikang Bian, Quande Liu, Huchuan Lu, Xintao Wang, Pengfei Wan, Kun Gai, Xu Jia:
MultiShotMaster: A Controllable Multi-Shot Video Generation Framework. CoRR abs/2512.03041 (2025)
[i57]Jiehui Huang, Yuechen Zhang, Xu He, Yuan Gao, Zhi Cen, Bin Xia, Yan Zhou, Xin Tao, Pengfei Wan, Jiaya Jia:
UnityVideo: Unified Multi-Modal Multi-Task Learning for Enhancing World-Aware Video Generation. CoRR abs/2512.07831 (2025)
[i56]Yixuan Zhu, Jiaqi Feng, Wenzhao Zheng, Yuan Gao, Xin Tao, Pengfei Wan, Jie Zhou, Jiwen Lu:
Astra: General Interactive World Model with Autoregressive Denoising. CoRR abs/2512.08931 (2025)
[i55]Xiangyang Luo, Qingyu Li, Xiaokun Liu, Wenyu Qin, Miao Yang, Meng Wang, Pengfei Wan, Di Zhang, Kun Gai, Shao-Lun Huang:
FilmWeaver: Weaving Consistent Multi-Shot Videos with Cache-Guided Autoregressive Diffusion. CoRR abs/2512.11274 (2025)
[i54]Minglei Shi, Haolin Wang, Borui Zhang, Wenzhao Zheng, Bohan Zeng, Ziyang Yuan, Xiaoshi Wu, Yuanxing Zhang, Huan Yang, Xintao Wang, Pengfei Wan, Kun Gai, Jie Zhou, Jiwen Lu:
SVG-T2I: Scaling Up Text-to-Image Latent Diffusion Model Without Variational Autoencoder. CoRR abs/2512.11749 (2025)
[i53]Jialu Chen, Yikang Ding, Zhixue Fang, Kun Gai, Yuan Gao, Kang He, Jingyun Hua, Boyuan Jiang, Mingming Lao, Xiaohan Li, Hui Liu, Jiwen Liu, Xiaoqiang Liu, Yuan Liu, Shun Lu, Yongsen Mao, Yingchao Shao, Huafeng Shi, Xiaoyu Shi, Peiqin Sun, Songlin Tang, Pengfei Wan, Chao Wang, Xuebo Wang, Haoxian Zhang, Yuanxing Zhang, Yan Zhou:
KlingAvatar 2.0 Technical Report. CoRR abs/2512.13313 (2025)
[i52]Sihui Ji, Xi Chen, Shuai Yang, Xin Tao, Pengfei Wan, Hengshuang Zhao:
MemFlow: Flowing Adaptive Memory for Consistent and Efficient Long Video Narratives. CoRR abs/2512.14699 (2025)
[i51]Bozhou Li, Sihan Yang, Yushuo Guan, Ruichuan An, Xinlong Chen, Yang Shi, Pengfei Wan, Wentao Zhang, Yuanxing zhang:
GRAN-TED: Generating Robust, Aligned, and Nuanced Text Embedding for Diffusion Models. CoRR abs/2512.15560 (2025)
[i50]Zixuan Ye, Quande Liu, Cong Wei, Yuanxing Zhang, Xintao Wang, Pengfei Wan, Kun Gai, Wenhan Luo:
Visual-Aware CoT: Achieving High-Fidelity Visual Consistency in Unified Models. CoRR abs/2512.19686 (2025)
[i49]Jianhong Bai, Xiaoshi Wu, Xintao Wang, Xiao Fu, Yuanxing Zhang, Qinghe Wang, Xiaoyu Shi, Menghan Xia, Zuozhu Liu, Haoji Hu, Pengfei Wan, Kun Gai:
SemanticGen: Video Generation in Semantic Space. CoRR abs/2512.20619 (2025)
[i48]Henglin Liu, Nisha Huang, Chang Liu, Jiangpeng Yan, Huijuan Huang, Jixuan Ying, Tong-Yee Lee, Pengfei Wan, Xiangyang Ji:
Bridging Cognitive Gap: Hierarchical Description Learning for Artistic Image Aesthetics Assessment. CoRR abs/2512.23413 (2025)
[i47]Haoran He, Yuxiao Ye, Jie Liu, Jiajun Liang, Zhiyong Wang, Ziyang Yuan, Xintao Wang, Hangyu Mao, Pengfei Wan, Ling Pan:
GARDO: Reinforcing Diffusion Models without Reward Hacking. CoRR abs/2512.24138 (2025)
[i46]Xu He, Haoxian Zhang, Hejia Chen, Changyuan Zheng, Liyang Chen, Songlin Tang, Jiehui Huang, Xiaoqiang Liu, Pengfei Wan, Zhiyong Wu:
From Inpainting to Editing: A Self-Bootstrapping Framework for Context-Rich Visual Dubbing. CoRR abs/2512.25066 (2025)- 2024
[c43]Dongchen Han
, Tianzhu Ye, Yizeng Han
, Zhuofan Xia, Siyuan Pan
, Pengfei Wan
, Shiji Song
, Gao Huang
:
Agent Attention: On the Integration of Softmax and Linear Attention. ECCV (50) 2024: 124-140
[c42]Shuo Huang
, Shikun Sun
, Zixuan Wang
, Xiaoyu Qin
, Yanmin Xiong
, Yuan Zhang
, Pengfei Wan
, Di Zhang
, Jia Jia
:
PlacidDreamer: Advancing Harmony in Text-to-3D Generation. ACM Multimedia 2024: 6880-6889
[c41]Ye Tian, Ling Yang, Haotian Yang, Yuan Gao, Yufan Deng, Xintao Wang, Zhaochen Yu, Xin Tao, Pengfei Wan, Di Zhang, Bin Cui:
VideoTetris: Towards Compositional Text-to-Video Generation. NeurIPS 2024
[c40]Haotian Yang
, Mingwu Zheng
, Chongyang Ma
, Yu-Kun Lai
, Pengfei Wan
, Haibin Huang
:
VRMM: A Volumetric Relightable Morphable Head Model. SIGGRAPH (Conference Paper Track) 2024: 46
[c39]Xun Guo
, Mingwu Zheng
, Liang Hou
, Yuan Gao
, Yufan Deng
, Pengfei Wan
, Di Zhang
, Yufan Liu
, Weiming Hu
, Zhengjun Zha
, Haibin Huang
, Chongyang Ma
:
I2V-Adapter: A General Image-to-Video Adapter for Diffusion Models. SIGGRAPH (Conference Paper Track) 2024: 112
[c38]Shiyuan Yang
, Liang Hou
, Haibin Huang
, Chongyang Ma
, Pengfei Wan
, Di Zhang
, Xiaodong Chen
, Jing Liao
:
Direct-a-Video: Customized Video Generation with User-Directed Camera Movement and Object Motion. SIGGRAPH (Conference Paper Track) 2024: 113
[c37]Yujian Zheng
, Yuda Qiu
, Leyang Jin
, Chongyang Ma
, Haibin Huang
, Di Zhang
, Pengfei Wan
, Xiaoguang Han
:
Towards Unified 3D Hair Reconstruction from Single-View Portraits. SIGGRAPH Asia 2024: 114:1-114:11
[i45]Shiyuan Yang, Liang Hou, Haibin Huang, Chongyang Ma, Pengfei Wan, Di Zhang, Xiaodong Chen, Jing Liao:
Direct-a-Video: Customized Video Generation with User-Directed Camera Movement and Object Motion. CoRR abs/2402.03162 (2024)
[i44]Haotian Yang, Mingwu Zheng, Chongyang Ma, Yu-Kun Lai, Pengfei Wan, Haibin Huang:
VRMM: A Volumetric Relightable Morphable Head Model. CoRR abs/2402.04101 (2024)
[i43]Luozhou Wang, Guibao Shen, Yixun Liang, Xin Tao, Pengfei Wan, Di Zhang, Yijun Li, Yingcong Chen:
Motion Inversion for Video Customization. CoRR abs/2403.20193 (2024)
[i42]Zhaokun Zhou
, Qiulin Wang, Bin Lin, Yiwei Su, Rui Chen, Xin Tao, Amin Zheng, Li Yuan, Pengfei Wan, Di Zhang:
UNIAA: A Unified Multi-modal Image Aesthetic Assessment Baseline and Benchmark. CoRR abs/2404.09619 (2024)
[i41]Guibao Shen, Luozhou Wang, Jiantao Lin, Wenhang Ge, Chaozhe Zhang, Xin Tao, Yuan Zhang, Pengfei Wan, Zhongyuan Wang, Guangyong Chen, Yijun Li, Ying-Cong Chen:
SG-Adapter: Enhancing Text-to-Image Generation with Scene Graph Guidance. CoRR abs/2405.15321 (2024)
[i40]Jinchao Zhu, Yuxuan Wang, Siyuan Pan, Pengfei Wan, Di Zhang, Gao Huang:
A-SDM: Accelerating Stable Diffusion through Model Assembly and Feature Inheritance Strategies. CoRR abs/2406.00210 (2024)
[i39]Ye Tian, Ling Yang
, Haotian Yang, Yuan Gao, Yufan Deng, Jingmin Chen, Xintao Wang, Zhaochen Yu, Xin Tao, Pengfei Wan, Di Zhang, Bin Cui:
VideoTetris: Towards Compositional Text-to-Video Generation. CoRR abs/2406.04277 (2024)
[i38]Jianzhu Guo, Dingyun Zhang, Xiaoqiang Liu, Zhizhou Zhong, Yuan Zhang, Pengfei Wan, Di Zhang:
LivePortrait: Efficient Portrait Animation with Stitching and Retargeting Control. CoRR abs/2407.03168 (2024)
[i37]Yu-Jie Yuan, Leif Kobbelt, Jiwen Liu, Yuan Zhang, Pengfei Wan, Yu-Kun Lai, Lin Gao:
4Dynamic: Text-to-4D Generation with Hybrid Priors. CoRR abs/2407.12684 (2024)
[i36]Shuo Huang, Shikun Sun, Zixuan Wang, Xiaoyu Qin, Yanmin Xiong, Yuan Zhang, Pengfei Wan, Di Zhang, Jia Jia:
PlacidDreamer: Advancing Harmony in Text-to-3D Generation. CoRR abs/2407.13976 (2024)
[i35]Liangdong Qiu, Chengxing Yu, Yanran Li, Zhao Wang, Haibin Huang, Chongyang Ma, Di Zhang, Pengfei Wan, Xiaoguang Han:
ViMo: Generating Motions from Casual Videos. CoRR abs/2408.06614 (2024)
[i34]Yuanyang Yin, Yaqi Zhao, Yajie Zhang
, Ke Lin, Jiahao Wang, Xin Tao, Pengfei Wan, Di Zhang, Baoqun Yin, Wentao Zhang:
SEA: Supervised Embedding Alignment for Token-Level Visual-Textual Integration in MLLMs. CoRR abs/2408.11813 (2024)
[i33]Yujian Zheng, Yuda Qiu, Leyang Jin, Chongyang Ma, Haibin Huang, Di Zhang, Pengfei Wan, Xiaoguang Han:
Towards Unified 3D Hair Reconstruction from Single-View Portraits. CoRR abs/2409.16863 (2024)
[i32]Qiuheng Wang, Yukai Shi, Jiarong Ou, Rui Chen, Ke Lin, Jiahao Wang, Boyuan Jiang, Haotian Yang, Mingwu Zheng, Xin Tao, Fei Yang, Pengfei Wan, Di Zhang:
Koala-36M: A Large-scale Video Dataset Improving Consistency between Fine-grained Conditions and Video Content. CoRR abs/2410.08260 (2024)
[i31]Yuan Wang, Di Huang, Yaqi Zhang, Wanli Ouyang, Jile Jiao, Xuetao Feng, Yan Zhou, Pengfei Wan, Shixiang Tang, Dan Xu:
MotionGPT-2: A General-Purpose Motion-Language Model for Motion Generation and Understanding. CoRR abs/2410.21747 (2024)
[i30]Zhuoman Liu, Weicai Ye, Yan Luximon, Pengfei Wan, Di Zhang:
Unleashing the Potential of Multi-modal Foundation Models and Video Diffusion for 4D Dynamic Physical Scene Simulation. CoRR abs/2411.14423 (2024)
[i29]Jiahao Hu, Tianxiong Zhong, Xuebo Wang, Boyuan Jiang, Xingye Tian, Fei Yang, Pengfei Wan, Di Zhang:
VIVID-10M: A Dataset and Baseline for Versatile and Interactive Video Local Editing. CoRR abs/2411.15260 (2024)
[i28]Yuanyang Yin, Yaqi Zhao, Mingwu Zheng, Ke Lin, Jiarong Ou, Rui Chen, Victor Shea-Jay Huang, Jiahao Wang, Xin Tao, Pengfei Wan, Di Zhang, Baoqun Yin, Wentao Zhang, Kun Gai:
Towards Precise Scaling Laws for Video Diffusion Transformers. CoRR abs/2411.17470 (2024)
[i27]Zixuan Ye, Huijuan Huang, Xintao Wang, Pengfei Wan, Di Zhang, Wenhan Luo:
StyleMaster: Stylize Your Video with Artistic Generation and Translation. CoRR abs/2412.07744 (2024)
[i26]Xiao Fu, Xian Liu, Xintao Wang, Sida Peng, Menghan Xia, Xiaoyu Shi, Ziyang Yuan, Pengfei Wan, Di Zhang, Dahua Lin:
3DTrajMaster: Mastering 3D Trajectory for Multi-Entity Motion in Video Generation. CoRR abs/2412.07759 (2024)
[i25]Jianhong Bai, Menghan Xia, Xintao Wang, Ziyang Yuan, Xiao Fu, Zuozhu Liu, Haoji Hu, Pengfei Wan, Di Zhang:
SynCamMaster: Synchronizing Multi-Camera Video Generation from Diverse Viewpoints. CoRR abs/2412.07760 (2024)
[i24]Yuanhui Huang, Wenzhao Zheng, Yuan Gao, Xin Tao, Pengfei Wan, Di Zhang, Jie Zhou, Jiwen Lu:
Owl-1: Omni World Model for Consistent Long Video Generation. CoRR abs/2412.09600 (2024)- 2023
[j11]Mengtian Li
, Yi Dong
, Minxuan Lin
, Haibin Huang
, Pengfei Wan
, Chongyang Ma
:
Multi-Modal Face Stylization with a Generative Prior. Comput. Graph. Forum 42(7) (2023)
[j10]Xin Wen
, Peng Xiang, Zhizhong Han, Yan-Pei Cao
, Pengfei Wan
, Wen Zheng, Yu-Shen Liu
:
PMP-Net++: Point Cloud Completion by Transformer-Enhanced Multi-Step Point Moving Paths. IEEE Trans. Pattern Anal. Mach. Intell. 45(1): 852-867 (2023)
[j9]Peng Xiang, Xin Wen
, Yu-Shen Liu
, Yan-Pei Cao
, Pengfei Wan
, Wen Zheng, Zhizhong Han:
Snowflake Point Deconvolution for Point Cloud Completion and Generation With Skip-Transformer. IEEE Trans. Pattern Anal. Mach. Intell. 45(5): 6320-6338 (2023)
[j8]Ran Yi
, Zipeng Ye
, Zhiyao Sun
, Juyong Zhang
, Guo-Xin Zhang, Pengfei Wan
, Hujun Bao
, Yong-Jin Liu
:
Predicting Personalized Head Movement From Short Video and Speech Signal. IEEE Trans. Multim. 25: 6315-6328 (2023)
[j7]Jinchao Zhou, Guoan Li, Feng Shi, Xiaoyan Guo, Pengfei Wan, Miao Wang:
EM-Gaze: eye context correlation and metric learning for gaze estimation. Vis. Comput. Ind. Biomed. Art 6(1): 8 (2023)
[c36]Mengfei Xia, Yezhi Shu, Yuji Wang, Yu-Kun Lai, Qiang Li, Pengfei Wan, Zhongyuan Wang, Yong-Jin Liu:
FEditNet: Few-Shot Editing of Latent Semantics in GAN Spaces. AAAI 2023: 2919-2927
[c35]Tao Zhang, Xingye Tian, Yu Wu, Shunping Ji, Xuebo Wang, Yuan Zhang, Pengfei Wan:
DVIS: Decoupled Video Instance Segmentation Framework. ICCV 2023: 1282-1291
[c34]Mingrui Zhang
, Ming Chen
, Yan Zhou
, Li Chen
, Weihua Jian
, Pengfei Wan
:
Automatic Human Scene Interaction through Contact Estimation and Motion Adaptation. ACM Multimedia 2023: 7628-7637
[c33]Liang Hou, Qi Cao, Yige Yuan, Songtao Zhao, Chongyang Ma, Siyuan Pan, Pengfei Wan, Zhongyuan Wang, Huawei Shen, Xueqi Cheng:
Augmentation-Aware Self-Supervision for Data-Efficient GAN Training. NeurIPS 2023
[c32]Haotian Yang
, Mingwu Zheng
, Wanquan Feng
, Haibin Huang
, Yu-Kun Lai
, Pengfei Wan
, Zhongyuan Wang
, Chongyang Ma
:
Towards Practical Capture of High-Fidelity Relightable Avatars. SIGGRAPH Asia 2023: 23:1-23:11
[i23]Mengtian Li, Yi Dong, Minxuan Lin, Haibin Huang, Pengfei Wan, Chongyang Ma:
Multi-Modal Face Stylization with a Generative Prior. CoRR abs/2305.18009 (2023)
[i22]Tao Zhang, Xingye Tian, Yu Wu, Shunping Ji, Xuebo Wang, Yuan Zhang, Pengfei Wan:
DVIS: Decoupled Video Instance Segmentation Framework. CoRR abs/2306.03413 (2023)
[i21]Tao Zhang, Xingye Tian, Haoran Wei, Yu Wu, Shunping Ji, Xuebo Wang, Xin Tao, Yuan Zhang, Pengfei Wan:
1st Place Solution for PVUW Challenge 2023: Video Panoptic Segmentation. CoRR abs/2306.04091 (2023)
[i20]Tao Zhang, Xingye Tian, Yikang Zhou, Yu Wu, Shunping Ji, Cilin Yan, Xuebo Wang, Xin Tao, Yuan Zhang, Pengfei Wan:
1st Place Solution for the 5th LSVOS Challenge: Video Instance Segmentation. CoRR abs/2308.14392 (2023)
[i19]Haotian Yang, Mingwu Zheng, Wanquan Feng, Haibin Huang, Yu-Kun Lai, Pengfei Wan, Zhongyuan Wang, Chongyang Ma:
Towards Practical Capture of High-Fidelity Relightable Avatars. CoRR abs/2309.04247 (2023)
[i18]Ming Chen, Yan Zhou, Weihua Jian, Pengfei Wan, Zhongyuan Wang:
Temporal-Aware Refinement for Video-based Human Pose and Shape Recovery. CoRR abs/2311.09543 (2023)
[i17]Qi Fan, Xin Tao, Lei Ke, Mingqiao Ye, Yuan Zhang, Pengfei Wan, Zhongyuan Wang, Yu-Wing Tai, Chi-Keung Tang:
Stable Segment Anything Model. CoRR abs/2311.15776 (2023)
[i16]Tao Zhang, Xingye Tian, Yikang Zhou, Shunping Ji, Xuebo Wang, Xin Tao, Yuan Zhang, Pengfei Wan, Zhongyuan Wang, Yu Wu:
DVIS++: Improved Decoupled Framework for Universal Video Segmentation. CoRR abs/2312.13305 (2023)
[i15]Xun Guo, Mingwu Zheng, Liang Hou, Yuan Gao, Yufan Deng, Chongyang Ma, Weiming Hu, Zhengjun Zha, Haibin Huang, Pengfei Wan, Di Zhang:
I2V-Adapter: A General Image-to-Video Adapter for Video Diffusion Models. CoRR abs/2312.16693 (2023)- 2022
[c31]Jiayi Guo, Chaoqun Du, Jiangshan Wang, Huijuan Huang
, Pengfei Wan, Gao Huang:
Assessing a Single Image in Reference-Guided Image Synthesis. AAAI 2022: 753-761
[c30]Linfeng Zhang
, Xin Chen, Xiaobing Tu, Pengfei Wan, Ning Xu, Kaisheng Ma:
Wavelet Knowledge Distillation: Towards Efficient Image-to-Image Translation. CVPR 2022: 12454-12464
[c29]Zhaoqing Wang, Qiang Li, Guoxin Zhang, Pengfei Wan, Wen Zheng, Nannan Wang, Mingming Gong, Tongliang Liu
:
Exploring Set Similarity for Dense Self-supervised Representation Learning. CVPR 2022: 16569-16578
[c28]Xuesong Niu, Jili Gu, Guoxin Zhang, Pengfei Wan, Zhongyuan Wang:
Learning an Inference-accelerated Network from a Pre-trained Model with Frequency-enhanced Feature Distillation. ACM Multimedia 2022: 1847-1856
[c27]Baixu Chen, Junguang Jiang, Ximei Wang, Pengfei Wan, Jianmin Wang, Mingsheng Long:
Debiased Self-Training for Semi-Supervised Learning. NeurIPS 2022
[i14]Peng Xiang, Xin Wen, Yu-Shen Liu, Yan-Pei Cao, Pengfei Wan, Wen Zheng, Zhizhong Han:
Snowflake Point Deconvolution for Point Cloud Completion and Generation with Skip-Transformer. CoRR abs/2202.09367 (2022)
[i13]Xin Wen, Peng Xiang, Zhizhong Han, Yan-Pei Cao, Pengfei Wan, Wen Zheng, Yu-Shen Liu:
PMP-Net++: Point Cloud Completion by Transformer-Enhanced Multi-step Point Moving Paths. CoRR abs/2202.09507 (2022)
[i12]Linfeng Zhang
, Xin Chen, Xiaobing Tu, Pengfei Wan, Ning Xu, Kaisheng Ma:
Wavelet Knowledge Distillation: Towards Efficient Image-to-Image Translation. CoRR abs/2203.06321 (2022)
[i11]Wanfeng Zheng, Qiang Li, Guoxin Zhang, Pengfei Wan, Zhongyuan Wang:
ITTR: Unpaired Image-to-Image Translation with Transformers. CoRR abs/2203.16015 (2022)
[i10]Wanfeng Zheng, Qiang Li, Xiaoyan Guo, Pengfei Wan, Zhongyuan Wang:
Bridging CLIP and StyleGAN through Latent Alignment for Image Editing. CoRR abs/2210.04506 (2022)- 2021
[j6]Jia-Qi Zhang, Xiang Xu, Zhi-Meng Shen, Zehuan Huang, Yang Zhao
, Yan-Pei Cao, Pengfei Wan, Miao Wang:
Write-An-Animation: High-level Text-based Animation Editing with Character-Scene Interaction. Comput. Graph. Forum 40(7): 217-228 (2021)
[c26]Xin Wen, Peng Xiang, Zhizhong Han, Yan-Pei Cao, Pengfei Wan, Wen Zheng, Yu-Shen Liu
:
PMP-Net: Point Cloud Completion by Learning Multi-Step Point Moving Paths. CVPR 2021: 7443-7452
[c25]Xin Wen, Zhizhong Han, Yan-Pei Cao, Pengfei Wan, Wen Zheng, Yu-Shen Liu
:
Cycle4Completion: Unpaired Point Cloud Completion Using Cycle Transformation With Missing Region Coding. CVPR 2021: 13080-13089
[c24]Xingyu Chen, Yufeng Liu
, Chongyang Ma, Jianlong Chang, Huayan Wang, Tian Chen, Xiaoyan Guo, Pengfei Wan, Wen Zheng:
Camera-Space Hand Mesh Recovery via Semantic Aggregation and Adaptive 2D-1D Registration. CVPR 2021: 13274-13283
[c23]Peng Xiang, Xin Wen, Yu-Shen Liu
, Yan-Pei Cao, Pengfei Wan, Wen Zheng, Zhizhong Han:
SnowflakeNet: Point Cloud Completion by Snowflake Point Deconvolution with Skip-Transformer. ICCV 2021: 5479-5489
[c22]Mingcong Liu, Qiang Li, Zekui Qin, Guoxin Zhang, Pengfei Wan, Wen Zheng:
BlendGAN: Implicitly GAN Blending for Arbitrary Stylized Face Generation. NeurIPS 2021: 29710-29722
[i9]Xingyu Chen, Yufeng Liu, Chongyang Ma, Jianlong Chang, Huayan Wang, Tian Chen, Xiaoyan Guo, Pengfei Wan, Wen Zheng:
Camera-Space Hand Mesh Recovery via Semantic Aggregation and Adaptive 2D-1D Registration. CoRR abs/2103.02845 (2021)
[i8]Xin Wen, Zhizhong Han, Yan-Pei Cao, Pengfei Wan, Wen Zheng, Yu-Shen Liu:
Cycle4Completion: Unpaired Point Cloud Completion using Cycle Transformation with Missing Region Coding. CoRR abs/2103.07838 (2021)
[i7]Zhaoqing Wang, Qiang Li, Guoxin Zhang, Pengfei Wan, Wen Zheng, Nannan Wang, Mingming Gong, Tongliang Liu:
Exploring Set Similarity for Dense Self-supervised Representation Learning. CoRR abs/2107.08712 (2021)
[i6]Peng Xiang, Xin Wen, Yu-Shen Liu, Yan-Pei Cao, Pengfei Wan, Wen Zheng, Zhizhong Han:
SnowflakeNet: Point Cloud Completion by Snowflake Point Deconvolution with Skip-Transformer. CoRR abs/2108.04444 (2021)
[i5]Mingcong Liu, Qiang Li, Zekui Qin, Guoxin Zhang, Pengfei Wan, Wen Zheng:
BlendGAN: Implicitly GAN Blending for Arbitrary Stylized Face Generation. CoRR abs/2110.11728 (2021)
[i4]Jiayi Guo, Chaoqun Du, Jiangshan Wang, Huijuan Huang, Pengfei Wan, Gao Huang:
Assessing a Single Image in Reference-Guided Image Synthesis. CoRR abs/2112.04163 (2021)- 2020
[j5]Wenhan Yang
, Ye Yuan, Wenqi Ren
, Jiaying Liu
, Walter J. Scheirer
, Zhangyang Wang
, Taiheng Zhang, Qiaoyong Zhong
, Di Xie, Shiliang Pu, Yuqiang Zheng, Yanyun Qu, Yuhong Xie, Liang Chen, Zhonghao Li, Chen Hong, Hao Jiang, Siyuan Yang, Yan Liu, Xiaochao Qu, Pengfei Wan, Shuai Zheng
, Minhui Zhong, Taiyi Su
, Lingzhi He, Yandong Guo, Yao Zhao, Zhenfeng Zhu, Jinxiu Liang
, Jingwen Wang, Tianyi Chen, Yuhui Quan, Yong Xu, Bo Liu, Xin Liu, Qi Sun, Tingyu Lin, Xiaochuan Li, Feng Lu, Lin Gu
, Shengdi Zhou, Cong Cao, Shifeng Zhang, Cheng Chi, Chubin Zhuang, Zhen Lei, Stan Z. Li, Shizheng Wang, Ruizhe Liu, Dong Yi, Zheming Zuo
, Jianning Chi
, Huan Wang, Kai Wang, Yixiu Liu, Xingyu Gao
, Zhenyu Chen
, Chang Guo, Yongzhou Li, Huicai Zhong, Jing Huang, Heng Guo, Jianfei Yang, Wenjuan Liao, Jiangang Yang
, Liguo Zhou
, Mingyue Feng, Likun Qin:
Advancing Image Understanding in Poor Visibility Environments: A Collective Benchmark Study. IEEE Trans. Image Process. 29: 5737-5752 (2020)
[i3]Xin Wen, Peng Xiang, Zhizhong Han, Yan-Pei Cao, Pengfei Wan, Wen Zheng, Yu-Shen Liu:
PMP-Net: Point Cloud Completion by Learning Multi-step Point Moving Paths. CoRR abs/2012.03408 (2020)
2010 – 2019
- 2019
[j4]Pengfei Wan, Jiexiang Tan
, Xiaocong Lian, Xiangyang Ji
:
High Bit-Depth Image Acquisition Framework Using Embedded Quantization Bias. IEEE Trans. Computational Imaging 5(4): 556-569 (2019)
[c21]Andrey Ignatov, Radu Timofte
, Xiaochao Qu, Xingguang Zhou, Ting Liu, Pengfei Wan, Syed Waqas Zamir, Aditya Arora, Salman H. Khan, Fahad Shahbaz Khan
, Ling Shao, Dongwon Park, Se Young Chun, Pablo Navarrete Michelini, Hanwen Liu, Dan Zhu, Zhiwei Zhong, Xianming Liu, Junjun Jiang, Debin Zhao, Muhammad Haris, Kazutoshi Akita, Tomoki Yoshida, Greg Shakhnarovich, Norimichi Ukita, Jie Liu, Cheolkon Jung, Raimondo Schettini, Simone Bianco
, Claudio Cusano, Flavio Piccoli
, Pengju Liu, Kai Zhang
, Jingdong Liu, Jiye Liu, Hongzhi Zhang, Wangmeng Zuo, Nelson Chong Ngee Bow, Lai-Kuan Wong, John See
, Jinghui Qin, Lishan Huang, Yukai Shi, Pengxu Wei, Wushao Wen, Liang Lin, Zheng Hui
, Xiumei Wang
, Xinbo Gao
, Kanti Kumari, Vikas Kumar Anand, Mahendra Khened, Ganapathy Krishnamurthi:
NTIRE 2019 Challenge on Image Enhancement: Methods and Results. CVPR Workshops 2019: 2224-2232
[c20]Codruta O. Ancuti, Cosmin Ancuti, Radu Timofte
, Luc Van Gool, Lei Zhang, Ming-Hsuan Yang, Tiantong Guo, Xuelu Li, Venkateswararao Cherukuri, Vishal Monga, Hao Jiang, Siyuan Yang, Yan Liu, Xiaochao Qu, Pengfei Wan, Dongwon Park, Se Young Chun, Ming Hong, Jinying Huang, Yizi Chen, Shuxin Chen, Bomin Wang, Pablo Navarrete Michelini, Hanwen Liu, Dan Zhu, Jing Liu, Sanchayan Santra, Ranjan Mondal, Bhabatosh Chanda, Peter Morales, Tzofi Klinghoffer, Le Manh Quan, Yong-Guk Kim, Xiao Liang, Runde Li, Jinshan Pan, Jinhui Tang, Kuldeep Purohit
, Maitreya Suin, A. N. Rajagopalan, Raimondo Schettini, Simone Bianco
, Flavio Piccoli
, Claudio Cusano, Luigi Celona
, Sunhee Hwang
, Yu Seung Ma, Hyeran Byun, Subrahmanyam Murala, Akshay Dudhane, Harshjeet Singh Aulakh, Tianxiang Zheng, Tao Zhang, Weining Qin, Runnan Zhou, Shanhu Wang, Jean-Philippe Tarel, Chuansheng Wang, Jiawei Wu:
NTIRE 2019 Image Dehazing Challenge Report. CVPR Workshops 2019: 2241-2253
[i2]Yiming He, Wei Hu, Siyuan Yang, Xiaochao Qu, Pengfei Wan, Zongming Guo:
GraphPoseGAN: 3D Hand Pose Estimation from a Monocular RGB Image via Adversarial Learning on Graphs. CoRR abs/1912.01875 (2019)- 2018
[c19]Jie Huang, Pengfei Zhu, Mingrui Geng, Jiewen Ran, Xingguang Zhou, Chen Xing, Pengfei Wan, Xiangyang Ji:
Range Scaling Global U-Net for Perceptual Image Enhancement on Mobile Devices. ECCV Workshops (5) 2018: 230-242- 2016
[j3]Pengfei Wan
, Gene Cheung, Dinei A. F. Florêncio, Cha Zhang, Oscar C. Au:
Image Bit-Depth Enhancement via Maximum A Posteriori Estimation of AC Signal. IEEE Trans. Image Process. 25(6): 2896-2909 (2016)- 2015
[j2]Pengfei Wan, Gene Cheung, Philip A. Chou, Dinei A. F. Florêncio, Cha Zhang, Oscar C. Au:
Precision Enhancement of 3-D Surfaces from Compressed Multiview Depth Maps. IEEE Signal Process. Lett. 22(10): 1676-1680 (2015)
[c18]Amin Zheng, Yuan Yuan
, Hong Zhang, Haitao Yang, Pengfei Wan, Oscar C. Au:
Motion vector fields based video coding. ICIP 2015: 2095-2099- 2014
[c17]Ting Sun
, Pengfei Wan, Oscar C. Au, Wei Dai, Luheng Jia, Yuan Yuan
, Amin Zheng, Rui Ma:
Fast binary motion estimation for screen content video coding. APSIPA 2014: 1-5
[c16]Rui Ma, Oscar C. Au, Pengfei Wan, Lingfeng Xu, Wenxiu Sun, Wei Hu:
Improved temporal psychovisual modulation for backward-compatible stereoscopic display. GlobalSIP 2014: 1034-1038
[c15]Wei Dai, Oscar C. Au, Wenjing Zhu, Pengfei Wan, Wei Hu, Jiantao Zhou:
SSIM-based rate-distortion optimization in H.264. ICASSP 2014: 7343-7347
[c14]Wenjing Zhu, Oscar C. Au, Wei Dai, Haitao Yang, Rui Ma, Luheng Jia, Jin Zeng, Pengfei Wan:
Palette-based compound image compression in HEVC by exploiting non-local spatial correlation. ICASSP 2014: 7348-7352
[c13]Pengfei Wan, Oscar C. Au, Jiahao Pang, Ketan Tang, Rui Ma:
High bit-precision image acquisition and reconstruction by planned sensor distortion. ICIP 2014: 1773-1777
[c12]Ting Sun
, Luhong Liang, King Hung Chiu, Pengfei Wan, Oscar C. Au:
DCT coefficients generation model for film grain noise and its application in super-resolution. ICIP 2014: 3857-3861
[c11]Pengfei Wan, Gene Cheung, Dinei A. F. Florêncio, Cha Zhang, Oscar C. Au:
Image bit-depth enhancement via maximum-a-posteriori estimation of graph AC component. ICIP 2014: 4052-4056
[c10]Luheng Jia, Oscar C. Au, Chi-Ying Tsui
, Wei Dai, Pengfei Wan:
A fast intermode decision algorithm based on analysis of inter prediction residual. MMSP 2014: 1-4
[c9]Rui Ma, Oscar C. Au, Pengfei Wan, Wenxiu Sun, Lingfeng Xu, Luheng Jia:
Solving dense stereo matching via quadratic programming. VCIP 2014: 370-373
[i1]Pengfei Wan, Gene Cheung, Philip A. Chou, Dinei A. F. Florêncio, Cha Zhang, Oscar C. Au:
Precision Enhancement of 3D Surfaces from Multiple Compressed Depth Maps. CoRR abs/1405.2062 (2014)- 2013
[j1]Pengfei Wan, Yunlong Feng, Gene Cheung, Ivan V. Bajic
, Oscar C. Au:
3-D Motion Estimation for Visual Saliency Modeling. IEEE Signal Process. Lett. 20(10): 972-975 (2013)
[c8]Pengfei Wan, Yunlong Feng, Gene Cheung, Ivan V. Bajic
, Oscar C. Au, Yusheng Ji:
3D motion in visual saliency modeling. ICASSP 2013: 1831-1835
[c7]Chao Pang, Oscar C. Au, Feng Zou, Xingyu Zhang, Wei Hu, Pengfei Wan:
Optimal dependent bit allocation for AVS intra-frame coding via successive convex approximation. ICIP 2013: 1520-1523
[c6]Wei Dai, Oscar C. Au, Wenjing Zhu, Wei Hu, Pengfei Wan, Jiali Li:
A robust interpolation-free approach for sub-pixel accuracy motion estimation. ICIP 2013: 1767-1771
[c5]Ruobing Zou, Oscar C. Au, Guyue Zhou, Wei Dai, Wei Hu, Pengfei Wan:
Personal photo album compression and management. ISCAS 2013: 1428-1431
[c4]Pengfei Wan, Gene Cheung, Philip A. Chou, Dinei Florêncio, Cha Zhang, Oscar C. Au:
Precision enhancement of 3D surfaces from multiple quantized depth maps. IVMSP 2013: 1-4- 2012
[c3]Ketan Tang, Oscar C. Au, Lu Fang, Yuanfang Guo, Pengfei Wan, Lingfeng Xu:
Super resolution for subpixel-based downsampled images. APSIPA 2012: 1-4
[c2]Pengfei Wan, Oscar C. Au, Ketan Tang, Yuanfang Guo:
Image de-quantization via spatially varying sparsity prior. ICIP 2012: 953-956
[c1]Pengfei Wan, Oscar C. Au, Ketan Tang, Yuanfang Guo, Lu Fang:
From 2D Extrapolation to 1D Interpolation: Content Adaptive Image Bit-Depth Expansion. ICME 2012: 170-175
Coauthor Index
aka: Wen Zheng

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from
to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the
of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from
,
, and
to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from
and
to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from
.
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2026-02-03 23:40 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint


Google
Google Scholar
Semantic Scholar
Internet Archive Scholar
CiteSeerX
ORCID







