


default search action
Xiangtai Li
Person information
Refine list

refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2025
[j25]Jiangning Zhang
, Xuhai Chen, Yabiao Wang
, Chengjie Wang
, Yong Liu, Xiangtai Li
, Ming-Hsuan Yang, Dacheng Tao
:
Exploring plain ViT features for multi-class unsupervised visual anomaly detection. Comput. Vis. Image Underst. 253: 104308 (2025)
[j24]Jiahao Xie, Wei Li, Xiangtai Li, Ziwei Liu, Yew Soon Ong, Chen Change Loy
:
MosaicFusion: Diffusion Models as Data Augmenters for Large Vocabulary Instance Segmentation. Int. J. Comput. Vis. 133(4): 1456-1475 (2025)
[j23]Chong Zhou
, Xiangtai Li, Chen Change Loy, Bo Dai:
EdgeSAM: Prompt-In-the-Loop Distillation for SAM. Int. J. Comput. Vis. 133(12): 8452-8468 (2025)
[j22]Hao Zhou
, Lu Qi
, Tiancheng Shen, Hai Huang
, Xu Yang
, Xiangtai Li
, Ming-Hsuan Yang:
Rethinking Evaluation Metrics of Open-Vocabulary Segmentation. IEEE Trans. Pattern Anal. Mach. Intell. 47(8): 6780-6796 (2025)
[j21]Jiangning Zhang
, Teng Hu
, Haoyang He
, Zhucun Xue
, Yabiao Wang
, Chengjie Wang
, Yong Liu
, Xiangtai Li
, Dacheng Tao
:
EMOv2: Pushing 5M Vision Model Frontier. IEEE Trans. Pattern Anal. Mach. Intell. 47(11): 10560-10576 (2025)
[j20]Chunlei Wang
, Wenquan Feng
, Shuchang Lyu
, Guangliang Cheng
, Xiangtai Li
, Binghao Liu
, Qi Zhao
:
A Masked Reference Token Supervision-Based Iterative Visual-Language Framework for Robust Visual Grounding. IEEE Trans. Circuits Syst. Video Technol. 35(1): 75-90 (2025)
[j19]Shilin Xu
, Xiangtai Li, Size Wu, Wenwei Zhang
, Yunhai Tong
, Chen Change Loy
:
DST-Det: Open-Vocabulary Object Detection via Dynamic Self-Training. IEEE Trans. Circuits Syst. Video Technol. 35(5): 5037-5050 (2025)
[c66]Kuan-Chih Huang, Xiangtai Li, Lu Qi, Shuicheng Yan, Ming-Hsuan Yang:
Reason3D: Searching and Reasoning 3D Segmentation via Large Language Model. 3DV 2025: 1177-1186
[c65]Qingdong He, Jiangning Zhang
, Jinlong Peng, Haoyang He, Xiangtai Li, Yabiao Wang
, Chengjie Wang
:
PointRWKV: Efficient RWKV-Like Model for Hierarchical Point Cloud Learning. AAAI 2025: 3410-3418
[c64]Chaoyang Wang, Xiangtai Li, Henghui Ding, Lu Qi, Jiangning Zhang
, Yunhai Tong, Chen Change Loy, Shuicheng Yan:
Explore In-Context Segmentation via Latent Diffusion Models. AAAI 2025: 7545-7553
[c63]Hao Yang, Qianyu Zhou, Haijia Sun, Xiangtai Li, Fengqi Liu, Xuequan Lu, Lizhuang Ma, Shuicheng Yan:
PointDGMamba: Domain Generalization of Point Cloud Classification via Generalized State Space Model. AAAI 2025: 9193-9201
[c62]Tao Zhang, Haobo Yuan, Lu Qi, Jiangning Zhang
, Qianyu Zhou, Shunping Ji, Shuicheng Yan, Xiangtai Li:
Point Cloud Mamba: Point Cloud Learning via State Space Model. AAAI 2025: 10121-10130
[c61]Xin Li, Yeying Jin, Xin Jin, Zongwei Wu, Bingchen Li, Yufei Wang, Wenhan Yang, Yu Li, Zhibo Chen, Bihan Wen, Robby T. Tan, Radu Timofte, Qiyu Rong, Hongyuan Jing, Mengmeng Zhang, Jinglong Li, Xiangyu Lu, Yi Ren, Yuting Liu, Meng Zhang, Xiang Chen, Qiyuan Guan, Jiangxin Dong, Jinshan Pan, Conglin Gou, Qirui Yang, Fangpu Zhang, Yunlong Lin, Sixiang Chen, Guoxi Huang, Ruirui Lin, Yan Zhang, Jingyu Yang, Huanjing Yue, Jiyuan Chen, Qiaosi Yi, Hongjun Wang, Chenxi Xie, Shuai Li, Yuhui Wu, Kaiyi Ma, Jiakui Hu, Juncheng Li, Liwen Pan, Guangwei Gao, Wenjie Li, Zhenyu Jin, Heng Guo, Zhanyu Ma, Yubo Wang, Jinghua Wang, Wangzhi Xing, Anjusree Karnavar, Diqi Chen, Mohammad Aminul Islam, Hao Yang, Ruikun Zhang, Liyuan Pan, Qianhao Luo, Xin Cao, Han Zhou, Yan Min, Wei Dong, Jun Chen, Taoyi Wu, Weijia Dou, Yu Wang, Shengjie Zhao, Yongcheng Huang, Xingyu Han, Anyan Huang, Hongtao Wu, Hong Wang, Yefeng Zheng, Abhijeet Kumar, Aman Kumar, Marcos V. Conde, Paula Garrido, Daniel Feijoo, Juan C. Benito, Guanglu Dong, Xin Lin, Siyuan Liu, Tianheng Zheng, Jiayu Zhong, Shouyi Wang, Xiangtai Li, Lanqing Guo, Lu Qi, Chao Ren, Shuaibo Wang, Shilong Zhang, Wanyu Zhou, Yunze Wu, Qinzhong Tan, Jieyuan Pei, Zhuoxuan Li, Jiayu Wang, Haoyu Bian, Haoran Sun, Subhajit Paul, Ni Tang, Junhao Huang, Zihan Cheng, Hongyun Zhu, Yuehan Wu, Kaixin Deng, Huang Ouyang, Tianxin Xiao, Fan Yang, Zhizun Luo, Zeyu Xiao, Zhuoyuan Li, Pham Hoang Le Nguyen, Dinh Thien An, Luu Thanh Son, Kiet Van Nguyen, Ronghua Xu, Xianmin Tian, Weijian Zhou, Jiacheng Zhang, Yuqian Chen, Yihang Duan, Yujie Wu, Suresh Raikwar, Arsh Garg, Kritika Kritika, Jianhua Zheng, Xiaoshan Ma, Ruolin Zhao, Yongyu Yang, Yongsheng Liang, Guiming Huang, Qiang Li, Hongbin Zhang, Xiangyu Zheng, A. N. Rajagopalan:
NTIRE 2025 Challenge on Day and Night Raindrop Removal for Dual-Focused Images: Methods and Results. CVPR Workshops 2025: 1172-1183
[c60]Qingyu Shi, Lu Qi, Jianzong Wu, Jinbin Bai, Jingbo Wang, Yunhai Tong, Xiangtai Li:
DreamRelation: Bridging Customization and Relation Generation. CVPR 2025: 15723-15732
[c59]Yicheng Chen, Xiangtai Li, Yining Li, Yanhong Zeng, Jianzong Wu, Xiangyu Zhao, Kai Chen:
Auto Cherry-Picker: Learning from High-quality Generative Data Driven by Language. CVPR 2025: 19952-19962
[c58]Shengqiong Wu, Hao Fei, Jingkang Yang, Xiangtai Li, Juncheng Li, Hanwang Zhang, Tat-Seng Chua:
Learning 4D Panoptic Scene Graph Generation from Rich 2D Visual Scene. CVPR 2025: 24539-24549
[c57]Jianzong Wu, Chao Tang, Jingbo Wang, Yanhong Zeng, Xiangtai Li, Yunhai Tong:
DiffSensei: Bridging Multi-Modal LLMs and Diffusion Models for Customized Manga Generation. CVPR 2025: 28684-28693
[c56]Zhenglin Huang, Jinwei Hu, Xiangtai Li, Yiwei He, Xingyu Zhao
, Bei Peng, Baoyuan Wu, Xiaowei Huang, Guangliang Cheng:
SIDA: Social Media Image Deepfake Detection, Localization and Explanation with Large Multimodal Model. CVPR 2025: 28831-28841
[c55]Lehan Yang, Lu Qi, Xiangtai Li, Sheng Li, Varun Jampani, Ming-Hsuan Yang:
Unified Dense Prediction of Video Diffusion. CVPR 2025: 28963-28973
[c54]Jinbin Bai, Tian Ye, Wei Chow, Enxin Song, Qing-Guo Chen, Xiangtai Li, Zhen Dong, Lei Zhu, Shuicheng Yan:
Meissonic: Revitalizing Masked Generative Transformers for Efficient High-Resolution Text-to-Image Synthesis. ICLR 2025
[c53]Peiwen Sun, Sitong Cheng, Xiangtai Li, Zhen Ye, Huadai Liu, Honggang Zhang, Wei Xue, Yike Guo:
Both Ears Wide Open: Towards Language-Driven Spatial Audio Generation. ICLR 2025
[c52]Shengqiong Wu, Hao Fei, Xiangtai Li, Jiayi Ji, Hanwang Zhang, Tat-Seng Chua, Shuicheng Yan:
Towards Semantic Equivalence of Tokenization in Multimodal LLM. ICLR 2025
[c51]Shilin Xu, Haobo Yuan, Qingyu Shi, Lu Qi, Jingbo Wang, Yibo Yang, Yining Li, Kai Chen, Yunhai Tong, Bernard Ghanem, Xiangtai Li, Ming-Hsuan Yang:
RMP-SAM: Towards Real-Time Multi-Purpose Segment Anything. ICLR 2025
[c50]Jingtong Yue, Zhiwei Lin, Xin Lin, Xiaoyu Zhou, Xiangtai Li, Lu Qi, Yongtao Wang, Ming-Hsuan Yang:
RobuRCDet: Enhancing Robustness of Radar-Camera Fusion in Bird's Eye View for 3D Object Detection. ICLR 2025
[c49]Hao Fei, Yuan Zhou, Juncheng Li, Xiangtai Li, Qingshan Xu, Bobo Li, Shengqiong Wu, Yaoting Wang, Junbao Zhou, Jiahao Meng, Qingyu Shi, Zhiyuan Zhou, Liangtao Shi, Minghe Gao, Daoan Zhang, Zhiqi Ge, Siliang Tang, Kaihang Pan, Yaobo Ye, Haobo Yuan, Tao Zhang, Weiming Wu, Tianjie Ju, Zixiang Meng, Shilin Xu, Liyu Jia, Wentao Hu, Meng Luo, Jiebo Luo, Tat-Seng Chua, Shuicheng Yan, Hanwang Zhang:
On Path to Multimodal Generalist: General-Level and General-Bench. ICML 2025
[c48]Hao Zhou, Xu Yang, Mingyu Fan, Lu Qi, Xiangtai Li, Ming-Hsuan Yang, Fei Luo:
Three-Dimensional Trajectory Prediction with 3DMoTraj Dataset. ICML 2025
[c47]Huadai Liu, Tianyi Luo, Kaicheng Luo, Qikai Jiang, Peiwen Sun, Jialei Wang, Rongjie Huang, Qian Chen, Wen Wang, Xiangtai Li, Shiliang Zhang, Zhijie Yan, Zhou Zhao, Wei Xue:
OmniAudio: Generating Spatial Audio from 360-Degree Video. ICML 2025
[i143]Haobo Yuan, Xiangtai Li, Tao Zhang, Zilong Huang, Shilin Xu, Shunping Ji, Yunhai Tong, Lu Qi, Jiashi Feng, Ming-Hsuan Yang:
Sa2VA: Marrying SAM2 with LLaVA for Dense Grounded Understanding of Images and Videos. CoRR abs/2501.04001 (2025)
[i142]Yikang Zhou, Tao Zhang, Shilin Xu, Shihao Chen, Qianyu Zhou, Yunhai Tong, Shunping Ji, Jiangning Zhang
, Xiangtai Li, Lu Qi:
Are They the Same? Exploring Visual Correspondence Shortcomings of Multimodal LLMs. CoRR abs/2501.04670 (2025)
[i141]Yu Qiu, Xin Lin, Jingbo Wang, Xiangtai Li, Lu Qi, Ming-Hsuan Yang:
UMC: Unified Resilient Controller for Legged Robots with Joint Malfunctions. CoRR abs/2502.03035 (2025)
[i140]Jingtong Yue, Zhiwei Lin, Xin Lin, Xiaoyu Zhou, Xiangtai Li, Lu Qi, Yongtao Wang, Ming-Hsuan Yang:
RobuRCDet: Enhancing Robustness of Radar-Camera Fusion in Bird's Eye View for 3D Object Detection. CoRR abs/2502.13071 (2025)
[i139]Lehan Yang, Lu Qi, Xiangtai Li, Sheng Li, Varun Jampani, Ming-Hsuan Yang:
Unified Dense Prediction of Video Diffusion. CoRR abs/2503.09344 (2025)
[i138]Shengqiong Wu, Hao Fei, Jingkang Yang, Xiangtai Li, Juncheng Li, Hanwang Zhang, Tat-Seng Chua:
Learning 4D Panoptic Scene Graph Generation from Rich 2D Visual Scene. CoRR abs/2503.15019 (2025)
[i137]Qingyu Shi, Jianzong Wu, Jinbin Bai, Jiangning Zhang
, Lu Qi, Xiangtai Li, Yunhai Tong:
Decouple and Track: Benchmarking and Improving Video Diffusion Transformers for Motion Transfer. CoRR abs/2503.17350 (2025)
[i136]Haobo Yuan, Tao Zhang, Xiangtai Li, Lu Qi, Zilong Huang, Shilin Xu, Jiashi Feng, Ming-Hsuan Yang:
4th PVUW MeViS 3rd Place Report: Sa2VA. CoRR abs/2504.00476 (2025)
[i135]Shaocong Long, Qianyu Zhou, Xiangtai Li, Chenhao Ying, Yunhai Tong, Lizhuang Ma, Yuan Luo, Dacheng Tao:
Generative Classifier for Domain Generalization. CoRR abs/2504.02272 (2025)
[i134]Sixiang Chen, Jinbin Bai, Zhuoran Zhao, Tian Ye, Qingyu Shi, Donghao Zhou, Wenhao Chai, Xin Lin, Jianzong Wu, Chao Tang
, Shilin Xu, Tao Zhang, Haobo Yuan, Yikang Zhou, Wei Chow, Linfeng Li, Xiangtai Li, Lei Zhu, Lu Qi:
An Empirical Study of GPT-4o Image Generation Capabilities. CoRR abs/2504.05979 (2025)
[i133]Weixian Lei, Jiacong Wang, Haochen Wang, Xiangtai Li, Jun Hao Liew, Jiashi Feng, Zilong Huang:
The Scalability of Simplicity: Empirical Analysis of Vision-Language Learning with a Single Transformer. CoRR abs/2504.10462 (2025)
[i132]Tao Zhang, Xiangtai Li, Zilong Huang, Yanwei Li, Weixian Lei, Xueqing Deng, Shihao Chen, Shunping Ji, Jiashi Feng:
Pixel-SAIL: Single Transformer For Pixel-Grounded Understanding. CoRR abs/2504.10465 (2025)
[i131]Henghui Ding, Chang Liu, Nikhila Ravi, Shuting He, Yunchao Wei, Song Bai, Philip Torr, Kehuan Song, Xinglin Xie, Kexin Zhang, Licheng Jiao, Lingling Li, Shuyuan Yang, Xuqiang Cao, Linnan Zhao, Jiaxuan Zhao, Fang Liu, Mengjiao Wang, Junpei Zhang, Xu Liu, Yuting Yang, Mengru Ma, Hao Fang, Runmin Cong, Xiankai Lu, Zhiyang Chen, Wei Zhang, Tianming Liang, Haichao Jiang, Wei-Shi Zheng, Jian-Fang Hu, Haobo Yuan, Xiangtai Li, Tao Zhang, Lu Qi, Ming-Hsuan Yang:
PVUW 2025 Challenge Report: Advances in Pixel-level Understanding of Complex Videos in the Wild. CoRR abs/2504.11326 (2025)
[i130]Mengshi Qi, Pengfei Zhu, Xiangtai Li, Xiaoyang Bi, Lu Qi, Huadong Ma, Ming-Hsuan Yang:
DC-SAM: In-Context Segment Anything in Images and Videos via Dual Consistency. CoRR abs/2504.12080 (2025)
[i129]Xin Li, Yeying Jin, Xin Jin, Zongwei Wu, Bingchen Li, Yufei Wang, Wenhan Yang, Yu Li, Zhibo Chen, Bihan Wen, Robby T. Tan, Radu Timofte, Qiyu Rong, Hongyuan Jing, Mengmeng Zhang, Jinglong Li, Xiangyu Lu, Yi Ren, Yuting Liu, Meng Zhang, Xiang Chen, Qiyuan Guan, Jiangxin Dong, Jinshan Pan, Conglin Gou, Qirui Yang, Fangpu Zhang, Yunlong Lin, Sixiang Chen, Guoxi Huang, Ruirui Lin, Yan Zhang, Jingyu Yang, Huanjing Yue, Jiyuan Chen, Qiaosi Yi, Hongjun Wang, Chenxi Xie, Shuai Li, Yuhui Wu, Kaiyi Ma, Jiakui Hu, Juncheng Li, Liwen Pan, Guangwei Gao, Wenjie Li, Zhenyu Jin, Heng Guo, Zhanyu Ma, Yubo Wang, Jinghua Wang, Wangzhi Xing, Anjusree Karnavar, Diqi Chen, Mohammad Aminul Islam, Hao Yang, Ruikun Zhang, Liyuan Pan, Qianhao Luo, XinCao, Han Zhou, Yan Min, Wei Dong, Jun Chen, Taoyi Wu, Weijia Dou, Yu Wang, Shengjie Zhao, Yongcheng Huang, Xingyu Han, Anyan Huang, Hongtao Wu, Hong Wang, Yefeng Zheng, Abhijeet Kumar, Aman Kumar, Marcos V. Conde, Paula Garrido, Daniel Feijoo, Juan C. Benito, Guanglu Dong, Xin Lin, Siyuan Liu, Tianheng Zheng, Jiayu Zhong, Shouyi Wang, Xiangtai Li, Lanqing Guo, Lu Qi, Chao Ren, Shuaibo Wang, Shilong Zhang, Wanyu Zhou, Yunze Wu, Qinzhong Tan, Jieyuan Pei, Zhuoxuan Li, Jiayu Wang, Haoyu Bian, Haoran Sun:
NTIRE 2025 Challenge on Day and Night Raindrop Removal for Dual-Focused Images: Methods and Results. CoRR abs/2504.12711 (2025)
[i128]Huadai Liu, Tianyi Luo, Qikai Jiang, Kaicheng Luo, Peiwen Sun, Jialei Wang, Rongjie Huang, Qian Chen, Wen Wang, Xiangtai Li, Shiliang Zhang, Zhijie Yan, Zhou Zhao, Wei Xue:
OmniAudio: Generating Spatial Audio from 360-Degree Video. CoRR abs/2504.14906 (2025)
[i127]Muyi Bao, Shuchang Lyu
, Zhaoyang Xu, Huiyu Zhou, Jinchang Ren, Shiming Xiang, Xiangtai Li, Guangliang Cheng:
Vision Mamba in Remote Sensing: A Comprehensive Survey of Techniques, Applications and Outlook. CoRR abs/2505.00630 (2025)
[i126]Hao Fei, Yuan Zhou, Juncheng Li, Xiangtai Li, Qingshan Xu, Bobo Li, Shengqiong Wu, Yaoting Wang, Junbao Zhou, Jiahao Meng, Qingyu Shi, Zhiyuan Zhou, Liangtao Shi, Minghe Gao, Daoan Zhang, Zhiqi Ge, Weiming Wu, Siliang Tang, Kaihang Pan, Yaobo Ye, Haobo Yuan, Tao Zhang, Tianjie Ju, Zixiang Meng, Shilin Xu, Liyu Jia, Wentao Hu, Meng Luo
, Jiebo Luo, Tat-Seng Chua, Shuicheng Yan, Hanwang Zhang:
On Path to Multimodal Generalist: General-Level and General-Bench. CoRR abs/2505.04620 (2025)
[i125]Haiquan Wen, Yiwei He, Zhenglin Huang, Tianxiao Li, Zihan Yu, Xingru Huang, Lu Qi, Baoyuan Wu, Xiangtai Li, Guangliang Cheng:
BusterX: MLLM-Powered AI-Generated Video Forgery Detection and Explanation. CoRR abs/2505.12620 (2025)
[i124]Chaoyang Wang, Xiangtai Li, Lu Qi, Xiaofan Lin, Jinbin Bai, Qianyu Zhou, Yunhai Tong:
Conditional Panoramic Image Generation via Masked Autoregressive Modeling. CoRR abs/2505.16862 (2025)
[i123]Zhenglin Huang, Tianxiao Li, Xiangtai Li, Haiquan Wen, Yiwei He, Jiangning Zhang, Hao Fei, Xi Yang, Xiaowei Huang, Bei Peng, Guangliang Cheng:
So-Fake: Benchmarking and Explaining Social Media Image Forgery Detection. CoRR abs/2505.18660 (2025)
[i122]Zitong Wang, Hang Zhao, Qianyu Zhou, Xuequan Lu, Xiangtai Li, Yiren Song:
DiffDecompose: Layer-Wise Decomposition of Alpha-Composited Images via Diffusion Transformers. CoRR abs/2505.21541 (2025)
[i121]Qingyu Shi, Jinbin Bai, Zhuoran Zhao, Wenhao Chai, Kaidong Yu, Jianzong Wu, Shuangyong Song, Yunhai Tong, Xiangtai Li, Xuelong Li, Shuicheng Yan:
Muddit: Liberating Generation Beyond Text-to-Image with a Unified Discrete Diffusion Model. CoRR abs/2505.23606 (2025)
[i120]Song Wang, Gongfan Fang, Lingdong Kong, Xiangtai Li, Jianyun Xu, Sheng Yang, Qiang Li, Jianke Zhu, Xinchao Wang:
PixelThink: Towards Efficient Chain-of-Pixel Reasoning. CoRR abs/2505.23727 (2025)
[i119]Shilin Xu, Yanwei Li, Rui Yang, Tao Zhang, Yueyi Sun, Wei Chow, Linfeng Li, Hang Song, Qi Xu, Yunhai Tong, Xiangtai Li, Hao Fei:
Mixed-R1: Unified Reward Perspective For Reasoning Capability in Multimodal Large Language Models. CoRR abs/2505.24164 (2025)
[i118]Wei Chow, Yuan Gao, Linfeng Li, Xian Wang, Qi Xu, Hang Song, Lingdong Kong, Ran Zhou, Yi Zeng, Yidong Cai, Botian Jiang, Shilin Xu, Jiajun Zhang, Minghui Qiu, Xiangtai Li, Tianshu Yang, Siliang Tang, Juncheng Li:
MERIT: Multilingual Semantic Retrieval with Interleaved Multi-Condition Query. CoRR abs/2506.03144 (2025)
[i117]Jiahao Meng, Shuyang Sun, Yue Tan, Lu Qi, Yunhai Tong, Xiangtai Li, Longyin Wen:
CyberV: Cybernetics for Test-time Scaling in Video Understanding. CoRR abs/2506.07971 (2025)
[i116]Zhucun Xue, Jiangning Zhang
, Xurong Xie, Yuxuan Cai, Yong Liu, Xiangtai Li, Dacheng Tao:
AdaVideoRAG: Omni-Contextual Adaptive Retrieval-Augmented Efficient Long Video Understanding. CoRR abs/2506.13589 (2025)
[i115]Zhucun Xue, Jiangning Zhang
, Teng Hu, Haoyang He, Yinan Chen, Yuxuan Cai, Yabiao Wang, Chengjie Wang, Yong Liu, Xiangtai Li, Dacheng Tao:
UltraVideo: High-Quality UHD Video Dataset with Comprehensive Captions. CoRR abs/2506.13691 (2025)
[i114]Yikang Zhou, Tao Zhang, Dizhe Zhang, Shunping Ji, Xiangtai Li, Lu Qi:
Dense360: Dense Understanding from Omnidirectional Panoramas. CoRR abs/2506.14471 (2025)
[i113]Yiwei He, Xiangtai Li, Zhenglin Huang, Yi Dong, Hao Fei, Jiangning Zhang
, Baoyuan Wu, Guangliang Cheng:
Towards Explainable Bilingual Multimodal Misinformation Detection and Localization. CoRR abs/2506.22930 (2025)
[i112]Xiangtai Li, Tao Zhang, Yanwei Li, Haobo Yuan, Shihao Chen, Yikang Zhou, Jiahao Meng, Yueyi Sun, Shilin Xu, Lu Qi, Tianheng Cheng, Yi Lin, Zilong Huang, Wenhao Huang, Jiashi Feng, Guang Shi:
DenseWorld-1M: Towards Detailed Dense Grounded Caption in the Real World. CoRR abs/2506.24102 (2025)
[i111]Qingdong He, Xueqin Chen, Chaoyi Wang, Yanjie Pan, Xiaobin Hu, Zhenye Gan, Yabiao Wang
, Chengjie Wang
, Xiangtai Li, Jiangning Zhang
:
Reasoning to Edit: Hypothetical Instruction-Based Image Editing with Visual Reasoning. CoRR abs/2507.01908 (2025)
[i110]Haochen Wang, Xiangtai Li, Zilong Huang, Anran Wang, Jiacong Wang, Tao Zhang, Jiani Zheng, Sule Bai, Zijian Kang, Jiashi Feng, Zhuochen Wang, Zhaoxiang Zhang:
Traceable Evidence Enhanced Visual Grounded Reasoning: Evaluation and Methodology. CoRR abs/2507.07999 (2025)
[i109]Yuhu Bai, Jiangning Zhang
, Yunkang Cao
, Guangyuan Lu, Qingdong He, Xiangtai Li, Guanzhong Tian:
Bridge Feature Matching and Cross-Modal Alignment with Mutual-filtering for Zero-shot Anomaly Detection. CoRR abs/2507.11003 (2025)
[i108]Mengyuan Liu, Xinshun Wang, Zhongbin Fang, Deheng Ye, Xia Li, Tao Tang, Songtao Wu, Xiangtai Li, Ming-Hsuan Yang:
Human-in-Context: Unified Cross-Domain 3D Human Motion Modeling via In-Context Learning. CoRR abs/2508.10897 (2025)
[i107]Haidong Xu, Guangwei Xu, Zhedong Zheng, Xiatian Zhu, Wei Ji, Xiangtai Li, Ruijie Guo, Meishan Zhang, Min Zhang, Hao Fei:
VimoRAG: Video-based Retrieval-augmented 3D Motion Generation for Motion Language Models. CoRR abs/2508.12081 (2025)
[i106]Hao Yang, Qianyu Zhou, Haijia Sun, Xiangtai Li, Xuequan Lu, Lizhuang Ma, Shuicheng Yan:
PointDGRWKV: Generalizing RWKV-like Architecture to Unseen Domains for Point Cloud Classification. CoRR abs/2508.20835 (2025)
[i105]Xin Lin, Xian Ge, Dizhe Zhang, Zhaoliang Wan, Xianshun Wang, Xiangtai Li, Wenjie Jiang, Bo Du, Dacheng Tao, Ming-Hsuan Yang, Lu Qi:
One Flight Over the Gap: A Survey from Perspective to Panoramic Vision. CoRR abs/2509.04444 (2025)
[i104]Quanzhu Niu, Dengxian Gong, Shihao Chen, Tao Zhang, Yikang Zhou, Haobo Yuan, Lu Qi, Xiangtai Li, Shunping Ji:
The 1st Solution for 7th LSVOS RVOS Track: SaSaSa2VA. CoRR abs/2509.16972 (2025)
[i103]Meixi Song, Xin Lin, Dizhe Zhang, Haodong Li, Xiangtai Li, Bo Du, Lu Qi:
D2GS: Depth-and-Density Guided Gaussian Splatting for Stable and Accurate Sparse-View Reconstruction. CoRR abs/2510.08566 (2025)
[i102]Chang Liu, Henghui Ding, Kaining Ying, Lingyi Hong, Ning Xu, Linjie Yang, Yuchen Fan, Mingqi Gao, Jingkun Chen, Yunqi Miao, Gengshen Wu, Zhijin Qin, Jungong Han, Zhixiong Zhang, Shuangrui Ding, Xiaoyi Dong, Yuhang Zang, Yuhang Cao, Jiaqi Wang, Chang Soo Lim, Joonyoung Moon, Donghyeon Cho, Tingmin Li, Yixuan Li, Yang Yang, An Yan, Leilei Cao, Feng Lu, Ran Hong, Youhai Jiang, Fengjie Zhu, Yujie Xie, Hongyang Zhang, Zhihui Liu, Shihai Ruan, Quanzhu Niu, Dengxian Gong, Shihao Chen, Tao Zhang, Yikang Zhou, Haobo Yuan, Lu Qi, Xiangtai Li, Shunping Ji, Alexey Nekrasov, Ali Athar, Daan de Geus, Alexander Hermans, Bastian Leibe:
LSVOS 2025 Challenge Report: Recent Advances in Complex Video Object Segmentation. CoRR abs/2510.11063 (2025)
[i101]Haoran Feng, Dizhe Zhang, Xiangtai Li, Bo Du, Lu Qi:
DiT360: High-Fidelity Panoramic Image Generation via Hybrid Training. CoRR abs/2510.11712 (2025)
[i100]Haochen Wang, Yuhao Wang, Tao Zhang, Yikang Zhou, Yanwei Li, Jiacong Wang, Jiani Zheng, Ye Tian, Jiahao Meng, Zilong Huang, Guangcan Mai, Anran Wang, Yunhai Tong, Zhuochen Wang, Xiangtai Li, Zhaoxiang Zhang:
Grasp Any Region: Towards Precise, Contextual Pixel Understanding for Multimodal LLMs. CoRR abs/2510.18876 (2025)
[i99]Jiahao Meng, Xiangtai Li, Haochen Wang, Yue Tan, Tao Zhang, Lingdong Kong, Yunhai Tong, Anran Wang, Zhiyang Teng, Yujing Wang, Zhuochen Wang:
Open-o3 Video: Grounded Video Reasoning with Explicit Spatio-Temporal Evidence. CoRR abs/2510.20579 (2025)
[i98]Jinbin Bai, Yu Lei, Hecong Wu, Yuchen Zhu, Shufan Li, Yi Xin, Xiangtai Li, Molei Tao, Aditya Grover, Ming-Hsuan Yang:
From Masks to Worlds: A Hitchhiker's Guide to World Models. CoRR abs/2510.20668 (2025)
[i97]Jiani Zheng, Zhiyang Teng, Xiangtai Li, Anran Wang, Yu Tian, Kunpeng Qiu, Ye Tian, Haochen Wang, Zhuochen Wang:
PairUni: Pairwise Training for Unified Multimodal Language Models. CoRR abs/2510.25682 (2025)
[i96]Ziyu Guo, Xinyan Chen, Renrui Zhang, Ruichuan An, Yu Qi, Dongzhi Jiang, Xiangtai Li, Manyuan Zhang, Hongsheng Li, Pheng-Ann Heng:
Are Video Models Ready as Zero-Shot Reasoners? An Empirical Study with the MME-CoF Benchmark. CoRR abs/2510.26802 (2025)
[i95]Rui Yang, Ziyu Zhu, Yanwei Li, Jingjia Huang, Shen Yan, Siyuan Zhou, Zhe Liu, Xiangtai Li, Shuangye Li, Wenqian Wang, Yi Lin, Hengshuang Zhao:
Visual Spatial Tuning. CoRR abs/2511.05491 (2025)- 2024
[j18]Zhongbin Fang
, Xia Li
, Xiangtai Li, Shen Zhao, Mengyuan Liu:
ModelNet-O: A large-scale synthetic dataset for occlusion-aware point cloud classification. Comput. Vis. Image Underst. 246: 104060 (2024)
[j17]Xiangtai Li, Jiangning Zhang
, Yibo Yang, Guangliang Cheng, Kuiyuan Yang, Yunhai Tong
, Dacheng Tao
:
Sfnet: Faster and Accurate Semantic Segmentation Via Semantic Flow. Int. J. Comput. Vis. 132(2): 466-489 (2024)
[j16]Jiangning Zhang
, Xiangtai Li, Yabiao Wang
, Chengjie Wang
, Yibo Yang, Yong Liu, Dacheng Tao
:
EATFormer: Improving Vision Transformer Inspired by Evolutionary Algorithm. Int. J. Comput. Vis. 132(9): 3509-3536 (2024)
[j15]Chunlei Wang
, Wenquan Feng, Xiangtai Li
, Guangliang Cheng
, Shuchang Lyu
, Binghao Liu
, Lijiang Chen
, Qi Zhao
:
OV-VG: A benchmark for open-vocabulary visual grounding. Neurocomputing 591: 127738 (2024)
[j14]Jianzong Wu
, Xiangtai Li
, Shilin Xu
, Haobo Yuan
, Henghui Ding
, Yibo Yang
, Xia Li
, Jiangning Zhang
, Yunhai Tong
, Xudong Jiang
, Bernard Ghanem
, Dacheng Tao
:
Towards Open Vocabulary Learning: A Survey. IEEE Trans. Pattern Anal. Mach. Intell. 46(7): 5092-5113 (2024)
[j13]Yue Han
, Jiangning Zhang
, Yabiao Wang
, Chengjie Wang
, Yong Liu
, Lu Qi
, Ming-Hsuan Yang
, Xiangtai Li
:
Reference Twice: A Simple and Unified Baseline for Few-Shot Instance Segmentation. IEEE Trans. Pattern Anal. Mach. Intell. 46(12): 9221-9238 (2024)
[j12]Xiangtai Li
, Henghui Ding
, Haobo Yuan
, Wenwei Zhang
, Jiangmiao Pang
, Guangliang Cheng
, Kai Chen
, Ziwei Liu
, Chen Change Loy
:
Transformer-Based Visual Segmentation: A Survey. IEEE Trans. Pattern Anal. Mach. Intell. 46(12): 10138-10163 (2024)
[j11]Jinghao Wang
, Zhengyu Wen
, Xiangtai Li
, Zujin Guo, Jingkang Yang
, Ziwei Liu
:
Pair Then Relation: Pair-Net for Panoptic Scene Graph Generation. IEEE Trans. Pattern Anal. Mach. Intell. 46(12): 10452-10465 (2024)
[j10]Xiangtai Li
, Shilin Xu
, Yibo Yang
, Haobo Yuan
, Guangliang Cheng
, Yunhai Tong
, Zhouchen Lin
, Ming-Hsuan Yang
, Dacheng Tao
:
Panoptic-PartFormer++: A Unified and Decoupled View for Panoptic Part Segmentation. IEEE Trans. Pattern Anal. Mach. Intell. 46(12): 11087-11103 (2024)
[j9]Guangliang Cheng
, Yunmeng Huang, Xiangtai Li, Shuchang Lyu
, Zhaoyang Xu, Hongbo Zhao
, Qi Zhao
, Shiming Xiang
:
Change Detection Methods for Remote Sensing in the Last Decade: A Comprehensive Review. Remote. Sens. 16(13): 2355 (2024)
[j8]Yangyang Xu
, Xiangtai Li
, Haobo Yuan
, Yibo Yang
, Lefei Zhang
:
Multi-Task Learning With Multi-Query Transformer for Dense Prediction. IEEE Trans. Circuits Syst. Video Technol. 34(2): 1228-1240 (2024)
[j7]Jianzong Wu, Xiangtai Li
, Xia Li
, Henghui Ding
, Yunhai Tong
, Dacheng Tao
:
Toward Robust Referring Image Segmentation. IEEE Trans. Image Process. 33: 1782-1794 (2024)
[c46]Tianmeng Yang
, Jiahao Meng
, Min Zhou
, Yaming Yang
, Yujing Wang
, Xiangtai Li
, Yunhai Tong
:
You Can't Ignore Either: Unifying Structure and Feature Denoising for Robust Graph Learning. CIKM 2024: 4178-4182
[c45]Peng Lu, Tao Jiang, Yining Li, Xiangtai Li, Kai Chen, Wenming Yang:
RTMO: Towards High-Performance One-Stage Real-Time Multi-Person Pose Estimation. CVPR 2024: 1491-1500
[c44]Xinshun Wang, Zhongbin Fang, Xia Li
, Xiangtai Li, Chen Chen, Mengyuan Liu:
Skeleton-in-Context: Unified Skeleton Sequence Modeling with In-Context Learning. CVPR 2024: 2436-2446
[c43]Yiran Song, Qianyu Zhou, Xiangtai Li, Deng-Ping Fan, Xuequan Lu, Lizhuang Ma:
BA-SAM: Scalable Bias-Mode Attention Mask for Segment Anything Model. CVPR 2024: 3162-3173
[c42]Jianzong Wu, Xiangtai Li, Chenyang Si, Shangchen Zhou, Jingkang Yang, Jiangning Zhang
, Yining Li, Kai Chen, Yunhai Tong, Ziwei Liu, Chen Change Loy:
Towards Language-Driven Video Inpainting via Multimodal Large Language Models. CVPR 2024: 12501-12511
[c41]Chang Liu, Xiangtai Li, Henghui Ding:
Referring Image Editing: Object-Level Image Editing via Referring Expressions. CVPR 2024: 13128-13138
[c40]Xiangtai Li, Haobo Yuan, Wei Li, Henghui Ding, Size Wu, Wenwei Zhang, Yining Li, Kai Chen, Chen Change Loy:
OMG-Seg: Is One Model Good Enough for all Segmentation? CVPR 2024: 27948-27959
[c39]Yue Han, Junwei Zhu, Keke He, Xu Chen, Yanhao Ge, Wei Li, Xiangtai Li, Jiangning Zhang
, Chengjie Wang
, Yong Liu:
Face-Adapter for Pre-trained Diffusion Models with Fine-Grained ID and Attribute Control. ECCV (50) 2024: 20-36
[c38]Xiaojie Li
, Yibo Yang
, Xiangtai Li
, Jianlong Wu
, Yue Yu
, Bernard Ghanem
, Min Zhang
:
GenView: Enhancing View Quality with Pretrained Generative Model for Self-Supervised Learning. ECCV (68) 2024: 306-325
[c37]Haobo Yuan
, Xiangtai Li
, Chong Zhou
, Yining Li
, Kai Chen
, Chen Change Loy
:
Open-Vocabulary SAM: Segment and Recognize Twenty-Thousand Classes Interactively. ECCV (43) 2024: 419-437
[c36]Yikang Zhou
, Tao Zhang
, Shunping Ji
, Shuicheng Yan
, Xiangtai Li
:
Improving Video Segmentation via Dynamic Anchor Queries. ECCV (50) 2024: 446-463
[c35]Size Wu, Wenwei Zhang, Lumin Xu, Sheng Jin, Xiangtai Li, Wentao Liu, Chen Change Loy:
CLIPSelf: Vision Transformer Distills Itself for Open-Vocabulary Dense Prediction. ICLR 2024
[c34]Zhichao Deng, Xiangtai Li, Xia Li
, Yunhai Tong, Shen Zhao, Mengyuan Liu:
VG4D: Vision-Language Model Goes 4D Video Recognition. ICRA 2024: 5014-5020
[c33]Shaocong Long
, Qianyu Zhou
, Xiangtai Li
, Xuequan Lu
, Chenhao Ying
, Yuan Luo
, Lizhuang Ma
, Shuicheng Yan
:
DGMamba: Domain Generalization via Generalized State Space Model. ACM Multimedia 2024: 3607-3616
[c32]Hao Fei
, Xiangtai Li
, Haotian Liu
, Fuxiao Liu
, Zhuosheng Zhang
, Hanwang Zhang
, Shuicheng Yan
:
From Multimodal LLM to Human-level AI: Modality, Instruction, Reasoning and Beyond. ACM Multimedia 2024: 11289-11291
[c31]Haoyang He, Yuhu Bai, Jiangning Zhang, Qingdong He, Hongxu Chen, Zhenye Gan, Chengjie Wang, Xiangtai Li, Guanzhong Tian, Lei Xie:
MambaAD: Exploring State Space Models for Multi-class Unsupervised Anomaly Detection. NeurIPS 2024
[c30]Chaoyang Wang, Xiangtai Li, Lu Qi, Henghui Ding, Yunhai Tong, Ming-Hsuan Yang:
SemFlow: Binding Semantic Segmentation and Image Synthesis via Rectified Flow. NeurIPS 2024
[c29]Jianzong Wu, Xiangtai Li, Yanhong Zeng, Jiangning Zhang, Qianyu Zhou, Yining Li, Yunhai Tong, Kai Chen:
MotionBooth: Motion-Aware Customized Text-to-Video Generation. NeurIPS 2024
[c28]Tao Zhang, Xiangtai Li, Hao Fei, Haobo Yuan, Shengqiong Wu, Shunping Ji, Chen Change Loy, Shuicheng Yan:
OMG-LLaVA: Bridging Image-level, Object-level, Pixel-level Reasoning and Understanding. NeurIPS 2024
[c27]Yu Zhao, Hao Fei, Xiangtai Li, Libo Qin, Jiayi Ji, Hongyuan Zhu, Meishan Zhang, Min Zhang, Jianguo Wei:
Synergistic Dual Spatial-aware Generation of Image-to-text and Text-to-image. NeurIPS 2024
[i94]Yue Han, Jiangning Zhang
, Junwei Zhu, Xiangtai Li, Yanhao Ge, Wei Li, Chengjie Wang
, Yong Liu, Xiaoming Liu, Ying Tai:
A Generalist FaceX via Learning Unified Facial Representation. CoRR abs/2401.00551 (2024)
[i93]Yiran Song, Qianyu Zhou, Xiangtai Li, Deng-Ping Fan, Xuequan Lu, Lizhuang Ma:
BA-SAM: Scalable Bias-Mode Attention Mask for Segment Anything Model. CoRR abs/2401.02317 (2024)
[i92]Xiangyu Zhao, Yicheng Chen, Shilin Xu, Xiangtai Li, Xinjiang Wang, Yining Li, Haian Huang:
An Open and Comprehensive Pipeline for Unified Object Grounding and Detection. CoRR abs/2401.02361 (2024)
[i91]Haobo Yuan, Xiangtai Li, Chong Zhou, Yining Li, Kai Chen, Chen Change Loy:
Open-Vocabulary SAM: Segment and Recognize Twenty-thousand Classes Interactively. CoRR abs/2401.02955 (2024)
[i90]Zhongbin Fang, Xia Li
, Xiangtai Li, Shen Zhao, Mengyuan Liu:
ModelNet-O: A Large-Scale Synthetic Dataset for Occlusion-Aware Point Cloud Classification. CoRR abs/2401.08210 (2024)
[i89]Jianzong Wu, Xiangtai Li, Chenyang Si, Shangchen Zhou, Jingkang Yang, Jiangning Zhang
, Yining Li, Kai Chen, Yunhai Tong, Ziwei Liu, Chen Change Loy:
Towards Language-Driven Video Inpainting via Multimodal Large Language Models. CoRR abs/2401.10226 (2024)
[i88]Shilin Xu, Haobo Yuan, Qingyu Shi, Lu Qi, Jingbo Wang, Yibo Yang, Yining Li, Kai Chen, Yunhai Tong, Bernard Ghanem, Xiangtai Li, Ming-Hsuan Yang:
RAP-SAM: Towards Real-Time All-Purpose Segment Anything. CoRR abs/2401.10228 (2024)
[i87]Xiangtai Li, Haobo Yuan, Wei Li, Henghui Ding, Size Wu, Wenwei Zhang, Yining Li, Kai Chen, Chen Change Loy:
OMG-Seg: Is One Model Good Enough For All Segmentation? CoRR abs/2401.10229 (2024)
[i86]Lu Qi, Yi-Wen Chen, Lehan Yang, Tiancheng Shen, Xiangtai Li, Weidong Guo, Yu Xu, Ming-Hsuan Yang:
Generalizable Entity Grounding via Assistance of Large Language Model. CoRR abs/2402.02555 (2024)
[i85]Tao Zhang, Xiangtai Li, Haobo Yuan, Shunping Ji, Shuicheng Yan:
Point Cloud Mamba: Point Cloud Learning via State Space Model. CoRR abs/2403.00762 (2024)
[i84]Chaoyang Wang, Xiangtai Li, Henghui Ding, Lu Qi, Jiangning Zhang
, Yunhai Tong, Chen Change Loy, Shuicheng Yan:
Explore In-Context Segmentation via Latent Diffusion Models. CoRR abs/2403.09616 (2024)
[i83]Xiaojie Li, Yibo Yang, Xiangtai Li, Jianlong Wu, Yue Yu, Bernard Ghanem, Min Zhang:
GenView: Enhancing View Quality with Pretrained Generative Model for Self-Supervised Learning. CoRR abs/2403.12003 (2024)
[i82]Yikang Zhou, Tao Zhang, Shunping Ji, Shuicheng Yan, Xiangtai Li:
DVIS-DAQ: Improving Video Segmentation via Dynamic Anchor Queries. CoRR abs/2404.00086 (2024)
[i81]Haoyang He, Yuhu Bai, Jiangning Zhang
, Qingdong He, Hongxu Chen, Zhenye Gan, Chengjie Wang
, Xiangtai Li, Guanzhong Tian, Lei Xie:
MambaAD: Exploring State Space Models for Multi-class Unsupervised Anomaly Detection. CoRR abs/2404.06564 (2024)
[i80]Shaocong Long, Qianyu Zhou, Xiangtai Li, Xuequan Lu, Chenhao Ying, Yuan Luo, Lizhuang Ma, Shuicheng Yan:
DGMamba: Domain Generalization via Generalized State Space Model. CoRR abs/2404.07794 (2024)
[i79]Jiangning Zhang
, Chengjie Wang
, Xiangtai Li, Guanzhong Tian, Zhucun Xue, Yong Liu, Guansong Pang, Dacheng Tao
:
Learning Feature Inversion for Multi-class Anomaly Detection under General-purpose COCO-AD Benchmark. CoRR abs/2404.10760 (2024)
[i78]Zhichao Deng, Xiangtai Li, Xia Li
, Yunhai Tong, Shen Zhao, Mengyuan Liu:
VG4D: Vision-Language Model Goes 4D Video Recognition. CoRR abs/2404.11605 (2024)
[i77]Mengyuan Liu, Zhongbin Fang, Xia Li
, Joachim M. Buhmann, Xiangtai Li, Chen Change Loy:
Point-In-Context: Understanding Point Cloud via In-Context Learning. CoRR abs/2404.12352 (2024)
[i76]Jingkang Yang, Jun Cen, Wenxuan Peng, Shuai Liu, Fangzhou Hong, Xiangtai Li, Kaiyang Zhou, Qifeng Chen, Ziwei Liu:
4D Panoptic Scene Graph Generation. CoRR abs/2405.10305 (2024)
[i75]Yue Han, Junwei Zhu, Keke He, Xu Chen, Yanhao Ge, Wei Li, Xiangtai Li, Jiangning Zhang
, Chengjie Wang
, Yong Liu:
Face Adapter for Pre-Trained Diffusion Models with Fine-Grained ID and Attribute Control. CoRR abs/2405.12970 (2024)
[i74]Fengfan Zhou, Qianyu Zhou, Xiangtai Li, Xuequan Lu, Lizhuang Ma, Hefei Ling:
Adversarial Attacks on Both Face Recognition and Face Anti-spoofing Models. CoRR abs/2405.16940 (2024)
[i73]Kuan-Chih Huang, Xiangtai Li, Lu Qi, Shuicheng Yan, Ming-Hsuan Yang:
Reason3D: Searching and Reasoning 3D Segmentation via Large Language Model. CoRR abs/2405.17427 (2024)
[i72]Chaoyang Wang, Xiangtai Li, Lu Qi, Henghui Ding, Yunhai Tong, Ming-Hsuan Yang:
SemFlow: Binding Semantic Segmentation and Image Synthesis via Rectified Flow. CoRR abs/2405.20282 (2024)
[i71]Zheng Zhou, Hongbo Zhao, Guangliang Cheng, Xiangtai Li, Shuchang Lyu
, Wenquan Feng, Qi Zhao:
BACON: Bayesian Optimal Condensation Framework for Dataset Distillation. CoRR abs/2406.01112 (2024)
[i70]Shengqiong Wu, Hao Fei, Xiangtai Li, Jiayi Ji, Hanwang Zhang, Tat-Seng Chua, Shuicheng Yan:
Towards Semantic Equivalence of Tokenization in Multimodal LLM. CoRR abs/2406.05127 (2024)
[i69]Jianzong Wu, Xiangtai Li, Yanhong Zeng, Jiangning Zhang
, Qianyu Zhou, Yining Li, Yunhai Tong, Kai Chen:
MotionBooth: Motion-Aware Customized Text-to-Video Generation. CoRR abs/2406.17758 (2024)
[i68]Xiangyu Zhao, Xiangtai Li, Haodong Duan, Haian Huang, Yining Li, Kai Chen, Hua Yang:
MG-LLaVA: Towards Multi-Granularity Visual Instruction Tuning. CoRR abs/2406.17770 (2024)
[i67]Haobo Yuan, Xiangtai Li, Lu Qi, Tao Zhang, Ming-Hsuan Yang, Shuicheng Yan, Chen Change Loy:
Mamba or RWKV: Exploring High-Quality and High-Efficiency Segment Anything Model. CoRR abs/2406.19369 (2024)
[i66]Tao Zhang, Xiangtai Li, Hao Fei, Haobo Yuan, Shengqiong Wu, Shunping Ji, Chen Change Loy, Shuicheng Yan:
OMG-LLaVA: Bridging Image-level, Object-level, Pixel-level Reasoning and Understanding. CoRR abs/2406.19389 (2024)
[i65]Yicheng Chen, Xiangtai Li, Yining Li, Yanhong Zeng, Jianzong Wu, Xiangyu Zhao, Kai Chen:
Auto Cherry-Picker: Learning from High-quality Generative Data Driven by Language. CoRR abs/2406.20085 (2024)
[i64]Shilin Xu, Xiangtai Li, Haobo Yuan, Lu Qi, Yunhai Tong, Ming-Hsuan Yang:
LLAVADI: What Matters For Multimodal Large Language Models Distillation. CoRR abs/2407.19409 (2024)
[i63]Tianmeng Yang, Jiahao Meng, Min Zhou, Yaming Yang, Yujing Wang, Xiangtai Li, Yunhai Tong:
You Can't Ignore Either: Unifying Structure and Feature Denoising for Robust Graph Learning. CoRR abs/2408.00700 (2024)
[i62]Hao Yang, Qianyu Zhou, Haijia Sun, Xiangtai Li, Fengqi Liu, Xuequan Lu, Lizhuang Ma, Shuicheng Yan:
PointDGMamba: Domain Generalization of Point Cloud Classification via Generalized State Space Model. CoRR abs/2408.13574 (2024)
[i61]Yue Han, Junwei Zhu, Yuxiang Feng, Xiaozhong Ji, Keke He, Xiangtai Li, Zhucun Xue, Yong Liu:
MIMAFace: Face Animation via Motion-Identity Modulated Appearance Feature Learning. CoRR abs/2409.15179 (2024)
[i60]Yujin Tang, Lu Qi, Fei Xie, Xiangtai Li, Chao Ma, Ming-Hsuan Yang:
PredFormer: Transformers Are Effective Spatial-Temporal Predictive Learners. CoRR abs/2410.04733 (2024)
[i59]Jinbin Bai, Tian Ye, Wei Chow, Enxin Song, Qing-Guo Chen, Xiangtai Li, Zhen Dong, Lei Zhu, Shuicheng Yan:
Meissonic: Revitalizing Masked Generative Transformers for Efficient High-Resolution Text-to-Image Synthesis. CoRR abs/2410.08261 (2024)
[i58]Peiwen Sun, Sitong Cheng
, Xiangtai Li, Zhen Ye, Huadai Liu, Honggang Zhang, Wei Xue, Yike Guo
:
Both Ears Wide Open: Towards Language-Driven Spatial Audio Generation. CoRR abs/2410.10676 (2024)
[i57]Yu Zhao, Hao Fei, Xiangtai Li, Libo Qin, Jiayi Ji, Hongyuan Zhu, Meishan Zhang, Min Zhang, Jianguo Wei:
Synergistic Dual Spatial-aware Generation of Image-to-Text and Text-to-Image. CoRR abs/2410.15312 (2024)
[i56]Qingyu Shi, Lu Qi, Jianzong Wu, Jinbin Bai, Jingbo Wang, Yunhai Tong, Xiangtai Li, Ming-Hsuan Yang:
RelationBooth: Towards Relation-Aware Customized Object Generation. CoRR abs/2410.23280 (2024)
[i55]Qingdong He, Jinlong Peng, Pengcheng Xu, Boyuan Jiang, Xiaobin Hu, Donghao Luo, Yong Liu, Yabiao Wang
, Chengjie Wang
, Xiangtai Li, Jiangning Zhang
:
DynamicControl: Adaptive Condition Selection for Improved Text-to-Image Generation. CoRR abs/2412.03255 (2024)
[i54]Jinbin Bai, Wei Chow, Ling Yang, Xiangtai Li, Juncheng Li, Hanwang Zhang, Shuicheng Yan:
HumanEdit: A High-Quality Human-Rewarded Dataset for Instruction-based Image Editing. CoRR abs/2412.04280 (2024)
[i53]Zhenglin Huang, Jinwei Hu, Xiangtai Li, Yiwei He, Xingyu Zhao
, Bei Peng, Baoyuan Wu, Xiaowei Huang, Guangliang Cheng:
SIDA: Social Media Image Deepfake Detection, Localization and Explanation with Large Multimodal Model. CoRR abs/2412.04292 (2024)
[i52]Jiangning Zhang
, Teng Hu, Haoyang He, Zhucun Xue, Yabiao Wang
, Chengjie Wang
, Yong Li, Xiangtai Li, Dacheng Tao
:
EMOv2: Pushing 5M Vision Model Frontier. CoRR abs/2412.06674 (2024)
[i51]Jianzong Wu, Chao Tang
, Jingbo Wang, Yanhong Zeng, Xiangtai Li, Yunhai Tong:
DiffSensei: Bridging Multi-Modal LLMs and Diffusion Models for Customized Manga Generation. CoRR abs/2412.07589 (2024)- 2023
[j6]Xiangtai Li
, Hao He, Yibo Yang
, Henghui Ding
, Kuiyuan Yang
, Guangliang Cheng
, Yunhai Tong
, Dacheng Tao
:
Improving Video Instance Segmentation via Temporal Pyramid Routing. IEEE Trans. Pattern Anal. Mach. Intell. 45(5): 6594-6601 (2023)
[j5]Qianyu Zhou
, Xiangtai Li
, Lu He, Yibo Yang
, Guangliang Cheng
, Yunhai Tong
, Lizhuang Ma
, Dacheng Tao
:
TransVOD: End-to-End Video Object Detection With Spatial-Temporal Transformers. IEEE Trans. Pattern Anal. Mach. Intell. 45(6): 7853-7869 (2023)
[j4]Yujing Wang
, Yaming Yang, Zhuo Li
, Jiangang Bai, Mingliang Zhang, Xiangtai Li
, Jing Yu, Ce Zhang, Gao Huang
, Yunhai Tong
:
Convolution-Enhanced Evolving Attention Networks. IEEE Trans. Pattern Anal. Mach. Intell. 45(7): 8176-8192 (2023)
[j3]Guozheng Xu
, Xue Jiang
, Xiangtai Li
, Ze Zhang, Xingzhao Liu
:
Exploring Self-Supervised Learning for Multi-Modal Remote Sensing Pre-Training via Asymmetric Attention Fusion. Remote. Sens. 15(24): 5682 (2023)
[c26]Jingkang Yang, Wenxuan Peng, Xiangtai Li, Zujin Guo, Liangyu Chen, Bo Li, Zheng Ma, Kaiyang Zhou, Wayne Zhang
, Chen Change Loy, Ziwei Liu:
Panoptic Video Scene Graph Generation. CVPR 2023: 18675-18685
[c25]Jiangning Zhang
, Xiangtai Li, Jian Li, Liang Liu, Zhucun Xue, Boshen Zhang, Zhengkai Jiang, Tianxin Huang, Yabiao Wang
, Chengjie Wang
:
Rethinking Mobile Block for Efficient Attention-based Models. ICCV 2023: 1389-1400
[c24]Xiangtai Li, Haobo Yuan, Wenwei Zhang, Guangliang Cheng, Jiangmiao Pang, Chen Change Loy:
Tube-Link: A Flexible Cross Tube Framework for Universal Video Segmentation. ICCV 2023: 13877-13887
[c23]Jianzong Wu, Xiangtai Li, Henghui Ding, Xia Li
, Guangliang Cheng, Yunhai Tong, Chen Change Loy:
Betrayed by Captions: Joint Caption Grounding and Generation for Open Vocabulary Instance Segmentation. ICCV 2023: 21881-21891
[c22]Menghao Li, Chunlei Wang, Wenquan Feng, Shuchang Lyu
, Guangliang Cheng, Xiangtai Li, Binghao Liu, Qi Zhao:
Iterative Robust Visual Grounding with Masked Reference based Centerpoint Supervision. ICCV (Workshops) 2023: 4653-4658
[c21]Yibo Yang, Haobo Yuan, Xiangtai Li, Zhouchen Lin, Philip H. S. Torr, Dacheng Tao:
Neural Collapse Inspired Feature-Classifier Alignment for Few-Shot Class-Incremental Learning. ICLR 2023
[c20]Zhongbin Fang, Xiangtai Li, Xia Li, Joachim M. Buhmann, Chen Change Loy, Mengyuan Liu:
Explore In-Context Learning for 3D Point Cloud Understanding. NeurIPS 2023
[c19]Jingkang Yang, Jun Cen, Wenxuan Peng, Shuai Liu, Fangzhou Hong, Xiangtai Li, Kaiyang Zhou, Qifeng Chen, Ziwei Liu:
4D Panoptic Scene Graph Generation. NeurIPS 2023
[i50]Jianzong Wu, Xiangtai Li, Henghui Ding
, Xia Li, Guangliang Cheng, Yunhai Tong, Chen Change Loy:
Betrayed by Captions: Joint Caption Grounding and Generation for Open Vocabulary Instance Segmentation. CoRR abs/2301.00805 (2023)
[i49]Xiangtai Li, Shilin Xu, Yibo Yang, Haobo Yuan
, Guangliang Cheng, Yunhai Tong, Zhouchen Lin, Dacheng Tao
:
PanopticPartFormer++: A Unified and Decoupled View for Panoptic Part Segmentation. CoRR abs/2301.00954 (2023)
[i48]Jiangning Zhang
, Xiangtai Li, Jian Li, Liang Liu, Zhucun Xue, Boshen Zhang, Zhengkai Jiang, Tianxin Huang, Yabiao Wang
, Chengjie Wang
:
Rethinking Mobile Block for Efficient Neural Models. CoRR abs/2301.01146 (2023)
[i47]Yue Han, Jiangning Zhang
, Zhucun Xue, Chao Xu, Xintian Shen, Yabiao Wang
, Chengjie Wang
, Yong Liu, Xiangtai Li:
Reference Twice: A Simple and Unified Baseline for Few-Shot Instance Segmentation. CoRR abs/2301.01156 (2023)
[i46]Yibo Yang, Haobo Yuan
, Xiangtai Li, Zhouchen Lin, Philip H. S. Torr, Dacheng Tao
:
Neural Collapse Inspired Feature-Classifier Alignment for Few-Shot Class Incremental Learning. CoRR abs/2302.03004 (2023)
[i45]Xiangtai Li, Haobo Yuan
, Wenwei Zhang, Guangliang Cheng, Jiangmiao Pang, Chen Change Loy:
Tube-Link: A Flexible Cross Tube Baseline for Universal Video Segmentation. CoRR abs/2303.12782 (2023)
[i44]Xiangtai Li, Henghui Ding, Wenwei Zhang, Haobo Yuan
, Jiangmiao Pang, Guangliang Cheng, Kai Chen, Ziwei Liu, Chen Change Loy:
Transformer-Based Visual Segmentation: A Survey. CoRR abs/2304.09854 (2023)
[i43]Guangliang Cheng, Yunmeng Huang, Xiangtai Li, Shuchang Lyu, Zhaoyang Xu, Qi Zhao, Shiming Xiang:
Change Detection Methods for Remote Sensing in the Last Decade: A Comprehensive Review. CoRR abs/2305.05813 (2023)
[i42]Zhongbin Fang, Xiangtai Li, Xia Li
, Joachim M. Buhmann, Chen Change Loy, Mengyuan Liu:
Explore In-Context Learning for 3D Point Cloud Understanding. CoRR abs/2306.08659 (2023)
[i41]Jianzong Wu, Xiangtai Li, Shilin Xu, Haobo Yuan, Henghui Ding
, Yibo Yang, Xia Li
, Jiangning Zhang
, Yunhai Tong, Xudong Jiang, Bernard Ghanem, Dacheng Tao
:
Towards Open Vocabulary Learning: A Survey. CoRR abs/2306.15880 (2023)
[i40]Jinghao Wang, Zhengyu Wen, Xiangtai Li, Zujin Guo, Jingkang Yang, Ziwei Liu:
Pair then Relation: Pair-Net for Panoptic Scene Graph Generation. CoRR abs/2307.08699 (2023)
[i39]Menghao Li, Chunlei Wang
, Wenquan Feng, Shuchang Lyu, Guangliang Cheng, Xiangtai Li, Binghao Liu, Qi Zhao:
Iterative Robust Visual Grounding with Masked Reference based Centerpoint Supervision. CoRR abs/2307.12392 (2023)
[i38]Yibo Yang, Haobo Yuan
, Xiangtai Li, Jianlong Wu, Lefei Zhang, Zhouchen Lin, Philip H. S. Torr, Dacheng Tao
, Bernard Ghanem:
Neural Collapse Terminus: A Unified Solution for Class Incremental Learning and Its Variants. CoRR abs/2308.01746 (2023)
[i37]Jiahao Xie, Wei Li, Xiangtai Li, Ziwei Liu, Yew Soon Ong, Chen Change Loy:
MosaicFusion: Diffusion Models as Data Augmenters for Large Vocabulary Instance Segmentation. CoRR abs/2309.13042 (2023)
[i36]Shilin Xu, Xiangtai Li, Size Wu, Wenwei Zhang, Yining Li, Guangliang Cheng, Yunhai Tong, Kai Chen, Chen Change Loy:
DST-Det: Simple Dynamic Self-Training for Open-Vocabulary Object Detection. CoRR abs/2310.01393 (2023)
[i35]Size Wu, Wenwei Zhang, Lumin Xu, Sheng Jin, Xiangtai Li, Wentao Liu, Chen Change Loy:
CLIPSelf: Vision Transformer Distills Itself for Open-Vocabulary Dense Prediction. CoRR abs/2310.01403 (2023)
[i34]Chunlei Wang, Wenquan Feng, Xiangtai Li, Guangliang Cheng, Shuchang Lyu
, Binghao Liu, Lijiang Chen, Qi Zhao:
OV-VG: A Benchmark for Open-Vocabulary Visual Grounding. CoRR abs/2310.14374 (2023)
[i33]Hao Zhou, Tiancheng Shen, Xu Yang, Hai Huang, Xiangtai Li, Lu Qi, Ming-Hsuan Yang:
Rethinking Evaluation Metrics of Open-Vocabulary Segmentaion. CoRR abs/2311.03352 (2023)
[i32]Jingkang Yang, Wenxuan Peng, Xiangtai Li, Zujin Guo, Liangyu Chen, Bo Li, Zheng Ma, Kaiyang Zhou, Wayne Zhang
, Chen Change Loy, Ziwei Liu:
Panoptic Video Scene Graph Generation. CoRR abs/2311.17058 (2023)
[i31]Yunhao Liu
, Lu Qi, Yu-Ju Tsai, Xiangtai Li, Kelvin C. K. Chan, Ming-Hsuan Yang:
Effective Adapter for Face Recognition in the Wild. CoRR abs/2312.01734 (2023)
[i30]Xinshun Wang, Zhongbin Fang, Xia Li
, Xiangtai Li, Chen Chen, Mengyuan Liu:
Skeleton-in-Context: Unified Skeleton Sequence Modeling with In-Context Learning. CoRR abs/2312.03703 (2023)
[i29]Chong Zhou, Xiangtai Li, Chen Change Loy, Bo Dai:
EdgeSAM: Prompt-In-the-Loop Distillation for On-Device Deployment of SAM. CoRR abs/2312.06660 (2023)
[i28]Jiangning Zhang
, Xuhai Chen, Yabiao Wang
, Chengjie Wang
, Yong Liu, Xiangtai Li, Ming-Hsuan Yang, Dacheng Tao
:
Exploring Plain ViT Reconstruction for Multi-class Unsupervised Anomaly Detection. CoRR abs/2312.07495 (2023)
[i27]Peng Lu, Tao Jiang, Yining Li, Xiangtai Li, Kai Chen, Wenming Yang:
RTMO: Towards High-Performance One-Stage Real-Time Multi-Person Pose Estimation. CoRR abs/2312.07526 (2023)- 2022
[c18]Xiangtai Li, Wenwei Zhang, Jiangmiao Pang, Kai Chen, Guangliang Cheng, Yunhai Tong, Chen Change Loy:
Video K-Net: A Simple, Strong, and Unified Baseline for Video Segmentation. CVPR 2022: 18825-18835
[c17]Shilin Xu, Xiangtai Li, Jingbo Wang, Guangliang Cheng, Yunhai Tong, Dacheng Tao
:
Fashionformer: A Simple, Effective and Unified Baseline for Human Fashion Segmentation and Recognition. ECCV (37) 2022: 545-563
[c16]Haobo Yuan
, Xiangtai Li, Yibo Yang, Guangliang Cheng, Jing Zhang, Yunhai Tong, Lefei Zhang, Dacheng Tao
:
PolyphonicFormer: Unified Query Learning for Depth-Aware Video Panoptic Segmentation. ECCV (27) 2022: 582-599
[c15]Xiangtai Li, Shilin Xu, Yibo Yang, Guangliang Cheng, Yunhai Tong, Dacheng Tao
:
Panoptic-PartFormer: Learning a Unified Model for Panoptic Part Segmentation. ECCV (27) 2022: 729-747
[c14]Shilin Xu, Xiangtai Li, Yibo Yang, Hongyang Li, Guangliang Cheng, Yunhai Tong:
Query Learning of Both Thing and Stuff for Panoptic Segmentation. ICIP 2022: 716-720
[c13]Yibo Yang, Shixiang Chen, Xiangtai Li, Liang Xie, Zhouchen Lin, Dacheng Tao:
Inducing Neural Collapse in Imbalanced Learning: Do We Really Need a Learnable Classifier at the End of Deep Neural Network? NeurIPS 2022
[i26]Qianyu Zhou, Xiangtai Li, Lu He, Yibo Yang, Guangliang Cheng, Yunhai Tong, Lizhuang Ma, Dacheng Tao:
TransVOD: End-to-end Video Object Detection with Spatial-Temporal Transformers. CoRR abs/2201.05047 (2022)
[i25]Yibo Yang, Liang Xie, Shixiang Chen, Xiangtai Li, Zhouchen Lin, Dacheng Tao
:
Do We Really Need a Learnable Classifier at the End of Deep Neural Network? CoRR abs/2203.09081 (2022)
[i24]Shilin Xu, Xiangtai Li, Jingbo B. Wang, Guangliang Cheng, Yunhai Tong, Dacheng Tao
:
Fashionformer: A simple, Effective and Unified Baseline for Human Fashion Segmentation and Recognition. CoRR abs/2204.04654 (2022)
[i23]Xiangtai Li, Shilin Xu, Yibo Yang, Guangliang Cheng, Yunhai Tong, Dacheng Tao
:
Panoptic-PartFormer: Learning a Unified Model for Panoptic Part Segmentation. CoRR abs/2204.04655 (2022)
[i22]Xiangtai Li, Wenwei Zhang, Jiangmiao Pang, Kai Chen, Guangliang Cheng, Yunhai Tong, Chen Change Loy:
Video K-Net: A Simple, Strong, and Unified Baseline for Video Segmentation. CoRR abs/2204.04656 (2022)
[i21]Yangyang Xu, Xiangtai Li, Haobo Yuan
, Yibo Yang, Jing Zhang, Yunhai Tong, Lefei Zhang, Dacheng Tao:
Multi-Task Learning with Multi-query Transformer for Dense Prediction. CoRR abs/2205.14354 (2022)
[i20]Jiangning Zhang
, Xiangtai Li, Yabiao Wang
, Chengjie Wang
, Yibo Yang, Yong Liu, Dacheng Tao
:
EATFormer: Improving Vision Transformer Inspired by Evolutionary Algorithm. CoRR abs/2206.09325 (2022)
[i19]Xiangtai Li, Jiangning Zhang
, Yibo Yang, Guangliang Cheng, Kuiyuan Yang, Yunhai Tong, Dacheng Tao
:
SFNet: Faster, Accurate, and Domain Agnostic Semantic Segmentation via Semantic Flow. CoRR abs/2207.04415 (2022)
[i18]Jianzong Wu, Xiangtai Li, Xia Li
, Henghui Ding
, Yunhai Tong, Dacheng Tao
:
Towards Robust Referring Image Segmentation. CoRR abs/2209.09554 (2022)
[i17]Yujing Wang, Yaming Yang, Zhuo Li, Jiangang Bai, Mingliang Zhang, Xiangtai Li, Jing Yu, Ce Zhang, Gao Huang, Yunhai Tong:
Convolution-enhanced Evolving Attention Networks. CoRR abs/2212.08330 (2022)- 2021
[j2]Xiangtai Li
, Li Zhang, Guangliang Cheng
, Kuiyuan Yang
, Yunhai Tong, Xiatian Zhu
, Tao Xiang
:
Global Aggregation Then Local Distribution for Scene Parsing. IEEE Trans. Image Process. 30: 6829-6842 (2021)
[j1]Xiangtai Li
, Xia Li
, Ansheng You, Li Zhang, Guangliang Cheng
, Kuiyuan Yang
, Yunhai Tong, Zhouchen Lin
:
Towards Efficient Scene Understanding via Squeeze Reasoning. IEEE Trans. Image Process. 30: 7050-7063 (2021)
[c12]Xiangtai Li, Hao He, Xia Li
, Duo Li, Guangliang Cheng, Jianping Shi, Lubin Weng, Yunhai Tong, Zhouchen Lin:
PointFlow: Flowing Semantics Through Points for Aerial Image Segmentation. CVPR 2021: 4217-4226
[c11]Duo Li, Jie Hu, Changhu Wang, Xiangtai Li, Qi She, Lei Zhu, Tong Zhang, Qifeng Chen:
Involution: Inverting the Inherence of Convolution for Visual Recognition. CVPR 2021: 12321-12330
[c10]Hao He, Xiangtai Li, Guangliang Cheng, Jianping Shi, Yunhai Tong, Gaofeng Meng, Véronique Prinet, Lubin Weng:
Enhanced Boundary Learning for Glass-like Object Segmentation. ICCV 2021: 15839-15848
[c9]Chen Shi, Xiangtai Li, Yanran Wu
, Yunhai Tong, Yi Xu:
Dynamic Dual Sampling Module For Fine-Grained Semantic Segmentation. ICIP 2021: 2269-2273
[c8]Yanran Wu
, Xiangtai Li, Chen Shi, Yunhai Tong, Yang Hua
, Tao Song
, Ruhui Ma, Haibing Guan:
Fast and Accurate Scene Parsing via Bi-Direction Alignment Networks. ICIP 2021: 2508-2512
[c7]Lu He, Qianyu Zhou
, Xiangtai Li, Li Niu, Guangliang Cheng, Xiao Li, Wenxuan Liu, Yunhai Tong, Lizhuang Ma, Liqing Zhang:
End-to-End Video Object Detection with Spatial-Temporal Transformers. ACM Multimedia 2021: 1507-1516
[i16]Duo Li, Jie Hu, Changhu Wang, Xiangtai Li, Qi She, Lei Zhu, Tong Zhang, Qifeng Chen:
Involution: Inverting the Inherence of Convolution for Visual Recognition. CoRR abs/2103.06255 (2021)
[i15]Xiangtai Li, Hao He, Xia Li, Duo Li, Guangliang Cheng, Jianping Shi, Lubin Weng, Yunhai Tong, Zhouchen Lin:
PointFlow: Flowing Semantics Through Points for Aerial Image Segmentation. CoRR abs/2103.06564 (2021)
[i14]Hao He, Xiangtai Li, Guangliang Cheng, Jianping Shi, Yunhai Tong, Gaofeng Meng, Véronique Prinet, Lubin Weng:
Enhanced Boundary Learning for Glass-like Object Segmentation. CoRR abs/2103.15734 (2021)
[i13]Lu He, Qianyu Zhou, Xiangtai Li, Li Niu, Guangliang Cheng, Xiao Li, Wenxuan Liu, Yunhai Tong, Lizhuang Ma, Liqing Zhang:
End-to-End Video Object Detection with Spatial-Temporal Transformers. CoRR abs/2105.10920 (2021)
[i12]Yanran Wu, Xiangtai Li, Chen Shi, Yunhai Tong, Yang Hua, Tao Song, Ruhui Ma, Haibing Guan:
Fast and Accurate Scene Parsing via Bi-direction Alignment Networks. CoRR abs/2105.11651 (2021)
[i11]Chen Shi, Xiangtai Li, Yanran Wu, Yunhai Tong, Yi Xu:
Dynamic Dual Sampling Module for Fine-Grained Semantic Segmentation. CoRR abs/2105.11657 (2021)
[i10]Hao He, Xiangtai Li, Kuiyuan Yang, Guangliang Cheng, Jianping Shi, Yunhai Tong, Zhengjun Zha, Lubin Weng:
BoundarySqueeze: Image Segmentation as Boundary Squeezing. CoRR abs/2105.11668 (2021)
[i9]Xiangtai Li, Li Zhang, Guangliang Cheng, Kuiyuan Yang, Yunhai Tong, Xiatian Zhu, Tao Xiang:
Global Aggregation then Local Distribution for Scene Parsing. CoRR abs/2107.13154 (2021)
[i8]Xiangtai Li, Hao He, Henghui Ding, Kuiyuan Yang, Guangliang Cheng, Jianping Shi, Yunhai Tong:
Improving Video Instance Segmentation via Temporal Pyramid Routing. CoRR abs/2107.13155 (2021)
[i7]Haobo Yuan, Xiangtai Li, Yibo Yang, Guangliang Cheng, Jing Zhang, Yunhai Tong, Lefei Zhang, Dacheng Tao:
PolyphonicFormer: Unified Query Learning for Depth-aware Video Panoptic Segmentation. CoRR abs/2112.02582 (2021)- 2020
[c6]Xiangtai Li, Houlong Zhao, Lei Han, Yunhai Tong, Shaohua Tan, Kuiyuan Yang:
Gated Fully Fusion for Semantic Segmentation. AAAI 2020: 11418-11425
[c5]Xiangtai Li, Xia Li
, Li Zhang, Guangliang Cheng, Jianping Shi, Zhouchen Lin, Shaohua Tan, Yunhai Tong:
Improving Semantic Segmentation via Decoupled Body and Edge Supervision. ECCV (17) 2020: 435-452
[c4]Xiangtai Li, Ansheng You, Zhen Zhu
, Houlong Zhao, Maoke Yang, Kuiyuan Yang, Shaohua Tan, Yunhai Tong:
Semantic Flow for Fast and Accurate Scene Parsing. ECCV (1) 2020: 775-793
[i6]Xiangtai Li, Ansheng You, Zhen Zhu, Houlong Zhao, Maoke Yang, Kuiyuan Yang, Yunhai Tong:
Semantic Flow for Fast and Accurate Scene Parsing. CoRR abs/2002.10120 (2020)
[i5]Xiangtai Li, Xia Li, Li Zhang, Guangliang Cheng, Jianping Shi, Zhouchen Lin, Shaohua Tan, Yunhai Tong:
Improving Semantic Segmentation via Decoupled Body and Edge Supervision. CoRR abs/2007.10035 (2020)
[i4]Xiangtai Li, Xia Li, Ansheng You, Li Zhang, Guangliang Cheng, Kuiyuan Yang, Yunhai Tong, Zhouchen Lin:
Towards Efficient Scene Understanding via Squeeze Reasoning. CoRR abs/2011.03308 (2020)
2010 – 2019
- 2019
[c3]Xiangtai Li, Li Zhang, Ansheng You, Maoke Yang, Kuiyuan Yang, Yunhai Tong:
Global Aggregation then Local Distribution in Fully Convolutional Networks. BMVC 2019: 244
[c2]Li Zhang, Xiangtai Li, Anurag Arnab, Kuiyuan Yang, Yunhai Tong, Philip H. S. Torr:
Dual Graph Convolutional Network for Semantic Segmentation. BMVC 2019: 254
[c1]Xiangtai Li, Jiangang Bai, Kuiyuan Yang, Yunhai Tong:
Flow2Seg: Motion-Aided Semantic Segmentation. ICANN (3) 2019: 225-237
[i3]Xiangtai Li, Houlong Zhao, Lei Han, Yunhai Tong, Kuiyuan Yang:
GFF: Gated Fully Fusion for Semantic Segmentation. CoRR abs/1904.01803 (2019)
[i2]Li Zhang, Xiangtai Li, Anurag Arnab, Kuiyuan Yang, Yunhai Tong, Philip H. S. Torr:
Dual Graph Convolutional Network for Semantic Segmentation. CoRR abs/1909.06121 (2019)
[i1]Xiangtai Li, Li Zhang, Ansheng You, Maoke Yang, Kuiyuan Yang, Yunhai Tong:
Global Aggregation then Local Distribution in Fully Convolutional Networks. CoRR abs/1909.07229 (2019)
Coauthor Index

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from
to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the
of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from
,
, and
to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from
and
to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from
.
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2026-01-03 00:44 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint


Google
Google Scholar
Semantic Scholar
Internet Archive Scholar
CiteSeerX
ORCID







