default search action

combined dblp search
author search
venue search
publication search

ask others

Xiangtai Li

> Home > Persons

Person information

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

2020 – today

see FAQ

What is the meaning of the colors in the publication lists?

2025
[j25]
- view
  authority control:
- export record
  dblp key:
  - journals/cviu/ZhangCWWLLYT25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/cviu/ZhangCWWLLYT25
Jiangning Zhang, Xuhai Chen, Yabiao Wang, Chengjie Wang, Yong Liu, Xiangtai Li, Ming-Hsuan Yang, Dacheng Tao:
Exploring plain ViT features for multi-class unsupervised visual anomaly detection. Comput. Vis. Image Underst. 253: 104308 (2025)
[j24]
- view
  authority control:
- export record
  dblp key:
  - journals/ijcv/XieLLLOL25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ijcv/XieLLLOL25
Jiahao Xie, Wei Li, Xiangtai Li, Ziwei Liu, Yew Soon Ong, Chen Change Loy:
MosaicFusion: Diffusion Models as Data Augmenters for Large Vocabulary Instance Segmentation. Int. J. Comput. Vis. 133(4): 1456-1475 (2025)
[j23]
- view
  authority control:
- export record
  dblp key:
  - journals/ijcv/ZhouLLD25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ijcv/ZhouLLD25
Chong Zhou, Xiangtai Li, Chen Change Loy, Bo Dai:
EdgeSAM: Prompt-In-the-Loop Distillation for SAM. Int. J. Comput. Vis. 133(12): 8452-8468 (2025)
[j22]
- view
  authority control:
- export record
  dblp key:
  - journals/pami/ZhouQSHYLY25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/pami/ZhouQSHYLY25
Hao Zhou, Lu Qi, Tiancheng Shen, Hai Huang, Xu Yang, Xiangtai Li, Ming-Hsuan Yang:
Rethinking Evaluation Metrics of Open-Vocabulary Segmentation. IEEE Trans. Pattern Anal. Mach. Intell. 47(8): 6780-6796 (2025)
[j21]
- view
  authority control:
- export record
  dblp key:
  - journals/pami/ZhangHHXWWLLT25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/pami/ZhangHHXWWLLT25
Jiangning Zhang, Teng Hu, Haoyang He, Zhucun Xue, Yabiao Wang, Chengjie Wang, Yong Liu, Xiangtai Li, Dacheng Tao:
EMOv2: Pushing 5M Vision Model Frontier. IEEE Trans. Pattern Anal. Mach. Intell. 47(11): 10560-10576 (2025)
[j20]
- view
  authority control:
- export record
  dblp key:
  - journals/tcsv/WangFLCLLZ25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tcsv/WangFLCLLZ25
Chunlei Wang, Wenquan Feng, Shuchang Lyu, Guangliang Cheng, Xiangtai Li, Binghao Liu, Qi Zhao:
A Masked Reference Token Supervision-Based Iterative Visual-Language Framework for Robust Visual Grounding. IEEE Trans. Circuits Syst. Video Technol. 35(1): 75-90 (2025)
[j19]
- view
  authority control:
- export record
  dblp key:
  - journals/tcsv/XuLWZTL25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tcsv/XuLWZTL25
Shilin Xu, Xiangtai Li, Size Wu, Wenwei Zhang, Yunhai Tong, Chen Change Loy:
DST-Det: Open-Vocabulary Object Detection via Dynamic Self-Training. IEEE Trans. Circuits Syst. Video Technol. 35(5): 5037-5050 (2025)
[c66]
- view
  authority control:
- export record
  dblp key:
  - conf/3dim/HuangLQYY25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/3dim/HuangLQYY25
Kuan-Chih Huang, Xiangtai Li, Lu Qi, Shuicheng Yan, Ming-Hsuan Yang:
Reason3D: Searching and Reasoning 3D Segmentation via Large Language Model. 3DV 2025: 1177-1186
[c65]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/HeZPHLWW25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/HeZPHLWW25
Qingdong He, Jiangning Zhang, Jinlong Peng, Haoyang He, Xiangtai Li, Yabiao Wang, Chengjie Wang:
PointRWKV: Efficient RWKV-Like Model for Hierarchical Point Cloud Learning. AAAI 2025: 3410-3418
[c64]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/WangLDQZTLY25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/WangLDQZTLY25
Chaoyang Wang, Xiangtai Li, Henghui Ding, Lu Qi, Jiangning Zhang, Yunhai Tong, Chen Change Loy, Shuicheng Yan:
Explore In-Context Segmentation via Latent Diffusion Models. AAAI 2025: 7545-7553
[c63]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/Yang0SLLLMY25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/Yang0SLLLMY25
Hao Yang, Qianyu Zhou, Haijia Sun, Xiangtai Li, Fengqi Liu, Xuequan Lu, Lizhuang Ma, Shuicheng Yan:
PointDGMamba: Domain Generalization of Point Cloud Classification via Generalized State Space Model. AAAI 2025: 9193-9201
[c62]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/ZhangYQZ0JYL25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/ZhangYQZ0JYL25
Tao Zhang, Haobo Yuan, Lu Qi, Jiangning Zhang, Qianyu Zhou, Shunping Ji, Shuicheng Yan, Xiangtai Li:
Point Cloud Mamba: Point Cloud Learning via State Space Model. AAAI 2025: 10121-10130
[c61]
- view
  - electronic edition @ thecvf.com (open access)
  - details & citations
- export record
  dblp key:
  - conf/cvpr/0082JJWLWYL0WTT25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/cvpr/0082JJWLWYL0WTT25
Xin Li, Yeying Jin, Xin Jin, Zongwei Wu, Bingchen Li, Yufei Wang, Wenhan Yang, Yu Li, Zhibo Chen, Bihan Wen, Robby T. Tan, Radu Timofte, Qiyu Rong, Hongyuan Jing, Mengmeng Zhang, Jinglong Li, Xiangyu Lu, Yi Ren, Yuting Liu, Meng Zhang, Xiang Chen, Qiyuan Guan, Jiangxin Dong, Jinshan Pan, Conglin Gou, Qirui Yang, Fangpu Zhang, Yunlong Lin, Sixiang Chen, Guoxi Huang, Ruirui Lin, Yan Zhang, Jingyu Yang, Huanjing Yue, Jiyuan Chen, Qiaosi Yi, Hongjun Wang, Chenxi Xie, Shuai Li, Yuhui Wu, Kaiyi Ma, Jiakui Hu, Juncheng Li, Liwen Pan, Guangwei Gao, Wenjie Li, Zhenyu Jin, Heng Guo, Zhanyu Ma, Yubo Wang, Jinghua Wang, Wangzhi Xing, Anjusree Karnavar, Diqi Chen, Mohammad Aminul Islam, Hao Yang, Ruikun Zhang, Liyuan Pan, Qianhao Luo, Xin Cao, Han Zhou, Yan Min, Wei Dong, Jun Chen, Taoyi Wu, Weijia Dou, Yu Wang, Shengjie Zhao, Yongcheng Huang, Xingyu Han, Anyan Huang, Hongtao Wu, Hong Wang, Yefeng Zheng, Abhijeet Kumar, Aman Kumar, Marcos V. Conde, Paula Garrido, Daniel Feijoo, Juan C. Benito, Guanglu Dong, Xin Lin, Siyuan Liu, Tianheng Zheng, Jiayu Zhong, Shouyi Wang, Xiangtai Li, Lanqing Guo, Lu Qi, Chao Ren, Shuaibo Wang, Shilong Zhang, Wanyu Zhou, Yunze Wu, Qinzhong Tan, Jieyuan Pei, Zhuoxuan Li, Jiayu Wang, Haoyu Bian, Haoran Sun, Subhajit Paul, Ni Tang, Junhao Huang, Zihan Cheng, Hongyun Zhu, Yuehan Wu, Kaixin Deng, Huang Ouyang, Tianxin Xiao, Fan Yang, Zhizun Luo, Zeyu Xiao, Zhuoyuan Li, Pham Hoang Le Nguyen, Dinh Thien An, Luu Thanh Son, Kiet Van Nguyen, Ronghua Xu, Xianmin Tian, Weijian Zhou, Jiacheng Zhang, Yuqian Chen, Yihang Duan, Yujie Wu, Suresh Raikwar, Arsh Garg, Kritika Kritika, Jianhua Zheng, Xiaoshan Ma, Ruolin Zhao, Yongyu Yang, Yongsheng Liang, Guiming Huang, Qiang Li, Hongbin Zhang, Xiangyu Zheng, A. N. Rajagopalan:
NTIRE 2025 Challenge on Day and Night Raindrop Removal for Dual-Focused Images: Methods and Results. CVPR Workshops 2025: 1172-1183
[c60]
- view
  authority control:
- export record
  dblp key:
  - conf/cvpr/ShiQWB0TL25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/cvpr/ShiQWB0TL25
Qingyu Shi, Lu Qi, Jianzong Wu, Jinbin Bai, Jingbo Wang, Yunhai Tong, Xiangtai Li:
DreamRelation: Bridging Customization and Relation Generation. CVPR 2025: 15723-15732
[c59]
- view
  authority control:
- export record
  dblp key:
  - conf/cvpr/ChenLLZWZ025
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/cvpr/ChenLLZWZ025
Yicheng Chen, Xiangtai Li, Yining Li, Yanhong Zeng, Jianzong Wu, Xiangyu Zhao, Kai Chen:
Auto Cherry-Picker: Learning from High-quality Generative Data Driven by Language. CVPR 2025: 19952-19962
[c58]
- view
  authority control:
- export record
  dblp key:
  - conf/cvpr/Wu0YL0ZC25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/cvpr/Wu0YL0ZC25
Shengqiong Wu, Hao Fei, Jingkang Yang, Xiangtai Li, Juncheng Li, Hanwang Zhang, Tat-Seng Chua:
Learning 4D Panoptic Scene Graph Generation from Rich 2D Visual Scene. CVPR 2025: 24539-24549
[c57]
- view
  authority control:
- export record
  dblp key:
  - conf/cvpr/WuT0ZLT25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/cvpr/WuT0ZLT25
Jianzong Wu, Chao Tang, Jingbo Wang, Yanhong Zeng, Xiangtai Li, Yunhai Tong:
DiffSensei: Bridging Multi-Modal LLMs and Diffusion Models for Customized Manga Generation. CVPR 2025: 28684-28693
[c56]
- view
  authority control:
- export record
  dblp key:
  - conf/cvpr/HuangHLH00W0C25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/cvpr/HuangHLH00W0C25
Zhenglin Huang, Jinwei Hu, Xiangtai Li, Yiwei He, Xingyu Zhao, Bei Peng, Baoyuan Wu, Xiaowei Huang, Guangliang Cheng:
SIDA: Social Media Image Deepfake Detection, Localization and Explanation with Large Multimodal Model. CVPR 2025: 28831-28841
[c55]
- view
  authority control:
- export record
  dblp key:
  - conf/cvpr/YangQLLJ025
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/cvpr/YangQLLJ025
Lehan Yang, Lu Qi, Xiangtai Li, Sheng Li, Varun Jampani, Ming-Hsuan Yang:
Unified Dense Prediction of Video Diffusion. CVPR 2025: 28963-28973
[c54]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/Bai0CSCL00Y25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/Bai0CSCL00Y25
Jinbin Bai, Tian Ye, Wei Chow, Enxin Song, Qing-Guo Chen, Xiangtai Li, Zhen Dong, Lei Zhu, Shuicheng Yan:
Meissonic: Revitalizing Masked Generative Transformers for Efficient High-Resolution Text-to-Image Synthesis. ICLR 2025
[c53]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/SunCLYL0XG25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/SunCLYL0XG25
Peiwen Sun, Sitong Cheng, Xiangtai Li, Zhen Ye, Huadai Liu, Honggang Zhang, Wei Xue, Yike Guo:
Both Ears Wide Open: Towards Language-Driven Spatial Audio Generation. ICLR 2025
[c52]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/Wu0LJZCY25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/Wu0LJZCY25
Shengqiong Wu, Hao Fei, Xiangtai Li, Jiayi Ji, Hanwang Zhang, Tat-Seng Chua, Shuicheng Yan:
Towards Semantic Equivalence of Tokenization in Multimodal LLM. ICLR 2025
[c51]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/XuYSQWYL0TGL025
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/XuYSQWYL0TGL025
Shilin Xu, Haobo Yuan, Qingyu Shi, Lu Qi, Jingbo Wang, Yibo Yang, Yining Li, Kai Chen, Yunhai Tong, Bernard Ghanem, Xiangtai Li, Ming-Hsuan Yang:
RMP-SAM: Towards Real-Time Multi-Purpose Segment Anything. ICLR 2025
[c50]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/YueLLZLQW025
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/YueLLZLQW025
Jingtong Yue, Zhiwei Lin, Xin Lin, Xiaoyu Zhou, Xiangtai Li, Lu Qi, Yongtao Wang, Ming-Hsuan Yang:
RobuRCDet: Enhancing Robustness of Radar-Camera Fusion in Bird's Eye View for 3D Object Detection. ICLR 2025
[c49]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/000100L0LWWZMSZ25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/000100L0LWWZMSZ25
Hao Fei, Yuan Zhou, Juncheng Li, Xiangtai Li, Qingshan Xu, Bobo Li, Shengqiong Wu, Yaoting Wang, Junbao Zhou, Jiahao Meng, Qingyu Shi, Zhiyuan Zhou, Liangtao Shi, Minghe Gao, Daoan Zhang, Zhiqi Ge, Siliang Tang, Kaihang Pan, Yaobo Ye, Haobo Yuan, Tao Zhang, Weiming Wu, Tianjie Ju, Zixiang Meng, Shilin Xu, Liyu Jia, Wentao Hu, Meng Luo, Jiebo Luo, Tat-Seng Chua, Shuicheng Yan, Hanwang Zhang:
On Path to Multimodal Generalist: General-Level and General-Bench. ICML 2025
[c48]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/00140FQL0L25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/00140FQL0L25
Hao Zhou, Xu Yang, Mingyu Fan, Lu Qi, Xiangtai Li, Ming-Hsuan Yang, Fei Luo:
Three-Dimensional Trajectory Prediction with 3DMoTraj Dataset. ICML 2025
[c47]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/icml/LiuLLJSW000LZY025
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icml/LiuLLJSW000LZY025
Huadai Liu, Tianyi Luo, Kaicheng Luo, Qikai Jiang, Peiwen Sun, Jialei Wang, Rongjie Huang, Qian Chen, Wen Wang, Xiangtai Li, Shiliang Zhang, Zhijie Yan, Zhou Zhao, Wei Xue:
OmniAudio: Generating Spatial Audio from 360-Degree Video. ICML 2025
[i143]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2501-04001
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2501-04001
Haobo Yuan, Xiangtai Li, Tao Zhang, Zilong Huang, Shilin Xu, Shunping Ji, Yunhai Tong, Lu Qi, Jiashi Feng, Ming-Hsuan Yang:
Sa2VA: Marrying SAM2 with LLaVA for Dense Grounded Understanding of Images and Videos. CoRR abs/2501.04001 (2025)
[i142]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2501-04670
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2501-04670
Yikang Zhou, Tao Zhang, Shilin Xu, Shihao Chen, Qianyu Zhou, Yunhai Tong, Shunping Ji, Jiangning Zhang, Xiangtai Li, Lu Qi:
Are They the Same? Exploring Visual Correspondence Shortcomings of Multimodal LLMs. CoRR abs/2501.04670 (2025)
[i141]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2502-03035
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2502-03035
Yu Qiu, Xin Lin, Jingbo Wang, Xiangtai Li, Lu Qi, Ming-Hsuan Yang:
UMC: Unified Resilient Controller for Legged Robots with Joint Malfunctions. CoRR abs/2502.03035 (2025)
[i140]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2502-13071
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2502-13071
Jingtong Yue, Zhiwei Lin, Xin Lin, Xiaoyu Zhou, Xiangtai Li, Lu Qi, Yongtao Wang, Ming-Hsuan Yang:
RobuRCDet: Enhancing Robustness of Radar-Camera Fusion in Bird's Eye View for 3D Object Detection. CoRR abs/2502.13071 (2025)
[i139]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2503-09344
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2503-09344
Lehan Yang, Lu Qi, Xiangtai Li, Sheng Li, Varun Jampani, Ming-Hsuan Yang:
Unified Dense Prediction of Video Diffusion. CoRR abs/2503.09344 (2025)
[i138]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2503-15019
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2503-15019
Shengqiong Wu, Hao Fei, Jingkang Yang, Xiangtai Li, Juncheng Li, Hanwang Zhang, Tat-Seng Chua:
Learning 4D Panoptic Scene Graph Generation from Rich 2D Visual Scene. CoRR abs/2503.15019 (2025)
[i137]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2503-17350
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2503-17350
Qingyu Shi, Jianzong Wu, Jinbin Bai, Jiangning Zhang, Lu Qi, Xiangtai Li, Yunhai Tong:
Decouple and Track: Benchmarking and Improving Video Diffusion Transformers for Motion Transfer. CoRR abs/2503.17350 (2025)
[i136]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2504-00476
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2504-00476
Haobo Yuan, Tao Zhang, Xiangtai Li, Lu Qi, Zilong Huang, Shilin Xu, Jiashi Feng, Ming-Hsuan Yang:
4th PVUW MeViS 3rd Place Report: Sa2VA. CoRR abs/2504.00476 (2025)
[i135]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2504-02272
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2504-02272
Shaocong Long, Qianyu Zhou, Xiangtai Li, Chenhao Ying, Yunhai Tong, Lizhuang Ma, Yuan Luo, Dacheng Tao:
Generative Classifier for Domain Generalization. CoRR abs/2504.02272 (2025)
[i134]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2504-05979
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2504-05979
Sixiang Chen, Jinbin Bai, Zhuoran Zhao, Tian Ye, Qingyu Shi, Donghao Zhou, Wenhao Chai, Xin Lin, Jianzong Wu, Chao Tang, Shilin Xu, Tao Zhang, Haobo Yuan, Yikang Zhou, Wei Chow, Linfeng Li, Xiangtai Li, Lei Zhu, Lu Qi:
An Empirical Study of GPT-4o Image Generation Capabilities. CoRR abs/2504.05979 (2025)
[i133]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2504-10462
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2504-10462
Weixian Lei, Jiacong Wang, Haochen Wang, Xiangtai Li, Jun Hao Liew, Jiashi Feng, Zilong Huang:
The Scalability of Simplicity: Empirical Analysis of Vision-Language Learning with a Single Transformer. CoRR abs/2504.10462 (2025)
[i132]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2504-10465
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2504-10465
Tao Zhang, Xiangtai Li, Zilong Huang, Yanwei Li, Weixian Lei, Xueqing Deng, Shihao Chen, Shunping Ji, Jiashi Feng:
Pixel-SAIL: Single Transformer For Pixel-Grounded Understanding. CoRR abs/2504.10465 (2025)
[i131]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2504-11326
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2504-11326
Henghui Ding, Chang Liu, Nikhila Ravi, Shuting He, Yunchao Wei, Song Bai, Philip Torr, Kehuan Song, Xinglin Xie, Kexin Zhang, Licheng Jiao, Lingling Li, Shuyuan Yang, Xuqiang Cao, Linnan Zhao, Jiaxuan Zhao, Fang Liu, Mengjiao Wang, Junpei Zhang, Xu Liu, Yuting Yang, Mengru Ma, Hao Fang, Runmin Cong, Xiankai Lu, Zhiyang Chen, Wei Zhang, Tianming Liang, Haichao Jiang, Wei-Shi Zheng, Jian-Fang Hu, Haobo Yuan, Xiangtai Li, Tao Zhang, Lu Qi, Ming-Hsuan Yang:
PVUW 2025 Challenge Report: Advances in Pixel-level Understanding of Complex Videos in the Wild. CoRR abs/2504.11326 (2025)
[i130]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2504-12080
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2504-12080
Mengshi Qi, Pengfei Zhu, Xiangtai Li, Xiaoyang Bi, Lu Qi, Huadong Ma, Ming-Hsuan Yang:
DC-SAM: In-Context Segment Anything in Images and Videos via Dual Consistency. CoRR abs/2504.12080 (2025)
[i129]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2504-12711
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2504-12711
Xin Li, Yeying Jin, Xin Jin, Zongwei Wu, Bingchen Li, Yufei Wang, Wenhan Yang, Yu Li, Zhibo Chen, Bihan Wen, Robby T. Tan, Radu Timofte, Qiyu Rong, Hongyuan Jing, Mengmeng Zhang, Jinglong Li, Xiangyu Lu, Yi Ren, Yuting Liu, Meng Zhang, Xiang Chen, Qiyuan Guan, Jiangxin Dong, Jinshan Pan, Conglin Gou, Qirui Yang, Fangpu Zhang, Yunlong Lin, Sixiang Chen, Guoxi Huang, Ruirui Lin, Yan Zhang, Jingyu Yang, Huanjing Yue, Jiyuan Chen, Qiaosi Yi, Hongjun Wang, Chenxi Xie, Shuai Li, Yuhui Wu, Kaiyi Ma, Jiakui Hu, Juncheng Li, Liwen Pan, Guangwei Gao, Wenjie Li, Zhenyu Jin, Heng Guo, Zhanyu Ma, Yubo Wang, Jinghua Wang, Wangzhi Xing, Anjusree Karnavar, Diqi Chen, Mohammad Aminul Islam, Hao Yang, Ruikun Zhang, Liyuan Pan, Qianhao Luo, XinCao, Han Zhou, Yan Min, Wei Dong, Jun Chen, Taoyi Wu, Weijia Dou, Yu Wang, Shengjie Zhao, Yongcheng Huang, Xingyu Han, Anyan Huang, Hongtao Wu, Hong Wang, Yefeng Zheng, Abhijeet Kumar, Aman Kumar, Marcos V. Conde, Paula Garrido, Daniel Feijoo, Juan C. Benito, Guanglu Dong, Xin Lin, Siyuan Liu, Tianheng Zheng, Jiayu Zhong, Shouyi Wang, Xiangtai Li, Lanqing Guo, Lu Qi, Chao Ren, Shuaibo Wang, Shilong Zhang, Wanyu Zhou, Yunze Wu, Qinzhong Tan, Jieyuan Pei, Zhuoxuan Li, Jiayu Wang, Haoyu Bian, Haoran Sun:
NTIRE 2025 Challenge on Day and Night Raindrop Removal for Dual-Focused Images: Methods and Results. CoRR abs/2504.12711 (2025)
[i128]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2504-14906
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2504-14906
Huadai Liu, Tianyi Luo, Qikai Jiang, Kaicheng Luo, Peiwen Sun, Jialei Wang, Rongjie Huang, Qian Chen, Wen Wang, Xiangtai Li, Shiliang Zhang, Zhijie Yan, Zhou Zhao, Wei Xue:
OmniAudio: Generating Spatial Audio from 360-Degree Video. CoRR abs/2504.14906 (2025)
[i127]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2505-00630
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2505-00630
Muyi Bao, Shuchang Lyu, Zhaoyang Xu, Huiyu Zhou, Jinchang Ren, Shiming Xiang, Xiangtai Li, Guangliang Cheng:
Vision Mamba in Remote Sensing: A Comprehensive Survey of Techniques, Applications and Outlook. CoRR abs/2505.00630 (2025)
[i126]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2505-04620
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2505-04620
Hao Fei, Yuan Zhou, Juncheng Li, Xiangtai Li, Qingshan Xu, Bobo Li, Shengqiong Wu, Yaoting Wang, Junbao Zhou, Jiahao Meng, Qingyu Shi, Zhiyuan Zhou, Liangtao Shi, Minghe Gao, Daoan Zhang, Zhiqi Ge, Weiming Wu, Siliang Tang, Kaihang Pan, Yaobo Ye, Haobo Yuan, Tao Zhang, Tianjie Ju, Zixiang Meng, Shilin Xu, Liyu Jia, Wentao Hu, Meng Luo, Jiebo Luo, Tat-Seng Chua, Shuicheng Yan, Hanwang Zhang:
On Path to Multimodal Generalist: General-Level and General-Bench. CoRR abs/2505.04620 (2025)
[i125]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2505-12620
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2505-12620
Haiquan Wen, Yiwei He, Zhenglin Huang, Tianxiao Li, Zihan Yu, Xingru Huang, Lu Qi, Baoyuan Wu, Xiangtai Li, Guangliang Cheng:
BusterX: MLLM-Powered AI-Generated Video Forgery Detection and Explanation. CoRR abs/2505.12620 (2025)
[i124]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2505-16862
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2505-16862
Chaoyang Wang, Xiangtai Li, Lu Qi, Xiaofan Lin, Jinbin Bai, Qianyu Zhou, Yunhai Tong:
Conditional Panoramic Image Generation via Masked Autoregressive Modeling. CoRR abs/2505.16862 (2025)
[i123]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2505-18660
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2505-18660
Zhenglin Huang, Tianxiao Li, Xiangtai Li, Haiquan Wen, Yiwei He, Jiangning Zhang, Hao Fei, Xi Yang, Xiaowei Huang, Bei Peng, Guangliang Cheng:
So-Fake: Benchmarking and Explaining Social Media Image Forgery Detection. CoRR abs/2505.18660 (2025)
[i122]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2505-21541
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2505-21541
Zitong Wang, Hang Zhao, Qianyu Zhou, Xuequan Lu, Xiangtai Li, Yiren Song:
DiffDecompose: Layer-Wise Decomposition of Alpha-Composited Images via Diffusion Transformers. CoRR abs/2505.21541 (2025)
[i121]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2505-23606
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2505-23606
Qingyu Shi, Jinbin Bai, Zhuoran Zhao, Wenhao Chai, Kaidong Yu, Jianzong Wu, Shuangyong Song, Yunhai Tong, Xiangtai Li, Xuelong Li, Shuicheng Yan:
Muddit: Liberating Generation Beyond Text-to-Image with a Unified Discrete Diffusion Model. CoRR abs/2505.23606 (2025)
[i120]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2505-23727
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2505-23727
Song Wang, Gongfan Fang, Lingdong Kong, Xiangtai Li, Jianyun Xu, Sheng Yang, Qiang Li, Jianke Zhu, Xinchao Wang:
PixelThink: Towards Efficient Chain-of-Pixel Reasoning. CoRR abs/2505.23727 (2025)
[i119]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2505-24164
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2505-24164
Shilin Xu, Yanwei Li, Rui Yang, Tao Zhang, Yueyi Sun, Wei Chow, Linfeng Li, Hang Song, Qi Xu, Yunhai Tong, Xiangtai Li, Hao Fei:
Mixed-R1: Unified Reward Perspective For Reasoning Capability in Multimodal Large Language Models. CoRR abs/2505.24164 (2025)
[i118]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2506-03144
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2506-03144
Wei Chow, Yuan Gao, Linfeng Li, Xian Wang, Qi Xu, Hang Song, Lingdong Kong, Ran Zhou, Yi Zeng, Yidong Cai, Botian Jiang, Shilin Xu, Jiajun Zhang, Minghui Qiu, Xiangtai Li, Tianshu Yang, Siliang Tang, Juncheng Li:
MERIT: Multilingual Semantic Retrieval with Interleaved Multi-Condition Query. CoRR abs/2506.03144 (2025)
[i117]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2506-07971
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2506-07971
Jiahao Meng, Shuyang Sun, Yue Tan, Lu Qi, Yunhai Tong, Xiangtai Li, Longyin Wen:
CyberV: Cybernetics for Test-time Scaling in Video Understanding. CoRR abs/2506.07971 (2025)
[i116]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2506-13589
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2506-13589
Zhucun Xue, Jiangning Zhang, Xurong Xie, Yuxuan Cai, Yong Liu, Xiangtai Li, Dacheng Tao:
AdaVideoRAG: Omni-Contextual Adaptive Retrieval-Augmented Efficient Long Video Understanding. CoRR abs/2506.13589 (2025)
[i115]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2506-13691
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2506-13691
Zhucun Xue, Jiangning Zhang, Teng Hu, Haoyang He, Yinan Chen, Yuxuan Cai, Yabiao Wang, Chengjie Wang, Yong Liu, Xiangtai Li, Dacheng Tao:
UltraVideo: High-Quality UHD Video Dataset with Comprehensive Captions. CoRR abs/2506.13691 (2025)
[i114]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2506-14471
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2506-14471
Yikang Zhou, Tao Zhang, Dizhe Zhang, Shunping Ji, Xiangtai Li, Lu Qi:
Dense360: Dense Understanding from Omnidirectional Panoramas. CoRR abs/2506.14471 (2025)
[i113]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2506-22930
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2506-22930
Yiwei He, Xiangtai Li, Zhenglin Huang, Yi Dong, Hao Fei, Jiangning Zhang, Baoyuan Wu, Guangliang Cheng:
Towards Explainable Bilingual Multimodal Misinformation Detection and Localization. CoRR abs/2506.22930 (2025)
[i112]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2506-24102
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2506-24102
Xiangtai Li, Tao Zhang, Yanwei Li, Haobo Yuan, Shihao Chen, Yikang Zhou, Jiahao Meng, Yueyi Sun, Shilin Xu, Lu Qi, Tianheng Cheng, Yi Lin, Zilong Huang, Wenhao Huang, Jiashi Feng, Guang Shi:
DenseWorld-1M: Towards Detailed Dense Grounded Caption in the Real World. CoRR abs/2506.24102 (2025)
[i111]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2507-01908
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2507-01908
Qingdong He, Xueqin Chen, Chaoyi Wang, Yanjie Pan, Xiaobin Hu, Zhenye Gan, Yabiao Wang, Chengjie Wang, Xiangtai Li, Jiangning Zhang:
Reasoning to Edit: Hypothetical Instruction-Based Image Editing with Visual Reasoning. CoRR abs/2507.01908 (2025)
[i110]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2507-07999
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2507-07999
Haochen Wang, Xiangtai Li, Zilong Huang, Anran Wang, Jiacong Wang, Tao Zhang, Jiani Zheng, Sule Bai, Zijian Kang, Jiashi Feng, Zhuochen Wang, Zhaoxiang Zhang:
Traceable Evidence Enhanced Visual Grounded Reasoning: Evaluation and Methodology. CoRR abs/2507.07999 (2025)
[i109]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2507-11003
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2507-11003
Yuhu Bai, Jiangning Zhang, Yunkang Cao, Guangyuan Lu, Qingdong He, Xiangtai Li, Guanzhong Tian:
Bridge Feature Matching and Cross-Modal Alignment with Mutual-filtering for Zero-shot Anomaly Detection. CoRR abs/2507.11003 (2025)
[i108]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2508-10897
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2508-10897
Mengyuan Liu, Xinshun Wang, Zhongbin Fang, Deheng Ye, Xia Li, Tao Tang, Songtao Wu, Xiangtai Li, Ming-Hsuan Yang:
Human-in-Context: Unified Cross-Domain 3D Human Motion Modeling via In-Context Learning. CoRR abs/2508.10897 (2025)
[i107]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2508-12081
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2508-12081
Haidong Xu, Guangwei Xu, Zhedong Zheng, Xiatian Zhu, Wei Ji, Xiangtai Li, Ruijie Guo, Meishan Zhang, Min Zhang, Hao Fei:
VimoRAG: Video-based Retrieval-augmented 3D Motion Generation for Motion Language Models. CoRR abs/2508.12081 (2025)
[i106]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2508-20835
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2508-20835
Hao Yang, Qianyu Zhou, Haijia Sun, Xiangtai Li, Xuequan Lu, Lizhuang Ma, Shuicheng Yan:
PointDGRWKV: Generalizing RWKV-like Architecture to Unseen Domains for Point Cloud Classification. CoRR abs/2508.20835 (2025)
[i105]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2509-04444
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2509-04444
Xin Lin, Xian Ge, Dizhe Zhang, Zhaoliang Wan, Xianshun Wang, Xiangtai Li, Wenjie Jiang, Bo Du, Dacheng Tao, Ming-Hsuan Yang, Lu Qi:
One Flight Over the Gap: A Survey from Perspective to Panoramic Vision. CoRR abs/2509.04444 (2025)
[i104]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2509-16972
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2509-16972
Quanzhu Niu, Dengxian Gong, Shihao Chen, Tao Zhang, Yikang Zhou, Haobo Yuan, Lu Qi, Xiangtai Li, Shunping Ji:
The 1st Solution for 7th LSVOS RVOS Track: SaSaSa2VA. CoRR abs/2509.16972 (2025)
[i103]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2510-08566
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2510-08566
Meixi Song, Xin Lin, Dizhe Zhang, Haodong Li, Xiangtai Li, Bo Du, Lu Qi:
D²GS: Depth-and-Density Guided Gaussian Splatting for Stable and Accurate Sparse-View Reconstruction. CoRR abs/2510.08566 (2025)
[i102]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2510-11063
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2510-11063
Chang Liu, Henghui Ding, Kaining Ying, Lingyi Hong, Ning Xu, Linjie Yang, Yuchen Fan, Mingqi Gao, Jingkun Chen, Yunqi Miao, Gengshen Wu, Zhijin Qin, Jungong Han, Zhixiong Zhang, Shuangrui Ding, Xiaoyi Dong, Yuhang Zang, Yuhang Cao, Jiaqi Wang, Chang Soo Lim, Joonyoung Moon, Donghyeon Cho, Tingmin Li, Yixuan Li, Yang Yang, An Yan, Leilei Cao, Feng Lu, Ran Hong, Youhai Jiang, Fengjie Zhu, Yujie Xie, Hongyang Zhang, Zhihui Liu, Shihai Ruan, Quanzhu Niu, Dengxian Gong, Shihao Chen, Tao Zhang, Yikang Zhou, Haobo Yuan, Lu Qi, Xiangtai Li, Shunping Ji, Alexey Nekrasov, Ali Athar, Daan de Geus, Alexander Hermans, Bastian Leibe:
LSVOS 2025 Challenge Report: Recent Advances in Complex Video Object Segmentation. CoRR abs/2510.11063 (2025)
[i101]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2510-11712
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2510-11712
Haoran Feng, Dizhe Zhang, Xiangtai Li, Bo Du, Lu Qi:
DiT360: High-Fidelity Panoramic Image Generation via Hybrid Training. CoRR abs/2510.11712 (2025)
[i100]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2510-18876
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2510-18876
Haochen Wang, Yuhao Wang, Tao Zhang, Yikang Zhou, Yanwei Li, Jiacong Wang, Jiani Zheng, Ye Tian, Jiahao Meng, Zilong Huang, Guangcan Mai, Anran Wang, Yunhai Tong, Zhuochen Wang, Xiangtai Li, Zhaoxiang Zhang:
Grasp Any Region: Towards Precise, Contextual Pixel Understanding for Multimodal LLMs. CoRR abs/2510.18876 (2025)
[i99]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2510-20579
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2510-20579
Jiahao Meng, Xiangtai Li, Haochen Wang, Yue Tan, Tao Zhang, Lingdong Kong, Yunhai Tong, Anran Wang, Zhiyang Teng, Yujing Wang, Zhuochen Wang:
Open-o3 Video: Grounded Video Reasoning with Explicit Spatio-Temporal Evidence. CoRR abs/2510.20579 (2025)
[i98]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2510-20668
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2510-20668
Jinbin Bai, Yu Lei, Hecong Wu, Yuchen Zhu, Shufan Li, Yi Xin, Xiangtai Li, Molei Tao, Aditya Grover, Ming-Hsuan Yang:
From Masks to Worlds: A Hitchhiker's Guide to World Models. CoRR abs/2510.20668 (2025)
[i97]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2510-25682
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2510-25682
Jiani Zheng, Zhiyang Teng, Xiangtai Li, Anran Wang, Yu Tian, Kunpeng Qiu, Ye Tian, Haochen Wang, Zhuochen Wang:
PairUni: Pairwise Training for Unified Multimodal Language Models. CoRR abs/2510.25682 (2025)
[i96]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2510-26802
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2510-26802
Ziyu Guo, Xinyan Chen, Renrui Zhang, Ruichuan An, Yu Qi, Dongzhi Jiang, Xiangtai Li, Manyuan Zhang, Hongsheng Li, Pheng-Ann Heng:
Are Video Models Ready as Zero-Shot Reasoners? An Empirical Study with the MME-CoF Benchmark. CoRR abs/2510.26802 (2025)
[i95]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2511-05491
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2511-05491
Rui Yang, Ziyu Zhu, Yanwei Li, Jingjia Huang, Shen Yan, Siyuan Zhou, Zhe Liu, Xiangtai Li, Shuangye Li, Wenqian Wang, Yi Lin, Hengshuang Zhao:
Visual Spatial Tuning. CoRR abs/2511.05491 (2025)
2024
[j18]
- view
  authority control:
- export record
  dblp key:
  - journals/cviu/FangLLZL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/cviu/FangLLZL24
Zhongbin Fang, Xia Li, Xiangtai Li, Shen Zhao, Mengyuan Liu:
ModelNet-O: A large-scale synthetic dataset for occlusion-aware point cloud classification. Comput. Vis. Image Underst. 246: 104060 (2024)
[j17]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/ijcv/LiZYCYTT24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ijcv/LiZYCYTT24
Xiangtai Li, Jiangning Zhang, Yibo Yang, Guangliang Cheng, Kuiyuan Yang, Yunhai Tong, Dacheng Tao:
Sfnet: Faster and Accurate Semantic Segmentation Via Semantic Flow. Int. J. Comput. Vis. 132(2): 466-489 (2024)
[j16]
- view
  authority control:
- export record
  dblp key:
  - journals/ijcv/ZhangLWWYLT24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ijcv/ZhangLWWYLT24
Jiangning Zhang, Xiangtai Li, Yabiao Wang, Chengjie Wang, Yibo Yang, Yong Liu, Dacheng Tao:
EATFormer: Improving Vision Transformer Inspired by Evolutionary Algorithm. Int. J. Comput. Vis. 132(9): 3509-3536 (2024)
[j15]
- view
  authority control:
- export record
  dblp key:
  - journals/ijon/WangFLCLLCZ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/ijon/WangFLCLLCZ24
Chunlei Wang, Wenquan Feng, Xiangtai Li, Guangliang Cheng, Shuchang Lyu, Binghao Liu, Lijiang Chen, Qi Zhao:
OV-VG: A benchmark for open-vocabulary visual grounding. Neurocomputing 591: 127738 (2024)
[j14]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/pami/WuLXYDY0ZT0GT24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/pami/WuLXYDY0ZT0GT24
Jianzong Wu, Xiangtai Li, Shilin Xu, Haobo Yuan, Henghui Ding, Yibo Yang, Xia Li, Jiangning Zhang, Yunhai Tong, Xudong Jiang, Bernard Ghanem, Dacheng Tao:
Towards Open Vocabulary Learning: A Survey. IEEE Trans. Pattern Anal. Mach. Intell. 46(7): 5092-5113 (2024)
[j13]
- view
  authority control:
- export record
  dblp key:
  - journals/pami/HanZWWLQYL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/pami/HanZWWLQYL24
Yue Han, Jiangning Zhang, Yabiao Wang, Chengjie Wang, Yong Liu, Lu Qi, Ming-Hsuan Yang, Xiangtai Li:
Reference Twice: A Simple and Unified Baseline for Few-Shot Instance Segmentation. IEEE Trans. Pattern Anal. Mach. Intell. 46(12): 9221-9238 (2024)
[j12]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/pami/LiDYZPCCLL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/pami/LiDYZPCCLL24
Xiangtai Li, Henghui Ding, Haobo Yuan, Wenwei Zhang, Jiangmiao Pang, Guangliang Cheng, Kai Chen, Ziwei Liu, Chen Change Loy:
Transformer-Based Visual Segmentation: A Survey. IEEE Trans. Pattern Anal. Mach. Intell. 46(12): 10138-10163 (2024)
[j11]
- view
  authority control:
- export record
  dblp key:
  - journals/pami/WangWLGYL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/pami/WangWLGYL24
Jinghao Wang, Zhengyu Wen, Xiangtai Li, Zujin Guo, Jingkang Yang, Ziwei Liu:
Pair Then Relation: Pair-Net for Panoptic Scene Graph Generation. IEEE Trans. Pattern Anal. Mach. Intell. 46(12): 10452-10465 (2024)
[j10]
- view
  authority control:
- export record
  dblp key:
  - journals/pami/LiXYYCTLYT24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/pami/LiXYYCTLYT24
Xiangtai Li, Shilin Xu, Yibo Yang, Haobo Yuan, Guangliang Cheng, Yunhai Tong, Zhouchen Lin, Ming-Hsuan Yang, Dacheng Tao:
Panoptic-PartFormer++: A Unified and Decoupled View for Panoptic Part Segmentation. IEEE Trans. Pattern Anal. Mach. Intell. 46(12): 11087-11103 (2024)
[j9]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/remotesensing/ChengHLLXZZX24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/remotesensing/ChengHLLXZZX24
Guangliang Cheng, Yunmeng Huang, Xiangtai Li, Shuchang Lyu, Zhaoyang Xu, Hongbo Zhao, Qi Zhao, Shiming Xiang:
Change Detection Methods for Remote Sensing in the Last Decade: A Comprehensive Review. Remote. Sens. 16(13): 2355 (2024)
[j8]
- view
  authority control:
- export record
  dblp key:
  - journals/tcsv/XuLYYZ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tcsv/XuLYYZ24
Yangyang Xu, Xiangtai Li, Haobo Yuan, Yibo Yang, Lefei Zhang:
Multi-Task Learning With Multi-Query Transformer for Dense Prediction. IEEE Trans. Circuits Syst. Video Technol. 34(2): 1228-1240 (2024)
[j7]
- view
  authority control:
- export record
  dblp key:
  - journals/tip/WuLLDTT24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tip/WuLLDTT24
Jianzong Wu, Xiangtai Li, Xia Li, Henghui Ding, Yunhai Tong, Dacheng Tao:
Toward Robust Referring Image Segmentation. IEEE Trans. Image Process. 33: 1782-1794 (2024)
[c46]
- view
  authority control:
- export record
  dblp key:
  - conf/cikm/YangM00WLT24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/cikm/YangM00WLT24
Tianmeng Yang, Jiahao Meng, Min Zhou, Yaming Yang, Yujing Wang, Xiangtai Li, Yunhai Tong:
You Can't Ignore Either: Unifying Structure and Feature Denoising for Robust Graph Learning. CIKM 2024: 4178-4182
[c45]
- view
  authority control:
- export record
  dblp key:
  - conf/cvpr/LuJLLCY24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/cvpr/LuJLLCY24
Peng Lu, Tao Jiang, Yining Li, Xiangtai Li, Kai Chen, Wenming Yang:
RTMO: Towards High-Performance One-Stage Real-Time Multi-Person Pose Estimation. CVPR 2024: 1491-1500
[c44]
- view
  authority control:
- export record
  dblp key:
  - conf/cvpr/WangF0L0L24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/cvpr/WangF0L0L24
Xinshun Wang, Zhongbin Fang, Xia Li, Xiangtai Li, Chen Chen, Mengyuan Liu:
Skeleton-in-Context: Unified Skeleton Sequence Modeling with In-Context Learning. CVPR 2024: 2436-2446
[c43]
- view
  authority control:
- export record
  dblp key:
  - conf/cvpr/Song0LFLM24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/cvpr/Song0LFLM24
Yiran Song, Qianyu Zhou, Xiangtai Li, Deng-Ping Fan, Xuequan Lu, Lizhuang Ma:
BA-SAM: Scalable Bias-Mode Attention Mask for Segment Anything Model. CVPR 2024: 3162-3173
[c42]
- view
  authority control:
- export record
  dblp key:
  - conf/cvpr/WuLSZYZLCTLL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/cvpr/WuLSZYZLCTLL24
Jianzong Wu, Xiangtai Li, Chenyang Si, Shangchen Zhou, Jingkang Yang, Jiangning Zhang, Yining Li, Kai Chen, Yunhai Tong, Ziwei Liu, Chen Change Loy:
Towards Language-Driven Video Inpainting via Multimodal Large Language Models. CVPR 2024: 12501-12511
[c41]
- view
  authority control:
- export record
  dblp key:
  - conf/cvpr/0072LD24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/cvpr/0072LD24
Chang Liu, Xiangtai Li, Henghui Ding:
Referring Image Editing: Object-Level Image Editing via Referring Expressions. CVPR 2024: 13128-13138
[c40]
- view
  authority control:
- export record
  dblp key:
  - conf/cvpr/LiY0DWZLCL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/cvpr/LiY0DWZLCL24
Xiangtai Li, Haobo Yuan, Wei Li, Henghui Ding, Size Wu, Wenwei Zhang, Yining Li, Kai Chen, Chen Change Loy:
OMG-Seg: Is One Model Good Enough for all Segmentation? CVPR 2024: 27948-27959
[c39]
- view
  authority control:
- export record
  dblp key:
  - conf/eccv/HanZHCGLLZWL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/eccv/HanZHCGLLZWL24
Yue Han, Junwei Zhu, Keke He, Xu Chen, Yanhao Ge, Wei Li, Xiangtai Li, Jiangning Zhang, Chengjie Wang, Yong Liu:
Face-Adapter for Pre-trained Diffusion Models with Fine-Grained ID and Attribute Control. ECCV (50) 2024: 20-36
[c38]
- view
  authority control:
- export record
  dblp key:
  - conf/eccv/LiYLWYGZ24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/eccv/LiYLWYGZ24
Xiaojie Li, Yibo Yang, Xiangtai Li, Jianlong Wu, Yue Yu, Bernard Ghanem, Min Zhang:
GenView: Enhancing View Quality with Pretrained Generative Model for Self-Supervised Learning. ECCV (68) 2024: 306-325
[c37]
- view
  authority control:
- export record
  dblp key:
  - conf/eccv/YuanLZLCL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/eccv/YuanLZLCL24
Haobo Yuan, Xiangtai Li, Chong Zhou, Yining Li, Kai Chen, Chen Change Loy:
Open-Vocabulary SAM: Segment and Recognize Twenty-Thousand Classes Interactively. ECCV (43) 2024: 419-437
[c36]
- view
  authority control:
- export record
  dblp key:
  - conf/eccv/ZhouZJYL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/eccv/ZhouZJYL24
Yikang Zhou, Tao Zhang, Shunping Ji, Shuicheng Yan, Xiangtai Li:
Improving Video Segmentation via Dynamic Anchor Queries. ECCV (50) 2024: 446-463
[c35]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/WuZX0L0L24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/WuZX0L0L24
Size Wu, Wenwei Zhang, Lumin Xu, Sheng Jin, Xiangtai Li, Wentao Liu, Chen Change Loy:
CLIPSelf: Vision Transformer Distills Itself for Open-Vocabulary Dense Prediction. ICLR 2024
[c34]
- view
  authority control:
- export record
  dblp key:
  - conf/icra/DengL0TZL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icra/DengL0TZL24
Zhichao Deng, Xiangtai Li, Xia Li, Yunhai Tong, Shen Zhao, Mengyuan Liu:
VG4D: Vision-Language Model Goes 4D Video Recognition. ICRA 2024: 5014-5020
[c33]
- view
  authority control:
- export record
  dblp key:
  - conf/mm/Long0LLY0MY24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/Long0LLY0MY24
Shaocong Long, Qianyu Zhou, Xiangtai Li, Xuequan Lu, Chenhao Ying, Yuan Luo, Lizhuang Ma, Shuicheng Yan:
DGMamba: Domain Generalization via Generalized State Space Model. ACM Multimedia 2024: 3607-3616
[c32]
- view
  authority control:
- export record
  dblp key:
  - conf/mm/0001LLLZZY24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/0001LLLZZY24
Hao Fei, Xiangtai Li, Haotian Liu, Fuxiao Liu, Zhuosheng Zhang, Hanwang Zhang, Shuicheng Yan:
From Multimodal LLM to Human-level AI: Modality, Instruction, Reasoning and Beyond. ACM Multimedia 2024: 11289-11291
[c31]
- view
  - electronic edition @ nips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/HeBZHCGWLT024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/HeBZHCGWLT024
Haoyang He, Yuhu Bai, Jiangning Zhang, Qingdong He, Hongxu Chen, Zhenye Gan, Chengjie Wang, Xiangtai Li, Guanzhong Tian, Lei Xie:
MambaAD: Exploring State Space Models for Multi-class Unsupervised Anomaly Detection. NeurIPS 2024
[c30]
- view
  - electronic edition @ nips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/WangLQDT024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/WangLQDT024
Chaoyang Wang, Xiangtai Li, Lu Qi, Henghui Ding, Yunhai Tong, Ming-Hsuan Yang:
SemFlow: Binding Semantic Segmentation and Image Synthesis via Rectified Flow. NeurIPS 2024
[c29]
- view
  - electronic edition @ nips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/WuLZZ0LTC24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/WuLZZ0LTC24
Jianzong Wu, Xiangtai Li, Yanhong Zeng, Jiangning Zhang, Qianyu Zhou, Yining Li, Yunhai Tong, Kai Chen:
MotionBooth: Motion-Aware Customized Text-to-Video Generation. NeurIPS 2024
[c28]
- view
  - electronic edition @ nips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/ZhangL0YWJLY24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/ZhangL0YWJLY24
Tao Zhang, Xiangtai Li, Hao Fei, Haobo Yuan, Shengqiong Wu, Shunping Ji, Chen Change Loy, Shuicheng Yan:
OMG-LLaVA: Bridging Image-level, Object-level, Pixel-level Reasoning and Understanding. NeurIPS 2024
[c27]
- view
  - electronic edition @ nips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/Zhao0L0JZZ0W24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/Zhao0L0JZZ0W24
Yu Zhao, Hao Fei, Xiangtai Li, Libo Qin, Jiayi Ji, Hongyuan Zhu, Meishan Zhang, Min Zhang, Jianguo Wei:
Synergistic Dual Spatial-aware Generation of Image-to-text and Text-to-image. NeurIPS 2024
[i94]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2401-00551
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2401-00551
Yue Han, Jiangning Zhang, Junwei Zhu, Xiangtai Li, Yanhao Ge, Wei Li, Chengjie Wang, Yong Liu, Xiaoming Liu, Ying Tai:
A Generalist FaceX via Learning Unified Facial Representation. CoRR abs/2401.00551 (2024)
[i93]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2401-02317
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2401-02317
Yiran Song, Qianyu Zhou, Xiangtai Li, Deng-Ping Fan, Xuequan Lu, Lizhuang Ma:
BA-SAM: Scalable Bias-Mode Attention Mask for Segment Anything Model. CoRR abs/2401.02317 (2024)
[i92]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2401-02361
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2401-02361
Xiangyu Zhao, Yicheng Chen, Shilin Xu, Xiangtai Li, Xinjiang Wang, Yining Li, Haian Huang:
An Open and Comprehensive Pipeline for Unified Object Grounding and Detection. CoRR abs/2401.02361 (2024)
[i91]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2401-02955
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2401-02955
Haobo Yuan, Xiangtai Li, Chong Zhou, Yining Li, Kai Chen, Chen Change Loy:
Open-Vocabulary SAM: Segment and Recognize Twenty-thousand Classes Interactively. CoRR abs/2401.02955 (2024)
[i90]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2401-08210
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2401-08210
Zhongbin Fang, Xia Li, Xiangtai Li, Shen Zhao, Mengyuan Liu:
ModelNet-O: A Large-Scale Synthetic Dataset for Occlusion-Aware Point Cloud Classification. CoRR abs/2401.08210 (2024)
[i89]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2401-10226
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2401-10226
Jianzong Wu, Xiangtai Li, Chenyang Si, Shangchen Zhou, Jingkang Yang, Jiangning Zhang, Yining Li, Kai Chen, Yunhai Tong, Ziwei Liu, Chen Change Loy:
Towards Language-Driven Video Inpainting via Multimodal Large Language Models. CoRR abs/2401.10226 (2024)
[i88]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2401-10228
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2401-10228
Shilin Xu, Haobo Yuan, Qingyu Shi, Lu Qi, Jingbo Wang, Yibo Yang, Yining Li, Kai Chen, Yunhai Tong, Bernard Ghanem, Xiangtai Li, Ming-Hsuan Yang:
RAP-SAM: Towards Real-Time All-Purpose Segment Anything. CoRR abs/2401.10228 (2024)
[i87]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2401-10229
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2401-10229
Xiangtai Li, Haobo Yuan, Wei Li, Henghui Ding, Size Wu, Wenwei Zhang, Yining Li, Kai Chen, Chen Change Loy:
OMG-Seg: Is One Model Good Enough For All Segmentation? CoRR abs/2401.10229 (2024)
[i86]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2402-02555
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2402-02555
Lu Qi, Yi-Wen Chen, Lehan Yang, Tiancheng Shen, Xiangtai Li, Weidong Guo, Yu Xu, Ming-Hsuan Yang:
Generalizable Entity Grounding via Assistance of Large Language Model. CoRR abs/2402.02555 (2024)
[i85]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2403-00762
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2403-00762
Tao Zhang, Xiangtai Li, Haobo Yuan, Shunping Ji, Shuicheng Yan:
Point Cloud Mamba: Point Cloud Learning via State Space Model. CoRR abs/2403.00762 (2024)
[i84]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2403-09616
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2403-09616
Chaoyang Wang, Xiangtai Li, Henghui Ding, Lu Qi, Jiangning Zhang, Yunhai Tong, Chen Change Loy, Shuicheng Yan:
Explore In-Context Segmentation via Latent Diffusion Models. CoRR abs/2403.09616 (2024)
[i83]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2403-12003
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2403-12003
Xiaojie Li, Yibo Yang, Xiangtai Li, Jianlong Wu, Yue Yu, Bernard Ghanem, Min Zhang:
GenView: Enhancing View Quality with Pretrained Generative Model for Self-Supervised Learning. CoRR abs/2403.12003 (2024)
[i82]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2404-00086
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2404-00086
Yikang Zhou, Tao Zhang, Shunping Ji, Shuicheng Yan, Xiangtai Li:
DVIS-DAQ: Improving Video Segmentation via Dynamic Anchor Queries. CoRR abs/2404.00086 (2024)
[i81]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2404-06564
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2404-06564
Haoyang He, Yuhu Bai, Jiangning Zhang, Qingdong He, Hongxu Chen, Zhenye Gan, Chengjie Wang, Xiangtai Li, Guanzhong Tian, Lei Xie:
MambaAD: Exploring State Space Models for Multi-class Unsupervised Anomaly Detection. CoRR abs/2404.06564 (2024)
[i80]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2404-07794
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2404-07794
Shaocong Long, Qianyu Zhou, Xiangtai Li, Xuequan Lu, Chenhao Ying, Yuan Luo, Lizhuang Ma, Shuicheng Yan:
DGMamba: Domain Generalization via Generalized State Space Model. CoRR abs/2404.07794 (2024)
[i79]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2404-10760
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2404-10760
Jiangning Zhang, Chengjie Wang, Xiangtai Li, Guanzhong Tian, Zhucun Xue, Yong Liu, Guansong Pang, Dacheng Tao:
Learning Feature Inversion for Multi-class Anomaly Detection under General-purpose COCO-AD Benchmark. CoRR abs/2404.10760 (2024)
[i78]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2404-11605
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2404-11605
Zhichao Deng, Xiangtai Li, Xia Li, Yunhai Tong, Shen Zhao, Mengyuan Liu:
VG4D: Vision-Language Model Goes 4D Video Recognition. CoRR abs/2404.11605 (2024)
[i77]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2404-12352
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2404-12352
Mengyuan Liu, Zhongbin Fang, Xia Li, Joachim M. Buhmann, Xiangtai Li, Chen Change Loy:
Point-In-Context: Understanding Point Cloud via In-Context Learning. CoRR abs/2404.12352 (2024)
[i76]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2405-10305
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2405-10305
Jingkang Yang, Jun Cen, Wenxuan Peng, Shuai Liu, Fangzhou Hong, Xiangtai Li, Kaiyang Zhou, Qifeng Chen, Ziwei Liu:
4D Panoptic Scene Graph Generation. CoRR abs/2405.10305 (2024)
[i75]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2405-12970
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2405-12970
Yue Han, Junwei Zhu, Keke He, Xu Chen, Yanhao Ge, Wei Li, Xiangtai Li, Jiangning Zhang, Chengjie Wang, Yong Liu:
Face Adapter for Pre-Trained Diffusion Models with Fine-Grained ID and Attribute Control. CoRR abs/2405.12970 (2024)
[i74]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2405-16940
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2405-16940
Fengfan Zhou, Qianyu Zhou, Xiangtai Li, Xuequan Lu, Lizhuang Ma, Hefei Ling:
Adversarial Attacks on Both Face Recognition and Face Anti-spoofing Models. CoRR abs/2405.16940 (2024)
[i73]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2405-17427
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2405-17427
Kuan-Chih Huang, Xiangtai Li, Lu Qi, Shuicheng Yan, Ming-Hsuan Yang:
Reason3D: Searching and Reasoning 3D Segmentation via Large Language Model. CoRR abs/2405.17427 (2024)
[i72]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2405-20282
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2405-20282
Chaoyang Wang, Xiangtai Li, Lu Qi, Henghui Ding, Yunhai Tong, Ming-Hsuan Yang:
SemFlow: Binding Semantic Segmentation and Image Synthesis via Rectified Flow. CoRR abs/2405.20282 (2024)
[i71]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2406-01112
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2406-01112
Zheng Zhou, Hongbo Zhao, Guangliang Cheng, Xiangtai Li, Shuchang Lyu, Wenquan Feng, Qi Zhao:
BACON: Bayesian Optimal Condensation Framework for Dataset Distillation. CoRR abs/2406.01112 (2024)
[i70]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2406-05127
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2406-05127
Shengqiong Wu, Hao Fei, Xiangtai Li, Jiayi Ji, Hanwang Zhang, Tat-Seng Chua, Shuicheng Yan:
Towards Semantic Equivalence of Tokenization in Multimodal LLM. CoRR abs/2406.05127 (2024)
[i69]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2406-17758
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2406-17758
Jianzong Wu, Xiangtai Li, Yanhong Zeng, Jiangning Zhang, Qianyu Zhou, Yining Li, Yunhai Tong, Kai Chen:
MotionBooth: Motion-Aware Customized Text-to-Video Generation. CoRR abs/2406.17758 (2024)
[i68]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2406-17770
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2406-17770
Xiangyu Zhao, Xiangtai Li, Haodong Duan, Haian Huang, Yining Li, Kai Chen, Hua Yang:
MG-LLaVA: Towards Multi-Granularity Visual Instruction Tuning. CoRR abs/2406.17770 (2024)
[i67]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2406-19369
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2406-19369
Haobo Yuan, Xiangtai Li, Lu Qi, Tao Zhang, Ming-Hsuan Yang, Shuicheng Yan, Chen Change Loy:
Mamba or RWKV: Exploring High-Quality and High-Efficiency Segment Anything Model. CoRR abs/2406.19369 (2024)
[i66]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2406-19389
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2406-19389
Tao Zhang, Xiangtai Li, Hao Fei, Haobo Yuan, Shengqiong Wu, Shunping Ji, Chen Change Loy, Shuicheng Yan:
OMG-LLaVA: Bridging Image-level, Object-level, Pixel-level Reasoning and Understanding. CoRR abs/2406.19389 (2024)
[i65]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2406-20085
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2406-20085
Yicheng Chen, Xiangtai Li, Yining Li, Yanhong Zeng, Jianzong Wu, Xiangyu Zhao, Kai Chen:
Auto Cherry-Picker: Learning from High-quality Generative Data Driven by Language. CoRR abs/2406.20085 (2024)
[i64]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2407-19409
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2407-19409
Shilin Xu, Xiangtai Li, Haobo Yuan, Lu Qi, Yunhai Tong, Ming-Hsuan Yang:
LLAVADI: What Matters For Multimodal Large Language Models Distillation. CoRR abs/2407.19409 (2024)
[i63]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2408-00700
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2408-00700
Tianmeng Yang, Jiahao Meng, Min Zhou, Yaming Yang, Yujing Wang, Xiangtai Li, Yunhai Tong:
You Can't Ignore Either: Unifying Structure and Feature Denoising for Robust Graph Learning. CoRR abs/2408.00700 (2024)
[i62]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2408-13574
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2408-13574
Hao Yang, Qianyu Zhou, Haijia Sun, Xiangtai Li, Fengqi Liu, Xuequan Lu, Lizhuang Ma, Shuicheng Yan:
PointDGMamba: Domain Generalization of Point Cloud Classification via Generalized State Space Model. CoRR abs/2408.13574 (2024)
[i61]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2409-15179
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2409-15179
Yue Han, Junwei Zhu, Yuxiang Feng, Xiaozhong Ji, Keke He, Xiangtai Li, Zhucun Xue, Yong Liu:
MIMAFace: Face Animation via Motion-Identity Modulated Appearance Feature Learning. CoRR abs/2409.15179 (2024)
[i60]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2410-04733
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2410-04733
Yujin Tang, Lu Qi, Fei Xie, Xiangtai Li, Chao Ma, Ming-Hsuan Yang:
PredFormer: Transformers Are Effective Spatial-Temporal Predictive Learners. CoRR abs/2410.04733 (2024)
[i59]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2410-08261
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2410-08261
Jinbin Bai, Tian Ye, Wei Chow, Enxin Song, Qing-Guo Chen, Xiangtai Li, Zhen Dong, Lei Zhu, Shuicheng Yan:
Meissonic: Revitalizing Masked Generative Transformers for Efficient High-Resolution Text-to-Image Synthesis. CoRR abs/2410.08261 (2024)
[i58]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2410-10676
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2410-10676
Peiwen Sun, Sitong Cheng, Xiangtai Li, Zhen Ye, Huadai Liu, Honggang Zhang, Wei Xue, Yike Guo:
Both Ears Wide Open: Towards Language-Driven Spatial Audio Generation. CoRR abs/2410.10676 (2024)
[i57]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2410-15312
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2410-15312
Yu Zhao, Hao Fei, Xiangtai Li, Libo Qin, Jiayi Ji, Hongyuan Zhu, Meishan Zhang, Min Zhang, Jianguo Wei:
Synergistic Dual Spatial-aware Generation of Image-to-Text and Text-to-Image. CoRR abs/2410.15312 (2024)
[i56]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2410-23280
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2410-23280
Qingyu Shi, Lu Qi, Jianzong Wu, Jinbin Bai, Jingbo Wang, Yunhai Tong, Xiangtai Li, Ming-Hsuan Yang:
RelationBooth: Towards Relation-Aware Customized Object Generation. CoRR abs/2410.23280 (2024)
[i55]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2412-03255
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2412-03255
Qingdong He, Jinlong Peng, Pengcheng Xu, Boyuan Jiang, Xiaobin Hu, Donghao Luo, Yong Liu, Yabiao Wang, Chengjie Wang, Xiangtai Li, Jiangning Zhang:
DynamicControl: Adaptive Condition Selection for Improved Text-to-Image Generation. CoRR abs/2412.03255 (2024)
[i54]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2412-04280
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2412-04280
Jinbin Bai, Wei Chow, Ling Yang, Xiangtai Li, Juncheng Li, Hanwang Zhang, Shuicheng Yan:
HumanEdit: A High-Quality Human-Rewarded Dataset for Instruction-based Image Editing. CoRR abs/2412.04280 (2024)
[i53]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2412-04292
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2412-04292
Zhenglin Huang, Jinwei Hu, Xiangtai Li, Yiwei He, Xingyu Zhao, Bei Peng, Baoyuan Wu, Xiaowei Huang, Guangliang Cheng:
SIDA: Social Media Image Deepfake Detection, Localization and Explanation with Large Multimodal Model. CoRR abs/2412.04292 (2024)
[i52]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2412-06674
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2412-06674
Jiangning Zhang, Teng Hu, Haoyang He, Zhucun Xue, Yabiao Wang, Chengjie Wang, Yong Li, Xiangtai Li, Dacheng Tao:
EMOv2: Pushing 5M Vision Model Frontier. CoRR abs/2412.06674 (2024)
[i51]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2412-07589
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2412-07589
Jianzong Wu, Chao Tang, Jingbo Wang, Yanhong Zeng, Xiangtai Li, Yunhai Tong:
DiffSensei: Bridging Multi-Modal LLMs and Diffusion Models for Customized Manga Generation. CoRR abs/2412.07589 (2024)
2023
[j6]
- view
  authority control:
- export record
  dblp key:
  - journals/pami/LiHYDYCTT23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/pami/LiHYDYCTT23
Xiangtai Li, Hao He, Yibo Yang, Henghui Ding, Kuiyuan Yang, Guangliang Cheng, Yunhai Tong, Dacheng Tao:
Improving Video Instance Segmentation via Temporal Pyramid Routing. IEEE Trans. Pattern Anal. Mach. Intell. 45(5): 6594-6601 (2023)
[j5]
- view
  authority control:
- export record
  dblp key:
  - journals/pami/ZhouLHYCTMT23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/pami/ZhouLHYCTMT23
Qianyu Zhou, Xiangtai Li, Lu He, Yibo Yang, Guangliang Cheng, Yunhai Tong, Lizhuang Ma, Dacheng Tao:
TransVOD: End-to-End Video Object Detection With Spatial-Temporal Transformers. IEEE Trans. Pattern Anal. Mach. Intell. 45(6): 7853-7869 (2023)
[j4]
- view
  authority control:
- export record
  dblp key:
  - journals/pami/WangYLBZLYZHT23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/pami/WangYLBZLYZHT23
Yujing Wang, Yaming Yang, Zhuo Li, Jiangang Bai, Mingliang Zhang, Xiangtai Li, Jing Yu, Ce Zhang, Gao Huang, Yunhai Tong:
Convolution-Enhanced Evolving Attention Networks. IEEE Trans. Pattern Anal. Mach. Intell. 45(7): 8176-8192 (2023)
[j3]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/remotesensing/XuJLZL23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/remotesensing/XuJLZL23
Guozheng Xu, Xue Jiang, Xiangtai Li, Ze Zhang, Xingzhao Liu:
Exploring Self-Supervised Learning for Multi-Modal Remote Sensing Pre-Training via Asymmetric Attention Fusion. Remote. Sens. 15(24): 5682 (2023)
[c26]
- view
  authority control:
- export record
  dblp key:
  - conf/cvpr/YangPLGCL0ZZLL23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/cvpr/YangPLGCL0ZZLL23
Jingkang Yang, Wenxuan Peng, Xiangtai Li, Zujin Guo, Liangyu Chen, Bo Li, Zheng Ma, Kaiyang Zhou, Wayne Zhang, Chen Change Loy, Ziwei Liu:
Panoptic Video Scene Graph Generation. CVPR 2023: 18675-18685
[c25]
- view
  authority control:
- export record
  dblp key:
  - conf/iccv/ZhangLL0XZ0HWW23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iccv/ZhangLL0XZ0HWW23
Jiangning Zhang, Xiangtai Li, Jian Li, Liang Liu, Zhucun Xue, Boshen Zhang, Zhengkai Jiang, Tianxin Huang, Yabiao Wang, Chengjie Wang:
Rethinking Mobile Block for Efficient Attention-based Models. ICCV 2023: 1389-1400
[c24]
- view
  authority control:
- export record
  dblp key:
  - conf/iccv/LiYZCPL23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iccv/LiYZCPL23
Xiangtai Li, Haobo Yuan, Wenwei Zhang, Guangliang Cheng, Jiangmiao Pang, Chen Change Loy:
Tube-Link: A Flexible Cross Tube Framework for Universal Video Segmentation. ICCV 2023: 13877-13887
[c23]
- view
  authority control:
- export record
  dblp key:
  - conf/iccv/WuLDLCTL23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iccv/WuLDLCTL23
Jianzong Wu, Xiangtai Li, Henghui Ding, Xia Li, Guangliang Cheng, Yunhai Tong, Chen Change Loy:
Betrayed by Captions: Joint Caption Grounding and Generation for Open Vocabulary Instance Segmentation. ICCV 2023: 21881-21891
[c22]
- view
  authority control:
- export record
  dblp key:
  - conf/iccvw/LiWFLCLLZ23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iccvw/LiWFLCLLZ23
Menghao Li, Chunlei Wang, Wenquan Feng, Shuchang Lyu, Guangliang Cheng, Xiangtai Li, Binghao Liu, Qi Zhao:
Iterative Robust Visual Grounding with Masked Reference based Centerpoint Supervision. ICCV (Workshops) 2023: 4653-4658
[c21]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - conf/iclr/YangYLLTT23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iclr/YangYLLTT23
Yibo Yang, Haobo Yuan, Xiangtai Li, Zhouchen Lin, Philip H. S. Torr, Dacheng Tao:
Neural Collapse Inspired Feature-Classifier Alignment for Few-Shot Class-Incremental Learning. ICLR 2023
[c20]
- view
  - electronic edition @ nips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/FangL0BLL23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/FangL0BLL23
Zhongbin Fang, Xiangtai Li, Xia Li, Joachim M. Buhmann, Chen Change Loy, Mengyuan Liu:
Explore In-Context Learning for 3D Point Cloud Understanding. NeurIPS 2023
[c19]
- view
  - electronic edition @ nips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/YangCPLHLZCL23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/YangCPLHLZCL23
Jingkang Yang, Jun Cen, Wenxuan Peng, Shuai Liu, Fangzhou Hong, Xiangtai Li, Kaiyang Zhou, Qifeng Chen, Ziwei Liu:
4D Panoptic Scene Graph Generation. NeurIPS 2023
[i50]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2301-00805
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2301-00805
Jianzong Wu, Xiangtai Li, Henghui Ding, Xia Li, Guangliang Cheng, Yunhai Tong, Chen Change Loy:
Betrayed by Captions: Joint Caption Grounding and Generation for Open Vocabulary Instance Segmentation. CoRR abs/2301.00805 (2023)
[i49]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2301-00954
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2301-00954
Xiangtai Li, Shilin Xu, Yibo Yang, Haobo Yuan, Guangliang Cheng, Yunhai Tong, Zhouchen Lin, Dacheng Tao:
PanopticPartFormer++: A Unified and Decoupled View for Panoptic Part Segmentation. CoRR abs/2301.00954 (2023)
[i48]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2301-01146
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2301-01146
Jiangning Zhang, Xiangtai Li, Jian Li, Liang Liu, Zhucun Xue, Boshen Zhang, Zhengkai Jiang, Tianxin Huang, Yabiao Wang, Chengjie Wang:
Rethinking Mobile Block for Efficient Neural Models. CoRR abs/2301.01146 (2023)
[i47]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2301-01156
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2301-01156
Yue Han, Jiangning Zhang, Zhucun Xue, Chao Xu, Xintian Shen, Yabiao Wang, Chengjie Wang, Yong Liu, Xiangtai Li:
Reference Twice: A Simple and Unified Baseline for Few-Shot Instance Segmentation. CoRR abs/2301.01156 (2023)
[i46]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2302-03004
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2302-03004
Yibo Yang, Haobo Yuan, Xiangtai Li, Zhouchen Lin, Philip H. S. Torr, Dacheng Tao:
Neural Collapse Inspired Feature-Classifier Alignment for Few-Shot Class Incremental Learning. CoRR abs/2302.03004 (2023)
[i45]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2303-12782
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2303-12782
Xiangtai Li, Haobo Yuan, Wenwei Zhang, Guangliang Cheng, Jiangmiao Pang, Chen Change Loy:
Tube-Link: A Flexible Cross Tube Baseline for Universal Video Segmentation. CoRR abs/2303.12782 (2023)
[i44]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2304-09854
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2304-09854
Xiangtai Li, Henghui Ding, Wenwei Zhang, Haobo Yuan, Jiangmiao Pang, Guangliang Cheng, Kai Chen, Ziwei Liu, Chen Change Loy:
Transformer-Based Visual Segmentation: A Survey. CoRR abs/2304.09854 (2023)
[i43]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2305-05813
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2305-05813
Guangliang Cheng, Yunmeng Huang, Xiangtai Li, Shuchang Lyu, Zhaoyang Xu, Qi Zhao, Shiming Xiang:
Change Detection Methods for Remote Sensing in the Last Decade: A Comprehensive Review. CoRR abs/2305.05813 (2023)
[i42]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2306-08659
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2306-08659
Zhongbin Fang, Xiangtai Li, Xia Li, Joachim M. Buhmann, Chen Change Loy, Mengyuan Liu:
Explore In-Context Learning for 3D Point Cloud Understanding. CoRR abs/2306.08659 (2023)
[i41]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2306-15880
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2306-15880
Jianzong Wu, Xiangtai Li, Shilin Xu, Haobo Yuan, Henghui Ding, Yibo Yang, Xia Li, Jiangning Zhang, Yunhai Tong, Xudong Jiang, Bernard Ghanem, Dacheng Tao:
Towards Open Vocabulary Learning: A Survey. CoRR abs/2306.15880 (2023)
[i40]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2307-08699
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2307-08699
Jinghao Wang, Zhengyu Wen, Xiangtai Li, Zujin Guo, Jingkang Yang, Ziwei Liu:
Pair then Relation: Pair-Net for Panoptic Scene Graph Generation. CoRR abs/2307.08699 (2023)
[i39]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2307-12392
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2307-12392
Menghao Li, Chunlei Wang, Wenquan Feng, Shuchang Lyu, Guangliang Cheng, Xiangtai Li, Binghao Liu, Qi Zhao:
Iterative Robust Visual Grounding with Masked Reference based Centerpoint Supervision. CoRR abs/2307.12392 (2023)
[i38]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2308-01746
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2308-01746
Yibo Yang, Haobo Yuan, Xiangtai Li, Jianlong Wu, Lefei Zhang, Zhouchen Lin, Philip H. S. Torr, Dacheng Tao, Bernard Ghanem:
Neural Collapse Terminus: A Unified Solution for Class Incremental Learning and Its Variants. CoRR abs/2308.01746 (2023)
[i37]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2309-13042
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2309-13042
Jiahao Xie, Wei Li, Xiangtai Li, Ziwei Liu, Yew Soon Ong, Chen Change Loy:
MosaicFusion: Diffusion Models as Data Augmenters for Large Vocabulary Instance Segmentation. CoRR abs/2309.13042 (2023)
[i36]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2310-01393
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2310-01393
Shilin Xu, Xiangtai Li, Size Wu, Wenwei Zhang, Yining Li, Guangliang Cheng, Yunhai Tong, Kai Chen, Chen Change Loy:
DST-Det: Simple Dynamic Self-Training for Open-Vocabulary Object Detection. CoRR abs/2310.01393 (2023)
[i35]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2310-01403
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2310-01403
Size Wu, Wenwei Zhang, Lumin Xu, Sheng Jin, Xiangtai Li, Wentao Liu, Chen Change Loy:
CLIPSelf: Vision Transformer Distills Itself for Open-Vocabulary Dense Prediction. CoRR abs/2310.01403 (2023)
[i34]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2310-14374
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2310-14374
Chunlei Wang, Wenquan Feng, Xiangtai Li, Guangliang Cheng, Shuchang Lyu, Binghao Liu, Lijiang Chen, Qi Zhao:
OV-VG: A Benchmark for Open-Vocabulary Visual Grounding. CoRR abs/2310.14374 (2023)
[i33]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2311-03352
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2311-03352
Hao Zhou, Tiancheng Shen, Xu Yang, Hai Huang, Xiangtai Li, Lu Qi, Ming-Hsuan Yang:
Rethinking Evaluation Metrics of Open-Vocabulary Segmentaion. CoRR abs/2311.03352 (2023)
[i32]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2311-17058
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2311-17058
Jingkang Yang, Wenxuan Peng, Xiangtai Li, Zujin Guo, Liangyu Chen, Bo Li, Zheng Ma, Kaiyang Zhou, Wayne Zhang, Chen Change Loy, Ziwei Liu:
Panoptic Video Scene Graph Generation. CoRR abs/2311.17058 (2023)
[i31]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2312-01734
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2312-01734
Yunhao Liu, Lu Qi, Yu-Ju Tsai, Xiangtai Li, Kelvin C. K. Chan, Ming-Hsuan Yang:
Effective Adapter for Face Recognition in the Wild. CoRR abs/2312.01734 (2023)
[i30]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2312-03703
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2312-03703
Xinshun Wang, Zhongbin Fang, Xia Li, Xiangtai Li, Chen Chen, Mengyuan Liu:
Skeleton-in-Context: Unified Skeleton Sequence Modeling with In-Context Learning. CoRR abs/2312.03703 (2023)
[i29]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2312-06660
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2312-06660
Chong Zhou, Xiangtai Li, Chen Change Loy, Bo Dai:
EdgeSAM: Prompt-In-the-Loop Distillation for On-Device Deployment of SAM. CoRR abs/2312.06660 (2023)
[i28]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2312-07495
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2312-07495
Jiangning Zhang, Xuhai Chen, Yabiao Wang, Chengjie Wang, Yong Liu, Xiangtai Li, Ming-Hsuan Yang, Dacheng Tao:
Exploring Plain ViT Reconstruction for Multi-class Unsupervised Anomaly Detection. CoRR abs/2312.07495 (2023)
[i27]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2312-07526
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2312-07526
Peng Lu, Tao Jiang, Yining Li, Xiangtai Li, Kai Chen, Wenming Yang:
RTMO: Towards High-Performance One-Stage Real-Time Multi-Person Pose Estimation. CoRR abs/2312.07526 (2023)
2022
[c18]
- view
  authority control:
- export record
  dblp key:
  - conf/cvpr/LiZPCCTL22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/cvpr/LiZPCCTL22
Xiangtai Li, Wenwei Zhang, Jiangmiao Pang, Kai Chen, Guangliang Cheng, Yunhai Tong, Chen Change Loy:
Video K-Net: A Simple, Strong, and Unified Baseline for Video Segmentation. CVPR 2022: 18825-18835
[c17]
- view
  authority control:
- export record
  dblp key:
  - conf/eccv/XuLWCTT22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/eccv/XuLWCTT22
Shilin Xu, Xiangtai Li, Jingbo Wang, Guangliang Cheng, Yunhai Tong, Dacheng Tao:
Fashionformer: A Simple, Effective and Unified Baseline for Human Fashion Segmentation and Recognition. ECCV (37) 2022: 545-563
[c16]
- view
  authority control:
- export record
  dblp key:
  - conf/eccv/YuanLYC0TZT22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/eccv/YuanLYC0TZT22
Haobo Yuan, Xiangtai Li, Yibo Yang, Guangliang Cheng, Jing Zhang, Yunhai Tong, Lefei Zhang, Dacheng Tao:
PolyphonicFormer: Unified Query Learning for Depth-Aware Video Panoptic Segmentation. ECCV (27) 2022: 582-599
[c15]
- view
  authority control:
- export record
  dblp key:
  - conf/eccv/LiXYCTT22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/eccv/LiXYCTT22
Xiangtai Li, Shilin Xu, Yibo Yang, Guangliang Cheng, Yunhai Tong, Dacheng Tao:
Panoptic-PartFormer: Learning a Unified Model for Panoptic Part Segmentation. ECCV (27) 2022: 729-747
[c14]
- view
  authority control:
- export record
  dblp key:
  - conf/icip/XuLYLCT22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icip/XuLYLCT22
Shilin Xu, Xiangtai Li, Yibo Yang, Hongyang Li, Guangliang Cheng, Yunhai Tong:
Query Learning of Both Thing and Stuff for Panoptic Segmentation. ICIP 2022: 716-720
[c13]
- view
  - electronic edition @ nips.cc (open access)
  - details & citations
- export record
  dblp key:
  - conf/nips/YangCLXLT22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nips/YangCLXLT22
Yibo Yang, Shixiang Chen, Xiangtai Li, Liang Xie, Zhouchen Lin, Dacheng Tao:
Inducing Neural Collapse in Imbalanced Learning: Do We Really Need a Learnable Classifier at the End of Deep Neural Network? NeurIPS 2022
[i26]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2201-05047
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2201-05047
Qianyu Zhou, Xiangtai Li, Lu He, Yibo Yang, Guangliang Cheng, Yunhai Tong, Lizhuang Ma, Dacheng Tao:
TransVOD: End-to-end Video Object Detection with Spatial-Temporal Transformers. CoRR abs/2201.05047 (2022)
[i25]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2203-09081
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2203-09081
Yibo Yang, Liang Xie, Shixiang Chen, Xiangtai Li, Zhouchen Lin, Dacheng Tao:
Do We Really Need a Learnable Classifier at the End of Deep Neural Network? CoRR abs/2203.09081 (2022)
[i24]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2204-04654
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2204-04654
Shilin Xu, Xiangtai Li, Jingbo B. Wang, Guangliang Cheng, Yunhai Tong, Dacheng Tao:
Fashionformer: A simple, Effective and Unified Baseline for Human Fashion Segmentation and Recognition. CoRR abs/2204.04654 (2022)
[i23]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2204-04655
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2204-04655
Xiangtai Li, Shilin Xu, Yibo Yang, Guangliang Cheng, Yunhai Tong, Dacheng Tao:
Panoptic-PartFormer: Learning a Unified Model for Panoptic Part Segmentation. CoRR abs/2204.04655 (2022)
[i22]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2204-04656
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2204-04656
Xiangtai Li, Wenwei Zhang, Jiangmiao Pang, Kai Chen, Guangliang Cheng, Yunhai Tong, Chen Change Loy:
Video K-Net: A Simple, Strong, and Unified Baseline for Video Segmentation. CoRR abs/2204.04656 (2022)
[i21]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2205-14354
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2205-14354
Yangyang Xu, Xiangtai Li, Haobo Yuan, Yibo Yang, Jing Zhang, Yunhai Tong, Lefei Zhang, Dacheng Tao:
Multi-Task Learning with Multi-query Transformer for Dense Prediction. CoRR abs/2205.14354 (2022)
[i20]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2206-09325
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2206-09325
Jiangning Zhang, Xiangtai Li, Yabiao Wang, Chengjie Wang, Yibo Yang, Yong Liu, Dacheng Tao:
EATFormer: Improving Vision Transformer Inspired by Evolutionary Algorithm. CoRR abs/2206.09325 (2022)
[i19]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2207-04415
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2207-04415
Xiangtai Li, Jiangning Zhang, Yibo Yang, Guangliang Cheng, Kuiyuan Yang, Yunhai Tong, Dacheng Tao:
SFNet: Faster, Accurate, and Domain Agnostic Semantic Segmentation via Semantic Flow. CoRR abs/2207.04415 (2022)
[i18]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2209-09554
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2209-09554
Jianzong Wu, Xiangtai Li, Xia Li, Henghui Ding, Yunhai Tong, Dacheng Tao:
Towards Robust Referring Image Segmentation. CoRR abs/2209.09554 (2022)
[i17]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2212-08330
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2212-08330
Yujing Wang, Yaming Yang, Zhuo Li, Jiangang Bai, Mingliang Zhang, Xiangtai Li, Jing Yu, Ce Zhang, Gao Huang, Yunhai Tong:
Convolution-enhanced Evolving Attention Networks. CoRR abs/2212.08330 (2022)
2021
[j2]
- view
  authority control:
- export record
  dblp key:
  - journals/tip/LiZCYTZX21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tip/LiZCYTZX21
Xiangtai Li, Li Zhang, Guangliang Cheng, Kuiyuan Yang, Yunhai Tong, Xiatian Zhu, Tao Xiang:
Global Aggregation Then Local Distribution for Scene Parsing. IEEE Trans. Image Process. 30: 6829-6842 (2021)
[j1]
- view
  authority control:
- export record
  dblp key:
  - journals/tip/LiLYZCYTL21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tip/LiLYZCYTL21
Xiangtai Li, Xia Li, Ansheng You, Li Zhang, Guangliang Cheng, Kuiyuan Yang, Yunhai Tong, Zhouchen Lin:
Towards Efficient Scene Understanding via Squeeze Reasoning. IEEE Trans. Image Process. 30: 7050-7063 (2021)
[c12]
- view
  authority control:
- export record
  dblp key:
  - conf/cvpr/LiHLLCSWTL21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/cvpr/LiHLLCSWTL21
Xiangtai Li, Hao He, Xia Li, Duo Li, Guangliang Cheng, Jianping Shi, Lubin Weng, Yunhai Tong, Zhouchen Lin:
PointFlow: Flowing Semantics Through Points for Aerial Image Segmentation. CVPR 2021: 4217-4226
[c11]
- view
  authority control:
- export record
  dblp key:
  - conf/cvpr/Li0WLSZZC21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/cvpr/Li0WLSZZC21
Duo Li, Jie Hu, Changhu Wang, Xiangtai Li, Qi She, Lei Zhu, Tong Zhang, Qifeng Chen:
Involution: Inverting the Inherence of Convolution for Visual Recognition. CVPR 2021: 12321-12330
[c10]
- view
  authority control:
- export record
  dblp key:
  - conf/iccv/HeLCSTMPW21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iccv/HeLCSTMPW21
Hao He, Xiangtai Li, Guangliang Cheng, Jianping Shi, Yunhai Tong, Gaofeng Meng, Véronique Prinet, Lubin Weng:
Enhanced Boundary Learning for Glass-like Object Segmentation. ICCV 2021: 15839-15848
[c9]
- view
  authority control:
- export record
  dblp key:
  - conf/icip/ShiLWTX21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icip/ShiLWTX21
Chen Shi, Xiangtai Li, Yanran Wu, Yunhai Tong, Yi Xu:
Dynamic Dual Sampling Module For Fine-Grained Semantic Segmentation. ICIP 2021: 2269-2273
[c8]
- view
  authority control:
- export record
  dblp key:
  - conf/icip/WuLSTHSMG21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icip/WuLSTHSMG21
Yanran Wu, Xiangtai Li, Chen Shi, Yunhai Tong, Yang Hua, Tao Song, Ruhui Ma, Haibing Guan:
Fast and Accurate Scene Parsing via Bi-Direction Alignment Networks. ICIP 2021: 2508-2512
[c7]
- view
  authority control:
- export record
  dblp key:
  - conf/mm/He0LNCLLTMZ21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mm/He0LNCLLTMZ21
Lu He, Qianyu Zhou, Xiangtai Li, Li Niu, Guangliang Cheng, Xiao Li, Wenxuan Liu, Yunhai Tong, Lizhuang Ma, Liqing Zhang:
End-to-End Video Object Detection with Spatial-Temporal Transformers. ACM Multimedia 2021: 1507-1516
[i16]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2103-06255
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2103-06255
Duo Li, Jie Hu, Changhu Wang, Xiangtai Li, Qi She, Lei Zhu, Tong Zhang, Qifeng Chen:
Involution: Inverting the Inherence of Convolution for Visual Recognition. CoRR abs/2103.06255 (2021)
[i15]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2103-06564
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2103-06564
Xiangtai Li, Hao He, Xia Li, Duo Li, Guangliang Cheng, Jianping Shi, Lubin Weng, Yunhai Tong, Zhouchen Lin:
PointFlow: Flowing Semantics Through Points for Aerial Image Segmentation. CoRR abs/2103.06564 (2021)
[i14]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2103-15734
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2103-15734
Hao He, Xiangtai Li, Guangliang Cheng, Jianping Shi, Yunhai Tong, Gaofeng Meng, Véronique Prinet, Lubin Weng:
Enhanced Boundary Learning for Glass-like Object Segmentation. CoRR abs/2103.15734 (2021)
[i13]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2105-10920
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2105-10920
Lu He, Qianyu Zhou, Xiangtai Li, Li Niu, Guangliang Cheng, Xiao Li, Wenxuan Liu, Yunhai Tong, Lizhuang Ma, Liqing Zhang:
End-to-End Video Object Detection with Spatial-Temporal Transformers. CoRR abs/2105.10920 (2021)
[i12]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2105-11651
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2105-11651
Yanran Wu, Xiangtai Li, Chen Shi, Yunhai Tong, Yang Hua, Tao Song, Ruhui Ma, Haibing Guan:
Fast and Accurate Scene Parsing via Bi-direction Alignment Networks. CoRR abs/2105.11651 (2021)
[i11]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2105-11657
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2105-11657
Chen Shi, Xiangtai Li, Yanran Wu, Yunhai Tong, Yi Xu:
Dynamic Dual Sampling Module for Fine-Grained Semantic Segmentation. CoRR abs/2105.11657 (2021)
[i10]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2105-11668
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2105-11668
Hao He, Xiangtai Li, Kuiyuan Yang, Guangliang Cheng, Jianping Shi, Yunhai Tong, Zhengjun Zha, Lubin Weng:
BoundarySqueeze: Image Segmentation as Boundary Squeezing. CoRR abs/2105.11668 (2021)
[i9]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2107-13154
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2107-13154
Xiangtai Li, Li Zhang, Guangliang Cheng, Kuiyuan Yang, Yunhai Tong, Xiatian Zhu, Tao Xiang:
Global Aggregation then Local Distribution for Scene Parsing. CoRR abs/2107.13154 (2021)
[i8]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2107-13155
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2107-13155
Xiangtai Li, Hao He, Henghui Ding, Kuiyuan Yang, Guangliang Cheng, Jianping Shi, Yunhai Tong:
Improving Video Instance Segmentation via Temporal Pyramid Routing. CoRR abs/2107.13155 (2021)
[i7]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2112-02582
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2112-02582
Haobo Yuan, Xiangtai Li, Yibo Yang, Guangliang Cheng, Jing Zhang, Yunhai Tong, Lefei Zhang, Dacheng Tao:
PolyphonicFormer: Unified Query Learning for Depth-aware Video Panoptic Segmentation. CoRR abs/2112.02582 (2021)
2020
[c6]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/aaai/LiZHTTY20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/aaai/LiZHTTY20
Xiangtai Li, Houlong Zhao, Lei Han, Yunhai Tong, Shaohua Tan, Kuiyuan Yang:
Gated Fully Fusion for Semantic Segmentation. AAAI 2020: 11418-11425
[c5]
- view
  authority control:
- export record
  dblp key:
  - conf/eccv/LiLZCSLTT20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/eccv/LiLZCSLTT20
Xiangtai Li, Xia Li, Li Zhang, Guangliang Cheng, Jianping Shi, Zhouchen Lin, Shaohua Tan, Yunhai Tong:
Improving Semantic Segmentation via Decoupled Body and Edge Supervision. ECCV (17) 2020: 435-452
[c4]
- view
  authority control:
- export record
  dblp key:
  - conf/eccv/LiYZZYYTT20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/eccv/LiYZZYYTT20
Xiangtai Li, Ansheng You, Zhen Zhu, Houlong Zhao, Maoke Yang, Kuiyuan Yang, Shaohua Tan, Yunhai Tong:
Semantic Flow for Fast and Accurate Scene Parsing. ECCV (1) 2020: 775-793
[i6]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2002-10120
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2002-10120
Xiangtai Li, Ansheng You, Zhen Zhu, Houlong Zhao, Maoke Yang, Kuiyuan Yang, Yunhai Tong:
Semantic Flow for Fast and Accurate Scene Parsing. CoRR abs/2002.10120 (2020)
[i5]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2007-10035
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2007-10035
Xiangtai Li, Xia Li, Li Zhang, Guangliang Cheng, Jianping Shi, Zhouchen Lin, Shaohua Tan, Yunhai Tong:
Improving Semantic Segmentation via Decoupled Body and Edge Supervision. CoRR abs/2007.10035 (2020)
[i4]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-2011-03308
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2011-03308
Xiangtai Li, Xia Li, Ansheng You, Li Zhang, Guangliang Cheng, Kuiyuan Yang, Yunhai Tong, Zhouchen Lin:
Towards Efficient Scene Understanding via Squeeze Reasoning. CoRR abs/2011.03308 (2020)

2010 – 2019

see FAQ

What is the meaning of the colors in the publication lists?

2019
[c3]
- view
  - electronic edition @ bmvc2019.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/bmvc/LiZYYYT19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/bmvc/LiZYYYT19
Xiangtai Li, Li Zhang, Ansheng You, Maoke Yang, Kuiyuan Yang, Yunhai Tong:
Global Aggregation then Local Distribution in Fully Convolutional Networks. BMVC 2019: 244
[c2]
- view
  - electronic edition @ bmvc2019.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/bmvc/ZhangLAYTT19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/bmvc/ZhangLAYTT19
Li Zhang, Xiangtai Li, Anurag Arnab, Kuiyuan Yang, Yunhai Tong, Philip H. S. Torr:
Dual Graph Convolutional Network for Semantic Segmentation. BMVC 2019: 254
[c1]
- view
  authority control:
- export record
  dblp key:
  - conf/icann/LiBYT19
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icann/LiBYT19
Xiangtai Li, Jiangang Bai, Kuiyuan Yang, Yunhai Tong:
Flow2Seg: Motion-Aided Semantic Segmentation. ICANN (3) 2019: 225-237
[i3]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1904-01803
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1904-01803
Xiangtai Li, Houlong Zhao, Lei Han, Yunhai Tong, Kuiyuan Yang:
GFF: Gated Fully Fusion for Semantic Segmentation. CoRR abs/1904.01803 (2019)
[i2]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1909-06121
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1909-06121
Li Zhang, Xiangtai Li, Anurag Arnab, Kuiyuan Yang, Yunhai Tong, Philip H. S. Torr:
Dual Graph Convolutional Network for Semantic Segmentation. CoRR abs/1909.06121 (2019)
[i1]
- view
  - electronic edition @ arxiv.org (open access)
  - details & citations
- export record
  dblp key:
  - journals/corr/abs-1909-07229
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-1909-07229
Xiangtai Li, Li Zhang, Ansheng You, Maoke Yang, Kuiyuan Yang, Yunhai Tong:
Global Aggregation then Local Distribution in Fully Convolutional Networks. CoRR abs/1909.07229 (2019)

Coauthor Index

see FAQ

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.