


default search action
International Journal of Computer Vision, Volume 133
Volume 133, Number 1, January 2025
- Lv Tang, Peng-Tao Jiang, Haoke Xiao, Bo Li
:
Towards Training-Free Open-World Segmentation via Image Prompt Foundation Models. 1-15 - Xin Jin
, Longhai Wu, Jie Chen, Youxin Chen, Jayoon Koo, Cheul-Hee Hahm, Zhao-Min Chen:
UPR-Net: A Unified Pyramid Recurrent Network for Video Frame Interpolation. 16-30 - Jian Liang
, Ran He, Tieniu Tan:
A Comprehensive Survey on Test-Time Adaptation Under Distribution Shifts. 31-64 - Shiyu Xuan, Ming Yang
, Shiliang Zhang
:
Incremental Model Enhancement via Memory-based Contrastive Learning. 65-83 - Xixi Wang, Bo Jiang, Xiao Wang, Bin Luo:
Learning Dynamic Batch-Graph Representation for Deep Representation Learning. 84-105 - Ruikang Xu, Mingde Yao, Chang Chen, Lizhi Wang, Zhiwei Xiong
:
Continuous Spatial-Spectral Reconstruction via Implicit Neural Representation. 106-128 - Zhen Wang, Jun Xiao, Yueting Zhuang, Fei Gao, Jian Shao, Long Chen
:
Learning Combinatorial Prompts for Universal Controllable Image Captioning. 129-150 - Hao Lu
, Wenze Liu
, Hongtao Fu
, Zhiguo Cao
:
FADE: A Task-Agnostic Upsampling Operator for Encoder-Decoder Architectures. 151-172 - Feng Li
, Runmin Cong, Jingjing Wu, Huihui Bai, Meng Wang, Yao Zhao:
SRConvNet: A Transformer-Style ConvNet for Lightweight Image Super-Resolution. 173-189 - Jin Zeng, Qingpeng Zhu
, Tongxuan Tian, Wenxiu Sun, Lin Zhang, Shengjie Zhao:
Deep Unrolled Weighted Graph Laplacian Regularization for Depth Completion. 190-210 - Zitai Wang
, Qianqian Xu, Zhiyong Yang, Peisong Wen, Yuan He, Xiaochun Cao, Qingming Huang:
Top-K Pairwise Ranking: Bridging the Gap Among Ranking-Based Measures for Multi-label Classification. 211-253 - Qi Zheng, Daqing Liu, Chaoyue Wang, Jing Zhang
, Dadong Wang, Dacheng Tao:
ESceme: Vision-and-Language Navigation with Episodic Scene Memory. 254-274 - Zeqi Xiao, Wenwei Zhang, Tai Wang, Chen Change Loy, Dahua Lin, Jiangmiao Pang:
Position-Guided Point Cloud Panoptic Segmentation Transformer. 275-290 - Florinel-Alin Croitoru, Nicolae-Catalin Ristea, Radu Tudor Ionescu
, Nicu Sebe
:
Learning Rate Curriculum. 291-314 - Jing Yang
, Xiatian Zhu
, Adrian Bulat, Brais Martínez, Georgios Tzimiropoulos:
Knowledge Distillation Meets Open-Set Semi-supervised Learning. 315-334 - Yawei Luo
, Ping Liu, Yi Yang:
Kill Two Birds with One Stone: Domain Generalization for Semantic Segmentation via Network Pruning. 335-352 - Xiao Yang
, Longlong Xu
, Tianyu Pang
, Yinpeng Dong, Yikai Wang, Hang Su, Jun Zhu:
Face3DAdv: Exploiting Robust Adversarial 3D Patches on Physical Face Recognition. 353-371 - Yang Shen, Xuhao Sun, Xiu-Shen Wei
, Anqi Xu, Lingyan Gao:
Equiangular Basis Vectors: A Novel Paradigm for Classification Tasks. 372-397 - Omkar Thawakar, Sanath Narayan, Hisham Cholakkal, Rao Muhammad Anwer, Salman H. Khan, Jorma Laaksonen
, Mubarak Shah, Fahad Shahbaz Khan:
Video Instance Segmentation in an Open-World. 398-409 - Yongxing Dai
, Yifan Sun, Jun Liu, Zekun Tong, Ling-Yu Duan:
Bridging the Source-to-Target Gap for Cross-Domain Person Re-identification with Intermediate Domains. 410-434 - Songnan Lin, Ye Ma, Jing Chen, Bihan Wen
:
Compressed Event Sensing (CES) Volumes for Event Cameras. 435-455 - Zhuo Huang, Muyang Li, Li Shen, Jun Yu
, Chen Gong, Bo Han, Tongliang Liu:
Winning Prize Comes from Losing Tickets: Improve Invariant Learning by Exploring Variant Parameters for Out-of-Distribution Generalization. 456-474 - Jingjing Ren, Haoyu Chen, Tian Ye, Hongtao Wu, Lei Zhu
:
Triplane-Smoothed Video Dehazing with CLIP-Enhanced Generalization. 475-488 - Hanrong Shi, Lin Li, Jun Xiao, Yueting Zhuang, Long Chen
:
From Easy to Hard: Learning Curricular Shape-Aware Features for Robust Panoptic Scene Graph Generation. 489-508 - Ming Li, Pan Zhou
, Jia-Wei Liu, Jussi Keppo, Min Lin, Shuicheng Yan
, Xiangyu Xu
:
Correction: Instant3D: Instant Text-to-3D Generation. 509
Volume 133, Number 2, February 2025
- Chen Xu, Yuhan Zhu, Haocheng Shen, Boheng Chen, Yixuan Liao, Xiaoxin Chen, Limin Wang
:
Progressive Visual Prompt Learning with Contrastive Feature Re-formation. 511-526 - Luigi Riz
, Cristiano Saltori
, Yiming Wang
, Elisa Ricci
, Fabio Poiesi
:
Novel Class Discovery Meets Foundation Models for 3D Semantic Segmentation. 527-548 - Daehwan Kim, Kwangrok Ryoo, Hansang Cho, Seungryong Kim
:
SplitNet: Learnable Clean-Noisy Label Splitting for Learning with Noisy Labels. 549-566 - Chang Liu
, Yinpeng Dong, Wenzhao Xiang, Xiao Yang, Hang Su
, Jun Zhu, Yuefeng Chen, Yuan He, Hui Xue, Shibao Zheng:
A Comprehensive Study on Robustness of Image Classification Models: Benchmarking and Rethinking. 567-589 - Shiyun Mao, Ruolin Chen, Huibin Li
:
Weighted Joint Distribution Optimal Transport Based Domain Adaptation for Cross-Scenario Face Anti-Spoofing. 590-610 - Xingxing Zuo
, Pouya Samangouei
, Yunwen Zhou, Yan Di, Mingyang Li:
FMGS: Foundation Model Embedded 3D Gaussian Splatting for Holistic 3D Scene Understanding. 611-627 - Zhimin Sun, Shen Chen, Taiping Yao, Ran Yi
, Shouhong Ding, Lizhuang Ma:
Rethinking Open-World DeepFake Attribution with Multi-perspective Sensory Learning. 628-651 - Haoliang Sun
, Qi Wei
, Lei Feng
, Yupeng Hu
, Fan Liu
, Hehe Fan
, Yilong Yin
:
Variational Rectification Inference for Learning with Noisy Labels. 652-671 - Junxian Duan, Yuang Ai, Jipeng Liu, Shenyuan Huang, Huaibo Huang, Jie Cao, Ran He:
Test-time Forgery Detection with Spatial-Frequency Prompt Learning. 672-687 - Bin Chen
, Xuanyu Zhang
, Shuai Liu
, Yongbing Zhang
, Jian Zhang
:
Self-supervised Scalable Deep Compressed Sensing. 688-723 - Jun Nie, Yadan Luo
, Shanshan Ye, Yonggang Zhang, Xinmei Tian, Zhen Fang:
Out-of-Distribution Detection with Virtual Outlier Smoothing. 724-741 - Hengcan Shi
, Son Duy Dao, Jianfei Cai:
LLMFormer: Large Language Model for Open-Vocabulary Semantic Segmentation. 742-759 - Yang Liu, Xinlong Wang
, Muzhi Zhu
, Yue Cao
, Tiejun Huang
, Chunhua Shen
:
Masked Channel Modeling for Bootstrapping Visual Pre-training. 760-780 - Oriane Siméoni
, Éloi Zablocki, Spyros Gidaris, Gilles Puy, Patrick Pérez:
Unsupervised Object Localization in the Era of Self-Supervised ViTs: A Survey. 781-808 - Yuanye Liu, Renwei Dian
, Shutao Li:
Low-Rank Transformer for High-Resolution Hyperspectral Computational Imaging. 809-824 - Yuhang Zang
, Wei Li, Jun Han, Kaiyang Zhou, Chen Change Loy:
Contextual Object Detection with Multimodal Large Language Models. 825-843 - Wenyu Zhang
, Li Shen, Chuan-Sheng Foo:
Source-Free Domain Adaptation Guided by Vision and Vision-Language Pre-training. 844-866 - Huanyu He, Weiyao Lin
, Yuang Zhang, Tianyao He, Yuxi Li, Jianguo Li:
Toward Accurate and Robust Pedestrian Detection via Variational Inference. 867-889 - Qiang Qi
, Zhenyu Qiu, Yan Yan, Yang Lu, Hanzi Wang:
IMC-Det: Intra-Inter Modality Contrastive Learning for Video Object Detection. 890-909 - Muhammad Atif Butt, Hassan Ali
, Adnan Qayyum, Waqas Sultani, Ala I. Al-Fuqaha, Junaid Qadir
:
R2S100K: Road-Region Segmentation Dataset for Semi-supervised Autonomous Driving in the Wild. 910-928 - Lin Li
, Jianing Qiu
, Michael W. Spratling:
AROID: Improving Adversarial Robustness Through Online Instance-Wise Data Augmentation. 929-950 - Tao Wang
, Li Yuan, Xinchao Wang, Jiashi Feng:
Learning Box Regression and Mask Segmentation Under Long-Tailed Distribution with Gradient Transfusing. 951-967 - Xu Zhang, Zhe Chen
, Jing Zhang, Tongliang Liu, Dacheng Tao
:
Learning General and Specific Embedding with Transformer for Few-Shot Object Detection. 968-984 - Zhun Zhong, Hong Liu, Yin Cui, Shin'ichi Satoh, Nicu Sebe, Ming-Hsuan Yang:
Guest Editorial: Special Issue on Open-World Visual Recognition. 985-988 - Kaiduo Zhang, Muyi Sun, Jianxin Sun, Kunbo Zhang, Zhenan Sun, Tieniu Tan:
Correction: Open-Vocabulary Text-Driven Human Image Generation. 989
Volume 133, Number 3, March 2025
- Zhihong Zhang, Runzhao Yang, Jinli Suo
, Yuxiao Cheng, Qionghai Dai:
Lightweight High-Speed Photography Built on Coded Exposure and Implicit Neural Representation of Videos. 991-1011 - Da-Wei Zhou
, Zi-Wen Cai, Han-Jia Ye, De-Chuan Zhan, Ziwei Liu:
Revisiting Class-Incremental Learning with Pre-Trained Models: Generalizability and Adaptivity are All You Need. 1012-1032 - Denis Huseljic
, Marek Herde, Paul Hahn, Mehmet Muejde, Bernhard Sick:
Systematic Evaluation of Uncertainty Calibration in Pretrained Object Detectors. 1033-1047 - Pengchong Qiao
, Yu Wang, Chang Liu, Lei Shang, Baigui Sun, Zhennan Wang, Xiawu Zheng, Rongrong Ji, Jie Chen:
Adaptive Fuzzy Positive Learning for Annotation-Scarce Semantic Segmentation. 1048-1066 - Ke Sun, Shen Chen, Taiping Yao, Xiaoshuai Sun, Shouhong Ding, Rongrong Ji
:
Continual Face Forgery Detection via Historical Distribution Preserving. 1067-1084 - Lianghui Zhu, Xinggang Wang
, Jiapei Feng, Tianheng Cheng, Yingyue Li, Bo Jiang, Dingwen Zhang, Junwei Han:
WeakCLIP: Adapting CLIP for Weakly-Supervised Semantic Segmentation. 1085-1105 - Zixin Wang
, Yadan Luo
, Liang Zheng, Zhuoxiao Chen, Sen Wang, Zi Huang
:
In Search of Lost Online Test-Time Adaptation: A Survey. 1106-1139 - Haohao Hu, Tianyu Han, Yuerong Wang, Wanjun Zhong, Jingwei Yue, Peng Zan
:
Hierarchical Active Learning for Low-Altitude Drone-View Object Detection. 1140-1152 - Anirudh Srinivasan Chakravarthy
, Meghana Reddy Ganesina, Peiyun Hu, Laura Leal-Taixé, Shu Kong, Deva Ramanan, Aljosa Osep:
Lidar Panoptic Segmentation in an Open World. 1153-1174 - Guangxuan Xiao
, Tianwei Yin, William T. Freeman, Frédo Durand, Song Han:
FastComposer: Tuning-Free Multi-subject Image Generation with Localized Attention. 1175-1194 - Zhen Cheng
, Fei Zhu, Xu-Yao Zhang, Chenglin Liu:
Breaking the Limits of Reliable Prediction via Generated Data. 1195-1221 - Shuai Zhao
, Linchao Zhu, Xiaohan Wang
, Yi Yang:
Slimmable Networks for Contrastive Self-supervised Learning. 1222-1237 - Shuai Jia
, Chao Ma, Yibing Song, Xiaokang Yang, Ming-Hsuan Yang
:
Robust Deep Object Tracking against Adversarial Attacks. 1238-1257 - Sifan Long
, Zhen Zhao, Junkun Yuan, Zichang Tan, Jiangjiang Liu, Jingyuan Feng, Sheng-Sheng Wang, Jingdong Wang:
Mutual Prompt Leaning for Vision Language Models. 1258-1276 - Yaohui Wang, Xin Ma, Xinyuan Chen, Cunjian Chen, Antitza Dantcheva, Bo Dai, Yu Qiao:
LEO: Generative Latent Image Animator for Human Video Synthesis. 1277-1289 - Ruicong Liu
, Haofei Wang
, Feng Lu
:
From Gaze Jitter to Domain Adaptation: Generalizing Gaze Estimation by Manipulating High-Frequency Components. 1290-1305 - Xianzhu Liu
, Haozhe Xie
, Shengping Zhang
, Hongxun Yao
, Rongrong Ji
, Liqiang Nie
, Dacheng Tao
:
2D Semantic-Guided Semantic Scene Completion. 1306-1325 - Hongjun Wang, Sagar Vaze, Kai Han
:
Dissecting Out-of-Distribution Detection and Open-Set Recognition: A Critical Analysis of Methods and Benchmarks. 1326-1351 - Bencheng Liao, Shaoyu Chen, Yunchi Zhang, Bo Jiang, Qian Zhang, Wenyu Liu, Chang Huang, Xinggang Wang
:
MapTRv2: An End-to-End Framework for Online Vectorized HD Map Construction. 1352-1374 - Wenting Chen
, Jie Liu
, Tianming Liu, Yixuan Yuan
:
Bi-VLGM: Bi-Level Class-Severity-Aware Vision-Language Graph Matching for Text Guided Medical Image Segmentation. 1375-1391 - Huan Liu, Zichang Tan, Qiang Chen, Yunchao Wei, Yao Zhao, Jingdong Wang:
Unified Frequency-Assisted Transformer Framework for Detecting and Grounding Multi-modal Manipulation. 1392-1409 - Yuxuan Li
, Xiang Li, Yimain Dai, Qibin Hou, Li Liu, Yongxiang Liu, Ming-Ming Cheng
, Jian Yang:
LSKNet: A Foundation Lightweight Backbone for Remote Sensing. 1410-1431 - Bastian Goldluecke:
Editor's Note: Special Issue on German Conference on Pattern Recognition (DAGM GCPR). 1432 - Editor's Note: Special Issue on Computer Vision Approaches for Animal Tracking and Modeling 2023. 1433
- Haoliang Sun
, Qi Wei
, Lei Feng
, Yupeng Hu
, Fan Liu
, Hehe Fan
, Yilong Yin
:
Correction: Variational Rectification Inference for Learning with Noisy Labels. 1434
Volume 133, Number 4, April 2025
- Jiuniu Wang, Wenjia Xu, Qingzhong Wang, Antoni B. Chan:
Group-Based Distinctive Image Captioning with Memory Difference Encoding and Attention. 1435-1455 - Jiahao Xie, Wei Li, Xiangtai Li, Ziwei Liu, Yew Soon Ong, Chen Change Loy:
MosaicFusion: Diffusion Models as Data Augmenters for Large Vocabulary Instance Segmentation. 1456-1475 - Haochen Wang, Yuchao Wang, Yujun Shen, Junsong Fan, Yuxi Wang, Zhaoxiang Zhang:
Using Unreliable Pseudo-Labels for Label-Efficient Semantic Segmentation. 1476-1498 - Ahmed R. El-gabri, Hussein A. Aly, Tarek Elsaid Ghoniemy, Mohamed A. Elshafey:
DLRA-Net: Deep Local Residual Attention Network with Contextual Refinement for Spectral Super-Resolution. 1499-1531 - Yang Yu, Rongrong Ni, Siyuan Yang, Yu Ni, Yao Zhao, Alex C. Kot:
Mining Generalized Multi-timescale Inconsistency for Detecting Deepfake Videos. 1532-1548 - Saihui Hou, Zengbin Wang, Man Zhang, Chunshui Cao, Xu Liu, Yongzhen Huang:
Edge-Oriented Adversarial Attack for Deep Gait Recognition. 1549-1563 - Patrick Wenzel, Nan Yang, Rui Wang, Niclas Zeller, Daniel Cremers:
4Seasons: Benchmarking Visual SLAM and Long-Term Localization for Autonomous Driving in Challenging Conditions. 1564-1586 - Mengyue Geng, Lizhi Wang, Lin Zhu, Wei Zhang, Ruiqin Xiong, Yonghong Tian:
Towards Ultra High-Speed Hyperspectral Imaging by Integrating Compressive and Neuromorphic Sampling. 1587-1610 - Sheng Xu, Yanjing Li, Chuanjian Liu, Baochang Zhang:
Learning Accurate Low-bit Quantization towards Efficient Computational Imaging. 1611-1643 - Jinxing Zhou, Xuyang Shen, Jianyuan Wang, Jiayi Zhang, Weixuan Sun, Jing Zhang, Stan Birchfield, Dan Guo, Lingpeng Kong, Meng Wang, Yiran Zhong:
Audio-Visual Segmentation with Semantics. 1644-1664 - Miaohui Wang, Zhuowei Xu, Mai Xu, Weisi Lin:
Blind Multimodal Quality Assessment of Low-Light Images. 1665-1688 - Rizhao Cai, Cecelia Soh, Zitong Yu, Haoliang Li, Wenhan Yang, Alex C. Kot:
Towards Data-Centric Face Anti-spoofing: Improving Cross-Domain Generalization via Physics-Based Data Synthesis. 1689-1710 - Zhiwen Shao, Hancheng Zhu, Yong Zhou, Xiang Xiang, Bing Liu, Rui Yao, Lizhuang Ma:
Facial Action Unit Detection by Adaptively Constraining Self-Attention and Causally Deconfounding Sample. 1711-1726 - Wenwen Qiang, Zeen Song, Ziyin Gu, Jiangmeng Li, Changwen Zheng, Fuchun Sun, Hui Xiong:
On the Generalization and Causal Explanation in Self-Supervised Learning. 1727-1754 - Arindam Sikdar, Yonghuai Liu, Siddhardha Kedarisetty, Yitian Zhao, Amr Ahmed, Ardhendu Behera:
Interweaving Insights: High-Order Feature Interaction for Fine-Grained Visual Recognition. 1755-1779 - Hanbo Bi, Yingchao Feng, Yongqiang Mao, Jianning Pei, Wenhui Diao, Hongqi Wang, Xian Sun:
AgMTR: Agent Mining Transformer for Few-Shot Segmentation in Remote Sensing. 1780-1807 - Zhongyang Zhu, Jie Tang:
CogCartoon: Towards Practical Story Visualization. 1808-1833 - Lucas Ventura, Cordelia Schmid, Gül Varol:
Learning Text-to-Video Retrieval from Image Captioning. 1834-1854 - Edoardo Mello Rella, Ajad Chhatkuli, Ender Konukoglu, Luc Van Gool:
Neural Vector Fields for Implicit Surface Representation and Inference. 1855-1878 - David Junhao Zhang, Jay Zhangjie Wu, Jia-Wei Liu, Rui Zhao, Lingmin Ran, Yuchao Gu, Difei Gao, Mike Zheng Shou:
Show-1: Marrying Pixel and Latent Diffusion Models for Text-to-Video Generation. 1879-1893 - Zhouxia Wang, Xintao Wang, Liangbin Xie, Zhongang Qi, Ying Shan, Wenping Wang, Ping Luo:
StyleAdapter: A Unified Stylized Image Generation Model. 1894-1911 - Jiyang Guan, Jian Liang, Yanbo Wang, Ran He:
Sample Correlation for Fingerprinting Deep Face Recognition. 1912-1926 - Guifang Zhang, Shijun Tan, Zhe Ji, Yuming Fang:
Dynamic Attention Vision-Language Transformer Network for Person Re-identification. 1927-1939 - Tianshan Liu, Kin-Man Lam, Bing-Kun Bao:
A Memory-Assisted Knowledge Transferring Framework with Curriculum Anticipation for Weakly Supervised Online Activity Detection. 1940-1963 - Hongbin Xu, Junduan Huang, Yuer Ma, Zifeng Li, Wenxiong Kang:
Improving 3D Finger Traits Recognition via Generalizable Neural Rendering. 1964-1998 - Emmanuel Hartman, Emery Pierson, Martin Bauer, Mohamed Daoudi, Nicolas Charon:
Basis Restricted Elastic Shape Analysis on the Space of Unregistered Surfaces. 1999-2024 - Jingzhi Li, Changjiang Luo, Hua Zhang, Yang Cao, Xin Liao, Xiaochun Cao:
Anti-Fake Vaccine: Safeguarding Privacy Against Face Swapping via Visual-Semantic Dual Degradation. 2025-2043 - Tao Zhou, Qi Ye, Wenhan Luo, Haizhou Ran, Zhiguo Shi, Jiming Chen:
APPTracker+: Displacement Uncertainty for Occlusion Handling in Low-Frame-Rate Multiple Object Tracking. 2044-2069 - Tianyao He, Huabin Liu, Zelin Ni, Yuxi Li, Xiao Ma, Cheng Zhong, Yang Zhang, Yingxue Wang, Weiyao Lin:
Achieving Procedure-Aware Instructional Video Correlation Learning Under Weak Supervision from a Collaborative Perspective. 2070-2095 - Huimin Ma, Sheng Yi, Shijie Chen, Jiansheng Chen, Yu Wang:
Few Annotated Pixels and Point Cloud Based Weakly Supervised Semantic Segmentation of Driving Scenes. 2096-2110 - Qing Guo, Hua Qi, Jingyang Sun, Felix Juefei-Xu, Lei Ma, Di Lin, Wei Feng, Song Wang:
EfficientDeRain+: Learning Uncertainty-Aware Filtering via RainMix Augmentation for High-Efficiency Deraining. 2111-2135 - Yunhua Zhang, Hazel Doughty, Cees G. M. Snoek:
Day2Dark: Pseudo-Supervised Activity Recognition Beyond Silent Daylight. 2136-2157 - Yen-Lung Lai, Xingbo Dong, Zhe Jin, Wei Jia, Massimo Tistarelli, Xuejun Li:
Rethinking Contemporary Deep Learning Techniques for Error Correction in Biometric Data. 2158-2175 - Yukang Zhang, Yan Yan, Yang Lu, Hanzi Wang:
Adaptive Middle Modality Alignment Learning for Visible-Infrared Person Re-identification. 2176-2196 - Abdullah Hamdi, Faisal AlZahrani, Silvio Giancola, Bernard Ghanem:
MVTN: Learning Multi-view Transformations for 3D Understanding. 2197-2226 - Fangrui Zhu, Yiming Xie, Weidi Xie, Huaizu Jiang:
Diagnosing Human-Object Interaction Detectors. 2227-2244 - Sicheng Zhao, Huizai Yao, Chuang Lin, Yue Gao, Guiguang Ding:
Correction: Multi-source-free Domain Adaptive Object Detection. 2245 - Ke Sun, Shen Chen, Taiping Yao, Xiaoshuai Sun, Shouhong Ding, Rongrong Ji:
Correction: Continual Face Forgery Detection via Historical Distribution Preserving. 2246

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.