


default search action
35th BMVC 2024: Glasgow, UK
- 35th British Machine Vision Conference, BMVC 2024, Glasgow, UK, November 25-28, 2024. BMVA Press 2024

- Hansol Kim, Hoyeol Choi, Youngjun Kwak:

Federated Learning for Face Recognition via Intra-subject Self-supervised Learning. - Alexey Kravets, Vinay P. Namboodiri:

CLIP Adaptation by Intra-Modal Overlap Reduction. - Zekun Zhang, Vu Quang Truong, Minh Hoai:

Efficiency-preserving Scene-adaptive Object Detection. - Jiayang Ao, Qiuhong Ke, Krista A. Ehinger:

Sequential Amodal Segmentation via Cumulative Occlusion Learning. - Kodai Kawamura, Shunya Yamagami, Go Irie:

Region-based Entropy Separation for One-shot Test-Time Adaptation. - Kim Yu-Ji, Hyunwoo Ha, Kim Youwang, Jaeheung Surh, Hyowon Ha, Tae-Hyun Oh:

MeTTA: Single-View to 3D Textured Mesh Reconstruction with Test-Time Adaptation. - Dilith Jayakody, Thanuja D. Ambegoda:

Few-shot Multispectral Segmentation with Representations Generated by Reinforcement Learning. - Shreyas Singh, Aryan Garg, Kaushik Mitra:

HDRSplat: Gaussian Splatting for High Dynmaic Range 3D Scene Reconstruction from Raw Images. - Ban Chen, Xin Jin, Longhai Wu, Jie Chen, Ilhyun Cho, Cheul-Hee Hahm:

Alignment-aware Patch-level Routing for Dynamic Video Frame Interpolation. - Damian Sójka, Bartlomiej Twardowski, Tomasz Trzcinski, Sebastian Cygert:

AR-TTA: A Simple Method for Real-World Continual Test-Time Adaptation. - Jiawei Yao, Tong Wu, Xiaofeng Zhang:

Improving Depth Gradient Continuity in Transformers: A Comparative Study on Monocular Depth Estimation with CNN. - Shohei Tanaka, Hao Wang, Yoshitaka Ushiku:

SciPostLayout: A Dataset for Layout Analysis and Layout Generation of Scientific Posters. - Munish Monga, Sachin Kumar Giroh, Ankit Jha, Mainak Singha, Biplab Banerjee, Jocelyn Chanussot:

COSMo: CLIP Talks on Open-Set Multi-Target Domain Adaptation. - Cristian Sbrolli, Matteo Matteucci:

No Captions, No Problem: Captionless 3D-CLIP Alignment with Hard Negatives via CLIP Knowledge and LLMs. - Shaoyu Wang, Changze Zhou, Bolin Song, Yiyang Wang:

Self-Supervised Real-World Denoising by Jointly Learning Visible and Invisible Noise. - Jack R. Saunders, Vinay P. Namboodiri:

TalkLoRA: Low-Rank Adaptation for Speech-Driven Animation. - Zhihan Cai, Kailu Wu, Dapeng Cao, Feng Chen, Kaisheng Ma:

DRAFT: Direct Radiance Fields Editing with Composable Operations. - Ryota Ishizaki, Shunya Yamagami, Yuta Goto, Go Irie:

Linear Calibration Approach to Knowledge-free Group Robust Classification. - Haoyu Zhao, Xingyue Zhao, Lingting Zhu, Weixi Zheng, Yongchao Xu:

HFGS: 4D Gaussian Splatting with Emphasis on Spatial and Temporal High-Frequency Components for Endoscopic Scene Reconstruction. - Minghong Duan, Linhao Qu, Shaolei Liu, Manning Wang:

Local Implicit Wavelet Transformer for Arbitrary-Scale Super-Resolution. - Matthew Lee, Felix John Samuel Bragman, Ricardo Sanchez-Matilla, Imanol Luengo, Danail Stoyanov:

Spatial-Temporal NAS for Fast Surgical Segmentation. - Jian Gao, Niall McLaughlin, Joanna Sara Valson, Neil Anderson, Ruth F. Hunter:

Learning to Segment Publicly Accessible Green Spaces with Visual and Semantic Data. - Aditya Nalgunda Ganesh, Gowri Srinivasa:

D³Nav: Data-Driven Driving Agents for Autonomous Vehicles in Unstructured Traffic. - Weixin Xu:

FFR-UNet: Feature Filter-Refinement UNet for Medical Image Segmentation. - Haoting He, Yaochen Li, Yutong Wang, Gaojie Li, Wei Guo, Runlin Zou:

Group Activity Recognition via Spatio-Temporal Reasoning of Key Instances. - Amin Ranem, John Kalkhof, Anirban Mukhopadhyay:

NCA-Morph: Medical Image Registration with Neural Cellular Automata. - Babak Ehteshami Bejnordi, Gaurav Kumar, Amelie Royer, Christos Louizos, Tijmen Blankevoort, Mohsen Ghafoorian:

InterroGate: Learning to Share, Specialize, and Prune Representations for Multi-task Learning. - Sungmin Kang, Jaeha Song, Jihie Kim:

Advancing Medical Image Segmentation: Morphology-Driven Learning with Diffusion Transformer. - Pauline Bourigault, Emmanuelle Bourigault, Danilo P. Mandic:

Multi-Modal Information Bottleneck Attribution with Cross-Attention Guidance. - Eman Ali, Muhammad Haris Khan:

Noise-Tolerant Few-Shot Unsupervised Adapter for Vision-Language Models. - Alexander D. J. Taylor, Jonathan James Morrison, Phillip Tregidgo, Neill D. F. Campbell:

Advancing Anomaly Detection: The IDW dataset and MC algorithm. - Yeongtak Oh, Jooyoung Choi, Yongsung Kim, Minjun Park, Chaehun Shin, Sungroh Yoon:

ControlDreamer: Blending Geometry and Style in Text-to-3D. - Yongseon Yoo, Seonggyu Kim, Jong-Min Lee:

SagaGAN: Style Applied using Gram matrix Attribution based on StarGAN v2. - Yiheng Xiong, Angela Dai:

PT43D: A Probabilistic Transformer for Generating 3D Shapes from Single Highly-Ambiguous RGB Images. - Tae-Min Choi, Inug Yoon, Jong-Hwan Kim, Juyoun Park:

Textual Attention RPN for Open-Vocabulary Object Detection. - Zhangliang Sun, Hui Zhang:

Painterly Image Harmonization via Bi-Transformation with Dynamic Kernels. - Qiaoqiao Wei, Hui Zhang, Jun-Hai Yong:

Interactive Image Segmentation with Temporal Information Augmented. - Donghao Zhou, Jialin Li, Jinpeng Li, Jiancheng Huang, Qiang Nie, Yong Liu, Bin-Bin Gao, Qiong Wang, Pheng-Ann Heng, Guangyong Chen:

Distribution-Aware Calibration for Object Detection with Noisy Bounding Boxes. - Rui Gong, Martin Danelljan, Han Sun, Julio Delgado Mangas, Nikolay Marin, Luc Van Gool:

Prompting Diffusion Representations for Cross-Domain Semantic Segmentation. - Sudip Das, Kaixin Xu, Nushrat Hussain, Ziyuan Zhao, Arindam Das, Weisi Lin, Ujjwal Bhattacharya:

MMPrune4U: Regularizing Multimodal Feature Distortion in Weight Pruning for Deep Neural Network Compression. - Ziqiang Dang, Tianxing Fan, Boming Zhao, Xujie Shen, Lei Wang, Guofeng Zhang, Zhaopeng Cui:

MoManifold: Learning to Measure 3D Human Motion via Decoupled Joint Acceleration Manifolds. - Maximilian Krahn, Michele Sasdelli, Frances Fengyi Yang, Vladislav Golyanik, Juho Kannala, Tat-Jun Chin, Tolga Birdal:

Projected Stochastic Gradient Descent with Quantum Annealed Binary Gradients. - Hiya Roy, Björn Stenger:

Text Removal In E-Commerce Images: A Comparison Of Inpainting Methods. - SeokHwan Oh, Guil Jung, Myeong-Gee Kim, Sang-Yun Kim, Young-Min Kim, Hyeon-Jik Lee, Hyuksool Kwon, Hyeon-Min Bae:

Key-point Guided Deformable Image Manipulation Using Diffusion Model. - Chenhao Wang, Xiaopeng Hong, Zhiheng Ma, Yupeng Wei, Yabin Wang, Xiaopeng Fan:

Multi-modal Crowd Counting via Modal Emulation. - Renwu Li, Wenjing Ke, Dong Li, Lu Tian, Emad Barsoum:

MonoGS++: Fast and Accurate Monocular RGB Gaussian SLAM. - Yusuke Oumi, Yuto Shibata, Go Irie, Akisato Kimura, Yoshimitsu Aoki, Mariko Isogawa:

Acoustic-based 3D Human Pose Estimation Robust to Human Position. - Joaquim Comas, Antònia Alomar, Adria Ruiz, Federico Sukno:

PhysFlow: Skin tone transfer for remote heart rate estimation through conditional normalizing flows. - Cho-Ying Wu, Quankai Gao, Chin-Cheng Hsu, Te-Lin Wu, Jing-Wen Chen, Ulrich Neumann:

InSpaceType: Dataset and Benchmark for Reconsidering Cross-Space Type Performance in Indoor Monocular Depth. - Junho Lee, Jeongwoo Shin, Seung Woo Ko, Seongsu Ha, Joonseok Lee:

Scalable Frame Sampling for Video Classification: A Semi-Optimal Policy Approach with Reduced Search Space. - Ziyu Yao:

Recovering Global Data Distribution Locally in Federated Learning. - Francesco Di Salvo, David Tafler, Sebastian Doerrich, Christian Ledig:

Privacy-preserving datasets by capturing feature distributions with Conditional VAEs. - Angel Villar-Corrales, Moritz Austermann, Sven Behnke:

MCDS-VSS: Moving Camera Dynamic Scene Video Semantic Segmentation by Filtering with Self-Supervised Geometry and Motion. - Evgeny Tsykunov, Wonju Lee, Minje Park:

AISE: Adaptive Input Sampling for Explanation of Black-box Models. - Ruiqi Mao, Rongxin Cui:

Retinex-Inspired Cooperative Game Through Multi-Level Feature Fusion for Robust, Universal Image Enhancement. - Yuyang Zhao, Na Zhao, Gim Hee Lee:

Synthetic-to-Real Domain Generalized Semantic Segmentation for 3D Indoor Point Clouds. - Yibin Wang, Yuchao Feng, Jianwei Zheng:

Learning Object Placement via Convolution Scoring Attention. - Yunsong Wang, Na Zhao, Gim Hee Lee:

Syn-to-Real Unsupervised Domain Adaptation for Indoor 3D Object Detection. - Xiaoyue Mi, Fan Tang, Yepeng Weng, Danding Wang, Juan Cao, Sheng Tang, Peng Li, Yang Liu:

Topology-preserving Adversarial Training for Alleviating Natural Accuracy Degradation. - Sai Tanmay Reddy Chakkera, Aggelina Chatziagapi, Dimitris Samaras:

JEAN: Joint Expression and Audio-guided NeRF-based Talking Face Generation. - Xuhan Zhu, Yifei Xing, Ruiping Wang, Yaowei Wang, Xiangyuan Lan:

Hierarchical Prompt Learning for Scene Graph Generation. - Róisín Luo, Alexandru Drimbarean, James McDermott, Colm O'Riordan:

Reclaiming Residual Knowledge: A Novel Paradigm to Low-Bit Quantization. - Zeyu Zhang, Yiran Wang, Biao Wu, Shuo Chen, Zhiyuan Zhang, Shiya Huang, Wenbo Zhang, Meng Fang, Ling Chen, Yang Zhao:

Motion Avatar: Generate Human and Animal Avatars with Arbitrary Motion. - Peichao Li, Oscar MacCormac, Jonathan Shapey, Tom Vercauteren:

A self-supervised and adversarial approach to hyperspectral demosaicking and RGB reconstruction in surgical imaging. - Seung Woo Ko, Joopyo Hong, Suyoung Kim, Seungjai Bang, Sungzoon Cho, Nojun Kwak, Hyung-Sin Kim, Joonseok Lee:

A Revisit to the Decoder for Camouflaged Object Detection. - Soumitri Chattopadhyay, Sanket Biswas, Emanuele Vivoli, Josep Lladós:

Towards Generative Class Prompt Learning for Fine-grained Visual Recognition. - Kang Zhang, Xinnian Guo:

Infrared and Visible Image Fusion Using Multi-level Adaptive Fractional Differential. - Shizhen Li, Jingcheng Liu, Jianwu Fang, Dezheng Gao, Jianru Xue:

S³-Match: Common-View Aligned Image Matching via Self-Supervised Keypoint Selection. - Huan Bao, Kaimin Wei, Yao Chen, Hanting Hou, Jinpeng Chen, Yongdong Wu:

From Black-box to Label-only: a Plug-and-Play Attack Network for Model Inversion. - Tomás Berriel Martins, Javier Civera:

Feature Splatting for Better Novel View Synthesis with Low Overlap. - Kieran Saunders, Luis J. Manso, George Vogiatzis:

BaseBoostDepth: Exploiting Larger Baselines For Self-supervised Monocular Depth Estimation. - Zhi Cai, Songtao Liu, Guodong Wang, Zeming Li, Zheng Ge, Xiangyu Zhang, Di Huang:

Align-DETR: Enhancing End-to-end Object Detection with Aligned Loss. - Luyao Tang, Yuxuan Yuan, Chaoqi Chen, Xinghao Ding, Yue Huang:

Mixstyle-Entropy: Whole Process Domain Generalization with Causal Intervention and Perturbation. - Theodoros Kouzelis, Emmanouil Plitsis, Mihalis Nicolaou, Yannis Panagakis:

Enabling Local Editing in Diffusion Models by Joint and Individual Component Analysis. - Krzysztof Baron-Lis, Matthias Rottmann, Annika Mütze, Sina Honari, Pascal Fua, Mathieu Salzmann:

AttEntropy: On the Generalization Ability of Supervised Semantic Segmentation Transformers to New Objects in New Domains. - Masane Fuchi, Tomohiro Takagi:

Erasing Concepts from Text-to-Image Diffusion Models with Few-shot Unlearning. - Maxim Khomiakov, Michael Riis Andersen, Jes Frellsen:

GeoFormer: A Multi-Polygon Segmentation Transformer. - Avideep Mukherjee, Soumya Banerjee, Piyush Rai, Vinay P. Namboodiri:

RISSOLE: Parameter-efficient Diffusion Models via Block-wise Generation and Retrieval-Guidance. - João P. C. Bertoldo, Dick Ameln, Ashwin Vaidya, Samet Akcay:

AUPIMO: Redefining Anomaly Localization Benchmarks with High Speed and Low Tolerance. - Zhanzhong Pang, Fadime Sener, Shrinivas Ramasubramanian, Angela Yao:

Cost-Sensitive Learning for Long-Tailed Temporal Action Segmentation. - Ziyang Ren, Ping Wei, Haowen Tang, Huan Li, Jin Yang:

Learning Scene-Goal-Aware Motion Representation for Trajectory Prediction. - Kensuke Taguchi, Takehiko Kawai, Wataru Imaeda, Hironobu Fujiyoshi:

SAM Helps SSL: Mask-guided Attention Bias for Self-supervised Learning. - Yamin Mao, Zhihua Liu, Weiming Li, SoonYong Cho, Qiang Wang, Xiaoshuai Hao:

Enhancing 3D Hand Pose Estimation via Dense Ordinal Regression Network. - Yang Sui, Zhuohang Li, Ding Ding, Xiang Pan, Xiaozhong Xu, Shan Liu, Zhenzhong Chen:

Transferable Learned Image Compression-Resistant Adversarial Perturbations. - Mengjiao Zhao, Mengting Ma, Xiangdong Li, Ao Gao, Siyang Song, Wei Zhang:

Deep Unfolding Network with Spatial-spectral Perception Enhanced for Pan-sharpening. - Xulong Bai, Hainan Cui, Shuhan Shen:

IncreLM: Incremental 3D Line Mapping. - Jordan Lam:

Motion Tracking with Rotated Bounding Boxes on Overhead Fisheye Imagery. - Xin Feng, Junxian Zeng, Siping Wang, Zhenwei He:

Toward Highly Efficient Semantic-Guided Machine Vision for Low-Light Object Detection. - Danai Triantafyllidou, Sarah Parisot, Ales Leonardis, Steven McDonagh:

Improving Object Detection via Local-global Contrastive Learning. - Heejoon Moon, Jongwoo Lee, Jeong-Gon Kim, Je Hyeong Hong:

Depth-Guided Privacy-Preserving Visual Localization Using 3D Sphere Clouds. - Shizhan Gong, Jingwei Zhang, Qi Dou, Farzan Farnia:

A Super-pixel-based Approach to the Stable Interpretation of Neural Networks. - Anandavardhan Hegde, Sudha Velusamy, Narayan Kothari, Aman Bahuguna, Apnesh Rawat, Hema Sathiamurthy, Ankit Raja:

PawFACS: Leveraging Semi-Supervised Learning for Pet Facial Action Recognition. - Qiao Xiao, Boqian Wu, Lu Yin, Christopher Neil Gadzinski, Tianjin Huang, Mykola Pechenizkiy, Decebal Constantin Mocanu:

Are Sparse Neural Networks Better Hard Sample Learners? - Shuang Chen, Amir Atapour-Abarghouei, Haozheng Zhang, Hubert P. H. Shum:

MxT: Mamba x Transformer for Image Inpainting. - Kuluhan Binici, Weiming Wu, Tulika Mitra:

Generalizing Teacher Networks for Effective Knowledge Distillation Across Student Architectures. - Mihnea Bogdan Jurca, Remco Royen, Ion Giosan, Adrian Munteanu:

RT-GS2: Real-Time Generalizable Semantic Segmentation for 3D Gaussian Representations of Radiance Fields. - Kirill Vishniakov, Eric P. Xing, Zhiqiang Shen:

MixMask: Revisiting Masking Strategy for Siamese ConvNets. - Marian Longa, João F. Henriques:

Interpretable Representation Learning from Videos using Nonlinear Priors. - Hasib Zunair, Abdessamad Ben Hamza:

PEEKABOO: Hiding Parts of an Image for Unsupervised Object Localization. - Ziteng Cui, Lin Gu, Tatsuya Harada:

Discovering an Image-Adaptive Coordinate System for Photography Processing. - Yu Gao, Xuchong Qiu, Zihan Ye:

Effective Message Hiding with Order-Preserving Mechanisms. - Zicheng Pan, Xiaohan Yu, Yongsheng Gao:

EIANet: A Novel Domain Adaptation Approach to Maximize Class Distinction with Neural Collapse Principles. - Ying Zhang, Yuezun Li, Bo Peng, Jiaran Zhou, Huiyu Zhou, Junyu Dong:

Mumpy: Multilateral Temporal-view Pyramid Transformer for Video Inpainting Detection. - Qing En, Yuhong Guo:

Annotation by Clicks: A Point-Supervised Contrastive Variance Method for Medical Semantic Segmentation. - Myeong-Yeon Yi, DongJae Lee, Naeun Ko, Yonghyun Jeong, Sang-goo Lee, Seunggyu Chang:

Complete the Feature Space: Diffusion-Based Fictional ID Generation for Face Recognition. - Dino Ienco, Cássio Fraga Dantas:

DisCoM-KD: Cross-Modal Knowledge Distillation via Disentanglement Representation and Adversarial Learning. - Ameera Ali Bawazir, Kebin Wu, Wenbin Li:

Uni-Mlip: Unified Self-Supervision for Medical Vision Language Pre-training. - Jiyao Gao, Chengxin He, Lei Duan, Jie Zuo:

Towards Better Zero-Shot Anomaly Detection under Distribution Shift with CLIP. - Hao Chen, Jiaze Wang, Ziyu Guo, Jinpeng Li, Donghao Zhou, Bian Wu, Chenyong Guan, Guangyong Chen, Pheng-Ann Heng:

SignVTCL: Multi-Modal Continuous Sign Language Recognition Enhanced by Visual-Textual Contrastive Learning. - Yangxiang Zhang, Yuezun Li, Ao Luo, Jiaran Zhou, Junyu Dong:

FastForensics: Efficient Two-Stream Design for Real-Time Image Manipulation Detection. - Yuxiang An, Dongnan Liu, Weidong Cai:

Unsupervised Domain Adaptation for Tubular Structure Segmentation Across Different Anatomical Sources. - Ivan Sabolic, Ivan Grubisic, Sinisa Segvic:

Backdoor Defense through Self-Supervised and Generative Learning. - Raquel Vidaurre, Elena Garces, Dan Casas:

DiffusedWrinkles: A Diffusion-Based Model for Data-Driven Garment Animation. - Muhammad Salman Ali, Maryam Qamar, Sung-Ho Bae, Enzo Tartaglione:

Trimming the Fat: Efficient Compression of 3D Gaussian Splats through Pruning. - Debjyoti Mondal, Rahul Mishra, Chandan Kumar Pandey:

Seg-HGNN: Unsupervised and Light-Weight Image Segmentation with Hyperbolic Graph Neural Networks. - Nadezda Kirillova, Muhammad Jehanzeb Mirza, Horst Bischof, Horst Possegger:

Into the Fog: Evaluating Robustness of Multiple Object Tracking. - Xie Yu, Wentao Zhang:

Anchor-Based Masked Generative Distillation for Pixel-Level Prediction Tasks. - Kai Pan, Yapeng Tian, Yinhe Han, Yiming Gan:

Benchmarking and Optimizing Federated Learning with Hardware-related Metrics. - Richard Franklin, Jiawei Yao, Deyang Zhong, Qi Qian, Juhua Hu:

Text-Guided Mixup Towards Long-Tailed Image Categorization. - Wei Zhao, Xiao-Jun Zeng, Chengdong Shi, Ching-Hsun Tseng, Yue Chang:

A Novel Divide and Merge Approach for Improved Classification of Functional Data. - Zane Durante, Robathan Harries, Edward Vendrow, Zelun Luo, Yuta Kyuragi, Kazuki Kozuka, Li Fei-Fei, Ehsan Adeli:

Few-Shot Classification of Interactive Activities of Daily Living (InteractADL). - Aditya R. Bhattacharya, Debanjan Goswami, Shayok Chakraborty:

ACIL: Active Class Incremental Learning for Image Classification. - Sachin Chhabra, Hemanth Venkateswara, Baoxin Li:

PatchRot: Self-Supervised Training of Vision Transformers by Rotation Prediction. - Sachin Chhabra, Hemanth Venkateswara, Baoxin Li:

Label Smoothing++: Enhanced Label Regularization for Training Neural Networks. - Wei Ye, Xinan He, Feng Ding:

Decoupling Forgery Semantics for Generalizable Deepfake Detection. - Adam Goodge, Bryan Hooi, Wee Siong Ng:

When Text and Images Don't Mix: Bias-Correcting Language-Image Similarity Scores for Anomaly Detection. - Sree Rama Vamsidhar S., Gorthi Rama Krishna Sai Subrahmanyam:

NSSR-DIL: Null-Shot Image Super-Resolution Using Deep Identity Learning. - Pankhi Kashyap, Pavni Tandon, Sunny Gupta, Abhishek Tiwari, Ritwik Kulkarni, Kshitij Sharad Jadhav:

Taming the Tail: Leveraging Asymmetric Loss and Padé Approximation to Overcome Long-Tailed Class Imbalance. - Yichen Zhou, Teck Khim Ng:

Kernel Representation for Dynamic Networks. - Rameshwar Mishra, A. Venkata Subramanyam:

Layout Free Scene Graph to Image Generation. - Canran Li, Dongnan Liu, Weidong Cai:

Rethinking Domain Adaptive Optic Disc and Cup Segmentation in Fundus Image through Dynamic Diffusion Flow. - Khanh-Binh Nguyen, Chae Jung Park:

RETRO: Reusing teacher projection head for efficient embedding distillation on Lightweight Models via Self-supervised Learning. - Shuo Wang, Xieenlong, Jinda Lu, Jinghan Li, Yanbin Hao:

GLCM-Adapter: Global-Local Content Matching for Few-shot CLIP Adaptation. - Tuyen Tran, Thao Minh Le, Duy Hung Tran, Truyen Tran:

Unified Compositional Query Machine with Multimodal Consistency for Video-based Human Activity Recognition. - Hao Xu, Shengye Yan, Wei Zheng:

Lightweight Human Pose Estimation with Enhanced Knowledge Review. - Dinh Phu Tran, Dao Duy Hung, Daeyoung Kim:

Channel-Partitioned Windowed Attention And Frequency Learning for Single Image Super-Resolution. - Dongyoung Kim, Jeong-Gun Lee, WonSook Lee:

Separated and Independent Contrastive Learning on Labeled and Unlabeled Samples: Boosting Performance on Long-tail Semi-supervised Learning. - Tianwen Zhou, Qihao Duan, Zitong Yu:

Difflare: Removing Image Lens Flare with Latent Diffusion Models. - Loris Giulivi, Giacomo Boracchi:

Explaining Multi-modal Large Language Models by Analyzing their Vision Perception. - Dylan Auty, Roy Miles, Benedikt Kolbeinsson, Krystian Mikolajczyk:

Learning to Project for Cross-Task Knowledge Distillation. - Saining Zhang, Baijun Ye, Xiaoxue Chen, Yuantao Chen, Zongzheng Zhang, Cheng Peng, Yongliang Shi, Hao Zhao:

Drone-assisted Road Gaussian Splatting with Cross-view Uncertainty. - Andrey Palaev, Adil Khan, Syed M. Ahsan Kazmi:

LLM-guided Instance-level Image Manipulation with Diffusion U-Net Cross-Attention Maps. - Quoc-Huy Trinh, Hai-Dang Nguyen, Bao-Tram Nguyen Ngoc, Debesh Jha, Ulas Bagci, Minh-Triet Tran:

SAM-EG: Segment Anything Model with Egde Guidance framework for efficient Polyp Segmentation. - Zhuofeng Wu, Doehyung Lee, Zihua Liu, Kazunori Yoshizaki, Yusuke Monno, Masatoshi Okutomi:

Disparity Estimation Using a Quad-Pixel Sensor. - Sungeun Kim, Jongbin Ryu:

Unsupervised Hashing Network with Hyper Quantization Tree. - Ahmet Serdar Karadeniz, Dimitrios Mallis, Nesryne Mejri, Kseniya Cherenkova, Anis Kacem, Djamila Aouada:

DAVINCI: A Single-Stage Architecture for Constrained CAD Sketch Inference. - Shane Josias, Willie Brink:

Multimodal base distributions in conditional flow matching generative models. - Xinxu Lin, Mingxuan Liu, Kezhuo Liu, Hong Chen:

Spike-SLR: An Energy-efficient Parallel Spiking Transformer for Event-based Sign Language Recognition. - Haosen Yang, Deng Huang, Bin Wen, Jiannan Wu, Hongxun Yao, Yi Jiang, Xiatian Zhu, Zehuan Yuan:

MotionMAE: Self-supervised Video Representation Learning with Motion-Aware Masked Autoencoders. - Rui Yu, Runkai Zhao, Cong Nie, Heng Wang, Siyu Li, Songhao Zhu:

Future Does Matter: Boosting 3D Object Detection with Temporal Motion Estimation in Point Cloud Sequences. - Mohammed Talha Alam, Raza Imam, Mohsen Guizani, Fakhri Karray:

FLARE up your data: Diffusion-based Augmentation Method in Astronomical Imaging. - Xuhui Zhu, Feng Jiang, Jing Wen, Yi Wang, Qiang Gao:

Semantic Image Synthesis of Anime Characters Based on Conditional Generative Adversarial Networks. - Kehang Jia, Gaorui Zhang, Yixuan Yang, Guangwei Huang, Penghuan Wang, Cheng Cheng:

ML-2SN: A Hybrid Two-Stream System for Sitting Posture Detection. - Xu Dong, Xinran Liu, Wanqing Li, Anthony Adeyemi-Ejeye, Andrew Gilbert:

Interpretable Long-term Action Quality Assessment. - Dragos Costea, Alina Marcu, Marius Leordeanu:

A self-supervised cyclic neural-analytic approach for novel view synthesis and 3D reconstruction. - Sebastian Janampa, Marios Pattichis:

SOFI: Multi-Scale Deformable Transformer for Camera Calibration with Enhanced Line Queries. - Jochem Loedeman, Maarten C. Stol, Tengda Han, Yuki M. Asano:

Prompt Generation Networks for Input-Space Adaptation of Frozen Vision Transformers. - Li Li, Tanqiu Qiao, Hubert P. H. Shum, Toby P. Breckon:

TraIL-Det: Transformation-Invariant Local Feature Networks for 3D LiDAR Object Detection with Unsupervised Pre-Training. - Francesco Girlanda, Olga V. Demler, Bjoern H. Menze, Neda Davoudi:

Enhancing Cardiovascular Disease Prediction through Multi-Modal Self-Supervised Learning. - Liuyuan Wen:

Out-Of-Distribution Detection for Audio-visual Generalized Zero-Shot Learning: A General Framework. - Christian Fruhwirth-Reisinger, Wei Lin, Dusan Malic, Horst Bischof, Horst Possegger:

Vision-Language Guidance for LiDAR-based Unsupervised 3D Object Detection. - Linghong Yao, Denis Hadjivelichkov, Andromachi Maria Delfaki, Yuanchang Liu, Brooks Paige, Dimitrios Kanoulas:

Balancing Calibration and Performance: Stochastic Depth in Segmentation BNNs. - Shanlin Sun, Tung Le, Pooya Khosravi, Chenyu You, Kun Han, Haoyu Ma, Deying Kong, Xiangyi Yan, Xiaohui Xie:

Hybrid-CSR: Coupling Explicit and Implicit Reconstruction of Cortical Surface. - Anjun Hu, Jindong Gu, Francesco Pinto, Konstantinos Kamnitsas, Philip Torr:

As Firm As Their Foundations: Creating Transferable Adversarial Examples Across Downstream Tasks with CLIP. - Xiangyu Chen, Jing Liu, Ye Wang, Pu Perry Wang, Matthew Brand, Guanghui Wang, Toshiaki Koike-Akino:

SuperLoRA: Parameter-Efficient Unified Adaptation of Large Foundation Models. - Piotr Kluska, Florian Scheidegger, A. Cristiano I. Malossi, Enrique S. Quintana-Ortí:

Beyond Static and Dynamic Quantization - Hybrid Quantization of Vision Transformers. - Jiageng Zhu, Hanchen Xie, Jianhua Wu, Mohamed E. Hussein, Mahyar Khayatkhoei, Jiazhi Li, Wael AbdAlmageed:

Multi-Scope Representation Learning for Causal Relation Discovery with new Challenging Datasets. - Rong Liu, Rui Xu, Yue Hu, Meida Chen, Andrew Feng:

AtomGS: Atomizing Gaussian Splatting for High-Fidelity Radiance Field. - Antoine Montmaur, Nicolas Larue, Ngoc-Son Vu:

Neural Collapse Inspired Contrastive Continual Learning. - Inderjeet Singh, Roman Vainshtein, Alon Zolfi, Asaf Shabtai, Tu Bui, Jonathan Brokman, Omer Hofman, Fumiyoshi Kasahara, Kentaro Tsuji, Hisashi Kojima:

ATLANTIS: A Framework for Automated Targeted Language-guided Augmentation Training for Robust Image Search. - Jaehoon Cho, Minjung Yoo, Jini Yang, Sunok Kim:

A Prototype Unit for Image De-raining using Time-Lapse Data. - Yuanwei Li, Elizaveta Ivanova, Martins Bruveris:

FADE: Few-shot/zero-shot Anomaly Detection Engine using Large Vision-Language Model. - Changkang Li, Yalong Jiang:

VLAVAD: Vision-Language Models Assisted Unsupervised Video Anomaly Detection. - Yuantian Huang, Satoshi Iizuka, Kazuhiro Fukui:

Training-Free Zero-Shot Semantic Segmentation with LLM Refinement. - Susmija Jabbireddy, Davit Soselia, Max Ehrlich, Christopher A. Metzler, Amitabh Varshney:

VEMIC: View-aware Entropy model for Multi-view Image Compression. - Tatsuhiro Eguchi, Shumpei Takezaki, Mihoko Shimano, Takayuki Yagi, Ryoma Bise:

Guidance-base Diffusion Models for Improving Photoacoustic Image Quality. - Shihao Chen, Xiaobing Li, Keduo Yan, Yong Li, Dongxu Gao:

STPose: 6D object pose estimation network based on sparse attention and cross-layer connection. - Nathan Louis, Mahzad Khoshlessan, Jason J. Corso:

Measuring Physical Plausibility of 3D Human Poses Using Physics Simulation. - Ching-Yi Lai, Chiou-Ting Hsu, Chih-Chung Hsu, Chia-Wen Lin:

Prompt-guided Multi-modal contrastive learning for Cross-compression-rate Deepfake Detection. - Ziyi Cao, Shengye Yan, Wei Zheng:

The Attempt on Combining Three Talents by KD with Enhanced Boundary in Co-Salient Object Detection. - Yufei Gao, Bin Fu, Lei Shi, Chengming Liu, Yucheng Shi:

GLPI: A Global Layered Prompt Integration approach for Explicit Visual Prompt. - Yijie Li, Hewei Wang, Aggelos K. Katsaggelos:

CPDR: Towards Highly-Efficient Salient Object Detection via Crossed Post-decoder Refinement. - Amrijit Biswas, Md. Ismail Hossain, Mirza M. Lutfe Elahi, Ali Cheraghian, Fuad Rahman, Nabeel Mohammed, Shafin Rahman:

3D Point Cloud Network Pruning: When Some Weights Do not Matter. - Zhaowei Gao, Mingyang Song, Christopher Schroers, Yang Zhang:

Revitalizing Legacy Video Content: Deinterlacing with Bidirectional Information Propagation. - Yongchao Lin, Xiangdong Su, Yuhan Yang:

3D Blur Kernel on Gaussian Splatting. - Sam Titarsolej, Neil Cohn, Nanne van Noord:

Drawing Insights: Sequential Representation Learning in Comics. - Alireza Javanmardi, Alain Pagani, Didier Stricker:

G3FA: Geometry-guided GAN for Face Animation. - Gopi Raju Matta, Rahul Siddartha, Rongali Simhachala Venkata Girish, Sumit Sharma, Kaushik Mitra:

GN-FR: Generalizable Neural Radinace Fields for Flare Removal. - Christian Löwens, Thorben Funke, André Wagner, Alexandru Paul Condurache:

Unsupervised Point Cloud Registration with Self-Distillation. - Wenbo Xu, Li Zhang, Qiankun Li, Qi Wu, Lin Yuanbo Wu, Liu Liu:

ICAF-4: An Integrated Framework of Category-level Articulated Object Perception and Manipulation for Embodied Intelligence. - Jungmin Ha, Euihyun Yoon, Sungsik Kim, Jinkyu Kim, Jaekoo Lee:

Leveraging Inductive Bias in ViT for Medical Image Diagnosis. - Qingju Liu, Hyeongwoo Kim, Gaurav Bharaj:

Content and Style Aware Audio-Driven Facial Animation. - Monica Millunzi, Lorenzo Bonicelli, Angelo Porrello, Jacopo Credi, Petter N. Kolm, Simone Calderara:

May the Forgetting Be with You: Alternate Replay for Learning with Noisy Labels. - Hashmat Shadab Malik, Numan Saeed, Asif Hanif, Muzammal Naseer, Mohammad Yaqub, Salman Khan, Fahad Shahbaz Khan:

On Evaluating Adversarial Robustness of Volumetric Medical Segmentation Models. - Satoshi Kamiya, Kota Yamashita, Kazuhiro Hotta:

Boundary Contrastive Learning for Label-Efficient Medical Image Segmentation. - Niraj Prakash Kini, Ruey-Horng Shiue, Ryan Chandra, Wen-Hsiao Peng, Ching-Wen Ma, Jenq-Neng Hwang:

TransHuPR: Cross-View Fusion Transformer for Human Pose Estimation Using mmWave Radar. - Jayateja Kalla, Soma Biswas:

AggSS: An Aggregated Self-Supervised Approach for Class Incremental Learning. - Cheng Chen, Jiang Liu, Liaoyuan Zeng, Fang Duan, Sean McGrath, Tian Dan:

Spatio-Temporal Transformer with Rotary Position Embedding and Bone Priors for 3D Human Pose Estimation. - Marcella Astrid, Enjie Ghorbel, Djamila Aouada:

Detecting Audio-Visual Deepfakes with Fine-Grained Inconsistencies. - Xiaoxue Chen, Hao Zhao, Guyue Zhou, Ya-Qin Zhang:

Time-conditioned Illumination for Inverse Rendering of Outdoor Scenes. - Jan Niklas Kolf, Naser Damer, Fadi Boutros:

QUD: Unsupervised Knowledge Distillation for Deep Face Recognition. - Harry Walsh, Ben Saunders, Richard Bowden:

Sign Stitching: A Novel Approach to Sign Language Production. - Di Cheng, YingJie Shi, ShiXin Sun, JiaFu Zhang, WeiJing Wang, Yu Liu:

ControlEdit: A MultiModal Local Clothing Image Editing Method. - Victoria Porter, Richard Gault, Stephanie G. Craig, Jacqueline A. James:

Optimising Diffusion Models for Histopathology Image Synthesis. - Erol Ozgur, Mohammad Alkhatib, Youcef Mezouar, Adrien Bartoli:

Reconstructing Spheres by Fitting Planes. - Pushpendu Ghosh, Aniket Joshi, Soumyajit Chowdhury, Promod Yenigalla:

AutoDOM: Automated Dimension Overlay for Enhanced Measurement-Guidance. - Hongjing Niu, Hanting Li, Guoping Wu, Bin Li, Feng Zhao:

Rectifying Shortcut Learning through Cellular Differentiation in Deep Learning Neurons. - Srinivasa Rao Nandam, Sara Atito, Zhenhua Feng, Josef Kittler, Muhammad Awais:

Pseudo Labelling for Enhanced Masked Auto Encoders. - Rajeev Ranjan Dwivedi, Priyadarshini Kumari, Vinod K. Kurmi:

CosFairNet: A Parameter-Space based Approach for Bias Free Learning. - Hongjing Niu, Qingyue Yang, Pengfei Xia, Wei Zhang, Bin Li, Feng Zhao:

Frequency Decomposition to Tap the Potential of Single Domain for Generalization. - Chunli Sun, Feng Zhao:

Task-Related Feature Enhancement Network for Neuronal Morphology Classification. - Valéry Dewil, Zhe Zheng, Arnaud Barral, Lara Raad, Nao Nicolas, Ioannis Cassagne, Jean-Michel Morel, Gabriele Facciolo, Bruno Galerne, Pablo Arias:

Adapting MIMO video restoration networks to low latency constraints. - Hoàng-Ân Lê, Paul Berg, Minh-Tan Pham:

Box for Mask and Mask for Box: weak losses for multi-task partially supervised learning. - Nicholas Moratelli, Davide Caffagni, Marcella Cornia, Lorenzo Baraldi, Rita Cucchiara:

Revisiting Image Captioning Training Paradigm via Direct CLIP-based Optimization. - Chenhongyi Yang, Zehui Chen, Miguel Espinosa, Linus Ericsson, Zhenyu Wang, Jiaming Liu, Elliot J. Crowley:

PlainMamba: Improving Non-Hierarchical Mamba in Visual Recognition. - Shijia Xu, Lin Zhao, Jialiang Tang, Guangyu Li, Chen Gong:

Open-World Semi-Supervised Learning under Compound Distribution Shifts. - Paul Berg, Björn Michele, Minh-Tan Pham, Laetitia Chapel, Nicolas Courty:

Horospherical Learning with Smart Prototypes. - Abu Taib Mohammed Shahjahan, Abdessamad Ben Hamza:

Flexible Graph Convolutional Network for 3D Human Pose Estimation. - Martin Ferianc, Hongxiang Fan, Miguel R. D. Rodrigues:

SAE: Single Architecture Ensemble Neural Networks. - Anja Delic, Matej Grcic, Sinisa Segvic:

Outlier detection by ensembling uncertainty with negative objectness. - Sina Ghorbani Kolahi, Seyed Kamal Chaharsooghi, Toktam Khatibi, Afshin Bozorgpour, Reza Azad, Moein Heidari, Ilker Hacihaliloglu, Dorit Merhof:

MSA2Net: Multi-scale Adaptive Attention-guided Network for Medical Image Segmentation. - Mona Ahmadian, Frank Guerin, Andrew Gilbert:

FILS: Self-Supervised Video Feature Prediction In Semantic Language Space. - Tamás Tófalvi, Bandó Kovács, Levente Hajder:

Calibration of 2D LiDAR sensors using cylindrical target. - Riya Verma, Sukhendu Das:

Multi-Scale Semantic Enrichment and Dual Angular Margin Contrast for Few-Shot Class Incremental Learning. - Hiroki Kobayashi, Naoki Murakami, Naoto Hiramatsu, Takahiro Suzuki, Manabu Hashimoto:

Anomaly Detection Based on Semi-Formula Driven Pre-training Dataset to Represent Subtle Difference and Anomaly Score. - Georgios Zampokas, Christos-Savvas Bouganis, Dimitrios Tzovaras:

Budget-aware Dynamic Spatially Adaptive Inference. - Yu Hsuan Hsieh, Shang-Hong Lai:

CSAD: Unsupervised Component Segmentation for Logical Anomaly Detection. - Sergio Sánchez Santiesteban, Muhammad Awais, Yi-Zhe Song, Josef Kittler:

Enhancing Radiology Report Generation: The Impact of Locally Grounded Vision and Language Training. - Dmitry Demidov, Abduragim Shtanchaev, Mihail Mihaylov, Mohammad Almansoori:

Extract More from Less: Efficient Fine-Grained Visual Recognition in Low-Data Regimes. - Emanuele Frascaroli, Aniello Panariello, Pietro Buzzega, Lorenzo Bonicelli, Angelo Porrello, Simone Calderara:

CLIP with Generative Latent Replay: a Strong Baseline for Incremental Learning. - Qing-Wen Yang, Kai-Wen Duan, Ting-Yi Lu, Kevin Lin, Cheng-Yen Yang, Lijuan Wang, Jenq-Neng Hwang, Shang-Hong Lai:

APTPose: Anatomy-aware Pre-Training for 3D Human Pose Estimation. - Sally Khaidem, Mansi Sharma:

A Deep Belief Network Approach to Scalable Compression of Light Field Data for Auto-Stereoscopic Displays. - Victor Enescu, Hichem Sahbi:

Learning conditionally untangled latent spaces using Fixed Point Iteration. - Haizhao Sun, Yu Ning, Xu Ji, Chuang Zhang, Ming Wu:

A Multimodal Network on Handwritten Chinese Character Error Correction. - Jakob Gawlikowski, Nina Maria Gottschling:

Efficient Data Source Relevance Quantification for Multi-Source Neural Networks. - Bin Fu, Qiyang Wan, Jialin Li, Ruiping Wang, Xilin Chen:

Blocks as Probes: Dissecting Categorization Ability of Large Multimodal Models. - Sadra Safadoust, Fabio Tosi, Fatma Güney, Matteo Poggi:

Self-Evolving Depth-Supervised 3D Gaussian Splatting from Rendered Stereo Pairs. - Seyed Mohsen Hosseini:

topK dice loss for medical image segmentation. - Takumi Kobayashi:

Direct-Sum Approach to Integrate Losses Via Classifier Subspace. - Kaushik Bhargav Sivangi, Fani Deligianni:

Knowledge Distillation with Global Filters for Efficient Human Pose Estimation. - Anqi Liu, Shiyi Mu, Shugong Xu:

A Learnable Color Correction Matrix for RAW Reconstruction. - Ankita Raj, Deepankar Varma, Chetan Arora:

Examining the Threat Landscape: Foundation Models and Model Stealing. - Kovvuri Sai Gopal Reddy, Bodduluri Saran, A. Mudit Adityaja, Saurabh J. Shigwan, Nitin Kumar, Snehasis Mukherjee:

UnSeGArmaNet: Unsupervised Image Segmentation using Graph Neural Networks with Convolutional ARMA Filters. - Shubham Dokania, Vasudev Singh, Shuaib Ahmed:

GazeHELL: Gaze Estimation with Hybrid Encoders and Localised Losses with weighing. - Nitish Agarwal, Steven Cadavid:

TrakAthlete4D: Multi-View On-Field Player Position Tracking in Sports. - Behnam Kazemivash, Armin Iraji, Sergey M. Plis, Vince D. Calhoun:

Spatiotemporal Vision Transformer for Weakly Supervised Dense Prediction of Dynamic Brain Maps. - Julius Körner, Dogu Tamgac, Dávid Rozenberszki:

SceneSAM: Integrating 2D Labels for Weakly Supervised 3D Scene Understanding. - Ashok Bandyopadhyay, Pranjal Baranwal, Arijit Sur, U. P. Rajeev:

PV-SLAM: Panoptic Visual SLAM with Loop Closure and Online Bundle Adjustment. - Christopher Beam, Andrew R. Willis, Kevin M. Brink:

Deep Learning for GPS-Denied SAR Image Focusing and Vehicle Trajectory Estimation. - Zihan Wang, Shuzhe Wang, Matias Turkulainen, Junyuan Fang, Juho Kannala:

Gaussian Splatting in Mirrors: Reflection-aware Rendering via Virtual Camera Optimization. - Melika Sadeghi Tabrizi, Ali Karimi, Ahmad Kalhor, Babak Nadjar Araabi, Mona Ahmadian:

Layer-wise Learning of CNNs by Self-tuning Learning Rate and Early Stopping at Each Layer. - Hariprasath Govindarajan, Per Sidén, Jacob Roll, Fredrik Lindsten:

On Partial Prototype Collapse in the DINO Family of Self-Supervised Methods. - Robero Leyva, Praveen Selvaraj, Andrew Elliott, Gregory Epiphaniou, Carsten Maple:

Beyond Face Matching: A Facial Traits based Privacy Score for Synthetic Face Datasets. - Oliver Mills, Nishant Ravikumar, Philip G. Conaghan, Samuel D. Relton:

Putting the Segment Anything Model to the Test with 3D Knee MRI - A Comparison with State-of-the-Art Performance. - Evgeney Bogatyrev, Ivan Molodetskikh, Dmitriy S. Vatolin:

SR+Codec: a Benchmark of Super-Resolution for Video Compression Bitrate Reduction. - Jianyu Zhao, Wei Quan, Bogdan J. Matuszewski:

CVAM-Pose: Conditional Variational Autoencoder for Multi-Object Monocular Pose Estimation. - Konstantinos Kontras, Christos Chatzichristos, Matthew B. Blaschko, Maarten De Vos:

Improving Multimodal Learning with Multi-Loss Gradient Modulation. - Abdullah Alchihabi, Marzi Heidari, Yuhong Guo:

Adaptive Weighted Co-Learning for Cross-Domain Few-Shot Learning. - Karim Radouane, Julien Lagarde, Sylvie Ranwez, Andon Tchechmedjiev:

Guided Attention for Interpretable Motion Captioning. - Xi Li, Jing Zhang, Ziheng Duan, Yi Dai, Siwei Xu:

iHAST: Integrating Hybrid Attention for Super-Resolution in Spatial Transcriptomics. - Jinhui Yi, Yanan Luo, Marion Deichmann, Gabriel Schaaf, Juergen Gall:

MV-Match: Multi-View Matching for Domain-Adaptive Identification of Plant Nutrient Deficiencies. - Akshita Gupta, Aditya Arora, Sanath Narayan, Salman Khan, Fahad Shahbaz Khan, Graham W. Taylor:

Open-Vocabulary Temporal Action Localization using Multimodal Guidance. - Chiang-Heng Chien, Ahmad Abdelfattah, Benjamin B. Kimia:

Recovering SLAM Tracking Lost by Trifocal Pose Estimation using GPU-HC++.

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.


Google
Google Scholar
Semantic Scholar
Internet Archive Scholar
CiteSeerX
ORCID














