default search action
CVPR 2023: Vancouver, BC, Canada - Workshops
- IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2023 - Workshops, Vancouver, BC, Canada, June 17-24, 2023. IEEE 2023, ISBN 979-8-3503-0249-3
- Ruggero Ragonesi, Pietro Morerio, Vittorio Murino:
Learning unbiased classifiers from biased data with meta-learning. 1-9 - Teng-Yok Lee, Yusuke Nagai, Akira Minezawa:
Memory-efficient and GPU-oriented visual anomaly detection with incremental dimension reduction. 1-9 - Juewen Peng, Zhiyu Pan, Chengxin Liu, Xianrui Luo, Huiqiang Sun, Liao Shen, Ke Xian, Zhiguo Cao:
Selective Bokeh Effect Transformation. 1-9 - Bilal Porgali, Vítor Albiero, Jordan Ryda, Cristian Canton-Ferrer, Caner Hazirbas:
The Casual Conversations v2 Dataset : A diverse, large benchmark for measuring fairness and robustness in audio/vision/speech models. 10-17 - Hannah Kirkland, Sanjeev J. Koppal:
Schrödinger's Camera: First Steps Towards a Quantum-Based Privacy Preserving Camera. 18-27 - Akshay Agarwal, Nalini K. Ratha, Richa Singh, Mayank Vatsa:
Robustness Against Gradient based Attacks through Cost Effective Network Fine-Tuning. 28-37 - Linzhi Huang, Mei Wang, Jiahao Liang, Weihong Deng, Hongzhi Shi, Dongchao Wen, Yingjie Zhang, Jian Zhao:
Gradient Attention Balance Network: Mitigating Face Recognition Racial Bias via Gradient Attention. 38-47 - Aman Shrivastava, Yanjun Qi, Vicente Ordonez:
Estimating and Maximizing Mutual Information for Knowledge Distillation. 48-57 - Shreyank N. Gowda:
Synthetic Sample Selection for Generalized Zero-Shot Learning. 58-67 - Yuhao Chen, Hayden Gunraj, E. Zhixuan Zeng, Robbie Meyer, Maximilian Gilles, Alexander Wong:
MMRNet: Improving Reliability for Multimodal Object Detection and Segmentation for Bin Picking via Multimodal Redundancy. 68-77 - Yang Zheng, Oles Andrienko, Yonglei Zhao, Minwoo Park, Trung Pham:
DPPD: Deformable Polar Polygon Object Detection. 78-87 - Oliver Zendel, Johannes Huemer, Markus Murschitz, Gustavo Fernández Domínguez, Amadeus Lobe:
Joint Camera and LiDAR Risk Analysis. 88-97 - Adriano Cardace, Pierluigi Zama Ramirez, Samuele Salti, Luigi Di Stefano:
Exploiting the Complementarity of 2D and 3D Networks to Address Domain-Shift in 3D Semantic Segmentation. 98-109 - Apoorv Singh:
Training Strategies for Vision Transformers for Object Detection. 110-118 - Yunxiao Shi, Hong Cai, Amin Ansari, Fatih Porikli:
EGA-Depth: Efficient Guided Attention for Self-Supervised Multi-Camera Depth Estimation. 119-129 - Vickram Rajendran, Chuck Tang, Frits van Paasschen:
Improving Rare Classes on nuScenes LiDAR segmentation Through Targeted Domain Adaptation. 130-139 - Håkon Hukkelås, Frank Lindseth:
Does Image Anonymization Impact Computer Vision Training? 140-150 - Ce Zhang, Chengjie Zhang, Yiluan Guo, Lingji Chen, Michael Happold:
MotionTrack: End-to-End Transformer-based Multi-Object Tracking with LiDAR-Camera Fusion. 151-160 - Tae Eun Choe, Jane Wu, Xiaolin Lin, Karen Kwon, Minwoo Park:
HazardNet: Road Debris Detection by Augmentation of Synthetic Models. 161-171 - Xuanyao Chen, Tianyuan Zhang, Yue Wang, Yilun Wang, Hang Zhao:
FUTR3D: A Unified Sensor Fusion Framework for 3D Detection. 172-181 - Felix Fent, Philipp Bauerschmidt, Markus Lienkamp:
RadarGNN: Transformation Invariant Graph Neural Network for Radar-based Perception. 182-191 - Ruphan Swaminathan, Pradyot V. N. Korupolu:
MobileDeRainGAN: An Efficient Semi-Supervised Approach to Single Image Rain Removal for Task-Driven Applications. 192-201 - Haotian Tang, Shang Yang, Zhijian Liu, Ke Hong, Zhongming Yu, Xiuyu Li, Guohao Dai, Yu Wang, Song Han:
TorchSparse++: Efficient Point Cloud Engine. 202-209 - Tommaso Nesti, Santhosh Boddana, Burhaneddin Yaman:
Ultra-Sonic Sensor based Object Detection for Autonomous Vehicles. 210-218 - Andreas Bär, Daniel Kusuma, Tim Fingscheidt:
Improvements to Image Reconstruction-Based Performance Prediction for Semantic Segmentation in Highly Automated Driving. 219-229 - Sheng-Cheng Lee, Victor Lu, Chieh-Chih Wang, Wen-Chieh Lin:
LiDAR-Based Localization on Highways Using Raw Data and Pole-Like Object Features. 230-237 - Matías Molina:
Zero-shot Classification at Different Levels of Granularity. 238-244 - Octavio Arriaga, Sebastian Palacio, Matias Valdenegro-Toro:
Difficulty Estimation with Action Scores for Computer Vision Tasks. 245-253 - Juan Luis Gonzalez Bello, Jaeho Moon, Munchurl Kim:
Detail-Preserving Self-Supervised Monocular Depth with Self-Supervised Structural Sharpening. 254-264 - Emmanuel Martinez, Roman Jacome, Alejandra Hernandez-Rojas, Henry Arguello:
LD-GAN: Low-Dimensional Generative Adversarial Network for Spectral Image Generation with Variance Regularization. 265-275 - David Laines, Miguel González-Mendoza, Gilberto Ochoa-Ruiz, Gissella Bejarano:
Isolated Sign Language Recognition based on Tree Structure Skeleton Images. 276-284 - Rafael Martinez Garcia Peña, Mansoor Ali Teevno, Gilberto Ochoa-Ruiz, Sharib Ali:
SUPRA: Superpixel Guided Loss for Improved Multi-modal Segmentation in Endoscopy. 285-294 - Daniel Flores-Araiza, Francisco Javier Lopez-Tiro, Jonathan El Beze, Jacques Hubert, Miguel González-Mendoza, Gilberto Ochoa-Ruiz, Christian Daul:
Deep Prototypical-Parts Ease Morphological Kidney Stone Identification and are Competitively Robust to Photometric Perturbations. 295-304 - Yoshio Rubio, Marco A. Contreras-Cruz:
Wildlife Image Generation from Scene Graphs. 305-314 - Juan C. Pérez, Motasem Alfarra, Ali K. Thabet, Pablo Arbeláez, Bernard Ghanem:
Towards Characterizing the Semantic Robustness of Face Recognition. 315-325 - Willams de Lima Costa, Estefania Talavera Martínez, Lucas Silva Figueiredo, Veronica Teichrieb:
High-level context representation for emotion recognition in images. 326-334 - Kshitij Nikhal, Nkiruka Uzuegbunam, Bridget Kennedy, Benjamin S. Riggan:
Mitigating Catastrophic Interference using Unsupervised Multi-Part Attention for RGB-IR Face Recognition. 335-344 - Alicja Kwasniewska, Anastacia MacAllister, Rey Nicolas, Javier Garzás:
Multi-sensor Ensemble-guided Attention Network for Aerial Vehicle Perception Beyond Visible Spectrum. 345-353 - Abel A. Reyes, Sidike Paheding, A. Rajaneesh, K. S. Sajinkumar, Thomas Oommen:
C-PLES: Contextual Progressive Layer Expansion with Self-attention for Multi-class Landslide Segmentation on Mars using Multimodal Satellite Imagery. 354-364 - Wassim A. El Ahmar, Yahya Massoud, Dhanvin Kolhatkar, Hamzah Alghamdi, Mohammad Al Ja'afreh, Robert Laganière, Riad I. Hammoud:
Enhanced Thermal-RGB Fusion for Robust Object Detection. 365-374 - Rhythm Vohra, Femina Senjaliya, Melissa Cote, Amanda Dash, Alexandra Branzan Albu, Julek Chawarski, Steve Pearce, Kaan Ersahin:
Detecting Underwater Discrete Scatterers in Echograms with Deep Learning-Based Semantic Segmentation. 375-384 - Eleni Kamenou, Jesús Martínez del Rincón, Paul Miller, Patricia Devlin-Hill:
A Meta-learning Approach for Domain Generalisation across Visual Modalities in Vehicle Re-identification. 385-393 - Noreen Anwar, Philippe Duplessis-Guindon, Guillaume-Alexandre Bilodeau, Wassim Bouachir:
VisiTherS: Visible-thermal infrared stereo disparity estimation of human silhouette. 394-402 - Yue Cao, Junchi Bin, Jozsef Hamari, Erik Blasch, Zheng Liu:
Multimodal Object Detection by Channel Switching and Spatial Attention. 403-411 - Spencer Low, Oliver Nina, Angel Domingo Sappa, Erik Blasch, Nathan Inkawhich:
Multi-modal Aerial View Object Classification Challenge Results - PBVS 2023. 412-421 - Meryem Mine Gündogan, Tolga Aksoy, Alptekin Temizel, Ugur Halici:
IR Reasoner: Real-time Infrared Object Detection by Visual Reasoning. 422-430 - Jincheng Zhang, Andrew R. Willis, Kevin M. Brink:
Photometric Correction for Infrared Sensors. 431-439 - Jasmine Bayrooti, Noah D. Goodman, Alex Tamkin:
Multispectral Contrastive Learning with Viewmaker Networks. 440-448 - Berkcan Ustun, Ahmet Kagan Kaya, Ezgi Cakir Ayerden, Fazil Altinel:
Spectral Transfer Guided Active Domain Adaptation For Thermal Imagery. 449-458 - Fabian Erlenbusch, Constanze Merkt, Bernardo de Oliveira, Alexander Gatter, Friedhelm Schwenker, Ulrich Klauck, Michael Teutsch:
Thermal Infrared Single Image Dehazing and Blind Image Quality Assessment. 459-469 - Rafael E. Rivadeneira, Angel Domingo Sappa, Boris Xavier Vintimilla, Chenyang Wang, Junjun Jiang, Xianming Liu, Zhiwei Zhong, Dai Bin, Li Ruodi, Shengye Li:
Thermal Image Super-Resolution Challenge Results - PBVS 2023. 470-478 - Feng Cai, Keyu Wu, Haipeng Wang, Feng Wang:
A Three-Stage Framework with Reliable Sample Pool for Long-Tailed Classification. 479-486 - Aniruddh Sikdar, Sumanth Udupa, Prajwal Gurunath, Suresh Sundaram:
DeepMAO: Deep Multi-scale Aware Overcomplete Network for Building Segmentation in Satellite Imagery. 487-496 - Ahmed Zgaren, Wassim Bouachir, Nizar Bouguila, Riad I. Hammoud:
MoundCount: A detection-based approach for automatic counting of planting microsites on UAV images. 497-506 - Aditya Kasliwal, Pratinav Seth, Sriya Rallabandi, Sanchit Singhal:
CoReFusion: Contrastive Regularized Fusion for Guided Thermal Super-Resolution. 507-514 - Spencer Low, Oliver Nina, Angel Domingo Sappa, Erik Blasch, Nathan Inkawhich:
Multi-modal Aerial View Image Challenge: Translation from Synthetic Aperture Radar to Electro-Optical Domain Results - PBVS 2023. 515-523 - Brian K. S. Isaac-Medina, Seyma Yucer, Neelanjan Bhowmik, Toby P. Breckon:
Seeing Through the Data: A Statistical Evaluation of Prohibited Item Detection Benchmark Datasets for X-ray Security Screening. 524-533 - Raghunath Sai Puttagunta, Zhu Li, Shuvra S. Bhattacharyya, George York:
Appearance Label Balanced Triplet Loss for Multi-modal Aerial View Object Classification. 534-542 - Ainkaran Santhirasekaram, Mathias Winkler, Andrea G. Rockall, Ben Glocker:
Topology Preserving Compositionality for Robust Medical Image Segmentation. 543-552 - Yi Tang Chen, Sebastian Kurtek:
Shape and Intensity Analysis of Glioblastoma Multiforme Tumors. 553-560 - Ainkaran Santhirasekaram, Avinash Kori, Mathias Winkler, Andrea G. Rockall, Francesca Toni, Ben Glocker:
Robust Hierarchical Symbolic Explanations in Hyperbolic Space for Image Classification. 561-570 - Kalyan Varma Nadimpalli, Amit Chattopadhyay, Bastian Rieck:
Euler Characteristic Transform Based Topological Loss for Reconstructing 3D Images from Single 2D Slices. 571-579 - Andac Demir, Elie Massaad, Bulent Kiziltan:
Topology-Aware Focal Loss for 3D Image Segmentation. 580-589 - Huma Jamil, Yajing Liu, Turgay Caglar, Christina M. Cole, Nathaniel Blanchard, Christopher Peterson, Michael Kirby:
Hamming Similarity and Graph Laplacians for Class Partitioning and Adversarial Image Detection. 590-599 - Audun Myers, Henry Kvinge, Tegan Emerson:
TopFusion: Using Topological Feature Space for Fusion and Imputation in Multi-Modal Data. 600-609 - Francisco Acosta, Sophia Sanborn, Khanh Dao Duc, Manu S. Madhav, Nina Miolane:
Quantifying Extrinsic Curvature in Neural Manifolds. 610-619 - Davis Brown, Henry Kvinge:
Making Corgis Important for Honeycomb Classification: Adversarial Attacks on Concept-based Explainability Tools. 620-627 - Bohan Zeng, Xuhui Liu, Sicheng Gao, Boyu Liu, Hong Li, Jianzhuang Liu, Baochang Zhang:
Face Animation with an Attribute-Guided Diffusion Model. 628-637 - Shaobo Lin, Kun Wang, Xingyu Zeng, Rui Zhao:
Explore the Power of Synthetic Data on Few-shot Object Detection. 638-647 - Noa Alkobi, Tamar Rott Shaham, Tomer Michaeli:
Internal Diverse Image Completion. 648-658 - Hazrat Ali, Christer Grönlund, Zubair Shah:
Leveraging GANs for data scarcity of COVID-19: Beyond the hype. 659-667 - Kaiwen Cui, Rongliang Wu, Fangneng Zhan, Shijian Lu:
Face Transformer: Towards High Fidelity and Accurate Face Swapping. 668-677 - René Haas, Stella Graßhof, Sami S. Brandt:
Controllable GAN Synthesis Using Non-Rigid Structure-from-Motion. 678-687 - Edgar Schönfeld, Julio Borges, Vadim Sushko, Bernt Schiele, Anna Khoreva:
Discovering Class-Specific GAN Controls for Semantic Image Synthesis. 688-697 - Yasser Benigmim, Subhankar Roy, Slim Essid, Vicky Kalogeiton, Stéphane Lathuilière:
One-shot Unsupervised Domain Adaptation with Personalized Diffusion Models. 698-708 - Shiyao Xu, Lingzhi Li, Li Shen, Zhouhui Lian:
DeSRF: Deformable Stylized Radiance Field. 709-718 - Heng Yu, Zoltan A. Milacski, László A. Jeni:
Unsupervised Style-based Explicit 3D Face Reconstruction from Single Image. 719-729 - Nitish Shukla, Sudipta Banerjee:
Generating Adversarial Attacks in the Latent Space. 730-739 - Kangmin Bae, Hyung-Il Kim, Yongjin Kwon, Jinyoung Moon:
Unsupervised Bidirectional Style Transfer Network using Local Feature Transform Module. 740-749 - Samy Chali, Inna Kucher, Marc Duranton, Jacques-Olivier Klein:
Improving Normalizing Flows with the Approximate Mass for Out-of-Distribution Detection. 750-758 - Tripti Shukla, Paridhi Maheshwari, Rajhans Singh, Ankita Shukla, Kuldeep Kulkarni, Pavan K. Turaga:
Scene Graph Driven Text-Prompt Generation for Image Inpainting. 759-768 - Jordan Shipard, Arnold Wiliem, Kien Nguyen Thanh, Wei Xiang, Clinton Fookes:
Diversity is Definitely Needed: Improving Model-Agnostic Zero-shot Classification via Stable Diffusion. 769-778 - Mohammadreza Mofayezi, Yasamin Medghalchi:
Benchmarking Robustness to Text-Guided Corruptions. 779-786 - Edgardo Solano-Carrillo, Ángel Bueno Rodríguez, Borja Carrillo-Perez, Yannik Steiniger, Jannis Stoppe:
Look ATME: The Discriminator Mean Entropy Needs Attention. 787-796 - Mark Hamazaspyan, Shant Navasardyan:
Diffusion-Enhanced PatchMatch: A Framework for Arbitrary Style Transfer with Diffusion Models. 797-805 - Jan Niklas Kolf, Tim Rieber, Jurek Elliesen, Fadi Boutros, Arjan Kuijper, Naser Damer:
Identity-driven Three-Player Generative Adversarial Network for Synthetic-based Face Recognition. 806-816 - Mohamed Amine Marnissi, Abir Fathallah:
GAN-based Vision Transformer for High-Quality Thermal Image Enhancement. 817-825 - Yutong Zhou, Nobutaka Shimada:
Vision + Language Applications: A Survey. 826-842 - Arpit Bansal, Hong-Min Chu, Avi Schwarzschild, Soumyadip Sengupta, Micah Goldblum, Jonas Geiping, Tom Goldstein:
Universal Guidance for Diffusion Models. 843-852 - Changhao Shi, Haomiao Ni, Kai Li, Shaobo Han, Mingfu Liang, Martin Renqiang Min:
Exploring Compositional Visual Generation with Latent Classifier Guidance. 853-862 - Matyás Bohácek, Hany Farid:
A Geometric and Photometric Exploration of GAN and Diffusion Synthesized Faces. 874-883 - Shivansh Mundra, Gonzalo J. Aniano Porcile, Smit Marvaniya, James R. Verbus, Hany Farid:
Exposing GAN-Generated Profile Photos from Compact Embeddings. 884-892 - Shan Jia, Mingzhen Huang, Zhou Zhou, Yan Ju, Jialing Cai, Siwei Lyu:
AutoSplice: A Text-prompt Manipulated Image Dataset for Media Forensics. 893-903 - Chengzhe Sun, Shan Jia, Shuwei Hou, Siwei Lyu:
AI-Synthesized Voice Detection Using Neural Vocoder Artifacts. 904-912 - Kar Balan, Shruti Agarwal, Simon Jenni, Andy Parsons, Andrew Gilbert, John P. Collomosse:
EKILA: Synthetic Media Provenance and Attribution for Generative Art. 913-922 - Hao Chen, Peng Zheng, Xin Wang, Shu Hu, Bin Zhu, Jinrong Hu, Xi Wu, Siwei Lyu:
Harnessing the Power of Text-image Contrastive Models for Automatic Detection of Online Misinformation. 923-932 - Tu Bui, Shruti Agarwal, Ning Yu, John P. Collomosse:
RoSteALS: Robust Steganography using Autoencoder Latent Space. 933-942 - Davide Cozzolino, Alessandro Pianese, Matthias Nießner, Luisa Verdoliva:
Audio-Visual Person-of-Interest DeepFake Detection. 943-952 - Jun Wang, Omran Alamayreh, Benedetta Tondi, Mauro Barni:
Open Set Classification of GAN-based Image Manipulations via a ViT-based Hybrid Architecture. 953-962 - Ziyue Xiang, Amit Kumar Singh Yadav, Paolo Bestagini, Stefano Tubaro, Edward J. Delp:
MTN: Forensic Analysis of MP4 Video Files Using Graph Neural Networks. 963-972 - Riccardo Corvi, Davide Cozzolino, Giovanni Poggi, Koki Nagano, Luisa Verdoliva:
Intriguing properties of synthetic images: from generative adversarial networks to diffusion models. 973-982 - Danial Samadi Vahdati, Tai D. Nguyen, Matthew C. Stamm:
Defending Low-Bandwidth Talking Head Videoconferencing Systems From Real-Time Puppeteering Attacks. 983-992 - Muhammad Anas Raza, Khalid Mahmood Malik:
Multimodaltrace: Deepfake Detection using Audiovisual Representation Learning. 993-1000 - Songlin Yang, Wei Wang, Chenye Xu, Ziwen He, Bo Peng, Jing Dong:
Exposing Fine-Grained Adversarial Vulnerability of Face Anti-Spoofing Models. 1001-1010 - Yufei Zhang, Rui Zhao, Ziyi Zhao, Naveen Ramakrishnan, Manoj Aggarwal, Gerard Medioni, Qiang Ji:
Robust Partial Fingerprint Recognition. 1011-1020 - Pedro C. Neto, Ana Filipa Sequeira, Jaime S. Cardoso, Philipp Terhörst:
PIC-Score: Probabilistic Interpretable Comparison Score for Optimal Matching Confidence in Single- and Multi-Biometric Face Recognition. 1021-1029 - Chi Xu, Yasushi Makihara, Xiang Li, Yasushi Yagi:
Gait Recognition from Fisheye Images. 1030-1040 - Haiyu Wu, Vítor Albiero, K. S. Krishnapriya, Michael C. King, Kevin W. Bowyer:
Face Recognition Accuracy Across Demographics: Shining a Light Into the Problem. 1041-1050 - Daniel DeAlcala, Aythami Morales, Ruben Tolosana, Alejandro Acien, Julian Fiérrez, Santiago Hernandez, Miguel A. Ferrer, Moisés Díaz:
BeCAPTCHA-Type: Biometric Keystroke Data Generation for Improved Bot Detection. 1051-1060 - Meiling Fang, Marco Huber, Naser Damer:
SynthASpoof: Developing Face Presentation Attack Detection Based on Privacy-friendly Synthetic Data. 1061-1070 - Sandipan Banerjee, Ajjen Joshi, Jay Turcot:
The Universal Face Encoder: Learning Disentangled Representations Across Different Attributes. 1071-1080 - Chih-Jung Chang, Yaw-Chern Lee, Shih-Hsuan Yao, Min-Hung Chen, Chien-Yi Wang, Shang-Hong Lai, Trista Pei-Chun Chen:
A Closer Look at Geometric Temporal Dynamics for Face Anti-Spoofing. 1081-1091 - Chongyi Li, Chunle Guo, Shangchen Zhou, Qiming Ai, Ruicheng Feng, Chen Change Loy:
FlexiCurve: Flexible Piecewise Curves Estimation for Photo Retouching. 1092-1101 - Qixin Yan, Chunle Guo, Jixin Zhao, Yuekun Dai, Chen Change Loy, Chongyi Li:
BeautyREC: Robust, Efficient, and Component-Specific Makeup Transfer. 1102-1110 - Iman Abbasnejad, Fabio Zambetta, Flora D. Salim, Timothy Wiley, Jeffrey Chan, Russell Gallagher, Ehsan Abbasnejad:
SCONE-GAN: Semantic Contrastive learning-based Generative Adversarial Network for an end-to-end image translation. 1111-1120 - Wei Jiang, Hyomin Choi, Fabien Racapé:
Adaptive Human-Centric Video Compression for Humans and Machines. 1121-1129 - Ali Hojjat, Janek Haberer, Olaf Landsiedel:
ProgDTD: Progressive Learned Image Compression with Double-Tail-Drop Training. 1130-1139 - Peter Buckel, Timo Oksanen, Thomas Dietmueller:
RB-Dust - A Reference-based Dataset for Vision-based Dust Removal. 1140-1149 - Han Yao Choong, Suryansh Kumar, Luc Van Gool:
Quantum Annealing for Single Image Super-Resolution. 1150-1159 - Yinhuai Wang, Jiwen Yu, Runyi Yu, Jian Zhang:
Unlimited-Size Diffusion Restoration. 1160-1167