


default search action
WACV 2023: Waikoloa, HI, USA
- IEEE/CVF Winter Conference on Applications of Computer Vision, WACV 2023, Waikoloa, HI, USA, January 2-7, 2023. IEEE 2023, ISBN 978-1-6654-9346-8

- Vivek Trivedy, Longin Jan Latecki:

CNN2Graph: Building Graphs for Image Classification. 1-11 - Dmitrii Marin, Jen-Hao Rick Chang, Anurag Ranjan, Anish Prabhu, Mohammad Rastegari, Oncel Tuzel:

Token Pooling in Vision Transformers for Image Classification. 12-21 - Yuting Wang, Ricardo Guerrero, Vladimir Pavlovic:

D2F2WOD: Learning Object Proposals for Weakly-Supervised Object Detection via Progressive Domain Adaptation. 22-31 - Tal Ridnik, Gilad Sharir, Avi Ben-Cohen, Emanuel Ben Baruch, Asaf Noy:

ML-Decoder: Scalable and Versatile Classification Head. 32-41 - Andres Palechor, Annesha Bhoumik, Manuel Günther

:
Large-Scale Open-Set Classification Protocols for ImageNet. 42-51 - George Adaimi, David Mizrahi, Alexandre Alahi:

Composite Relationship Fields with Transformers for Scene Graph Generation. 52-64 - Yutong Bai, Angtian Wang, Adam Kortylewski, Alan L. Yuille:

CoKe: Contrastive Learning for Robust Keypoint Detection. 65-74 - Quentin Bouniot

, Angélique Loesch, Amaury Habrard, Romaric Audigier
:
Towards Few-Annotation Learning for Object Detection: Are Transformer-based Models More Efficient? 75-84 - Tyler LaBonte, Yale Song, Xin Wang, Vibhav Vineet, Neel Joshi:

Scaling Novel Object Detection with Weakly Supervised Detection Transformers. 85-96 - Yung-Hsu Yang, Thomas E. Huang, Min Sun, Samuel Rota Bulò, Peter Kontschieder, Fisher Yu:

Dense Prediction with Attentive Feature Aggregation. 97-106 - Chull Hwan Song, Jooyoung Yoon, Shunghyun Choi, Yannis Avrithis:

Boosting vision transformers for image retrieval. 107-117 - Paul Albert

, Eric Arazo, Tarun Krishna, Noel E. O'Connor
, Kevin McGuinness:
Is your noise correction noisy? PLS: Robustness to label noise with two stage detection. 118-127 - Martin Engilberge, Haixin Shi, Zhiye Wang, Pascal Fua:

Two-level Data Augmentation for Calibrated Multi-view Detection. 128-136 - Soufiane Belharbi, Ismail Ben Ayed, Luke McCaffrey, Eric Granger

:
TCAM: Temporal Class Activation Maps for Object Localization in Weakly-Labeled Unconstrained Videos. 137-146 - Islam Nassar, Munawar Hayat, Ehsan Abbasnejad, Hamid Rezatofighi, Mehrtash Harandi, Gholamreza Haffari:

LAVA:Label-efficient Visual Learning and Adaptation. 147-156 - Rishi Agarwal, Tirupati Saketh Chandra, Vaidehi Patil, Aniruddha Mahapatra, Kuldeep Kulkarni, Vishwa Vinay:

GEMS: Scene Expansion using Generative Models of Graphs. 157-166 - Mingjie Wang, Hao Cai, Yong Dai, Minglun Gong

:
Dynamic Mixture of Counter Network for Location-Agnostic Crowd Counting. 167-177 - Teppei Kurita, Yuhi Kondo, Legong Sun, Yusuke Moriuchi:

Simultaneous Acquisition of High Quality RGB Image and Polarization Information using a Sparse Polarization Sensor. 178-188 - Bingchuan Li, Shaofei Cai, Wei Liu, Peng Zhang, Qian He, Miao Hua, Zili Yi:

DyStyle: Dynamic Neural Network for Multi-Attribute-Conditioned Style Editings. 189-197 - Zhihao Duan, Ming Lu, Zhan Ma, Fengqing Zhu:

Lossy Image Compression with Quantized Hierarchical VAEs. 198-207 - Jitesh Jain, Yuqian Zhou, Ning Yu, Humphrey Shi

:
Keys to Better Image Inpainting: Structure and Texture Go Hand in Hand. 208-217 - Pedro Figueirêdo, Avinash Paliwal

, Nima Khademi Kalantari:
Frame Interpolation for Dynamic Scenes with Implicit Flow Encoding. 218-228 - Jiwan Hur, Jae Young Lee, Jaehyun Choi, Junmo Kim:

I See-Through You: A Framework for Removing Foreground Occlusion in Both Sparse and Dense Light Field Images. 229-238 - B. H. Pawan Prasad, Green Rosh K. S, R. B. Lokesh, Kaushik Mitra:

Burst Reflection Removal using Reflection Motion Aggregation Cues. 239-248 - Tai-Yin Chiu, Danna Gurari:

Line Search-Based Feature Transformation for Fast, Stable, and Tunable Content-Style Control in Photorealistic Style Transfer. 249-258 - Liyun Zhang

, Photchara Ratsamee
, Bowen Wang
, Zhaojie Luo, Yuki Uranishi, Manabu Higashida, Haruo Takemura:
Panoptic-aware Image-to-Image Translation. 259-268 - Abhishek Jha, Soroush Seifi, Tinne Tuytelaars

:
SimGlim: Simplifying glimpse based active visual reconstruction. 269-278 - Lorenzo Luzi, Carlos Ortiz Marrero, Nile Wynar, Richard G. Baraniuk, Michael J. Henry:

Evaluating generative networks using Gaussian mixtures of image features. 279-288 - Xihui Liu

, Dong Huk Park, Samaneh Azadi, Gong Zhang, Arman Chopikyan, Yuxiao Hu, Humphrey Shi
, Anna Rohrbach, Trevor Darrell:
More Control for Free! Image Synthesis with Semantic Diffusion Guidance. 289-299 - James F. Mullen Jr.

, Divya Kothandaraman, Aniket Bera, Dinesh Manocha:
Placing Human Animations into 3D Scenes by Learning Interaction- and Geometry-Driven Keyframes. 300-310 - Takafumi Iwaguchi, Hiroshi Kawasaki:

Surface normal estimation from optimized and distributed light sources using DNN-based photometric stereo. 311-320 - David Hart, Michael Whitney, Bryan S. Morse

:
Interpolated SelectionConv for Spherical Images and Surfaces. 321-330 - Yingnan Ma, Chenqiu Zhao, Anup Basu, Xudong Li:

RAST: Restorable Arbitrary Style Transfer via Multi-restoration. 331-340 - Cameron Gordon, Shin-Fang Ch'ng, Lachlan E. MacDonald, Simon Lucey

:
On Quantizing Implicit Neural Representations. 341-350 - Lydia Lindner, Alexander Effland

, Filip Ilic, Thomas Pock, Erich Kobler:
Lightweight Video Denoising using Aggregated Shifted Window Attention. 351-360 - Junming Chen, Meirui Jiang, Qi Dou, Qifeng Chen:

Federated Domain Generalization for Image Recognition via Cross-Client Style Transfer. 361-370 - Shahar Mahpod, Noam Gaash, Hay Hoffman, Gil Ben-Artzi:

CTrGAN: Cycle Transformers GAN for Gait Transfer. 371-381 - Divya Kothandaraman, Sumit Shekhar, Abhilasha Sancheti, Manoj Ghuhan, Tripti Shukla, Dinesh Manocha:

SALAD : Source-free Active Label-Agnostic Domain Adaptation for Classification, Segmentation and Detection. 382-391 - Thomas Westfechtel, Hao-Wei Yeh, Qier Meng, Yusuke Mukuta, Tatsuya Harada:

Backprop Induced Feature Weighting for Adversarial Domain Adaptation with Iterative Label Distribution Alignment. 392-401 - Md Mahmudur Rahman, Rameswar Panda, Mohammad Arif Ul Alam

:
Semi-Supervised Domain Adaptation with Auto-Encoder via Simultaneous Learning. 402-411 - Haomiao Ni, Yihao Liu, Sharon X. Huang, Yuan Xue

:
Cross-identity Video Motion Retargeting with Joint Transformation and Synthesis. 412-422 - Giulio Mattolin, Luca Zanella

, Elisa Ricci
, Yiming Wang
:
ConfMix: Unsupervised Domain Adaptation for Object Detection via Confidence-based Mixing. 423-433 - Tejas Gokhale, Rushil Anirudh, Jayaraman J. Thiagarajan, Bhavya Kailkhura

, Chitta Baral, Yezhou Yang:
Improving Diversity with Adversarially Learned Transformations for Domain Generalization. 434-443 - Donald Shenaj

, Eros Fanì, Marco Toldo, Debora Caldarola, Antonio Tavera
, Umberto Michieli
, Marco Ciccone, Pietro Zanuttigh, Barbara Caputo:
Learning Across Domains and Devices: Style-Driven Source-Free Domain Adaptation in Clustered Federated Learning. 444-454 - Matthew R. Keaton, Ram J. Zaveri, Gianfranco Doretto:

CellTranspose: Few-shot Domain Adaptation for Cellular Instance Segmentation. 455-466 - Swati Jindal, Xin Eric Wang:

CUDA-GHR: Controllable Unsupervised Domain Adaptation for Gaze and Head Redirection. 467-477 - Vibashan VS, Poojan Oza, Vishal M. Patel:

Towards Online Domain Adaptive Object Detection. 478-488 - Kyusik Cho

, Suhyeon Lee
, Hongje Seong, Euntai Kim:
Domain Adaptive Video Semantic Segmentation via Cross-Domain Moving Object Mixing. 489-498 - Fabrizio J. Piva, Daan de Geus

, Gijs Dubbelman:
Empirical Generalization Study: Unsupervised Domain Adaptation vs. Domain Generalization Methods for Semantic Segmentation in the Wild. 499-508 - Yumeng Li, Dan Zhang, Margret Keuper, Anna Khoreva:

Intra-Source Style Augmentation for Improved Domain Generalization. 509-519 - Jinyu Yang, Jingjing Liu, Ning Xu, Junzhou Huang

:
TVT: Transferable Vision Transformer for Unsupervised Domain Adaptation. 520-530 - Sungsu Hur, Inkyu Shin, Kwanyong Park, Sanghyun Woo, In So Kweon:

Learning Classifiers of Prototypes and Reciprocal Points for Universal Domain Adaptation. 531-540 - Michael Essich

, Markus Rehmann
, Cristóbal Curio
:
Auxiliary Task-Guided CycleGAN for Black-Box Model Domain Adaptation. 541-550 - Weiwei Sun, Daniel Rebain, Renjie Liao, Vladimir Tankovich, Soroosh Yazdani, Kwang Moo Yi, Andrea Tagliasacchi:

NeuralBF: Neural Bilateral Filtering for Top-down Instance Segmentation on Point Clouds. 551-560 - Brent Griffin:

Mobile Robot Manipulation using Pure Object Detection. 561-571 - Driton Salihu, Eckehard G. Steinbach

:
SGPCR: Spherical Gaussian Point Cloud Representation and its Application to Object Registration and Retrieval. 572-581 - Min Seok Lee, Seok Woo Yang, Sung Won Han:

GaIA: Graphical Information Gain based Attention Network for Weakly Supervised Point Cloud Semantic Segmentation. 582-591 - Jaeyeon Kim, Binh-Son Hua, Duc Thanh Nguyen, Sai-Kit Yeung:

PointInverter: Point Cloud Reconstruction and Editing via a Generative Model with Shape Priors. 592-601 - Maximilian Pittner, Alexandru Condurache, Joel Janai:

3D-SpLineNet: 3D Traffic Line Detection using Parametric Spline Representations. 602-611 - Jinlong Li, Runsheng Xu, Jin Ma, Qin Zou, Jiaqi Ma, Hongkai Yu:

Domain Adaptive Object Detection for Autonomous Driving under Foggy Weather. 612-622 - Dusan Malic, Christian Fruhwirth-Reisinger, Horst Possegger

, Horst Bischof
:
SAILOR: Scaling Anchors via Insights into Latent Object Representation. 623-632 - Zhimin Chen, Longlong Jing, Liang Yang, Yingwei Li, Bing Li:

Class-Level Confidence Based 3D Semi-Supervised Learning. 633-642 - Minghan Zhu, Lingting Ge, Panqu Wang, Huei Peng:

MonoEdge: Monocular 3D Object Detection Using Local Perspectives. 643-652 - Minmin Yang, Jiajing Chen, Senem Velipasalar:

Cross-Modality Feature Fusion Network for Few-Shot 3D Point Cloud Classification. 653-662 - Anas Mahmoud, Jordan S. K. Hu, Steven L. Waslander:

Dense Voxel Fusion for 3D Object Detection. 663-672 - Nagma S. Khan, Kazumine Ogura, Eric Cosatto, Masayuki Ariyoshi:

Real-time Concealed Weapon Detection on 3D Radar Images for Walk-through Screening System. 673-681 - Daeun Lee, Jinkyu Kim:

Resolving Class Imbalance for LiDAR-based Object Detector by Dynamic Weight Average and Contextual Ground Truth Sampling. 682-691 - Shubham Gupta, Jeet Kanjani, Mengtian Li, Francesco Ferroni, James Hays, Deva Ramanan

, Shu Kong:
Far3Det: Towards Far-Field 3D Detection. 692-701 - Dmitrii Torbunov, Yi Huang, Haiwang Yu, Jin Huang, Shinjae Yoo, Meifeng Lin, Brett Viren, Yihui Ren

:
UVCGAN: UNet Vision Transformer cycle-consistent GAN for unpaired image-to-image translation. 702-712 - Simon Niklaus, Ping Hu, Jiawen Chen:

Splatting-based Synthesis for Video Frame Interpolation. 713-723 - Kyungmin Jo, Gyumin Shim, Sanghun Jung, Soyoung Yang, Jaegul Choo:

CG-NeRF: Conditional Generative Neural Radiance Fields for 3D-aware Image Synthesis. 724-733 - Nikola Popovic, Ritika Chakraborty, Danda Pani Paudel

, Thomas Probst, Luc Van Gool:
Spatially Multi-conditional Image Generation. 734-743 - Min Woo Kim, Nam Ik Cho:

WHFL: Wavelet-Domain High Frequency Loss for Sketch-to-Image Translation. 744-754 - David Dadon, Ohad Fried, Yacov Hel-Or:

DDNeRF: Depth Distribution Neural Radiance Fields. 755-763 - Hanbit Lee, Youna Kim, Sang-Goo Lee:

Multi-scale Contrastive Learning for Complex Scene Generation. 764-774 - Pol Caselles, Eduard Ramon, Jaime García, Xavier Giró-i-Nieto, Francesc Moreno-Noguer, Gil Triginer:

SIRA: Relightable Avatars from a Single Image. 775-784 - Aditya Chattopadhyay, Xi Zhang, David Paul Wipf, Himanshu Arora, René Vidal:

Learning Graph Variational Autoencoders with Constraints and Structured Priors for Conditional Indoor 3D Scene Generation. 785-794 - Mingtong Zhang, Shuhong Zheng, Zhipeng Bao, Martial Hebert, Yu-Xiong Wang:

Beyond RGB: Scene-Property Synthesis with Neural Radiance Fields. 795-805 - Kai-En Lin, Yen-Chen Lin, Wei-Sheng Lai, Tsung-Yi Lin, Yi-Chang Shih, Ravi Ramamoorthi:

Vision Transformer for NeRF-Based View Synthesis from a Single Input Image. 806-815 - Luca De Luigi, Damiano Bolognini, Federico Domeniconi, Daniele De Gregorio, Matteo Poggi

, Luigi Di Stefano:
ScanNeRF: a Scalable Benchmark for Neural Radiance Fields. 816-825 - Fariborz Taherkhani, Aashish Rai, Quankai Gao, Shaunak Srivastava, Xuanbai Chen, Fernando De la Torre, Steven Song, Aayush Prakash, Daeil Kim:

Controllable 3D Generative Adversarial Face Model via Disentangling Shape and Appearance. 826-836 - Inwoo Hwang, Junho Kim, Young Min Kim:

Ev-NeRF: Event Based Neural Radiance Field. 837-847 - Chaerin Kong, Dong Hyeon Jeon, Ohjoon Kwon, Nojun Kwak:

Leveraging Off-the-shelf Diffusion Model for Multi-attribute Fashion Image Manipulation. 848-857 - Samia Shafique, Bailey Kong, Shu Kong, Charless C. Fowlkes:

Creating a Forensic Database of Shoeprints from Online Shoe-Tread Photos. 858-868 - Safa C. Medin, Amir Weiss, Frédo Durand, William T. Freeman, Gregory W. Wornell

:
Can Shadows Reveal Biometric Informationƒ. 869-879 - Qiaomu Miao, Minh Hoai, Dimitris Samaras:

Patch-level Gaze Distribution Prediction for Gaze Following. 880-889 - Vikrant Nagpure, Kenji Okuma:

Searching Efficient Neural Architecture with Multi-resolution Fusion Transformer for Appearance-based Gaze Estimation. 890-899 - Siamul Karim Khan

, Patrick Tinsley, Adam Czajka:
DeformIrisNet: An Identity-Preserving Model of Iris Texture Deformation. 900-908 - Haidong Zhu, Zhaoheng Zheng, Ram Nevatia:

Gait Recognition Using 3-D Human Body Shape Inference. 909-918 - Wes Robbins, Steven Zhou, Aman Bhatta, Chad Mello, Vítor Albiero, Kevin W. Bowyer, Terrance E. Boult:

CAST: Conditional Attribute Subsampling Toolkit for Fine-grained Evaluation. 919-929 - Ziyuan Huang, Zhengping Zhou, Yung-Yu Chuang, Jiajun Wu, C. Karen Liu:

Physically Plausible Animation of Human Upper Body from a Single Image. 930-939 - Manh Huynh, Gita Alaghband:

Online Adaptive Temporal Memory with Certainty Estimation for Human Trajectory Prediction. 940-949 - Igor Vozniak, Philipp Müller, Lorena Hell, Nils Lipp, Ahmed Abouelazm, Christian Müller:

Context-empowered Visual Attention Prediction in Pedestrian Scenarios. 950-960 - Akshay Agarwal

, Nalini K. Ratha
, Afzel Noore, Richa Singh, Mayank Vatsa:
Misclassifications of Contact Lens Iris PAD Algorithms: Is it Gender Bias or Environmental Conditions? 961-970 - André Brasil Vieira Wyzykowski, Anil K. Jain:

Synthetic Latent Fingerprint Generator. 971-980 - Andreas Specker

, Mickael Cormier, Jürgen Beyerer:
UPAR: Unified Pedestrian Attribute Recognition and Person Retrieval. 981-990 - Takahiro Toizumi, Koichi Takahashi, Masato Tsukada:

Segmentation-free Direct Iris Localization Networks. 991-1000 - Ahmed Tawfik Aboukhadra, Jameel Malik, Ahmed Elhayek

, Nadia Robertini, Didier Stricker:
THOR-Net: End-to-end Graformer-based Realistic Two Hands and Object Reconstruction with Self-supervision. 1001-1010 - Yuxin Tian, Shawn D. Newsam, Kofi Boakye:

Fashion Image Retrieval with Text Feedback by Additive Attention Compositional Learning. 1011-1021 - Xuri Ge, Fuhai Chen, Songpei Xu, Fuxiang Tao, Joemon M. Jose:

Cross-modal Semantic Enhanced Interaction for Image-Sentence Retrieval. 1022-1031 - Ruoyue Shen, Nakamasa Inoue, Koichi Shinoda:

Text-Guided Object Detector for Multi-modal Video Question Answering. 1032-1042 - Srikanth Malla, Chiho Choi, Isht Dwivedi, Joon Hee Choi, Jiachen Li:

DRAMA: Joint Risk Localization and Captioning in Driving. 1043-1052 - Ryugo Morita, Zhiqiang Zhang, Man M. Ho

, Jinjia Zhou:
Interactive Image Manipulation with Complex Text Instructions. 1053-1062 - Konstantin Kobs, Michael Steininger, Andreas Hotho:

InDiReCT: Language-Guided Zero-Shot Deep Metric Learning for Images. 1063-1072 - Tzu-Jui Julius Wang, Jorma Laaksonen

, Tomas Langer, Heikki Arponen, Tom E. Bishop:
Learning by Hallucinating: Vision-Language Pre-training with Weak Supervision. 1073-1083 - Abhishek Jha, Badri N. Patro, Luc Van Gool, Tinne Tuytelaars

:
Barlow constrained optimization for Visual Question Answering. 1084-1093 - Jason Armitage

, Leonardo Impett
, Rico Sennrich:
A Priority Map for Vision-and-Language Navigation with Trajectory Plans and Feature-Location Cues. 1094-1103 - Chia-Wen Kuo, Chih-Yao Ma, Judy Hoffman, Zsolt Kira:

Structure-Encoding Auxiliary Tasks for Improved Visual Representation in Vision-and-Language Navigation. 1104-1113 - Jihyeon Lee, Woo-Young Kang, Eun-Sol Kim:

Dense but Efficient VideoQA for Intricate Compositional Reasoning. 1114-1123 - Ukyo Honda, Taro Watanabe, Yuji Matsumoto:

Switching to Discriminative Image Captioning by Relieving a Bottleneck of Reinforcement Learning. 1124-1134 - Bhavin Jawade

, Deen Dayal Mohan, Naji Mohamed Ali, Srirangaraj Setlur
, Venu Govindaraju
:
NAPReg: Nouns As Proxies Regularization for Semantically Aware Cross-Modal Embeddings. 1135-1144 - Mark Hubenthal, Suren Kumar

:
Image-Text Pre-Training for Logo Recognition. 1145-1154 - Sahithya Ravi, Aditya Chinchure, Leonid Sigal, Renjie Liao, Vered Shwartz:

VLC-BERT: Visual Question Answering with Contextualized Commonsense Knowledge. 1155-1165 - Jonas Theiner, Ralph Ewerth:

TVCalib: Camera Calibration for Sports Field Registration in Soccer. 1166-1175 - Yue Qiu, Shintaro Yamamoto, Ryosuke Yamada, Ryota Suzuki, Hirokatsu Kataoka, Kenji Iwata, Yutaka Satoh:

3D Change Localization and Captioning from Dynamic Scans of Indoor Scenes. 1176-1185 - Donghao Qiao, Farhana H. Zulkernine:

Adaptive Feature Fusion for Cooperative Perception using LiDAR Point Clouds. 1186-1195 - Hanzhe Teng, Dimitrios Chatziparaschis, Xinyue Kan, Amit K. Roy-Chowdhury, Konstantinos Karydis:

Centroid Distance Keypoint Detector for Colored Point Clouds. 1196-1205 - Linh Trinh

, Phuong Pham, Hoang Trinh, Nguyen Bach, Dung Nguyen, Giang Nguyen, Huy Nguyen:
PP4AV: A benchmarking Dataset for Privacy-preserving Autonomous Driving. 1206-1215 - Mohamed El Banani, Ignacio Rocco, David Novotný, Andrea Vedaldi, Natalia Neverova, Justin Johnson, Benjamin Graham:

Self-supervised Correspondence Estimation via Multiview Registration. 1216-1225 - Jeonghyun Kim, Kaichun Mo, Minhyuk Sung

, Woontack Woo:
Seg&Struct: The Interplay Between Part Segmentation and Structure Inference for 3D Shape Parsing. 1226-1235 - Chenxi Lola Deng, Enzo Tartaglione:

Compressing Explicit Voxel Grid Representations: fast NeRFs become also small. 1236-1245 - Renrui Zhang, Liuhui Wang, Ziyu Guo, Jianbo Shi:

Nearest Neighbors Meet Deep Neural Networks for Point Cloud Analysis. 1246-1255 - Kazuto Nakashima, Yumi Iwashita, Ryo Kurazume:

Generative Range Imaging for Learning Scene Priors of 3D LiDAR Data. 1256-1266 - Marwane Hariat, Antoine Manzanera, David Filliat:

Rebalancing gradient to improve self-supervised co-training of depth, odometry and optical flow predictions. 1267-1276 - David Deng, Avideh Zakhor:

RSF: Optimizing Rigid Scene Flow From 3D Point Clouds Without Labels. 1277-1286 - Xingyi Li, Wenxuan Wu, Xiaoli Z. Fern, Fuxin Li:

Improving the Robustness of Point Convolution on k-Nearest Neighbor Neighborhoods with a Viewpoint-Invariant Coordinate Transform. 1287-1297 - Tunhou Zhang, Mingyuan Ma, Feng Yan, Hai Li, Yiran Chen:

: Joint Point Interaction-Dimension Search for 3D Point Cloud. 1298-1307 - Abhishek Aich, Shasha Li, Chengyu Song

, M. Salman Asif, Srikanth V. Krishnamurthy
, Amit K. Roy-Chowdhury:
Leveraging Local Patch Differences in Multi-Object Scenes for Generative Adversarial Attacks. 1308-1318 - William Theisen, Daniel Gonzalez Cedre, Zachariah Carmichael, Daniel Moreira

, Tim Weninger, Walter J. Scheirer:
Motif Mining: Finding and Summarizing Remixed Image Content. 1319-1328 - Håkon Hukkelås, Frank Lindseth:

DeepPrivacy2: Towards Realistic Full-Body Anonymization. 1329-1338 - Chuqiao Li, Zhiwu Huang

, Danda Pani Paudel
, Yabin Wang
, Mohamad Shahbazi, Xiaopeng Hong, Luc Van Gool:
A Continual Deepfake Detection Benchmark: Dataset, Methods, and Essentials. 1339-1349 - Radhika Dua, Seongjun Yang, Yixuan Li, Edward Choi

:
Task Agnostic and Post-hoc Unseen Distribution Detection. 1350-1359 - Futa Waseda, Sosuke Nishikawa, Trung-Nghia Le

, Huy H. Nguyen, Isao Echizen:
Closer Look at the Transferability of Adversarial Examples: How They Fool Different Models Differently. 1360-1368 - Umur A. Ciftci, Gokturk Yuksek, Ilke Demir

:
My Face My Choice: Privacy Enhancing Deepfakes for Social Media Anonymization. 1369-1379 - Nathan Drenkow, Max Lennon, I-Jeng Wang, Philippe Burlina:

Do Adaptive Active Attacks Pose Greater Risk Than Static Attacks? 1380-1389 - Jian Jiang, Oya Çeliktutan:

Neural Weight Search for Scalable Task Incremental Learning. 1390-1399 - Thanh Vu, Yanqi Zhou, Chunfeng Wen, Yueqi Li, Jan-Michael Frahm:

Toward Edge-Efficient Dense Predictions with Synergistic Multi-Task Neural Architecture Search. 1400-1410 - Tianhong Li, Lijie Fan, Yuan Yuan, Hao He, Yonglong Tian, Rogério Feris, Piotr Indyk, Dina Katabi:

Addressing Feature Suppression in Unsupervised Visual Representations. 1411-1420 - Hamed Behzadi Khormuji, José Oramas

:
A Protocol for Evaluating Model Interpretation Methods from Visual Explanations. 1421-1429 - Håkon Hukkelås, Morten Smebye, Rudolf Mester, Frank Lindseth:

Realistic Full-Body Anonymization with Surface-Guided GANs. 1430-1440 - Tim Lebailly, Tinne Tuytelaars

:
Global-Local Self-Distillation for Visual Representation Learning. 1441-1450 - Pavel Suma, Giorgos Tolias

:
Large-to-small Image Resolution Asymmetry in Deep Metric Learning. 1451-1460 - Matthew Watson, Bashar Awwad Shiekh Hasan, Noura Al Moubayed:

Learning How to MIMIC: Using Model Explanations to Guide Deep Learning Training. 1461-1470 - Johannes Gilg, Torben Teepe, Fabian Herzog, Gerhard Rigoll:

The Box Size Confidence Bias Harms Your Object Detector. 1471-1480 - Mikolaj Sacha, Dawid Rymarczyk, Lukasz Struski, Jacek Tabor, Bartosz Zielinski:

ProtoSeg: Interpretable Semantic Segmentation with Prototypical Parts. 1481-1492 - Niccolò Cavagnero

, Luca Robbiano
, Barbara Caputo, Giuseppe Averta:
FreeREA: Training-Free Evolution-based Architecture Search. 1493-1502 - Zhewen Yu, Christos-Savvas Bouganis

:
SVD-NAS: Coupling Low-Rank Approximation and Neural Architecture Search. 1503-1512 - Tomoki Uchiyama, Naoya Sogi, Koichiro Niinuma, Kazuhiro Fukui:

Visually explaining 3D-CNN predictions for video classification with an adaptive occlusion sensitivity analysis. 1513-1522 - Jaspreet Singh

, Chandan Singh, Ankur Rana:
Orthogonal Transforms For Learning Invariant Representations In Equivariant Neural Networks. 1523-1530 - Shentong Mo, Zhun Sun, Chao Li:

Representation Disentanglement in Generative Models with Contrastive Learning. 1531-1540 - Rishabh Patra, Ramya Hebbalaguppe, Tirtharaj Dash, Gautam Shroff, Lovekesh Vig:

Calibrating Deep Neural Networks using Explicit Regularisation and Dynamic Data Pruning. 1541-1549 - Mitsuhiro Mabuchi, Tetsuya Ishikawa:

Patch-based Privacy Preserving Neural Network for Vision Tasks. 1550-1559 - Brian Chen, Ramprasaath R. Selvaraju, Shih-Fu Chang, Juan Carlos Niebles, Nikhil Naik:

PreViTS: Contrastive Pretraining with Video Tracking Supervision. 1560-1570 - Philippe Blatter, Menelaos Kanakis, Martin Danelljan, Luc Van Gool:

Efficient Visual Tracking with Exemplar Transformers. 1571-1581 - Martin Engilberge, Weizhe Liu, Pascal Fua:

Multi-view Tracking Using Weakly Supervised Human Motion Prediction. 1582-1592 - Jonás Serých

, Jirí Matas:
Planar Object Tracking via Weighted Optical Flow. 1593-1602 - Minjung Kim, MyeongAh Cho

, Sangyoun Lee:
Feature Disentanglement Learning with Switching and Aggregation for Video-based Person Re-Identification. 1603-1612 - Vladimir Somers, Christophe De Vleeschouwer, Alexandre Alahi:

Body Part-Based Representation Learning for Occluded Person Re-Identification. 1613-1623 - Djebril Mekhazni, Maximilien Dufau, Christian Desrosiers, Marco Pedersoli, Eric Granger

:
Camera Alignment and Weighted Contrastive Learning for Domain Adaptation in Video Person ReID. 1624-1633 - Daniel Davila, Dawei Du, Bryon Lewis, Christopher Funk

, Joseph Van Pelt, Roderic Collins, Kellie Corona, Matt S. Brown, Scott McCloskey, Anthony Hoogs, Brian Clipp:
MEVID: Multi-view Extended Videos with Identities for Video Person Re-Identification. 1634-1643 - Thomas Kreutz

, Max Mühlhäuser, Alejandro Sánchez Guinea:
Unsupervised 4D LiDAR Moving Object Segmentation in Stationary Settings with Multivariate Occupancy Time Series. 1644-1653 - Keivan Nalaie, Rong Zheng:

AttTrack: Online Deep Attention Transfer for Multi-object Tracking. 1654-1663 - Takanori Asanomi, Kazuya Nishimura, Ryoma Bise:

Multi-Frame Attention with Feature-Level Warping for Drone Crowd Tracking. 1664-1673 - Ali Athar, Jonathon Luiten, Paul Voigtlaender, Tarasha Khurana, Achal Dave, Bastian Leibe, Deva Ramanan

:
BURST: A Benchmark for Unifying Object Recognition, Segmentation and Tracking in Video. 1674-1683 - Lucas Jaffe

, Avideh Zakhor:
Gallery Filter Network for Person Search. 1684-1693 - Xiaoyu Xiang, Jon Morton, Fitsum A. Reda, Lucas D. Young, Federico Perazzi, Rakesh Ranjan, Amit Kumar, Andrea Colaco, Jan P. Allebach:

HIME: Efficient Headshot Image Super-Resolution with Multiple Exemplars. 1694-1704 - Jeya Maria Jose Valanarasu, Vishal M. Patel:

Fine-Context Shadow Detection using Shadow Removal. 1705-1714 - Haoyu Ren, Yi Fan, Stephen Huang:

Robust Real-world Image Enhancement Based on Multi-Exposure LDR Images. 1715-1723 - Runsheng Xu, Zhengzhong Tu, Yuanqi Du, Xiaoyu Dong, Jinlong Li, Zibo Meng, Jiaqi Ma, Alan C. Bovik, Hongkai Yu:

Pik-Fix: Restoring and Colorizing Old Photos. 1724-1734 - Dario Fuoli, Martin Danelljan, Radu Timofte

, Luc Van Gool:
Fast Online Video Super-Resolution with Deformable Attention Pyramid. 1735-1744 - Jonghwa Yim, Minjae Kim:

Style-Guided Inference of Transformer for High-resolution Image Synthesis. 1745-1755 - Hue Nguyen, Diep Tran, Khoi Nguyen, Rang Nguyen:

PSENet: Progressive Self-Enhancement Network for Unsupervised Extreme-Light Image Enhancement. 1756-1765 - Eugene Lee, Lien-Feng Hsu, Evan Chen

, Chen-Yi Lee:
Cross-Resolution Flow Propagation for Foveated Video Super-Resolution. 1766-1775 - Yunhan Zhao, Connelly Barnes, Yuqian Zhou, Eli Shechtman, Sohrab Amirghodsi, Charless C. Fowlkes:

GeoFill: Reference-Based Image Inpainting with Better Geometric Understanding. 1776-1786 - Jooyeol Yun, Sanghyeon Lee, Minho Park, Jaegul Choo:

iColoriT: Towards Propagating Local Hints to the Right Region in Interactive Colorization by Leveraging Vision Transformer. 1787-1796 - Charles Laroche, Andrés Almansa

, Matias Tassano:
Deep Model-Based Super-Resolution with Non-uniform Blur. 1797-1808 - Mrinmoy Sen, Sai Pradyumna Chermala, Nazrinbanu Nurmohammad Nagori, Venkat Peddigari, Praful Mathur, B. H. Pawan Prasad, Moon-Hwan Jeong:

SHARDS: Efficient SHAdow Removal using Dual Stage Network for High-Resolution Images. 1809-1817 - Youngin Cho, Junsoo Lee, Soyoung Yang, Juntae Kim, Yeojeong Park, Haneol Lee, Mohammad Azam Khan, Daesik Kim, Jaegul Choo:

Guiding Users to Where to Give Color Hints for Efficient Interactive Sketch Colorization via Unsupervised Region Prioritization. 1818-1827 - Youngrae Kim, Jinsu Lim, Hoonhee Cho, Minji Lee, Dongman Lee, Kuk-Jin Yoon, Ho-Jin Choi:

Efficient Reference-based Video Super-Resolution (ERVSR): Single Reference Image Is All You Need. 1828-1837 - Stavros Tsogkas, Fengjia Zhang, Allan D. Jepson, Alex Levinshtein:

Efficient Flow-Guided Multi-frame De-fencing. 1838-1847 - Marcos V. Conde, Florin-Alexandru Vasluianu, Javier Vazquez-Corral

, Radu Timofte
:
Perceptual Image Enhancement for Smartphone Real-Time Applications. 1848-1858 - Ting-Wei Wu, Jia-Hong Huang, Joseph Lin, Marcel Worring

:
Expert-defined Keywords Improve Interpretability of Retinal Image Captioning. 1859-1868 - Kun Han, Shanlin Sun, Xiangyi Yan, Chenyu You, Hao Tang, Junayed Naushad, Haoyu Ma, Deying Kong, Xiaohui Xie:

Diffeomorphic Image Registration with Neural Velocity Field. 1869-1879 - Ali Mirzazadeh, Florian Dubost, Maxwell Pike, Krish Maniar, Max Zuo, Christopher Lee-Messer, Daniel L. Rubin:

ATCON: Attention Consistency for Vision Models. 1880-1889 - Florian Dubost, Erin Hong, Siyi Tang, Nandita Bhaskhar, Christopher Lee-Messer, Daniel L. Rubin:

Semi-Supervised Learning for Sparsely-Labeled Sequential Data: Application to Healthcare Video Processing. 1890-1899 - Huy H. Nguyen, Trung-Nghia Le

, Junichi Yamagishi, Isao Echizen:
Analysis of Master Vein Attacks on Finger Vein Recognition Systems. 1900-1908 - Xiaofei Huang, Michael Wan

, Lingfei Luan
, Bethany Tunik, Sarah Ostadabbas:
Computer Vision to the Rescue: Infant Postural Symmetry Estimation from Incongruent Annotations. 1909-1917 - Kechun Liu, Beibin Li, Wenjun Wu, Caitlin J. May, Oliver Chang, Stevan Knezevich, Lisa M. Reisch, Joann G. Elmore

, Linda G. Shapiro:
VSGD-Net: Virtual Staining Guided Melanocyte Detection on Histopathological Images. 1918-1927 - Bowen Song

, Liyue Shen, Lei Xing
:
PINER: Prior-informed Implicit Neural Representation Learning for Test-time Adaptation in Sparse-view CT Reconstruction. 1928-1937 - Junmo Cho, Seungjae Han

, Eun-Seo Cho
, Kijung Shin, Young-Gyu Yoon
:
Robust and Efficient Alignment of Calcium Imaging Data through Simultaneous Low Rank and Sparse Decomposition. 1938-1947 - Jiyoon Shin, Jungwoo Lee:

MRI Imputation based on Fused Index- and Intensity-Registration. 1948-1957 - Joshua Peters, Léo Lebrat

, Rodrigo Santa Cruz, Aaron Nicolson, Gregg Belous, Salamata Konate, Parnesh Raniga, Vincent Doré, Pierrick Bourgeat, Jurgen Mejan-Fripp, Clinton Fookes, Olivier Salvado
:
DBCE : A Saliency Method for Medical Deep Learning Through Anatomically-Consistent Free-Form Deformations. 1958-1968 - Zekai Chen

, Devansh Agarwal
, Kshitij Aggarwal, Wiem Safta, Mariann Micsinai Balan, Kevin Brown:
Masked Image Modeling Advances 3D Medical Image Analysis. 1969-1979 - Tien-Phat Nguyen, Trong-Thang Pham, Tri Nguyen, Hieu Le, Dung Nguyen, Hau Lam, Phong Nguyen, Jennifer Fowler

, Minh-Triet Tran, Ngan Le:
EmbryosFormer: Deformable Transformer and Collaborative Encoding-Decoding for Embryos Stage Development Classification. 1980-1989 - Ella Lan:

Performer: A Novel PPG-to-ECG Reconstruction Transformer for a Digital Biomarker of Cardiovascular Disease Detection. 1990-1998 - Puria Azadi Moghadam, Sanne Van Dalen, Karina C. Martin, Jochen K. Lennerz

, Stephen Yip, Hossein Farahani, Ali Bashashati:
A Morphology Focused Diffusion Probabilistic Model for Synthesis of Histopathology Images. 1999-2008 - Zongshang Pang, Yuta Nakashima, Mayu Otani, Hajime Nagahara:

Contrastive Losses Are Natural Criteria for Unsupervised Video Summarization. 2009-2018 - Lukas Mehl, Azin Jahedi, Jenny Schmalfuss, Andrés Bruhn:

M-FUSE: Multi-frame Fusion for Scene Flow Estimation. 2019-2028 - Khurram Azeem Hashmi, Alain Pagani, Didier Stricker, Muhammad Zeshan Afzal

:
BoxMask: Revisiting Bounding Box Supervision for Video Object Detection. 2029-2039 - Jasdeep Singh, Subrahmanyam Murala, G. Sankara Raju Kosuru:

Lightweight Network For Video Motion Magnification. 2040-2049 - Feiyan Hu

, Simone Palazzo, Federica Proietto Salanitri
, Giovanni Bellitto
, Morteza Moradi, Concetto Spampinato, Kevin McGuinness:
TinyHD: Efficient Video Saliency Prediction with Heterogeneous Decoders using Hierarchical Maps Distillation. 2050-2059 - Rémi Marsal, Florian Chabot, Angélique Loesch, Hichem Sahbi:

BrightFlow: Brightness-Change-Aware Unsupervised Learning of Optical Flow. 2060-2069 - Tarun Kalluri, Deepak Pathak, Manmohan Chandraker, Du Tran:

FLAVR: Flow-Agnostic Video Representations for Fast Frame Interpolation. 2070-2081 - Digbalay Bose

, Rajat Hebbar, Krishna Somandepalli, Haoyang Zhang, Yin Cui, Kree Cole-McLaughlin, Huisheng Wang, Shrikanth Narayanan:
MovieCLIP: Visual Scene Recognition in Movies. 2082-2091 - Florian Hofherr, Lukas Koestler, Florian Bernard, Daniel Cremers

:
Neural Implicit Representations for Physical Parameter Inference from a Single Video. 2092-2102 - Florian Kadner, Tobias Thomas, David Hoppe, Constantin A. Rothkopf

:
Improving saliency models' predictions of the next fixation with humans' intrinsic cost of gaze shifts. 2103-2113 - Boris Chen, Amir Ziai, Rebecca S. Tucker, Yuchen Xie:

Match Cutting: Finding Cuts with Smooth Visual Transitions. 2114-2124 - David Osowiechi, Gustavo Adolfo Vargas Hakim, Mehrdad Noori, Milad Cheraghalikhani, Ismail Ben Ayed, Christian Desrosiers:

TTTFlow: Unsupervised Test-Time Training with Normalizing Flow. 2125-2126 - Michael Schelling

, Pedro Hermosilla
, Timo Ropinski
:
Weakly-Supervised Optical Flow Estimation for Time-of-Flight. 2134-2143 - Chaerin Min, Tae Hyun Kim, Jongwoo Lim:

Meta-Learning for Adaptation of Deep Optical Flow Networks. 2144-2153 - Zecheng Yu, Yifei Huang, Ryosuke Furuta, Takuma Yagi, Yusuke Goutsu, Yoichi Sato:

Fine-grained Affordance Annotation for Egocentric Hand-Object Interaction Videos. 2154-2162 - Hong Xuan, Xi Stephen Chen:

Dissecting Deep Metric Learning Losses for Image-Text Retrieval. 2163-2172 - Takayuki Nakatsuka

, Masahiro Hamasaki, Masataka Goto
:
Content-Based Music-Image Retrieval Using Self- and Cross-Modal Feature Embedding Memory. 2173-2183 - Cagri Gungor, Adriana Kovashka:

Complementary Cues from Audio Help Combat Noise in Weakly-Supervised Object Detection. 2184-2193 - Mariana-Iuliana Georgescu, Radu Tudor Ionescu, Andreea-Iuliana Miron

, Olivian Savencu, Nicolae-Catalin Ristea, Nicolae Verga, Fahad Shahbaz Khan:
Multimodal Multi-Head Convolutional Attention with Various Kernel Sizes for Medical Image Super-Resolution. 2194-2204 - Chunjin Song, Yuchi Zhang, Willis Peng, Parmis Mohaghegh, Bastian Wandt, Helge Rhodin:

AudioViewer: Learning to Visualize Sounds. 2205-2215 - Aditya Agarwal, Bipasha Sen, Rudrabha Mukhopadhyay, Vinay P. Namboodiri, C. V. Jawahar:

Towards MOOCs for Lipreading: Using Synthetic Talking Heads to Train Humans in Lipreading at Scale. 2216-2225 - Zudi Lin, Erhan Bas, Kunwar Yashraj Singh, Gurumurthy Swaminathan, Rahul Bhotika:

Relaxing Contrastiveness in Multimodal Representation Learning. 2226-2235 - Arda Senocak, Junsik Kim, Tae-Hyun Oh, Dingzeyu Li, In So Kweon:

Event-Specific Audio-Visual Fusion Layers: A Simple and New Perspective on Video Understanding. 2236-2246 - Youshan Zhang, Jialu Li:

BirdSoundsDenoising: Deep Visual Audio Denoising for Bird Sounds. 2247-2256 - Maxime Burchi, Radu Timofte

:
Audio-Visual Efficient Conformer for Robust Speech Recognition. 2257-2266 - Prateksha Udhayanan, Suryateja BV, Parth Laturia, Dev Chauhan, Darshan Khandelwal, Stefano Petrangeli, Balaji Vasan Srinivasan:

Recipe2Video: Synthesizing Personalized Videos from Recipe Texts. 2267-2276 - Dennis Fedorishin, Deen Dayal Mohan, Bhavin Jawade

, Srirangaraj Setlur
, Venu Govindaraju
:
Hear The Flow: Optical Flow-Based Self-Supervised Visual Sound Source Localization. 2277-2286 - Arpit Garg

, Cuong Nguyen
, Rafael Felix, Thanh-Toan Do, Gustavo Carneiro
:
Instance-Dependent Noisy Label Learning via Graphical Modelling. 2287-2297 - Menelaos Kanakis, Thomas E. Huang, David Brüggemann, Fisher Yu, Luc Van Gool:

Composite Learning for Robust and Effective Dense Predictions. 2298-2307 - HyunJae Lee, Gihyeon Lee, Junhwan Kim, Sungjun Cho

, Dohyun Kim, Donggeun Yoo:
Improving Multi-fidelity Optimization with a Recurring Learning Rate for Hyperparameter Tuning. 2308-2317 - Hongjun Choi

, Eun Som Jeon, Ankita Shukla, Pavan K. Turaga:
Understanding the Role of Mixup in Knowledge Distillation: An Empirical Study. 2318-2327 - Ivan Lopes, Tuan-Hung Vu, Raoul de Charette:

Cross-task Attention Mechanism for Dense Multi-task Learning. 2328-2337 - Dupati Srikar Chandra, Sakshi Varshney, P. K. Srijith, Sunil Gupta:

Continual Learning with Dependency Preserving Hypernetworks. 2338-2347 - Souvik Kundu, Sairam Sundaresan, Massoud Pedram, Peter A. Beerel

:
FLOAT: Fast Learnable Once-for-All Adversarial Training for Tunable Trade-off between Accuracy and Robustness. 2348-2357 - Geethu Miriam Jacob, Vishal Agarwal, Björn Stenger:

Online Knowledge Distillation for Multi-task Learning. 2358-2367 - Timon Höfer, Benjamin Kiefer, Martin Messmer, Andreas Zell:

HyperPosePDF Hypernetworks Predicting the Probability Distribution on SO(3). 2368-2378 - Ahmed Ben Saad, Kristina Prokopetc, Josselin Kherroubi, Axel Davy, Adrien Courtois, Gabriele Facciolo:

Improving Pixel-Level Contrastive Learning by Leveraging Exogenous Depth Information. 2379-2388 - Olivier Risser-Maroix, Benjamin Chamand:

What can we Learn by Predicting Accuracy? 2389-2398 - Eva Feillet, Grégoire Petit, Adrian Popescu, Marina Reyboz, Céline Hudelot:

AdvisIL - A Class-Incremental Learning Advisor. 2399-2408 - Daehyun Ahn, Hyungjun Kim, Taesu Kim, Eunhyeok Park, Jae-Joon Kim:

Searching for Robust Binary Neural Networks via Bimodal Parameter Perturbation. 2409-2418 - Ben Usman, Dina Bashkirova, Kate Saenko:

RIFT: Disentangled Unsupervised Image Translation via Restricted Information Flow. 2419-2428 - Gourav Datta, Zeyu Liu, Zihan Yin, Linyu Sun, Akhilesh R. Jaiswal, Peter A. Beerel

:
Enabling ISPless Low-Power Computer Vision. 2429-2438 - Benoit Brummer

, Christophe De Vleeschouwer:
On the Importance of Denoising when Learning to Compress Images. 2439-2447 - Khanh Quoc Dinh, Kwang Pyo Choi

:
End-to-End Single-Frame Image Signal Processing for High Dynamic Range Scenes. 2448-2457 - Nithin C. Babu

, Vignesh Kannan, Rajiv Soundararajan:
No Reference Opinion Unaware Quality Assessment of Authentically Distorted Images. 2458-2467 - Marcin Sendera, Marcin Przewiezlikowski, Konrad Karanowski, Maciej Zieba, Jacek Tabor, Przemyslaw Spurek:

HyperShot: Few-Shot Learning by Kernel HyperNetworks. 2468-2477 - Rakshith Subramanyam, Mark Heimann, T. S. Jayram, Rushil Anirudh, Jayaraman J. Thiagarajan:

Contrastive Knowledge-Augmented Meta-Learning for Few-Shot Classification. 2478-2486 - Hao Ding, Changchang Sun, Hao Tang, Dawen Cai, Yan Yan:

Few-shot Medical Image Segmentation with Cycle-resemblance Attention. 2487-2496 - Nitish Mital, Ezgi Özyilkan

, Ali Garjani, Deniz Gündüz:
Neural Distributed Image Compression with Cross-Attention Feature Alignment. 2497-2506 - Huanle Zhang, Hamed Pirsiavash, Xin Liu:

MASTAF: A Model-Agnostic Spatio-Temporal Attention Fusion Network for Few-shot Video Classification. 2507-2516 - Hao-Wei Chen, Ting-Hsuan Liao, Hsuan-Kung Yang, Chun-Yi Lee:

Pixel-Wise Prediction based Visual Odometry via Uncertainty Estimation. 2517-2527 - Koki Tsubota, Hiroaki Akutsu, Kiyoharu Aizawa:

Universal Deep Image Compression via Content-Adaptive Optimization with Adapters. 2528-2537 - Dahye Kim, Jungin Park, Jiyoung Lee

, Seongheon Park, Kwanghoon Sohn:
Language-free Training for Zero-shot Video Grounding. 2538-2547 - Soma Kajiyama, Taihe Piao, Ryo Kawahara

, Takahiro Okabe:
Separating Partially-Polarized Diffuse and Specular Reflection Components under Unpolarized Light Sources. 2548-2557 - Alper Kayabasi, Gülin Tüfekci, Ilkay Ulusoy:

Elimination of Non-Novel Segments at Multi-Scale for Few-Shot Segmentation. 2558-2566 - Jihyun Kim, Seong-Hun Jeong, Kyeongbo Kong, Suk-Ju Kang:

An Unified Framework for Language Guided Image Completion. 2567-2577 - Abhishek Aich, Kuan-Chuan Peng, Amit K. Roy-Chowdhury:

Cross-Domain Video Anomaly Detection without Target Domain Adaptation. 2578-2590 - Marco Rudolph, Tom Wehrbein, Bodo Rosenhahn, Bastian Wandt:

Asymmetric Student-Teacher Networks for Industrial Anomaly Detection. 2591-2601 - Julia Hornauer, Vasileios Belagiannis:

Heatmap-based Out-of-Distribution Detection. 2602-2611 - Paul Bergmann, David Sattlegger:

Anomaly Detection in 3D Point Clouds using Deep Geometric Descriptors. 2612-2622 - Wonwoo Cho, Jeonghoon Park, Jaegul Choo:

Training Auxiliary Prototypical Classifiers for Explainable Anomaly Detection in Medical Image Segmentation. 2623-2632 - Hanqiu Deng, Zhaoxiang Zhang, Shihao Zou, Xingyu Li:

Bi-directional Frame Interpolation for Unsupervised Video Anomaly Detection. 2633-2642 - Samuel Wilson, Tobias Fischer

, Niko Sünderhauf
, Feras Dayoub
:
Hyperdimensional Feature Fusion for Out-of-Distribution Detection. 2643-2653 - Keval Doshi, Yasin Yilmaz:

Towards Interpretable Video Anomaly Detection. 2654-2663 - Seongheon Park, Hanjae Kim, Minsu Kim, Dahye Kim, Kwanghoon Sohn:

Normality Guided Multiple Instance Learning for Weakly Supervised Video Anomaly Detection. 2664-2673 - Changhwa Park, Junho Yim, Eunji Jun:

Mutual Learning for Long-Tailed Recognition. 2674-2683 - Xiangyi Yan, Junayed Naushad, Shanlin Sun, Kun Han, Hao Tang, Deying Kong, Haoyu Ma, Chenyu You, Xiaohui Xie:

Representation Recovering for Self-Supervised Pre-training on Medical Images. 2684-2694 - Cheng-Yen Hsieh, Chih-Jung Chang, Fu-En Yang, Yu-Chiang Frank Wang:

Self-Supervised Pyramid Representation Learning for Multi-Label Visual Analysis and Beyond. 2695-2704 - Julien Denize, Jaonary Rabarisoa, Astrid Orcesi, Romain Hérault

, Stéphane Canu
:
Similarity Contrastive Estimation for Self-Supervised Soft Contrastive Learning. 2705-2715 - Prakash Chandra Chhipa

, Richa Upadhyay, Gustav Grund Pihlgren
, Rajkumar Saini, Seiichi Uchida, Marcus Liwicki:
Magnification Prior: A Self-Supervised Method for Learning Representations on Breast Cancer Histopathological Images. 2716-2726 - Ayush K. Rai, Tarun Krishna, Julia Dietlmeier

, Kevin McGuinness, Alan F. Smeaton, Noel E. O'Connor
:
Motion Aware Self-Supervision for Generic Event Boundary Detection. 2727-2738 - So Hasegawa, Masayuki Hiromoto, Akira Nakagawa

, Yuhei Umeda:
Improving Predicate Representation in Scene Graph Generation by Self-Supervised Learning. 2739-2748 - Suhong Moon, Domas Buracas, Seunghyun Park, Jinkyu Kim, John F. Canny:

An Embedding-Dynamic Approach to Self-Supervised Learning. 2749-2757 - Samarth Sinha, Peter V. Gehler, Francesco Locatello, Bernt Schiele

:
TeST: Test-time Self-Training under Distribution Shift. 2758-2768 - Wei-Chi Chen, Wei-Ta Chu:

SSSD: Self-Supervised Self Distillation. 2769-2776 - Shentong Mo, Zhun Sun, Chao Li:

Multi-level Contrastive Learning for Self-Supervised Vision Transformers. 2777-2786 - Srikrishna Jaganathan, Maximilian Kukla, Jian Wang, Karthik Shetty, Andreas K. Maier:

Self-Supervised 2D/3D Registration for X-Ray to CT Image Fusion. 2787-2797 - Salman Mohamadi, Gianfranco Doretto, Donald A. Adjeroh:

FUSSL: Fuzzy Uncertain Self Supervised Learning. 2798-2807 - Atsuyuki Miyai, Qing Yu, Daiki Ikami, Go Irie, Kiyoharu Aizawa:

Rethinking Rotation in Self-Supervised Contrastive Learning: Adaptive Positive or Negative Data Augmentation. 2808-2817 - Michael Mu, Sreyasee Das Bhattacharjee

, Junsong Yuan
:
Self-Supervised Distilled Learning for Multi-modal Misinformation Identification. 2818-2827 - Jiho Jang, Seonhoon Kim, KiYoon Yoo, Chaerin Kong, Jangho Kim, Nojun Kwak:

Self-Distilled Self-supervised Representation Learning. 2828-2838 - Yifan Xu

, Pourya Shamsolmoali, Eric Granger
, Claire Nicodeme, Laurent Gardes, Jie Yang:
TransVLAD: Multi-Scale Attention-Based Global Descriptors for Visual Geo-Localization. 2839-2848 - Sungho Chun, Sungbum Park, Ju Yong Chang:

Learnable Human Mesh Triangulation for 3D Human Pose and Shape Estimation. 2849-2858 - Stefan Thalhammer

, Timothy Patten, Markus Vincze:
COPE: End-to-end trainable Constant Runtime Object Pose Estimation. 2859-2869 - Christian Grund, Julian Tanke

, Juergen Gall:
ElliPose: Stereoscopic 3D Human Pose Estimation by Fitting Ellipsoids. 2870-2880 - Snehal Bhayani, Torsten Sattler, Viktor Larsson, Janne Heikkilä, Zuzana Kukelova:

Partially calibrated semi-generalized pose from hybrid point correspondences. 2881-2890 - Arthur Moreau, Thomas Gilles, Nathan Piasco, Dzmitry Tsishkou, Bogdan Stanciulescu, Arnaud de La Fortelle:

ImPosing: Implicit Pose Encoding for Efficient Visual Localization. 2891-2901 - Moritz Einfalt, Katja Ludwig, Rainer Lienhart:

Uplift and Upsample: Efficient 3D Human Pose Estimation with Uplifting Transformers. 2902-2912 - Xiaohan Zhang, Waqas Sultani, Safwan Wshah:

Cross-View Image Sequence Geo-localization. 2913-2922 - Cheng-Yen Yang, Jiajia Luo, Lu Xia, Yuyin Sun, Nan Qiao, Ke Zhang, Zhongyu Jiang, Jenq-Neng Hwang, Cheng-Hao Kuo:

CameraPose: Weakly-Supervised Monocular 3D Human Pose Estimation by Leveraging In-the-wild 2D Annotations. 2923-2932 - Seongyeong Lee, Hansoo Park, Dong Uk Kim, Jihyeon Kim, Muhammadjon Boboev, Seungryul Baek:

Image-free Domain Generalization via CLIP for 3D Hand Pose Estimation. 2933-2943 - Lauri Suomela, Jussi Kalliola, Atakan Dag, Harry Edelman

, Joni-Kristian Kämäräinen:
Benchmarking Visual Localization for Autonomous Navigation. 2944-2954 - István Sárándi

, Alexander Hermans, Bastian Leibe:
Learning 3D Human Pose Estimation from Dozens of Datasets using a Geometry-Aware Autoencoder to Bridge Between Skeleton Formats. 2955-2965 - Jaehoon Ko, Kyusun Cho, Daewon Choi, Kwangrok Ryoo, Seungryong Kim:

3D GAN Inversion with Pose Optimization. 2966-2975 - Erwin Wu, Hayato Nishioka, Shinichi Furuya, Hideki Koike

:
Marker-removal Networks to Collect Precise 3D Hand Data for RGB-based Estimation and its Application in Piano. 2976-2985 - Michaël Soumm, Adrian Popescu, Bertrand Delezoide:

Vis2Rec: A Large-Scale Visual Dataset for Visit Recommendation. 2986-2996 - Amar Ali-bey, Brahim Chaib-draa, Philippe Giguère:

MixVPR: Feature Mixing for Visual Place Recognition. 2997-3006 - Porter Jenkins, Kyle Armstrong, Stephen Nelson, Siddhesh Gotad, J. Stockton Jenkins, Wade Wilkey, Tanner Watts:

CountNet3D: A 3D Computer Vision Approach to Infer Counts of Occluded Objects. 3007-3016 - Zhaoshuo Li, Wei Ye, Dilin Wang, Francis X. Creighton, Russell H. Taylor, Ganesh Venkatesh, Mathias Unberath:

Temporally Consistent Online Depth Estimation in Dynamic Scenes. 3017-3026 - Kensuke Taguchi

, Shogo Morita, Yusuke Hayashi, Wataru Imaeda, Hironobu Fujiyoshi:
Uncertainty-Aware Interactive LiDAR Sampling for Deep Depth Completion. 3027-3035 - Kohei Yamashita

, Yuto Enyo, Shohei Nobuhara, Ko Nishino:
nLMVS-Net: Deep Non-Lambertian Multi-View Stereo. 3036-3045 - Gustav Bredell, Ertunc Erdil, Bruno Weber

, Ender Konukoglu:
Wiener Guided DIP for Unsupervised Blind Image Deconvolution. 3046-3055 - Ching-Ya Chiu, Yu-Ting Wu, I-Chao Shen

, Yung-Yu Chuang:
360MVSNet: Deep Multi-view Stereo Network with 360° Images for Indoor Scene Reconstruction. 3056-3065 - Markus Plack, Clara Callenberg, Monika Schneider, Matthias B. Hullin:

Fast Differentiable Transient Rendering for Non-Line-of-Sight Reconstruction. 3066-3075 - Andra Petrovai, Sergiu Nedevschi

:
MonoDVPS: A Self-Supervised Monocular Depth Estimation Approach to Depth-aware Video Panoptic Segmentation. 3076-3085 - Christian Sormann

, Emanuele Santellani, Mattia Rossi, Andreas Kuhn, Friedrich Fraundorfer:
DELS-MVS: Deep Epipolar Line Search for Multi-View Stereo. 3086-3095 - Antoni Rosinol, John J. Leonard, Luca Carlone:

Probabilistic Volumetric Fusion for Dense Monocular SLAM. 3096-3104 - Lu Sang, Bjoern Haefner, Xingxing Zuo, Daniel Cremers

:
High-Quality RGB-D Reconstruction via Multi-View Uncalibrated Photometric Stereo and Gradient-SDF. 3105-3114 - Chi-Han Peng, Jiayao Zhang:

High-Resolution Depth Estimation for 360° Panoramas through Perspective and Panoramic Depth Images Registration. 3115-3124 - Berk Kaya, Suryansh Kumar

, Carlos Eduardo Porto de Oliveira, Vittorio Ferrari, Luc Van Gool:
Multi-View Photometric Stereo Revisited. 3125-3134 - Hari Santhanam, Nehal Doiphode, Jianbo Shi:

Automated Line Labelling: Dataset for Contour Detection and 3D Reconstruction. 3135-3144 - Mohammad Farazi, Wenhui Zhu, Zhangsihao Yang, Yalin Wang:

Anisotropic Multi-Scale Graph Convolutional Network for Dense Shape Correspondence. 3145-3154 - Stefan Ainetter, Sinisa Stekovic, Friedrich Fraundorfer, Vincent Lepetit:

Automatically Annotating Indoor Images with CAD Models via RGB-D Scans. 3155-3163 - Daan de Geus

, Gijs Dubbelman:
Intra-Batch Supervision for Panoptic Segmentation on High-Resolution Images. 3164-3172 - David Brüggemann, Christos Sakaridis

, Prune Truong, Luc Van Gool:
Refign: Align and Refine for Adaptation of Semantic Segmentation to Adverse Conditions. 3173-3183 - Kazuya Nishimura, Ryoma Bise:

Weakly Supervised Cell-Instance Segmentation with Two Types of Weak Labels by Single Instance Pasting. 3184-3193 - Dipam Goswami, René Schuster, Joost van de Weijer

, Didier Stricker:
Attribution-aware Weight Transfer: A Warm-Start Initialization for Class-Incremental Semantic Segmentation. 3194-3203 - Kazuki Endo, Masayuki Tanaka, Masatoshi Okutomi:

Semantic Segmentation of Degraded Images Using Layer-Wise Feature Adjustor. 3204-3212 - Matthias Rottmann, Marco Reese:

Automated Detection of Label Errors in Semantic Segmentation Datasets via Deep Learning and Uncertainty Quantification. 3213-3222 - Loic Themyr, Clément Rambour, Nicolas Thome, Toby Collins, Alexandre Hostettler:

Full Contextual Attention for Multi-resolution Transformers in Semantic Segmentation. 3223-3232 - Subba Reddy Oota, Vijay Rowtula, Shahid Saleem Mohammed, Minghsun Liu, Manish Gupta:

WSNet: Towards An Effective Method for Wound Image Segmentation. 3233-3242 - Bruno Sauvalle, Arnaud de La Fortelle:

Autoencoder-based background reconstruction and foreground segmentation with background noise estimation. 3243-3254 - Fengyi Shen, Zador Pataki, Akhil Gurram, Ziyuan Liu, He Wang, Alois C. Knoll:

LoopDA: Constructing Self-loops to Adapt Nighttime Semantic Segmentation. 3255-3265 - Sandra Kara, Hejer Ammar, Florian Chabot, Quoc Cuong Pham:

Image Segmentation-based Unsupervised Multiple Objects Discovery. 3276-3285 - Shubhankar Borse

, Marvin Klingner, Varun Ravi Kumar, Hong Cai, Abdulaziz Almuzairee, Senthil Kumar Yogamani, Fatih Porikli
:
X-Align: Cross-Modal Cross-View Alignment for Bird's-Eye-View Segmentation. 3286-3296 - Sumin Lee, Sangmin Woo, Yeonju Park, Muhammad Adi Nugroho, Changick Kim:

Modality Mixer for Multi-modal Action Recognition. 3297-3306 - Jeffrey Byrne, Greg Castañón, Zhongheng Li, Gil J. Ettinger:

Fine-grained Activities of People Worldwide. 3307-3318 - Dasom Ahn, Sangwon Kim, Hyunsu Hong, ByoungChul Ko:

STAR-Transformer: A Spatio-temporal Cross Attention Transformer for Human Action Recognition. 3319-3328 - Gueter Josmy Faure

, Min-Hung Chen, Shang-Hong Lai:
Holistic Interaction Transformer Network for Action Detection. 3329-3339 - Yue Qiu, Yoshiki Nagasaki, Kensho Hara, Hirokatsu Kataoka, Ryota Suzuki, Kenji Iwata, Yutaka Satoh:

VirtualHome Action Genome: A Simulated Spatio-Temporal Scene Graph Dataset with Consistent Relationship Labels. 3340-3349 - Jong-Hyeon Seon, Jaedong Hwang, Jonghwan Mun, Bohyung Han:

Stop or Forward: Dynamic Layer Skipping for Efficient Action Recognition. 3350-3359 - Dawei Du, Ameya Shringi, Anthony Hoogs, Christopher Funk

:
Reconstructing Humpty Dumpty: Multi-feature Graph Autoencoder for Open Set Action Recognition. 3360-3369 - Ketul Shah, Anshul Shah, Chun Pong Lau

, Celso M. de Melo, Rama Chellappa:
Multi-View Action Recognition using Contrastive Learning. 3370-3380 - Tanay Agrawal, Michal Balazia, Philipp Müller, François Brémond:

Multimodal Vision Transformers with Forced Attention for Behavior Analysis. 3381-3391 - Min-Seok Kang, Dongoh Kang, Hansaem Kim:

Efficient Skeleton-Based Action Recognition via Joint-Mapping strategies. 3392-3401 - Samrudhdhi B. Rangrej, Kevin J. Liang, Tal Hassner, James J. Clark:

GliTr: Glimpse Transformers with Spatiotemporal Consistency for Online Action Prediction. 3402-3412 - Siqi Deng, Yuanjun Xiong, Meng Wang, Wei Xia, Stefano Soatto:

Harnessing Unrecognizable Faces for Improving Face Recognition. 3413-3422 - Byungho Jo, Donghyeon Cho, In Kyu Park, Sungeun Hong:

IFQA: Interpretable Face Quality Assessment. 3433-3442 - Felix Rosberg

, Eren Erdal Aksoy, Fernando Alonso-Fernandez, Cristofer Englund:
FaceDancer: Pose- and Occlusion-Aware High Fidelity Face Swapping. 3443-3452 - Sangjin Park, Dae Ha Kim, Byung Cheol Song:

Fine Gaze Redirection Learning with Gaze Hardness-aware Transformation. 3453-3462 - Frank Yu, Sid Fels

, Helge Rhodin:
Scaling Neural Face Synthesis to High FPS and Low Latency by Neural Caching. 3463-3472 - Philipp Terhörst, Malte Ihlefeld, Marco Huber, Naser Damer, Florian Kirchbuchner, Kiran B. Raja, Arjan Kuijper:

QMagFace: Simple and Accurate Quality-Aware Face Recognition. 3473-3483 - Aditya Agarwal, Bipasha Sen, Rudrabha Mukhopadhyay, Vinay P. Namboodiri, C. V. Jawahar:

FaceOff: A Video-to-Video Face Swapping System. 3484-3493 - Tingyu Qu, Tinne Tuytelaars

, Marie-Francine Moens:
Weakly Supervised Face Naming with Symmetry-Enhanced Contrastive Loss. 3494-3503 - Chirag Raman, Charlie Hewitt, Erroll Wood, Tadas Baltrusaitis:

Mesh-Tension Driven Expression-Based Wrinkles for Synthetic Faces. 3504-3514 - Gwangbin Bae, Martin de La Gorce, Tadas Baltrusaitis, Charlie Hewitt, Dong Chen, Julien P. C. Valentin, Roberto Cipolla, Jingjing Shen:

DigiFace-1M: 1 Million Digital Face Images for Face Recognition. 3515-3524 - Stathis Galanakis

, Baris Gecer, Alexandros Lattas, Stefanos Zafeiriou:
3DMM-RF: Convolutional Radiance Fields for 3D Face Modeling. 3525-3536 - Yang Zhang, Simao Herdade, Kapil Thadani, Eric Dodds, Jack Culpepper, Yueh-Ning Ku:

Unifying Margin-Based Softmax Losses in Face Recognition. 3537-3546 - Sahng-Min Yoo, Tae-Min Choi, Jae-Woo Choi

, Jong-Hwan Kim:
FastSwap: A Lightweight One-Stage Framework for Real-Time Face Swapping. 3547-3556 - Youssef Dawoud, Arij Bouazizi, Katharina Ernst, Gustavo Carneiro

, Vasileios Belagiannis:
Knowing What to Label for Few Shot Microscopy Image Cell Segmentation. 3557-3566 - Zongyi Liu:

A Deep Neural Framework to Detect Individual Advertisement (Ad) from Videos. 3567-3576 - Junfei Xiao, Yutong Bai, Alan L. Yuille, Zongwei Zhou:

Delving into Masked Autoencoders for Multi-Label Thorax Disease Classification. 3577-3589 - Rohan Sarkar, Navaneeth Bodla, Mariya I. Vasileva, Yen-Liang Lin, Anurag Beniwal, Alan Lu, Gérard G. Medioni:

OutfitTransformer: Learning Outfit Representations for Fashion Recommendation. 3590-3598 - Puneet Mathur, Rajiv Jain, Ashutosh Mehra, Jiuxiang Gu, Franck Dernoncourt, Anandhavelu Natarajan, Quan Hung Tran, Verena Kaynig-Fittkau, Ani Nenkova, Dinesh Manocha, Vlad I. Morariu:

LayerDoc: Layer-wise Extraction of Spatial Hierarchical Structure in Visually-Rich Documents. 3599-3609 - Qianru Qiu, Xueting Wang, Mayu Otani, Yuki Iwazaki:

Color Recommendation for Vector Graphic Documents based on Multi-Palette Representation. 3610-3618 - Tom van Sonsbeek, Xiantong Zhen, Dwarikanath Mahapatra, Marcel Worring

:
Probabilistic Integration of Object Level Annotations in Chest X-ray Classification. 3619-3629 - Pushpendu Ghosh, Nancy Wang, Promod Yenigalla:

D-Extract: Extracting Dimensional Attributes From Product Images. 3630-3638 - Kotaro Kikuchi, Naoto Inoue, Mayu Otani, Edgar Simo-Serra, Kota Yamaguchi

:
Generative Colorization of Structured Mobile Web Pages. 3639-3648 - Athanasios Tragakis, Chaitanya Kaul

, Roderick Murray-Smith, Dirk Husmeier:
The Fully Convolutional Transformer for Medical Image Segmentation. 3649-3658 - Yuzhi Shi, Mijung Kim, Yeongnam Chae:

Multi-scale Cell-based Layout Representation for Document Understanding. 3659-3668 - Axel De Nardin

, Silvia Zottin
, Matteo Paier, Gian Luca Foresti, Emanuela Colombi
, Claudio Piciarelli:
Efficient few-shot learning for pixel-precise handwritten document layout analysis. 3669-3677 - Juan A. Rodríguez, David Vázquez, Issam H. Laradji, Marco Pedersoli, Pau Rodríguez:

OCR-VQGAN: Taming Text-within-Image Generation. 3678-3687 - Alexander Gillert, Bo Peters, Uwe Freiherr von Lukas

, Jürgen Kreyling, Gesche Blume-Werry:
Tracking Growth and Decay of Plant Roots in Minirhizotron Images. 3688-3697 - Scott Workman, Armin Hadzic, Muhammad Usman Rafique:

Handling Image and Label Resolution Mismatch in Remote Sensing. 3698-3707 - Rebbapragada V. C. Sairam, Monish Keswani, Uttaran Sinha, Nishit Shah, Vineeth N. Balasubramanian:

ARUBA: An Architecture-Agnostic Balanced Loss for Aerial Object Detection. 3708-3717 - Daniel Steininger, Andreas Trondl, Gerardus Croonen, Julia Simon

, Verena Widhalm
:
The CropAndWeed Dataset: a Multi-Modal Learning Approach for Efficient Crop and Weed Manipulation. 3718-3727 - Dabing Yu, Qingwu Li, Xiaolin Wang

, Zhiliang Zhang, Yixi Qian, Chang Xu:
DSTrans: Dual-Stream Transformer for Hyperspectral Image Restoration. 3728-3738 - Byeolyi Han, Tae-Hyun Oh:

Learning Few-shot Segmentation from Bounding Box Annotations. 3739-3748 - Karim Guirguis, Mohamed Abdelsamad, George Eskandar, Ahmed Hendawy, Matthias Kayser, Bin Yang, Juergen Beyerer:

Towards Discriminative and Transferable One-Stage Few-Shot Object Detectors. 3749-3758 - Prasad P. Iyer

, Saaketh Desai
, Sadhvikas Addamane, Rémi Dingreville, Igal Brener:
Learning incoherent light emission steering from metasurfaces using generative models. 3759-3766 - Francesco Luzi, Aneesh Gupta, Leslie M. Collins, Kyle Bradbury

, Jordan M. Malof:
Transformers For Recognition In Overhead Imagery: A Reality Check. 3767-3776 - Leon Amadeus Varga, Martin Messmer, Nuri Benbarka, Andreas Zell:

Wavelength-aware 2D Convolutions for Hyperspectral Imaging. 3777-3786 - Maofeng Tang, Konstantinos Georgiou, Hairong Qi, Cody Champion, Marc Bosch:

Semantic Segmentation in Aerial Imagery Using Multi-level Contrastive Learning with Local Consistency. 3787-3796 - Antoine Vanderschueren, Christophe De Vleeschouwer:

Are Straight-Through gradients and Soft-Thresholding all you need for Sparse Training? 3797-3806 - Muhammad Gul Zain Ali Khan, Muhammad Ferjad Naeem, Luc Van Gool, Alain Pagani, Didier Stricker, Muhammad Zeshan Afzal

:
Learning Attention Propagation for Compositional Zero-Shot Learning. 3817-3826 - Trevor Ortega, Thomas Nelson, Skyler Crane, Josh Myers-Dean, Scott Wehrwein:

Computer Vision for International Border Legibility. 3827-3836 - Lukasz Struski, Tomasz Danel

, Marek Smieja, Jacek Tabor, Bartosz Zielinski:
SONGs: Self-Organizing Neural Graphs. 3837-3846 - Guojun Wu, Xin Zhang, Ziming Zhang, Yanhua Li, Xun Zhou, Christopher G. Brinton

, Zhenming Liu:
Learning Lightweight Neural Networks via Channel-Split Recurrent Convolution. 3847-3857 - Edouard Yvinec, Arnaud Dapogny, Matthieu Cord, Kevin Bailly:

SPIQ: Data-Free Per-Channel Static Input Quantization. 3858-3867 - Thomas Verelst, Paul K. Rubenstein, Marcin Eichner, Tinne Tuytelaars

, Maxim Berman:
Spatial Consistency Loss for Training Multi-Label Classifiers from Single-Label Annotations. 3868-3878 - Ju He, Adam Kortylewski, Alan L. Yuille:

CORL: Compositional Representation Learning for Few-Shot Classification. 3879-3888 - Jiayun Wang

, Yubei Chen, Stella X. Yu, Brian Cheung, Yann LeCun:
Compact and Optimal Deep Learning with Recurrent Parameter Generators. 3889-3899 - Grégoire Petit, Adrian Popescu, Hugo Schindler, David Picard, Bertrand Delezoide:

FeTrIL: Feature Translation for Exemplar-Free Class-Incremental Learning. 3900-3909 - Tobias Riedlinger, Matthias Rottmann, Marius Schubert, Hanno Gottschalk:

Gradient-Based Quantification of Epistemic Uncertainty for Deep Object Detectors. 3910-3920 - Deep Patel, P. S. Sastry:

Adaptive Sample Selection for Robust Learning under Label Noise. 3921-3931 - Yilin Ji, Daniel Kästner, Oliver Wirth, Christian Wressnegger:

Randomness is the Root of All Evil: More Reliable Evaluation of Deep Active Learning. 3932-3941 - Yue Liu, Christos Matsoukas, Fredrik Strand

, Hossein Azizpour, Kevin Smith:
PatchDropout: Economizing Vision Transformers Using Patch Dropout. 3942-3951 - Bo Sun, Jason Kuen, Zhe Lin, Philippos Mordohai, Simon Chen:

PRN: Panoptic Refinement Network. 3952-3962 - Fang Chen, Gourav Datta, Souvik Kundu, Peter A. Beerel

:
Self-Attentive Pooling for Efficient Deep Learning. 3963-3972 - Xiangyu Chen, Qinghao Hu, Kaidong Li, Cuncong Zhong, Guanghui Wang:

Accumulated Trivial Attention Matters in Vision Transformers on Small Datasets. 3973-3981 - Ragav Sachdeva, Andrew Zisserman:

The Change You Want to See. 3982-3991 - Xiwen Dengxiong, Yu Kong:

Ancestor Search: Generalized Open Set Recognition via Hyperbolic Side Information Learning. 3992-4001 - Chang Chen, Jiaming Zhang, Kailun Yang, Kunyu Peng, Rainer Stiefelhagen:

Trans4Map: Revisiting Holistic Bird's-Eye-View Mapping from Egocentric Images to Allocentric Semantics with Vision Transformers. 4002-4011 - Zhanwen Chen

, Saed Rezayi, Sheng Li:
More Knowledge, Less Bias: Unbiasing Scene Graph Generation with Explicit Ontological Adjustment. 4012-4021 - Njuod Alsudays, Jing Wu, Yu-Kun Lai, Ze Ji:

AFPSNet: Multi-Class Part Parsing based on Scaled Attention and Feature Fusion. 4022-4031 - Zhiwei Lin, Zengyu Yang, Yongtao Wang:

Foreground Guidance and Multi-Layer Feature Fusion for Unsupervised Object Discovery with Transformers. 4032-4042 - Shin-I Cheng, Yu-Jie Chen, Wei-Chen Chiu, Hung-Yu Tseng, Hsin-Ying Lee:

Adaptively-Realistic Image Generation from Stroke and Sketch with Diffusion Model. 4043-4051 - Phuoc-Hieu Le

, Quynh Le, Rang Nguyen, Binh-Son Hua:
Single-Image HDR Reconstruction by Multi-Exposure Generation. 4052-4061 - Michail Christos Doukas, Stylianos Ploumpis, Stefanos Zafeiriou:

Dynamic Neural Portraits. 4062-4072 - Jie An, Tao Li, Hao-Zhi Huang, Jinwen Ma, Jiebo Luo

:
Is Bigger Always Better? An Empirical Study on Efficient Architectures for Style Transfer and Beyond. 4073-4083 - Aradhya Neeraj Mathur, Anish Madan, Ojaswa Sharma:

SLI-pSp: Injecting Multi-Scale Spatial Layout in pSp. 4084-4093 - Sameer Malik, Rajiv Soundararajan:

Semi-Supervised Learning for Low-light Image Restoration through Quality Assisted Pseudo-Labeling. 4094-4103 - Aishwarya Agarwal, Srikrishna Karanam, Balaji Vasan Srinivasan, Biplab Banerjee:

Contrastive Learning of Semantic Concepts for Open-set Cross-domain Retrieval. 4104-4113 - Sachin Chhabra, Hemanth Venkateswara, Baoxin Li:

Generative Alignment of Posterior Probabilities for Source-free Domain Adaptation. 4114-4123 - Zijian Wang

, Yadan Luo
, Zi Huang
, Mahsa Baktashmotlagh
:
FFM: Injecting Out-of-Domain Knowledge via Factorized Frequency Modification. 4124-4133 - Yifan Lu, Gurkirt Singh, Suman Saha, Luc Van Gool:

Exploiting Instance-based Mixed Sampling via Auxiliary Source Domain Supervision for Domain-adaptive Action Detection. 4134-4145 - Tianle Chen, Mahsa Baktashmotlagh

, Zijian Wang
, Mathieu Salzmann:
Center-aware Adversarial Augmentation for Single Domain Generalization. 4146-4154 - Adriano Cardace, Riccardo Spezialetti, Pierluigi Zama Ramirez

, Samuele Salti, Luigi Di Stefano:
Self-Distillation for Unsupervised 3D Domain Adaptation. 4155-4166 - Vikash Kumar, Rohit Lal, Himanshu Patil, Anirban Chakraborty:

CoNMix for Source-free Single and Multi-target Domain Adaptation. 4167-4177 - Jitender Maurya, Keyur Ruganathbhai Ranipa, Osamu Yamaguchi, Tomoyuki Shibata, Daisuke Kobayashi:

Domain Adaptation using Self-Training with Mixup for One-Stage Object Detection. 4178-4187 - Sofia Broomé, Ernest Pokropek, Boyu Li, Hedvig Kjellström

:
Recur, Attend or Convolve? On Whether Temporal Modeling Matters for Cross-Domain Robustness in Action Recognition. 4188-4198 - Aadarsh Sahoo, Rameswar Panda, Rogério Feris, Kate Saenko, Abir Das:

Select, Label, and Mix: Learning Discriminative Invariant Feature Representations for Partial Domain Adaptation. 4199-4208 - Gaurav Bhatt, Vineeth N. Balasubramanian:

Learning Style Subspaces for Controllable Unpaired Domain Translation. 4209-4218 - Zhipeng Luo, Gongjie Zhang, Changqing Zhou, Tianrui Liu, Shijian Lu, Liang Pan

:
TransPillars: Coarse-to-Fine Aggregation for Multi-Frame 3D Object Detection. 4219-4228 - Gopi Krishna Erabati

, Helder Araújo
:
Li3DeTr: A LiDAR based 3D Detection Transformer. 4239-4248 - Xuelin Qian, Li Wang, Yi Zhu, Li Zhang, Yanwei Fu

, Xiangyang Xue:
ImpDet: Exploring Implicit Fields for 3D Object Detection. 4249-4259 - Heming Du, Xin Yu

, Farookh Khadeer Hussain, Mohammad Ali Armin, Lars Petersson
, Weihao Li:
Weakly-supervised Point Cloud Instance Segmentation with Geometric Priors. 4260-4269 - Xuepeng Shi, Zhixiang Chen, Tae-Kyun Kim

:
Multivariate Probabilistic Monocular 3D Object Detection. 4270-4279 - Ruixin Liu, Zhihao Guan, Zejian Yuan, Ao Liu, Tong Zhou, Tang Kun, Erlong Li, Chao Zheng, Shuqi Mei:

Learning to Detect 3D Lanes by Shape Matching and Embedding. 4280-4288 - Debtanu Gupta, Shubh Maheshwari, Sai Shashank Kalakonda, Manasvi Vaidyula, Ravi Kiran Sarvadevabhatla

:
DSAG: A Scalable Deep Framework for Action-Conditioned Multi-Actor Full Body Motion Synthesis. 4289-4297 - Pavel Solovev, Taras Khakhulin, Denis Korzhenkov:

Self-improving Multiplane-to-layer Images for Novel View Synthesis. 4298-4307 - Zirui An, Jingbo Yu

, Runtao Liu, Chuang Wang, Qian Yu:
SketchInverter: Multi-Class Sketch-Based Image Generation via GAN Inversion. 4308-4318 - Decai Chen, Peng Zhang, Ingo Feldmann, Oliver Schreer

, Peter Eisert:
Recovering Fine Details for Neural Implicit Surface Reconstruction. 4319-4328 - Verica Lazova, Vladimir Guzov, Kyle Olszewski, Sergey Tulyakov, Gerard Pons-Moll:

Control-NeRF: Editable Feature Volumes for Scene Rendering and Manipulation. 4329-4339 - Weijian Deng, Yumin Suh, Xiang Yu, Masoud Faraki, Liang Zheng

, Manmohan Chandraker:
Split to Learn: Gradient Split for Multi-Task Human Image Analysis. 4340-4349 - Devansh Gupta, Aditya Saini, Sarthak Bhagat, Shagun Uppal, Rishi Raj Jain, Drishti Bhasin, Ponnurangam Kumaraguru, Rajiv Ratn Shah

:
A Suspect Identification Framework using Contrastive Relevance Feedback. 4350-4358 - Anil Kunchala, Mélanie Bouroche, Bianca Schoen-Phelan

:
Towards A Framework for Privacy-Preserving Pedestrian Analysis. 4359-4369 - Thao Minh Le

, Vuong Le, Sunil Gupta, Svetha Venkatesh, Truyen Tran:
Guiding Visual Question Answering with Attention Priors. 4370-4379 - Aditay Tripathi, Anand Mishra, Anirban Chakraborty:

Grounding Scene Graphs on Natural Images via Visio-Lingual Message Passing. 4380-4389 - Kohei Uehara, Tatsuya Harada:

K-VQG: Knowledge-aware Visual Question Generation for Common-sense Acquisition. 4390-4398 - Zineng Tang, Jaemin Cho

, Jie Lei, Mohit Bansal:
PERCEIVER-VL: Efficient Vision-and-Language Modeling with Iterative Latent Attention. 4399-4409 - Zehranaz Canfes, M. Furkan Atasoy, Alara Dirik, Pinar Yanardag:

Text and Image Guided 3D Avatar Generation and Manipulation. 4410-4420 - Yuxiao Chen, Jianbo Yuan, Long Zhao, Tianlang Chen, Rui Luo, Larry Davis, Dimitris N. Metaxas:

More Than Just Attention: Improving Cross-Modal Attentions with Contrastive Constraints for Image-Text Matching. 4421-4429 - Soumya Jahagirdar, Minesh Mathew, Dimosthenis Karatzas

, C. V. Jawahar:
Watching the News: Towards VideoQA Models that can Read. 4430-4439 - Mingda Zhang, Rebecca Hwa

, Adriana Kovashka:
How to Practice VQA on a Resource-limited Target Domain. 4440-4449 - Zhihong Pan, Xin Zhou, Hao Tian:

Arbitrary Style Guidance for Enhanced Diffusion-Based Text-to-Image Generation. 4450-4460 - Lokender Tiwari, Brojeshwar Bhowmick:

GarSim: Particle Based Neural Garment Simulator. 4461-4470 - Shubham Dokania, A. H. Abdul Hafez, Anbumani Subramanian, Manmohan Chandraker, C. V. Jawahar:

IDD-3D: Indian Driving Dataset for 3D Unstructured Road Scenes. 4471-4480 - Yu Feng, Patrick Hansen, Paul N. Whatmough, Guoyu Lu

, Yuhao Zhu:
Fast and Accurate: Video Enhancement Using Sparse Depth. 4481-4489 - Zishuo Zheng, Chunyu Lin

, Lang Nie, Kang Liao, Zhijie Shen, Yao Zhao:
Complementary Bi-directional Feature Compression for Indoor 360° Semantic Segmentation with Self-distillation. 4490-4499 - Guofeng Mei

, Fabio Poiesi, Cristiano Saltori, Jian Zhang
, Elisa Ricci
, Nicu Sebe
:
Overlap-guided Gaussian Mixture Models for Point Cloud Registration. 4500-4509 - Petros Tzathas, Petros Maragos, Anastasios Roussos

:
3D Neural Sculpting (3DNS): Editing Neural Signed Distance Functions. 4510-4519 - Chengzhi Wu, Xuelei Bi, Julius Pfrommer, Alexander Cebulla, Simon Mangold

, Jürgen Beyerer:
Sim2real Transfer Learning for Point Cloud Segmentation: An Industrial Application Case on Autonomous Disassembly. 4520-4529 - Zhihao Zheng, Xiaowen Ying, Zhen Yao

, Mooi Choo Chuah:
Robustness of Trajectory Prediction Models Under Map-Based Attacks. 4530-4539 - Saehyung Lee, Hyungyu Lee:

Inducing Data Amplification Using Auxiliary Datasets in Adversarial Training. 4540-4549 - Kazuya Kakizaki

, Kazuto Fukuchi, Jun Sakuma:
Certified Defense for Content Based Image Retrieval. 4550-4559 - Avishag Shapira, Alon Zolfi, Luca Demetrio

, Battista Biggio, Asaf Shabtai:
Phantom Sponges: Exploiting Non-Maximum Suppression to Attack Deep Object Detectors. 4560-4569 - Hanxiao Tan, Helena Kotthaus:

Explainability-Aware One Point Attack for Point Cloud Neural Networks. 4570-4579 - Xingqian Xu, Shant Navasardyan, Vahram Tadevosyan, Andranik Sargsyan

, Yadong Mu, Humphrey Shi
:
Image Completion with Heterogeneously Filtered Spectral Hints. 4580-4590 - Yuan Zhao, Bo Liu

, Ming Ding, Baoping Liu, Tianqing Zhu, Xin Yu
:
Proactive Deepfake Defence via Identity Watermarking. 4591-4600 - Lei Fan, Ying Wu:

Avoiding Lingering in Learning Active Recognition by Adversarial Disturbance. 4601-4610 - Gaurav Kumar Nayak

, Ruchit Rawal, Anirban Chakraborty:
DE-CROP: Data-efficient Certified Robustness for Pretrained Classifiers. 4611-4620 - Ke Xu

, Yao Xiao, Zhaoheng Zheng, Kaijie Cai, Ram Nevatia:
PatchZero: Defending against Adversarial Patch Attacks by Detecting and Zeroing the Patch. 4621-4630 - Fahim Faisal Niloy, Kishor Kumar Bhaumik, Simon S. Woo:

CFL-Net: Image Forgery Localization Using Contrastive Learning. 4631-4640 - Rui Yang, Duc Minh Vo, Hideki Nakayama:

Indirect Adversarial Losses via an Intermediate Distribution for Training GANs. 4641-4650 - Rahul Mysore Venkatesh, Eric Wong

, Zico Kolter:
Adversarial robustness in discontinuous spaces via alternating sampling & descent. 4651-4660 - Taras Rumezhak, Francisco Girbal Eiras, Philip H. S. Torr, Adel Bibi

:
RANCER: Non-Axis Aligned Anisotropic Certification with Randomized Smoothing. 4661-4669 - Thanh Nguyen-Duc, Trung Le, He Zhao, Jianfei Cai, Dinh Phung:

Adversarial local distribution regularization for knowledge distillation. 4670-4679 - Baoping Liu, Bo Liu

, Ming Ding, Tianqing Zhu, Xin Yu
:
TI2Net: Temporal Identity Inconsistency Network for Deepfake Detection. 4680-4689 - Likun Zhang, Yahong Chen, Ang Li, Binghui Wang, Yiran Chen, Fenghua Li, Jin Cao, Ben Niu:

Interpreting Disparate Privacy-Utility Tradeoff in Adversarial Learning via Attribute Correlation. 4690-4698 - Shruti Agarwal, Liwen Hu, Evonne Ng, Trevor Darrell, Hao Li, Anna Rohrbach:

Watch Those Words: Video Falsification Detection Using Word-Conditioned Facial Motion. 4699-4708 - Sumedha Singla, Nihal Murali, Forough Arabshahi, Sofia Triantafyllou, Kayhan Batmanghelich

:
Augmentation by Counterfactual Explanation -Fixing an Overconfident Classifier. 4709-4719 - Enis Simsar, Umut Kocasari, Ezgi Gülperi Er, Pinar Yanardag:

Fantastic Style Channels and Where to Find Them: A Submodular Framework for Discovering Diverse Directions in GANs. 4720-4729 - Hanxiao Tan:

Visualizing Global Explanations of Point Cloud DNNs. 4730-4739 - Taojiannan Yang, Linjie Yang, Xiaojie Jin, Chen Chen:

Revisiting Training-free NAS Metrics: An Efficient Training-based Method. 4740-4749 - Yunhao Ge, Zhi Xu, Yao Xiao, Gan Xin, Yunkui Pang, Laurent Itti:

Encouraging Disentangled and Convex Representation with Controllable Interpolation Regularization. 4750-4758 - Monu Verma, Priyanka Lubal, Santosh Kumar Vipparthi

, Mohamed Abdel-Mottaleb:
RNAS-MER: A Refined Neural Architecture Search with Hybrid Spatiotemporal Operations for Micro-Expression Recognition. 4759-4768 - Lena Heidemann, Maureen Monnet, Karsten Roscher:

Concept Correlation and Its Effects on Concept-Based Models. 4769-4777 - Yuqiao Xian, Jinrui Yang, Fufu Yu, Jun Zhang, Xing Sun

:
Graph-Based Self-Learning for Robust Person Re-identification. 4778-4787 - Fan Yang

, Shigeyuki Odashima, Shoichi Masui, Shan Jiang:
Hard to Track Objects with Irregular Motions and Similar Appearances? Make It Easier by Buffering the Matching Space. 4788-4797 - Wen Guo

, Yuming Du, Xi Shen, Vincent Lepetit, Xavier Alameda-Pineda, Francesc Moreno-Noguer:
Back to MLP: A Simple Baseline for Human Motion Prediction. 4798-4808 - Mustansar Fiaz

, Hisham Cholakkal, Rao Muhammad Anwer
, Fahad Shahbaz Khan:
SAT: Scale-Augmented Transformer for Person Search. 4809-4818 - Gozde Sahin, Laurent Itti:

HOOT: Heavy Occlusions in Object Tracking Benchmark. 4819-4828 - Adhiraj Ghosh, Kuruparan Shanmugalingam, Wen-Yan Lin:

Relation Preserving Triplet Mining for Stabilising the Triplet Loss in Re-identification Systems. 4829-4838 - Jeongseok Hyun, Myunggu Kang, Dongyoon Wee, Dit-Yan Yeung:

Detection Recovery in Online Multi-Object Tracking with Sparse Graph Tracker. 4839-4848 - Xiaotian Han, Quanzeng You, Chunyu Wang, Zhizheng Zhang, Peng Chu, Houdong Hu, Jiang Wang, Zicheng Liu:

MMPTRACK: Large-scale Densely Annotated Multi-camera Multiple People Tracking Benchmark. 4849-4858 - Peng Chu, Jiang Wang, Quanzeng You, Haibin Ling, Zicheng Liu:

TransMOT: Spatial-Temporal Graph Transformer for Multiple Object Tracking. 4859-4869 - Luca Piano

, Filippo Gabriele Pratticò
, Alessandro Sebastian Russo, Lorenzo Lanari, Lia Morra, Fabrizio Lamberti:
Bent & Broken Bicycles: Leveraging synthetic data for damaged object re-identification. 4870-4880 - Furkan Kinli

, Doga Yilmaz
, Baris Özcan
, Furkan Kiraç:
Modeling the Lighting in Scenes as Style for Auto White-Balance Correction. 4892-4902 - Mohit Lamba, M. V. A. Suhas Kumar, Kaushik Mitra:

Real-Time Restoration of Dark Stereo Images. 4903-4913 - Mehmet Kerim Yücel

, Valia Dimaridou, Bruno Manganelli, Mete Ozay, Anastasios Drosou, Albert Saà-Garriga:
LRA&LDRA: Rethinking Residual Predictions for Efficient Shadow Detection and Removal. 4914-4924 - Quan H. Nguyen, William J. Beksi:

Single Image Super-Resolution via a Dual Interactive Implicit Neural Network. 4925-4934 - Akash Gupta, Sudhir Kumar Singh, Amit K. Roy-Chowdhury:

Joint Video Rolling Shutter Correction and Super-Resolution. 4935-4944 - Jinsu Yoo, Taehoon Kim, Sihaeng Lee, Seung Hwan Kim, Honglak Lee, Tae Hyun Kim:

Enriched CNN-Transformer Feature Aggregation Networks for Super-Resolution. 4945-4954 - Bo Zhou

, Neel Dey, Jo Schlemper, Seyed Sadegh Mohseni Salehi, Chi Liu, James S. Duncan, Michal Sofka:
DSFormer: A Dual-domain Self-supervised Transformer for Accelerated Multi-contrast MRI Reconstruction. 4955-4964 - Anup Kumar Gupta

, Rupesh Kumar, Lokendra Birla, Puneet Gupta:
RADIANT: Better rPPG estimation using signal embeddings and Transformer. 4965-4975 - Ajay Jaiswal, Tianlong Chen, Justin F. Rousseau

, Yifan Peng, Ying Ding, Zhangyang Wang:
Attend Who is Weak: Pruning-assisted Medical Image Localization under Sophisticated and Implicit Imbalances. 4976-4985 - Georg Wölflein

, In Hwa Um
, David J. Harrison
, Ognjen Arandjelovic
:
HoechstGAN: Virtual Lymphocyte Staining Using Generative Adversarial Networks. 4986-4996 - Xin Liu, Brian L. Hill, Ziheng Jiang, Shwetak N. Patel, Daniel McDuff:

EfficientPhys: Enabling Simple, Fast and Accurate Camera-Based Cardiac Measurement. 4997-5006 - Alexander Hustinx, Fabio Hellmann, Ömer Sümer, Behnam Javanmardi

, Elisabeth André
, Peter M. Krawitz, Tzung-Chien Hsieh
:
Improving Deep Facial Phenotyping for Ultra-rare Disorder Verification Using Model Ensembles. 5007-5017 - Lokendra Birla, Sneha Shukla

, Anup Kumar Gupta
, Puneet Gupta:
ALPINE: Improving Remote Heart Rate Estimation using Contrastive Learning. 5018-5027 - Yan Yang, Md. Zakir Hossain, Eric A. Stone, Shafin Rahman:

Exemplar Guided Deep Neural Network for Spatial Transcriptomics Analysis of Gene Expression Prediction. 5028-5037 - Xin Jin, Longhai Wu, Guotao Shen, Youxin Chen, Jie Chen, Jayoon Koo, Cheul-Hee Hahm:

Enhanced Bi-directional Motion Estimation for Video Frame Interpolation. 5038-5046 - Wenjie Yin, Hang Yin, Kim Baraka

, Danica Kragic, Mårten Björkman:
Dance Style Transfer with Cross-modal Transformer. 5047-5056 - Yonghu Chen, Dongchen Zhu, Wenjun Shi, Guanghui Zhang, Tianyu Zhang, Xiaolin Zhang, Jiamao Li:

MFCFlow: A Motion Feature Compensated Multi-Frame Recurrent Network for Optical Flow Estimation. 5057-5066 - Jerin Geo James, Devansh Jain

, Ajit Rajwade:
GlobalFlowNet: Video Stabilization using Deep Distilled Global Motion Estimates. 5067-5076 - Stefano Savian, Pietro Morerio, Alessio Del Bue

, Andrea A. Janes, Tammam Tillo:
Towards Equivariant Optical Flow Estimation with Deep Learning. 5077-5086 - Apoorva Agarwal, Rishabh Dabral, Arjun Jain, Ganesh Ramakrishnan:

Skew-Robust Human-Object Interactions in Videos. 5087-5096 - Valéry Dewil, Adrien Courtois, Mariano Rodríguez, Thibaud Ehret, Nicola Brandonisio, Denis Bujoreanu, Gabriele Facciolo, Pablo Arias:

Video joint denoising and demosaicing with recurrent CNNs. 5097-5108 - Yumeng Wang, Bo Xu

, Ziwen Li, Han Huang, Cheng Lu, Yandong Guo:
Video Object Matting via Hierarchical Space-Time Semantic Guidance. 5109-5118 - Shengyu Feng, Hesham Mostafa, Marcel Nassar, Somdeb Majumdar, Subarna Tripathi:

Exploiting Long-Term Dependencies for Generating Dynamic Scene Graphs. 5119-5128 - Suhwan Cho

, Minhyeok Lee, Seunghoon Lee, Chaewon Park, Donghyeong Kim, Sangyoun Lee:
Treating Motion as Option to Reduce Motion Dependency in Unsupervised Video Object Segmentation. 5129-5138 - Huaizu Jiang, Erik G. Learned-Miller:

DCVNet: Dilated Cost Volume Networks for Fast Optical Flow. 5139-5146 - Tanvir Mahmud, Diana Marculescu

:
AVE-CLIP: AudioCLIP-based Multi-window Temporal Transformer for Audio Visual Event Localization. 5147-5156 - Xinchi Zhou, Dongzhan Zhou, Wanli Ouyang

, Hang Zhou, Di Hu:
SeCo: Separating Unknown Musical Visual Sounds with Consistency Guidance. 5157-5166 - Madhav Agarwal, Rudrabha Mukhopadhyay, Vinay P. Namboodiri, C. V. Jawahar:

Audio-Visual Face Reenactment. 5167-5176 - Jielin Qiu, Franck Dernoncourt, Trung Bui, Zhaowen Wang, Ding Zhao

, Hailin Jin:
LiveSeg: Unsupervised Multimodal Temporal Segmentation of Long Livestream Videos. 5177-5187 - Xinchi Zhou, Dongzhan Zhou, Di Hu, Hang Zhou, Wanli Ouyang

:
Exploiting Visual Context Semantics for Sound Source Localization. 5188-5197 - Anchit Gupta, Rudrabha Mukhopadhyay, Sindhu Balachandra, Faizan Farooq Khan, Vinay P. Namboodiri, C. V. Jawahar:

Towards Generating Ultra-High Resolution Talking-Face Videos with Lip synchronization. 5198-5207 - Tianwei Ni, Kiana Ehsani, Luca Weihs, Jordi Salvador:

Towards Disturbance-Free Visual Mobile Manipulation. 5208-5220 - Darshan Singh S, Anchit Gupta, C. V. Jawahar, Makarand Tapaswi:

Unsupervised Audio-Visual Lecture Segmentation. 5221-5230 - Onkar Susladkar, Gayatri Deshmukh, Dhruv Makwana

, Sparsh Mittal, R. Sai Chandra Teja, Rekha Singhal:
GAFNet: A Global Fourier Self Attention Based Novel Network for multi-modal downstream tasks. 5231-5240 - Shih-Yun Chu, Ming-Sui Lee:

MT-DETR: Robust End-to-end Multimodal Detection with Confidence Fusion. 5241-5250 - Dan Liu, Xi Chen, Chen Ma

, Xue Liu:
Hyperspherical Quantization: Toward Smaller and More Accurate Models. 5251-5261 - Gobinda Saha, Kaushik Roy:

Saliency Guided Experience Packing for Replay in Continual Learning. 5262-5272 - Shiv Ram Dubey

, Satish Kumar Singh, Bidyut Baran Chaudhuri:
AdaNorm: Adaptive Gradient Norm Correction based Optimizer for CNNs. 5273-5282 - Matthew Dutson, Yin Li, Mohit Gupta:

Spike-Based Anytime Perception. 5283-5293 - Ze Wang, Yue Lu, Qiang Qiu:

Meta-OLE: Meta-learned Orthogonal Low-Rank Embedding. 5294-5303 - Varad Pimpalkhute, Shruti Kunde, Rekha Singhal:

GEMS: Generating Efficient Meta-Subnets. 5304-5312 - Sayan Nag, Mayukh Bhattacharyya, Anuraag Mukherjee, Rohit Kundu:

Serf: Towards better training of deep neural networks using log-Softplus ERror activation Function. 5313-5322 - Shaogang Ren, Hongliang Fei, Dingcheng Li, Ping Li:

Learning Latent Structural Relations with Message Passing Prior. 5323-5332 - Brandon Smart, Gustavo Carneiro

:
Bootstrapping the Relationship Between Images and Their Clean and Noisy Labels. 5333-5343 - Reza Pourreza, Hoang Le, Amir Said, Guillaume Sautière, Auke J. Wiggers:

Boosting neural video codecs by exploiting hierarchical redundancy. 5344-5353 - Noor Fathima Ghouse, Jens Petersen, Guillaume Sautière, Auke J. Wiggers, Reza Pourreza:

A neural video codec with spatial rate-distortion control. 5354-5363 - Sizhuo Ma, Paul Mos, Edoardo Charbon, Mohit Gupta:

Burst Vision Using Single-Photon Cameras. 5364-5374 - Xinyu Jiang, Zhengjia Li, Maoqing Tian, Jianbo Liu, Shuai Yi, Duoqian Miao:

Few-shot Object Detection via Improved Classification Features. 5375-5384 - Ze Huang, Li Sun, Cheng Zhao, Song Li, Songzhi Su:

EventPoint: Self-Supervised Interest Point Detection and Description for Event-based Camera. 5385-5394 - Qi Rao, Xin Yu

, Shant Navasardyan, Humphrey Shi
:
Sim2RealVS: A New Benchmark for Video Stabilization with a Strong Baseline. 5395-5404 - Zhihong Pan, Baopu Li, Dongliang He, Wenhao Wu

, Errui Ding:
Effective Invertible Arbitrary Image Rescaling. 5405-5414 - Ojas Kishorkumar Shirekar, Anuj Singh, Hadi Jamali Rad:

Self-Attention Message Passing for Contrastive Few-Shot Learning. 5414-5425 - Abhinav Java, Shripad V. Deshmukh, Milan Aggarwal, Surgan Jandial, Mausoom Sarkar, Balaji Krishnamurthy:

One-Shot Doc Snippet Detection: Powering Search in Document Beyond Text. 5426-5435 - Fengyuan Yang, Ruiping Wang, Xilin Chen:

Semantic Guided Latent Parts Embedding for Few-Shot Learning. 5436-5446 - Seyed Ehsan Marjani Bajestani, Giovanni Beltrame:

Event-based RGB sensing with structured light. 5447-5456 - Xinyu Li, Yanyi Zhang, Jianbo Yuan, Hanlin Lu, Yibo Zhu:

Discrete Cosin TransFormer: Image Modeling From Frequency Domain. 5457-5467 - Kihyuk Sohn, Jinsung Yoon, Chun-Liang Li, Chen-Yu Lee, Tomas Pfister:

Anomaly Clustering: Grouping Images into Coherent Clusters of Anomaly Types. 5468-5479 - Tomás Vojír, Jirí Matas:

Image-Consistent Detection of Road Anomalies as Unpredictable Patches. 5480-5489 - Aitor Artola, Yannis Kolodziej, Jean-Michel Morel

, Thibaud Ehret:
GLAD: A Global-to-Local Anomaly Detector. 5490-5499 - Mohamed Yousef, Marcel Ackermann, Unmesh Kurup, Tom E. Bishop:

No Shifted Augmentations (NSA): compact distributions for robust self-supervised Anomaly Detection. 5500-5509 - Mu Cai, Yixuan Li:

Out-of-distribution Detection via Frequency-regularized Generative Models. 5510-5519 - Jingyang Zhang, Nathan Inkawhich, Randolph Linderman, Yiran Chen, Hai Li:

Mixture Outlier Exposure: Towards Out-of-Distribution Detection in Fine-grained Environments. 5520-5529 - Kamalakar Vijay Thakare, Yash Raghuwanshi, Debi Prosad Dogra, Heeseung Choi, Ig-Jae Kim:

DyAnNet: A Scene Dynamicity Guided Self-Trained Video Anomaly Detection Network. 5530-5539 - Genki Osada, Tsubasa Takahashi, Budrul Ahsan, Takashi Nishide:

Out-of-Distribution Detection with Reconstruction Error and Typicality-based Penalty. 5540-5552 - Toshimichi Aota, Lloyd Teh Tzer Tong, Takayuki Okatani:

Zero-shot versus Many-shot: Unsupervised Texture Anomaly Detection. 5553-5561 - Srijan Das

, Michael S. Ryoo:
ViewCLR: Learning Self-supervised Video Representation for Unseen Viewpoints. 5562-5572 - Haopeng Li

, Qiuhong Ke, Mingming Gong, Tom Drummond:
Progressive Video Summarization via Multimodal Self-supervised Learning. 5573-5582 - Amani Almalki

, Longin Jan Latecki:
Self-Supervised Learning with Masked Image Modeling for Teeth Numbering, Detection of Dental Restorations, and Instance Segmentation in Dental Panoramic Radiographs. 5583-5592 - Yangsong Zhang, Subhankar Roy

, Hongtao Lu, Elisa Ricci
, Stéphane Lathuilière:
Cooperative Self-Training for Multi-Target Adaptive Semantic Segmentation. 5593-5602 - Sutanu Bera, Prabir Kumar Biswas:

Self Supervised Low Dose Computed Tomography Image Denoising Using Invertible Network Exploiting Inter Slice Congruence. 5603-5612 - Ashraful Islam, Ben Lundell, Harpreet Sawhney, Sudipta Sinha, Peter Morales, Richard J. Radke

:
Self-supervised Learning with Local Contrastive Loss for Detection and Semantic Segmentation. 5613-5622 - Leonardo Tadeu Lopes

, Daniel Carlos Guimarães Pedronette:
Self-Supervised Clustering based on Manifold Learning and Graph Convolutional Networks. 5623-5632 - Justin Lazarow, Kihyuk Sohn, Chen-Yu Lee, Chun-Liang Li, Zizhao Zhang, Tomas Pfister:

Unifying Distribution Alignment as a Loss for Imbalanced Semi-supervised Learning. 5633-5642 - Mustafa Taha Koçyigit

, Timothy M. Hospedales, Hakan Bilen
:
Accelerating Self-Supervised Learning via Efficient Training Strategies. 5643-5653 - Hao Zhang, Xin Chen, Heming Jing, Yingbin Zheng

, Yuan Wu, Cheng Jin:
ETR: An Efficient Transformer for Re-ranking in Visual Place Recognition. 5654-5663 - Yintong Wang, Lili Chen, Jiamao Li, Xiaolin Zhang:

HandGCNFormer: A Novel Topology-Aware Transformer Network for 3D Hand Pose Estimation. 5664-5673 - Guowei Li, Dongchen Zhu, Guanghui Zhang, Wenjun Shi, Tianyu Zhang, Xiaolin Zhang, Jiamao Li:

SD-Pose: Structural Discrepancy Aware Category-Level 6D Object Pose Estimation. 5674-5683 - Qi Feng, Kun He, He Wen, Cem Keskin, Yuting Ye

:
Rethinking the Data Annotation Process for Multiview 3D Pose Estimation with Active Learning and Self-Training. 5684-5693 - Bruce R. Muller, William A. P. Smith:

Self-supervised Relative Pose with Homography Model-fitting in the Loop. 5694-5703 - Shih-Po Lee, Niraj Prakash Kini, Wen-Hsiao Peng, Ching-Wen Ma, Jenq-Neng Hwang:

HuPR: A Benchmark for Human Pose Estimation Using Millimeter Wave Radar. 5704-5713 - Kyung-Min Jin, Byoung-Sung Lim, Gun-Hee Lee, Tae-Kyung Kang, Seong-Whan Lee:

Kinematic-aware Hierarchical Attention Network for Human Pose Estimation in Videos. 5714-5723 - Rong Wang, Wei Mao, Hongdong Li:

Interacting Hand-Object Pose Estimation via Dense Mutual Attention. 5724-5734 - Pedro Castro, Tae-Kyun Kim

:
CRT-6D: Fast 6D Object Pose Estimation with Cascaded Refinement Transformers. 5735-5744 - Huan Liu, Zhixiang Chi, Yuanhao Yu, Yang Wang, Jun Chen, Jin Tang:

Meta-Auxiliary Learning for Future Depth Prediction in Videos. 5745-5754 - Haoyi Zhu:

X-NeRF: Explicit Neural Radiance Field for Multi-Scene 360° Insufficient RGB-D Views. 5755-5764 - Xingyu Chen, Ruonan Zhang, Ji Jiang, Yan Wang, Ge Li, Thomas H. Li:

Self-Supervised Monocular Depth Estimation: Solving the Edge-Fattening Problem. 5765-5775 - Runkai Zhao, Heng Wang

, Chaoyi Zhang, Weidong Cai
:
PointNeuron: 3D Neuron Reconstruction via Geometry and Topology Learning of Point Clouds. 5776-5786 - Ukcheol Shin

, Kwanyong Park, Byeong-Uk Lee, Kyunghyun Lee
, In So Kweon:
Self-supervised Monocular Depth Estimation from Thermal Images via Adversarial Multi-spectral Adaptation. 5787-5796 - Xingyu Chen, Thomas H. Li, Ruonan Zhang, Ge Li:

Frequency-Aware Self-Supervised Monocular Depth Estimation. 5797-5806 - Avinash Nittur Ramesh, Fabio Giovanneschi, María A. González-Huici:

SIUNet: Sparsity Invariant U-Net for Edge-Aware Depth Completion. 5807-5816 - Nitin Bansal, Pan Ji, Junsong Yuan, Yi Xu:

Semantics-Depth-Symbiosis: Deeply Coupled Semi-Supervised Learning of Semantics and Depth. 5817-5828 - Andrea Pilzer, Yuxin Hou, Niki Andreas Lopi, Arno Solin

, Juho Kannala:
Expansion of Visual Hints for Improved Generalization in Stereo Matching. 5829-5838 - Jamie Watson, Sara Vicente, Oisin Mac Aodha, Clément Godard, Gabriel J. Brostow, Michael Firman:

Heightfields for Efficient Scene Reconstruction for AR. 5839-5849 - Ashutosh Agarwal, Chetan Arora:

Attention Attention Everywhere: Monocular Depth Prediction with Skip Attention. 5850-5859 - Andrea Conti, Matteo Poggi

, Stefano Mattoccia
:
Sparsity Agnostic Depth Completion. 5860-5869 - Nan Qiao, Yuyin Sun, Chong Liu, Lu Xia, Jiajia Luo, Ke Zhang, Cheng-Hao Kuo:

Human-in-the-Loop Video Semantic Segmentation Auto-Annotation. 5870-5880 - Georgy Ponimatkin, Nermin Samet, Yang Xiao, Yuming Du, Renaud Marlet, Vincent Lepetit:

A Simple and Powerful Global Optimization for Unsupervised Video Object Segmentation. 5881-5892 - Sharat Agarwal, Saket Anand, Chetan Arora:

Reducing Annotation Effort by Identifying and Labeling Contextually Diverse Classes for Semantic Segmentation Under Domain Shift. 5893-5902 - Heejo Kong, Gun-Hee Lee, Suneung Kim, Seong-Whan Lee:

Pruning-Guided Curriculum Learning for Semi-Supervised Semantic Segmentation. 5903-5912 - Minhyeok Lee, Suhwan Cho

, Seunghoon Lee, Chaewon Park, Sangyoun Lee:
Unsupervised Video Object Segmentation via Prototype Memory Network. 5913-5923 - Lang Peng, Zhirong Chen, Zhangjie Fu, Pengpeng Liang, Erkang Cheng:

BEVSegFormer: Bird's Eye View Semantic Segmentation From Arbitrary Camera Rigs. 5924-5932 - Yating Zhou, Wenjing Li, Ge Yang:

SCTS: Instance Segmentation of Single Cells Using a Transformer-Based Semantic-Aware Model and Space-Filling Augmentation. 5933-5942 - Peri Akiva, Kristin J. Dana:

Single Stage Weakly Supervised Semantic Segmentation of Complex Scenes. 5943-5954 - Aneesh Rangnekar, Christopher Kanan, Matthew J. Hoffman:

Semantic Segmentation with Active Semi-Supervised Learning. 5955-5966 - Anurag Das, Yongqin Xian, Yang He, Zeynep Akata, Bernt Schiele

:
Urban Scene Semantic Segmentation with Low-Cost Coarse Annotation. 5967-5976 - Chihiro Noguchi, Toshihiro Tanizawa:

Ego-Vehicle Action Recognition based on Semi-Supervised Contrastive Learning. 5977-5987 - Lin Sui, Chen-Lin Zhang, Lixin Gu, Feng Han:

A Simple and Efficient Pipeline to Build an End-to-End Spatial-Temporal Action Detector. 5988-5997 - Gurkirt Singh, Vasileios Choutas, Suman Saha, Fisher Yu, Luc Van Gool:

Spatio-Temporal Action Detection Under Large Motion. 5998-6007 - Jing Yang, Jie Shen, Yiming Lin, Yordan Hristov, Maja Pantic:

FAN-Trans: Online Knowledge Distillation for Facial Action Unit Detection. 6008-6016 - Jianxiong Zhou

, Ying Wu:
Temporal Feature Enhancement Dilated Convolution Network for Weakly-supervised Temporal Action Localization. 6017-6026 - Anqi Zhu, Qiuhong Ke, Mingming Gong, James Bailey:

Adaptive Local-Component-aware Graph Convolutional Network for One-shot Skeleton-based Action Recognition. 6027-6036 - Esteve Valls Mascaro, Hyemin Ahn

, Dongheui Lee:
Intention-Conditioned Long-Term Human Egocentric Action Anticipation. 6037-6046 - Tae-Kyung Kang, Gun-Hee Lee, Kyung-Min Jin, Seong-Whan Lee:

Action-aware Masking Network with Group-based Attention for Temporal Action Localization. 6047-6056 - Zeyun Zhong

, David Schneider
, Michael Voit, Rainer Stiefelhagen, Jürgen Beyerer:
Anticipative Feature Fusion Transformer for Multi-Modal Action Anticipation. 6057-6066 - Shruti S. Phutke, Subrahmanyam Murala:

Nested Deformable Multi-head Attention for Facial Image Inpainting. 6067-6076 - Nhat Le, Khanh Nguyen

, Quang D. Tran
, Erman Tjiputra, Bac Le, Anh Nguyen:
Uncertainty-aware Label Distribution Learning for Facial Expression Recognition. 6077-6086 - Chen-Hao Liao, Wen-Cheng Chen, Hsuan-Tung Liu, Yi-Ren Yeh, Min-Chun Hu, Chu-Song Chen:

Domain Invariant Vision Transformer Learning for Face Anti-spoofing. 6087-6096 - Aidan Boyd, Patrick Tinsley, Kevin W. Bowyer, Adam Czajka:

CYBORG: Blending Human Saliency Into the Loss Improves Deep Learning-Based Synthetic Face Detection. 6097-6106 - Yasheng Sun, Jiangke Lin, Hang Zhou, Zhiliang Xu, Dongliang He, Hideki Koike

:
ReEnFP: Detail-Preserving Face Reconstruction by Encoding Facial Priors. 6107-6117 - Mohammad Saeed Ebrahimi Saadabadi, Sahar Rahimi Malakshan, Ali Zafari

, Moktari Mostofa, Nasser M. Nasrabadi:
A Quality Aware Sample-to-Sample Comparison for Face Recognition. 6118-6127 - Chao-Han Huck Yang, I-Te Danny Hung, Yi-Chieh Liu, Pin-Yu Chen:

Treatment Learning Causal Transformer for Noisy Image Classification. 6128-6139 - Xiangcheng Du

, Zhao Zhou, Yingbin Zheng
, Tianlong Ma, Xingjiao Wu
, Cheng Jin:
Modeling Stroke Mask for End-to-End Text Erasing. 6140-6148 - Boon Peng Yap, Beng Koon Ng:

Cut-Paste Consistency Learning for Semi-Supervised Lesion Segmentation. 6149-6158 - Thomas Stegmüller, Behzad Bozorgtabar

, Antoine Spahr, Jean-Philippe Thiran:
ScoreNet: Learning Non-Uniform Attention and Augmentation for Transformer-Based Histopathological Image Classification. 6159-6168 - Gaurav Patel

, Jan P. Allebach, Qiang Qiu:
Seq-UPS: Sequential Uncertainty-aware Pseudo-label Selection for Semi-Supervised Text Recognition. 6169-6179 - Britty Baby, Daksh Thapar, Mustafa Chasmai, Tamajit Banerjee, Kunal Dargan, Ashish Suri, Subhashis Banerjee, Chetan Arora:

From Forks to Forceps: A New Framework for Instance Segmentation of Surgical Instruments. 6180-6190 - Moein Heidari, Amirhossein Kazerouni

, Milad Soltany Kadarvish, Reza Azad, Ehsan Khodapanah Aghdam, Julien Cohen-Adad, Dorit Merhof:
HiFormer: Hierarchical Multi-scale Representations Using Transformers for Medical Image Segmentation. 6191-6201 - Shivasankaran V. P, Muhammad Yusuf Hassan

, Mayank Singh:
LineEX: Data Extraction from Scientific Line Charts. 6202-6210 - Md Mostafijur Rahman, Radu Marculescu:

Medical Image Segmentation via Cascaded Attention Decoding. 6211-6220 - Dehua Zheng, Xiaochen Zheng, Laurence T. Yang, Yuan Gao, Chenlu Zhu, Yiheng Ruan:

MFFN: Multi-view Feature Fusion Network for Camouflaged Object Detection. 6221-6231 - Sayak Nag, Orpaz Goldstein, Amit K. Roy-Chowdhury:

Semantics Guided Contrastive Learning of Transformers for Zero-shot Temporal Activity Detection. 6232-6242 - Junshi Xia, Naoto Yokoya

, Bruno Adriano, Clifford Broni-Bediako:
OpenEarthMap: A Benchmark Dataset for Global High-Resolution Land Cover Mapping. 6243-6253 - Yong Wu, Shekhor Chanda, Mehrdad Hosseinzadeh, Zhi Liu

, Yang Wang:
Few-Shot Learning of Compact Models via Task-Specific Meta Distillation. 6254-6263 - Zicheng Pan

, Xiaohan Yu
, Miaohua Zhang, Yongsheng Gao:
SSFE-Net: Self-Supervised Feature Enhancement for Ultra-Fine-Grained Few-Shot Class Incremental Learning. 6264-6273 - Vadim Sushko, Dan Zhang, Juergen Gall, Anna Khoreva:

One-Shot Synthesis of Images and Segmentation Masks. 6274-6283 - Debabrata Pal

, Shirsha Bose, Biplab Banerjee, Yogananda V. Jeppu:
MORGAN: Meta-Learning-based Few-Shot Open-Set Recognition via Generative Adversarial Network. 6284-6293 - Ashutosh Kulkarni, Subrahmanyam Murala:

Aerial Image Dehazing with Attentive Deformable Transformers. 6294-6303 - Zhiyuan You

, Kai Yang, Wenhan Luo
, Xin Lu, Lei Cui, Xinyi Le:
Few-shot Object Counting with Similarity-Aware Feature Enhancement. 6304-6313 - He-Yen Hsieh, Ding-Jie Chen, Cheng-Wei Chang, Tyng-Luh Liu:

Aggregating Bilateral Attention for Few-Shot Instance Localization. 6314-6323 - Nathan Elias:

Deep Learning Methodology for Early Detection and Outbreak Prediction of Invasive Species Growth. 6324-6332 - Alvaro Gómez, Gregory Randall, Gabriele Facciolo, Rafael Grompone von Gioi:

Improving the Pair Selection and the Model Fusion Steps of Satellite Multi-View Stereo Pipelines. 6333-6342 - Ankit Jha, Shirsha Bose, Biplab Banerjee:

GAF-Net: Improving the Performance of Remote Sensing Image Fusion using Novel Global Self and Cross Attention Learning. 6343-6352 - Marcelo Gennari Do Nascimento, Victor Adrian Prisacariu, Roger Fawcett, Martin Langhammer:

HyperBlock Floating Point: Generalised Quantization Scheme for Gradient and Inference Computation. 6353-6362 - Minseok Seo, Hakjin Lee, Yongjin Jeon, Junghoon Seo:

Self-Pair: Synthesizing Changes from Single Source for Object Change Detection in Remote Sensing Imagery. 6363-6372 - Ke Li, Dengxin Dai, Luc Van Gool:

Jointly Learning Band Selection and Filter Array Design for Hyperspectral Imaging. 6373-6383 - Evangelos Moschos, Alisa Kugusheva, Paul Coste, Alexandre Stegner:

Computer Vision for Ocean Eddy Detection in Infrared Imagery. 6384-6393 - JunKyu Lee

, Blesson Varghese, Hans Vandierendonck:
ROMA: Run-Time Object Detection To Maximize Real-Time Accuracy. 6394-6403 - Swati Bhugra, Vinay Kaushik, Amit Gupta, Brejesh Lall, Santanu Chaudhury:

AnoLeaf: Unsupervised Leaf Disease Segmentation via Structurally Robust Generative Inpainting. 6404-6413 - Sieger Falkena, Hadi Jamali Rad, Jan van Gemert:

LAB: Learnable Activation Binarizer for Binary Neural Networks. 6414-6423 - Cuong Pham, Tuan Hoang, Thanh-Toan Do:

Collaborative Multi-Teacher Knowledge Distillation for Learning Low Bit-width Deep Neural Networks. 6424-6432 - Saptarshi Sinha, Hiroki Ohashi:

Difficulty-Net: Learning to Predict Difficulty for Long-Tailed Recognition. 6433-6442 - Shwai He, Chenbo Jiang, Daize Dong, Liang Ding:

SD-Conv: Towards the Parameter-Efficiency of Dynamic Convolution. 6443-6452 - Hanyu Peng, Weiguo Pian, Mingming Sun, Ping Li:

Dynamic Re-weighting for Long-tailed Semi-supervised Learning. 6453-6463 - Hai Lan, Xihao Wang

, Hao Shen
, Peidong Liang, Xian Wei:
Couplformer: Rethinking Vision Transformer with Coupling Attention. 6464-6473 - Kuan-Ying Lee, Yuanyi Zhong, Yu-Xiong Wang:

Do Pre-trained Models Benefit Equally in Continual Learning? 6474-6482 - Amélie Gruel

, Jean Martinet, Bernabé Linares-Barranco, Teresa Serrano-Gotarredona:
Performance comparison of DVS data spatial downscaling methods using Spiking Neural Networks. 6483-6491 - Vinay Kumar Verma, Nikhil Mehta, Shijing Si, Ricardo Henao, Lawrence Carin:

Pushing the Efficiency Limit Using Structured Sparse Convolutions. 6492-6502 - Bo Zhao, Hakan Bilen

:
Dataset Condensation with Distribution Matching. 6503-6512 - Molly O'Brien, Brett Wolfinger, Julia V. Bukowski

, Mathias Unberath, Aria Pezeshk, Greg Hager:
Mapping DNN Embedding Manifolds for Network Generalization Prediction. 6513-6522 - Shreyansh Jain, Koteswar Rao Jerripothula

:
Federated Learning for Commercial Image Sources. 6523-6532

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.


Google
Google Scholar
Semantic Scholar
Internet Archive Scholar
CiteSeerX
ORCID














