default search action
WACV 2023: Waikoloa, HI, USA
- IEEE/CVF Winter Conference on Applications of Computer Vision, WACV 2023, Waikoloa, HI, USA, January 2-7, 2023. IEEE 2023, ISBN 978-1-6654-9346-8
- Vivek Trivedy, Longin Jan Latecki:
CNN2Graph: Building Graphs for Image Classification. 1-11 - Dmitrii Marin, Jen-Hao Rick Chang, Anurag Ranjan, Anish Prabhu, Mohammad Rastegari, Oncel Tuzel:
Token Pooling in Vision Transformers for Image Classification. 12-21 - Yuting Wang, Ricardo Guerrero, Vladimir Pavlovic:
D2F2WOD: Learning Object Proposals for Weakly-Supervised Object Detection via Progressive Domain Adaptation. 22-31 - Tal Ridnik, Gilad Sharir, Avi Ben-Cohen, Emanuel Ben Baruch, Asaf Noy:
ML-Decoder: Scalable and Versatile Classification Head. 32-41 - Andres Palechor, Annesha Bhoumik, Manuel Günther:
Large-Scale Open-Set Classification Protocols for ImageNet. 42-51 - George Adaimi, David Mizrahi, Alexandre Alahi:
Composite Relationship Fields with Transformers for Scene Graph Generation. 52-64 - Yutong Bai, Angtian Wang, Adam Kortylewski, Alan L. Yuille:
CoKe: Contrastive Learning for Robust Keypoint Detection. 65-74 - Quentin Bouniot, Angélique Loesch, Amaury Habrard, Romaric Audigier:
Towards Few-Annotation Learning for Object Detection: Are Transformer-based Models More Efficient? 75-84 - Tyler LaBonte, Yale Song, Xin Wang, Vibhav Vineet, Neel Joshi:
Scaling Novel Object Detection with Weakly Supervised Detection Transformers. 85-96 - Yung-Hsu Yang, Thomas E. Huang, Min Sun, Samuel Rota Bulò, Peter Kontschieder, Fisher Yu:
Dense Prediction with Attentive Feature Aggregation. 97-106 - Chull Hwan Song, Jooyoung Yoon, Shunghyun Choi, Yannis Avrithis:
Boosting vision transformers for image retrieval. 107-117 - Paul Albert, Eric Arazo, Tarun Krishna, Noel E. O'Connor, Kevin McGuinness:
Is your noise correction noisy? PLS: Robustness to label noise with two stage detection. 118-127 - Martin Engilberge, Haixin Shi, Zhiye Wang, Pascal Fua:
Two-level Data Augmentation for Calibrated Multi-view Detection. 128-136 - Soufiane Belharbi, Ismail Ben Ayed, Luke McCaffrey, Eric Granger:
TCAM: Temporal Class Activation Maps for Object Localization in Weakly-Labeled Unconstrained Videos. 137-146 - Islam Nassar, Munawar Hayat, Ehsan Abbasnejad, Hamid Rezatofighi, Mehrtash Harandi, Gholamreza Haffari:
LAVA:Label-efficient Visual Learning and Adaptation. 147-156 - Rishi Agarwal, Tirupati Saketh Chandra, Vaidehi Patil, Aniruddha Mahapatra, Kuldeep Kulkarni, Vishwa Vinay:
GEMS: Scene Expansion using Generative Models of Graphs. 157-166 - Mingjie Wang, Hao Cai, Yong Dai, Minglun Gong:
Dynamic Mixture of Counter Network for Location-Agnostic Crowd Counting. 167-177 - Teppei Kurita, Yuhi Kondo, Legong Sun, Yusuke Moriuchi:
Simultaneous Acquisition of High Quality RGB Image and Polarization Information using a Sparse Polarization Sensor. 178-188 - Bingchuan Li, Shaofei Cai, Wei Liu, Peng Zhang, Qian He, Miao Hua, Zili Yi:
DyStyle: Dynamic Neural Network for Multi-Attribute-Conditioned Style Editings. 189-197 - Zhihao Duan, Ming Lu, Zhan Ma, Fengqing Zhu:
Lossy Image Compression with Quantized Hierarchical VAEs. 198-207 - Jitesh Jain, Yuqian Zhou, Ning Yu, Humphrey Shi:
Keys to Better Image Inpainting: Structure and Texture Go Hand in Hand. 208-217 - Pedro Figueirêdo, Avinash Paliwal, Nima Khademi Kalantari:
Frame Interpolation for Dynamic Scenes with Implicit Flow Encoding. 218-228 - Jiwan Hur, Jae Young Lee, Jaehyun Choi, Junmo Kim:
I See-Through You: A Framework for Removing Foreground Occlusion in Both Sparse and Dense Light Field Images. 229-238 - B. H. Pawan Prasad, Green Rosh K. S, R. B. Lokesh, Kaushik Mitra:
Burst Reflection Removal using Reflection Motion Aggregation Cues. 239-248 - Tai-Yin Chiu, Danna Gurari:
Line Search-Based Feature Transformation for Fast, Stable, and Tunable Content-Style Control in Photorealistic Style Transfer. 249-258 - Liyun Zhang, Photchara Ratsamee, Bowen Wang, Zhaojie Luo, Yuki Uranishi, Manabu Higashida, Haruo Takemura:
Panoptic-aware Image-to-Image Translation. 259-268 - Abhishek Jha, Soroush Seifi, Tinne Tuytelaars:
SimGlim: Simplifying glimpse based active visual reconstruction. 269-278 - Lorenzo Luzi, Carlos Ortiz Marrero, Nile Wynar, Richard G. Baraniuk, Michael J. Henry:
Evaluating generative networks using Gaussian mixtures of image features. 279-288 - Xihui Liu, Dong Huk Park, Samaneh Azadi, Gong Zhang, Arman Chopikyan, Yuxiao Hu, Humphrey Shi, Anna Rohrbach, Trevor Darrell:
More Control for Free! Image Synthesis with Semantic Diffusion Guidance. 289-299 - James F. Mullen Jr., Divya Kothandaraman, Aniket Bera, Dinesh Manocha:
Placing Human Animations into 3D Scenes by Learning Interaction- and Geometry-Driven Keyframes. 300-310 - Takafumi Iwaguchi, Hiroshi Kawasaki:
Surface normal estimation from optimized and distributed light sources using DNN-based photometric stereo. 311-320 - David Hart, Michael Whitney, Bryan S. Morse:
Interpolated SelectionConv for Spherical Images and Surfaces. 321-330 - Yingnan Ma, Chenqiu Zhao, Anup Basu, Xudong Li:
RAST: Restorable Arbitrary Style Transfer via Multi-restoration. 331-340 - Cameron Gordon, Shin-Fang Ch'ng, Lachlan E. MacDonald, Simon Lucey:
On Quantizing Implicit Neural Representations. 341-350 - Lydia Lindner, Alexander Effland, Filip Ilic, Thomas Pock, Erich Kobler:
Lightweight Video Denoising using Aggregated Shifted Window Attention. 351-360 - Junming Chen, Meirui Jiang, Qi Dou, Qifeng Chen:
Federated Domain Generalization for Image Recognition via Cross-Client Style Transfer. 361-370 - Shahar Mahpod, Noam Gaash, Hay Hoffman, Gil Ben-Artzi:
CTrGAN: Cycle Transformers GAN for Gait Transfer. 371-381 - Divya Kothandaraman, Sumit Shekhar, Abhilasha Sancheti, Manoj Ghuhan, Tripti Shukla, Dinesh Manocha:
SALAD : Source-free Active Label-Agnostic Domain Adaptation for Classification, Segmentation and Detection. 382-391 - Thomas Westfechtel, Hao-Wei Yeh, Qier Meng, Yusuke Mukuta, Tatsuya Harada:
Backprop Induced Feature Weighting for Adversarial Domain Adaptation with Iterative Label Distribution Alignment. 392-401 - Md Mahmudur Rahman, Rameswar Panda, Mohammad Arif Ul Alam:
Semi-Supervised Domain Adaptation with Auto-Encoder via Simultaneous Learning. 402-411 - Haomiao Ni, Yihao Liu, Sharon X. Huang, Yuan Xue:
Cross-identity Video Motion Retargeting with Joint Transformation and Synthesis. 412-422 - Giulio Mattolin, Luca Zanella, Elisa Ricci, Yiming Wang:
ConfMix: Unsupervised Domain Adaptation for Object Detection via Confidence-based Mixing. 423-433 - Tejas Gokhale, Rushil Anirudh, Jayaraman J. Thiagarajan, Bhavya Kailkhura, Chitta Baral, Yezhou Yang:
Improving Diversity with Adversarially Learned Transformations for Domain Generalization. 434-443 - Donald Shenaj, Eros Fanì, Marco Toldo, Debora Caldarola, Antonio Tavera, Umberto Michieli, Marco Ciccone, Pietro Zanuttigh, Barbara Caputo:
Learning Across Domains and Devices: Style-Driven Source-Free Domain Adaptation in Clustered Federated Learning. 444-454 - Matthew R. Keaton, Ram J. Zaveri, Gianfranco Doretto:
CellTranspose: Few-shot Domain Adaptation for Cellular Instance Segmentation. 455-466 - Swati Jindal, Xin Eric Wang:
CUDA-GHR: Controllable Unsupervised Domain Adaptation for Gaze and Head Redirection. 467-477 - Vibashan VS, Poojan Oza, Vishal M. Patel:
Towards Online Domain Adaptive Object Detection. 478-488 - Kyusik Cho, Suhyeon Lee, Hongje Seong, Euntai Kim:
Domain Adaptive Video Semantic Segmentation via Cross-Domain Moving Object Mixing. 489-498 - Fabrizio J. Piva, Daan de Geus, Gijs Dubbelman:
Empirical Generalization Study: Unsupervised Domain Adaptation vs. Domain Generalization Methods for Semantic Segmentation in the Wild. 499-508 - Yumeng Li, Dan Zhang, Margret Keuper, Anna Khoreva:
Intra-Source Style Augmentation for Improved Domain Generalization. 509-519 - Jinyu Yang, Jingjing Liu, Ning Xu, Junzhou Huang:
TVT: Transferable Vision Transformer for Unsupervised Domain Adaptation. 520-530 - Sungsu Hur, Inkyu Shin, Kwanyong Park, Sanghyun Woo, In So Kweon:
Learning Classifiers of Prototypes and Reciprocal Points for Universal Domain Adaptation. 531-540 - Michael Essich, Markus Rehmann, Cristóbal Curio:
Auxiliary Task-Guided CycleGAN for Black-Box Model Domain Adaptation. 541-550 - Weiwei Sun, Daniel Rebain, Renjie Liao, Vladimir Tankovich, Soroosh Yazdani, Kwang Moo Yi, Andrea Tagliasacchi:
NeuralBF: Neural Bilateral Filtering for Top-down Instance Segmentation on Point Clouds. 551-560 - Brent Griffin:
Mobile Robot Manipulation using Pure Object Detection. 561-571 - Driton Salihu, Eckehard G. Steinbach:
SGPCR: Spherical Gaussian Point Cloud Representation and its Application to Object Registration and Retrieval. 572-581 - Min Seok Lee, Seok Woo Yang, Sung Won Han:
GaIA: Graphical Information Gain based Attention Network for Weakly Supervised Point Cloud Semantic Segmentation. 582-591 - Jaeyeon Kim, Binh-Son Hua, Duc Thanh Nguyen, Sai-Kit Yeung:
PointInverter: Point Cloud Reconstruction and Editing via a Generative Model with Shape Priors. 592-601 - Maximilian Pittner, Alexandru Condurache, Joel Janai:
3D-SpLineNet: 3D Traffic Line Detection using Parametric Spline Representations. 602-611 - Jinlong Li, Runsheng Xu, Jin Ma, Qin Zou, Jiaqi Ma, Hongkai Yu:
Domain Adaptive Object Detection for Autonomous Driving under Foggy Weather. 612-622 - Dusan Malic, Christian Fruhwirth-Reisinger, Horst Possegger, Horst Bischof:
SAILOR: Scaling Anchors via Insights into Latent Object Representation. 623-632 - Zhimin Chen, Longlong Jing, Liang Yang, Yingwei Li, Bing Li:
Class-Level Confidence Based 3D Semi-Supervised Learning. 633-642 - Minghan Zhu, Lingting Ge, Panqu Wang, Huei Peng:
MonoEdge: Monocular 3D Object Detection Using Local Perspectives. 643-652 - Minmin Yang, Jiajing Chen, Senem Velipasalar:
Cross-Modality Feature Fusion Network for Few-Shot 3D Point Cloud Classification. 653-662 - Anas Mahmoud, Jordan S. K. Hu, Steven L. Waslander:
Dense Voxel Fusion for 3D Object Detection. 663-672 - Nagma S. Khan, Kazumine Ogura, Eric Cosatto, Masayuki Ariyoshi:
Real-time Concealed Weapon Detection on 3D Radar Images for Walk-through Screening System. 673-681 - Daeun Lee, Jinkyu Kim:
Resolving Class Imbalance for LiDAR-based Object Detector by Dynamic Weight Average and Contextual Ground Truth Sampling. 682-691 - Shubham Gupta, Jeet Kanjani, Mengtian Li, Francesco Ferroni, James Hays, Deva Ramanan, Shu Kong:
Far3Det: Towards Far-Field 3D Detection. 692-701 - Dmitrii Torbunov, Yi Huang, Haiwang Yu, Jin Huang, Shinjae Yoo, Meifeng Lin, Brett Viren, Yihui Ren:
UVCGAN: UNet Vision Transformer cycle-consistent GAN for unpaired image-to-image translation. 702-712 - Simon Niklaus, Ping Hu, Jiawen Chen:
Splatting-based Synthesis for Video Frame Interpolation. 713-723 - Kyungmin Jo, Gyumin Shim, Sanghun Jung, Soyoung Yang, Jaegul Choo:
CG-NeRF: Conditional Generative Neural Radiance Fields for 3D-aware Image Synthesis. 724-733 - Nikola Popovic, Ritika Chakraborty, Danda Pani Paudel, Thomas Probst, Luc Van Gool:
Spatially Multi-conditional Image Generation. 734-743 - Min Woo Kim, Nam Ik Cho:
WHFL: Wavelet-Domain High Frequency Loss for Sketch-to-Image Translation. 744-754 - David Dadon, Ohad Fried, Yacov Hel-Or:
DDNeRF: Depth Distribution Neural Radiance Fields. 755-763 - Hanbit Lee, Youna Kim, Sang-Goo Lee:
Multi-scale Contrastive Learning for Complex Scene Generation. 764-774 - Pol Caselles, Eduard Ramon, Jaime García, Xavier Giró-i-Nieto, Francesc Moreno-Noguer, Gil Triginer:
SIRA: Relightable Avatars from a Single Image. 775-784 - Aditya Chattopadhyay, Xi Zhang, David Paul Wipf, Himanshu Arora, René Vidal:
Learning Graph Variational Autoencoders with Constraints and Structured Priors for Conditional Indoor 3D Scene Generation. 785-794 - Mingtong Zhang, Shuhong Zheng, Zhipeng Bao, Martial Hebert, Yu-Xiong Wang:
Beyond RGB: Scene-Property Synthesis with Neural Radiance Fields. 795-805 - Kai-En Lin, Yen-Chen Lin, Wei-Sheng Lai, Tsung-Yi Lin, Yi-Chang Shih, Ravi Ramamoorthi:
Vision Transformer for NeRF-Based View Synthesis from a Single Input Image. 806-815 - Luca De Luigi, Damiano Bolognini, Federico Domeniconi, Daniele De Gregorio, Matteo Poggi, Luigi Di Stefano:
ScanNeRF: a Scalable Benchmark for Neural Radiance Fields. 816-825 - Fariborz Taherkhani, Aashish Rai, Quankai Gao, Shaunak Srivastava, Xuanbai Chen, Fernando De la Torre, Steven Song, Aayush Prakash, Daeil Kim:
Controllable 3D Generative Adversarial Face Model via Disentangling Shape and Appearance. 826-836 - Inwoo Hwang, Junho Kim, Young Min Kim:
Ev-NeRF: Event Based Neural Radiance Field. 837-847 - Chaerin Kong, Dong Hyeon Jeon, Ohjoon Kwon, Nojun Kwak:
Leveraging Off-the-shelf Diffusion Model for Multi-attribute Fashion Image Manipulation. 848-857 - Samia Shafique, Bailey Kong, Shu Kong, Charless C. Fowlkes:
Creating a Forensic Database of Shoeprints from Online Shoe-Tread Photos. 858-868 - Safa C. Medin, Amir Weiss, Frédo Durand, William T. Freeman, Gregory W. Wornell:
Can Shadows Reveal Biometric Informationƒ. 869-879 - Qiaomu Miao, Minh Hoai, Dimitris Samaras:
Patch-level Gaze Distribution Prediction for Gaze Following. 880-889 - Vikrant Nagpure, Kenji Okuma:
Searching Efficient Neural Architecture with Multi-resolution Fusion Transformer for Appearance-based Gaze Estimation. 890-899 - Siamul Karim Khan, Patrick Tinsley, Adam Czajka:
DeformIrisNet: An Identity-Preserving Model of Iris Texture Deformation. 900-908 - Haidong Zhu, Zhaoheng Zheng, Ram Nevatia:
Gait Recognition Using 3-D Human Body Shape Inference. 909-918 - Wes Robbins, Steven Zhou, Aman Bhatta, Chad Mello, Vítor Albiero, Kevin W. Bowyer, Terrance E. Boult:
CAST: Conditional Attribute Subsampling Toolkit for Fine-grained Evaluation. 919-929 - Ziyuan Huang, Zhengping Zhou, Yung-Yu Chuang, Jiajun Wu, C. Karen Liu:
Physically Plausible Animation of Human Upper Body from a Single Image. 930-939 - Manh Huynh, Gita Alaghband:
Online Adaptive Temporal Memory with Certainty Estimation for Human Trajectory Prediction. 940-949 - Igor Vozniak, Philipp Müller, Lorena Hell, Nils Lipp, Ahmed Abouelazm, Christian Müller:
Context-empowered Visual Attention Prediction in Pedestrian Scenarios. 950-960 - Akshay Agarwal, Nalini K. Ratha, Afzel Noore, Richa Singh, Mayank Vatsa:
Misclassifications of Contact Lens Iris PAD Algorithms: Is it Gender Bias or Environmental Conditions? 961-970 - André Brasil Vieira Wyzykowski, Anil K. Jain:
Synthetic Latent Fingerprint Generator. 971-980 - Andreas Specker, Mickael Cormier, Jürgen Beyerer:
UPAR: Unified Pedestrian Attribute Recognition and Person Retrieval. 981-990 - Takahiro Toizumi, Koichi Takahashi, Masato Tsukada:
Segmentation-free Direct Iris Localization Networks. 991-1000 - Ahmed Tawfik Aboukhadra, Jameel Malik, Ahmed Elhayek, Nadia Robertini, Didier Stricker:
THOR-Net: End-to-end Graformer-based Realistic Two Hands and Object Reconstruction with Self-supervision. 1001-1010 - Yuxin Tian, Shawn D. Newsam, Kofi Boakye:
Fashion Image Retrieval with Text Feedback by Additive Attention Compositional Learning. 1011-1021 - Xuri Ge, Fuhai Chen, Songpei Xu, Fuxiang Tao, Joemon M. Jose:
Cross-modal Semantic Enhanced Interaction for Image-Sentence Retrieval. 1022-1031 - Ruoyue Shen, Nakamasa Inoue, Koichi Shinoda:
Text-Guided Object Detector for Multi-modal Video Question Answering. 1032-1042 - Srikanth Malla, Chiho Choi, Isht Dwivedi, Joon Hee Choi, Jiachen Li:
DRAMA: Joint Risk Localization and Captioning in Driving. 1043-1052 - Ryugo Morita, Zhiqiang Zhang, Man M. Ho, Jinjia Zhou:
Interactive Image Manipulation with Complex Text Instructions. 1053-1062 - Konstantin Kobs, Michael Steininger, Andreas Hotho:
InDiReCT: Language-Guided Zero-Shot Deep Metric Learning for Images. 1063-1072 - Tzu-Jui Julius Wang, Jorma Laaksonen, Tomas Langer, Heikki Arponen, Tom E. Bishop:
Learning by Hallucinating: Vision-Language Pre-training with Weak Supervision. 1073-1083 - Abhishek Jha, Badri N. Patro, Luc Van Gool, Tinne Tuytelaars:
Barlow constrained optimization for Visual Question Answering. 1084-1093 - Jason Armitage, Leonardo Impett, Rico Sennrich:
A Priority Map for Vision-and-Language Navigation with Trajectory Plans and Feature-Location Cues. 1094-1103 - Chia-Wen Kuo, Chih-Yao Ma, Judy Hoffman, Zsolt Kira:
Structure-Encoding Auxiliary Tasks for Improved Visual Representation in Vision-and-Language Navigation. 1104-1113 - Jihyeon Lee, Woo-Young Kang, Eun-Sol Kim:
Dense but Efficient VideoQA for Intricate Compositional Reasoning. 1114-1123 - Ukyo Honda, Taro Watanabe, Yuji Matsumoto:
Switching to Discriminative Image Captioning by Relieving a Bottleneck of Reinforcement Learning. 1124-1134 - Bhavin Jawade, Deen Dayal Mohan, Naji Mohamed Ali, Srirangaraj Setlur, Venu Govindaraju:
NAPReg: Nouns As Proxies Regularization for Semantically Aware Cross-Modal Embeddings. 1135-1144 - Mark Hubenthal, Suren Kumar:
Image-Text Pre-Training for Logo Recognition. 1145-1154 - Sahithya Ravi, Aditya Chinchure, Leonid Sigal, Renjie Liao, Vered Shwartz:
VLC-BERT: Visual Question Answering with Contextualized Commonsense Knowledge. 1155-1165 - Jonas Theiner, Ralph Ewerth:
TVCalib: Camera Calibration for Sports Field Registration in Soccer. 1166-1175 - Yue Qiu, Shintaro Yamamoto, Ryosuke Yamada, Ryota Suzuki, Hirokatsu Kataoka, Kenji Iwata, Yutaka Satoh:
3D Change Localization and Captioning from Dynamic Scans of Indoor Scenes. 1176-1185 - Donghao Qiao, Farhana H. Zulkernine:
Adaptive Feature Fusion for Cooperative Perception using LiDAR Point Clouds. 1186-1195 - Hanzhe Teng, Dimitrios Chatziparaschis, Xinyue Kan, Amit K. Roy-Chowdhury, Konstantinos Karydis:
Centroid Distance Keypoint Detector for Colored Point Clouds. 1196-1205 - Linh Trinh, Phuong Pham, Hoang Trinh, Nguyen Bach, Dung Nguyen, Giang Nguyen, Huy Nguyen:
PP4AV: A benchmarking Dataset for Privacy-preserving Autonomous Driving. 1206-1215 - Mohamed El Banani, Ignacio Rocco, David Novotný, Andrea Vedaldi, Natalia Neverova, Justin Johnson, Benjamin Graham:
Self-supervised Correspondence Estimation via Multiview Registration. 1216-1225 - Jeonghyun Kim, Kaichun Mo, Minhyuk Sung, Woontack Woo:
Seg&Struct: The Interplay Between Part Segmentation and Structure Inference for 3D Shape Parsing. 1226-1235 - Chenxi Lola Deng, Enzo Tartaglione:
Compressing Explicit Voxel Grid Representations: fast NeRFs become also small. 1236-1245