


default search action
WACV 2024: Waikoloa, HI, USA
- IEEE/CVF Winter Conference on Applications of Computer Vision, WACV 2024, Waikoloa, HI, USA, January 3-8, 2024. IEEE 2024, ISBN 979-8-3503-1892-0
- Piyush Arora, Pratik Mazumder:
Hybrid Sample Synthesis-based Debiasing of Classifier in Limited Data Setting. i-ix - Yining Ding
, Andrew M. Wallace, Sen Wang:
Estimating Fog Parameters from an Image Sequence using Non-linear Optimisation. i-ix - Alon Shoshan, Ori Linial
, Nadav Bhonker, Elad Hirsch, Lior Zamir, Igor Kviatkovsky, Gérard G. Medioni:
Asymmetric Image Retrieval with Cross Model Compatible Ensembles. 1-11 - Sai Aparna Aketi, Kaushik Roy:
Cross-feature Contrastive Loss for Decentralized Deep Learning on Heterogeneous Data. 12-21 - Suhas Srinath, Shankhanil Mitra, Shika Rao, Rajiv Soundararajan:
Learning Generalizable Perceptual Representations for Data-Efficient No-Reference Image Quality Assessment. 22-31 - Jayateja Kalla, Soma Biswas:
Robust Feature Learning and Global Variance-Driven Classifier Alignment for Long-Tail Class Incremental Learning. 32-41 - Amirhossein Dadashzadeh, Shuchao Duan, Alan L. Whone, Majid Mirmehdi:
PECoP: Parameter Efficient Continual Pretraining for Action Quality Assessment. 42-52 - Pierpaolo Morì, Lukas Frickenstein, Shambhavi Balamuthu Sampath, Moritz Thoma, Nael Fasfous, Manoj Rohit Vemparala, Alexander Frickenstein, Christian Unger, Walter Stechele, Daniel Mueller-Gritschneder, Claudio Passerone:
Wino Vidi Vici: Conquering Numerical Instability of 8-bit Winograd Convolution for Accurate Inference Acceleration on Edge. 53-62 - Sahil Singla, Atoosa Malemir Chegini, Mazda Moayeri, Soheil Feizi:
Data-Centric Debugging: mitigating model failures via targeted image retrieval. 63-74 - Jinfeng Wang, Sifan Song, Jionglong Su, S. Kevin Zhou:
Distortion-Disentangled Contrastive Learning. 75-85 - Xuwei Xu, Sen Wang
, Yudong Chen, Yanping Zheng, Zhewei Wei, Jiajun Liu:
GTP-ViT: Efficient Vision Transformers via Graph-based Token Propagation. 86-95 - Khanh-Binh Nguyen:
SequenceMatch Revisiting the design of weak-strong augmentations for Semi-supervised learning. 96-105 - Saurabh Kumar Jain, Sukhendu Das:
Stochastic Binary Network for Universal Domain Adaptation. 106-115 - Mathilde Caron, Neil Houlsby, Cordelia Schmid:
Location-Aware Self-Supervised Transformers for Semantic Segmentation. 116-126 - Kilian Batzner, Lars Heckler, Rebecca König:
EfficientAD: Accurate Visual Anomaly Detection at Millisecond-Level Latencies. 127-137 - Seungwook Kim, Juhong Min, Minsu Cho:
Efficient Semantic Matching with Hypercolumn Correlation. 138-147 - Jie Zhang, Masanori Suganuma, Takayuki Okatani:
Contextual Affinity Distillation for Image Anomaly Detection. 148-157 - Hojin Kim, Seunghun Lee, Hyeon Kang, Sunghoon Im:
Offline-to-Online Knowledge Distillation for Video Instance Segmentation. 158-167 - Yanda Li, Zilong Huang, Gang Yu, Ling Chen, Yunchao Wei, Jianbo Jiao:
Disentangled Pre-training for Image Matting. 168-177 - Ziqiang Shi, Rujie Liu:
Conditional Velocity Score Estimation for Image Restoration. 178-187 - Lorenzo Agnolucci, Leonardo Galteri
, Marco Bertini, Alberto Del Bimbo:
ARNIQA: Learning Distortion Manifold for Image Quality Assessment. 188-197 - Bokyeung Lee, Kyungdeuk Ko, Jonghwan Hong, Hanseok Ko:
Hard Sample-aware Consistency for Low-resolution Facial Expression Recognition. 198-207 - Maxim Shugaev, Ilya Semenov, Kyle Ashley, Michael Klaczynski, Naresh Cuntoor, Mun Wai Lee, Nathan Jacobs:
ArcGeo: Localizing Limited Field-of-View Images using Cross-view Matching. 208-217 - Hiran Sarkar, Vishal M. Chudasama, Naoyuki Onoe, Pankaj Wasnik, Vineeth N. Balasubramanian:
Open-Set Object Detection By Aligning Known Class Representations. 218-227 - Inkyu Shin, Dahun Kim, Qihang Yu, Jun Xie, Hong-Seok Kim, Bradley Green, In So Kweon, Kuk-Jin Yoon, Liang-Chieh Chen:
Video-kMaX: A Simple Unified Approach for Online and Near-Online Video Panoptic Segmentation. 228-238 - Volodymyr Fedynyak, Yaroslav Romanus, Bohdan Hlovatskyi, Bohdan Sydor, Oles Dobosevych, Igor Babin, Roman Riazantsev:
DeVOS: Flow-Guided Deformable Transformer for Video Object Segmentation. 239-248 - Md Awsafur Rahman
, Shaikh Anowarul Fattah:
Semi-Supervised Semantic Depth Estimation using Symbiotic Transformer and NearFarMix Augmentation. 249-258 - Vladan Stojnic
, Zakaria Laskar, Giorgos Tolias:
Training Ensembles with Inliers and Outliers for Semi-supervised Active Learning. 259-268 - Samuel Black, Richard Souvenir:
Multi-view Classification Using Hybrid Fusion and Mutual Distillation. 269-279 - Jiayang Ao, Qiuhong Ke, Krista A. Ehinger:
Amodal Intra-class Instance Segmentation: Synthetic Datasets and Benchmark. 280-289 - Balamurali Murugesan, Rukhshanda Hussain, Rajarshi Bhattacharya, Ismail Ben Ayed, Jose Dolz:
Prompting classes: Exploring the Power of Prompt Class Learning in Weakly Supervised Semantic Segmentation. 290-301 - Jingwen Sun, Jing Wu, Ze Ji, Yu-Kun Lai:
RSMPNet: Relationship Guided Semantic Map Prediction. 302-311 - Rajeev Yasarla, Renliang Weng, Wongun Choi, Vishal M. Patel, Amir Sadeghian:
3SD: Self-Supervised Saliency Detection With No Labels. 312-321 - Zenglin Shi, Ying Sun, Mengmi Zhang:
Training-free Object Counting with Prompts. 322-330 - Souradeep Chakraborty, Shujon Naha, Muhammet Bastan, Amit Kumar K. C, Dimitris Samaras:
Unsupervised and semi-supervised co-salient object detection via segmentation frequency statistics. 331-341 - Zheng Xiong, Liangyu Chai, Wenxi Liu, Yongtuo Liu, Sucheng Ren, Shengfeng He:
Glance to Count: Learning to Rank with Anchors for Weakly-supervised Crowd Counting. 342-351 - Yahia Dalbah, Jean Lahoud, Hisham Cholakkal:
TransRadar: Adaptive-Directional Transformer for Real-Time Multi-View Radar Semantic Segmentation. 352-361 - Jinwoo Hwang, Philipp Benz, Pete Kim:
Booster-SHOT: Boosting Stacked Homography Transformations for Multiview Pedestrian Detection with Attention. 362-371 - Zhifeng Teng, Jiaming Zhang, Kailun Yang, Kunyu Peng, Hao Shi, Simon Reiß, Ke Cao, Rainer Stiefelhagen:
360BEV: Panoramic Semantic Mapping for Indoor Bird's-Eye View. 372-381 - Yasser Abdelaziz Dahou Djilali, Kevin McGuinness, Noel E. O'Connor:
Learning Saliency From Fixations. 382-392 - Qilei Li, Shaogang Gong:
Mitigate Domain Shift by Primary-Auxiliary Objectives Association for Generalizing Person ReID. 393-402 - Md. Motiur Rahman
, Shiva Shokouhmand, Smriti Bhatt, Miad Faezipour
:
MIST: Medical Image Segmentation Transformer with Convolutional Attention Mixing (CAM) Decoder. 403-412 - Cheolhyun Mun, Sanghuk Lee, Youngjung Uh
, Junsuk Choe
, Hyeran Byun:
Small Objects Matters in Weakly-supervised Semantic Segmentation. 413-422 - Qizhen Lan, Qing Tian:
Gradient-Guided Knowledge Distillation for Object Detectors. 423-432 - Beoungwoo Kang, Seunghun Moon, Yubin Cho
, Hyunwoo Yu
, Suk-Ju Kang:
MetaSeg: MetaFormer-based Global Contexts-aware Network for Efficient Semantic Segmentation. 433-442 - Cheng-Hsiu Chen, Jheng-Wei Su, Min-Chun Hu, Chih-Yuan Yao, Hung-Kuo Chu:
Panelformer: Sewing Pattern Reconstruction from 2D Garment Images. 443-452 - Ruxue Wen, Hangjie Yuan, Dong Ni, Wenbo Xiao, Yaoyao Wu:
From Denoising Training to Test-Time Adaptation: Enhancing Domain Generalization for Medical Image Segmentation. 453-463 - Tariq Berrada, Camille Couprie, Karteek Alahari, Jakob Verbeek:
Guided Distillation for Semi-Supervised Instance Segmentation. 464-472 - Gwanghan Lee, Saebyeol Shin, Taeyoung Na, Simon S. Woo:
Real-Time User-guided Adaptive Colorization with Vision Transformer. 473-482 - Bin Duan, Hao Tang, Changchang Sun, Ye Zhu, Yan Yan:
Mining and Unifying Heterogeneous Contrastive Relations for Weakly-Supervised Actor-Action Segmentation. 483-492 - Yessine Khanfir, Marwa Dhiaf, Emna Ghodhbani, Ahmed Cheikh Rouhou, Yousri Kessentini:
Graph Neural Networks for End-to-End Information Extraction from Handwritten Documents. 493-501 - Lei Li:
CPSeg: Finer-grained Image Semantic Segmentation via Chain-of-Thought Language Prompting. 502-511 - Xiaobo Yang, Xiaojin Gong:
Foundation Model Assisted Weakly Supervised Semantic Segmentation. 512-521 - Ahmed Ben Saad, Gabriele Facciolo, Axel Davy:
On the Importance of Large Objects in CNN Based Object Detection Algorithms. 522-531 - Yeti Ziya Gürbüz, Ogul Can, A. Aydin Alatan:
Deep Metric Learning with Chance Constraints. 532-542 - Tajamul Ashraf, Fuzayil Bin Afzal Mir, Iqra Altaf Gillani:
TransFed: A way to epitomize Focal Modulation using Transformer-based Federated Learning. 543-552 - Yangzheng Wu, Michael A. Greenspan:
Learning Better Keypoints for Multi-Object 6DoF Pose Estimation. 553-563 - Praful Mathur, Shashi Kumar Parwani, Mrinmoy Sen, Roopa Sheshadri, Aman Sharma:
Object Aware Contrastive Prior for Interactive Image Segmentation. 564-573 - Teodora Popordanoska, Aleksei Tiulpin, Matthew B. Blaschko:
Beyond Classification: Definition and Density-based Estimation of Calibration in Object Detection. 574-583 - Jianlong Yuan, Minh Hieu Phan, Liyang Liu, Yifan Liu:
FAKD: Feature Augmented Knowledge Distillation for Semantic Segmentation. 584-594 - Han Qiu, Gongjie Zhang, Jiaxing Huang, Peng Gao, Zhang Wei, Shijian Lu:
Efficient MAE towards Large-Scale Vision Transformers. 595-604 - Saad Himmi, Vincent Parret, Ajad Chhatkuli, Luc Van Gool:
MS-EVS: Multispectral event-based vision for deep learning based face detection. 605-614 - Hyuna Cho, Injun Choi, Suha Kwak, Won Hwa Kim:
Interactive Network Perturbation between Teacher and Students for Semi-Supervised Semantic Segmentation. 615-624 - Gengyuan Zhang, Yurui Zhang, Kerui Zhang, Volker Tresp:
Can Vision-Language Models be a Good Guesser? Exploring VLMs for Times and Location Reasoning. 625-634 - Ji-Ye Jeon, Xuan Truong Nguyen, Soojung Ryu, Hyuk-Jae Lee:
USDN: A Unified Sample-wise Dynamic Network with Mixed-Precision and Early-Exit. 635-643 - Qian Xie, Ta Ying Cheng, Jia-Xing Zhong, Kaichen Zhou, Andrew Markham, Niki Trigoni:
Beyond Fusion: Modality Hallucination-based Multispectral Fusion for Pedestrian Detection. 644-653 - Fangchen Yu, Yina Xie, Lei Wu, Yafei Wen, Guozhi Wang, Shuai Ren, Xiaoxin Chen, Jianfeng Mao, Wenye Li:
DocReal: Robust Document Dewarping of Real-Life Images via Attention-Enhanced Control Point Prediction. 654-663 - Hasib Zunair, A. Ben Hamza:
Learning to Recognize Occluded and Small Objects with Partial Inputs. 664-673 - Razieh Kaviani Baghbaderani, Yuanxin Li, Shuangquan Wang, Hairong Qi:
Temporally-Consistent Video Semantic Segmentation with Bidirectional Occlusion-guided Feature Propagation. 674-684 - Nikhil Reddy, Mahsa Baktashmotlagh
, Chetan Arora:
Domain-Aware Knowledge Distillation for Continual Model Generalization. 685-696 - Kamalakar Vijay Thakare, Debi Prosad Dogra, Heeseung Choi, Haksub Kim, Ig-Jae Kim:
Let's Observe Them Over Time: An Improved Pedestrian Attribute Recognition Approach. 697-706 - Alloy Das, Sanket Biswas, Ayan Banerjee, Josep Lladós, Umapada Pal, Saumik Bhattacharya:
Harnessing the Power of Multi-Lingual Datasets for Pre-training: Towards Enhancing Text Spotting Performance. 707-717 - Tianyi Zhang, Kishore Kasichainula, Yaoxin Zhuo, Baoxin Li, Jae-Sun Seo, Yu Cao:
Patch-based Selection and Refinement for Early Object Detection. 718-727 - Cagri Gungor, Adriana Kovashka:
Boosting Weakly Supervised Object Detection using Fusion and Priors from Hallucinated Depth. 728-737 - Ashutosh Kulkarni, Shruti S. Phutke, Santosh Kumar Vipparthi, Subrahmanyam Murala:
C2AIR: Consolidated Compact Aerial Image Haze Removal. 738-747 - K. N. Ajay Shastry, K. Ravi Sri Teja, Aditya Nigam, Chetan Arora:
Favoring One Among Equals - Not a Good Idea: Many-to-one Matching for Robust Transformer based Pedestrian Detection. 748-757 - Cheng Yang, Rui Xu, Ye Guo, Peixiang Huang, Yiru Chen, Wenkui Ding, Zhongyuan Wang, Hong Zhou:
Improving Vision-and-Language Reasoning via Spatial Relations Modeling. 758-767 - Chau Pham, Truong Vu, Khoi Nguyen:
LP-OVOD: Open-Vocabulary Object Detection by Linear Probing. 768-777 - Barsegh Atanyan, Levon Khachatryan, Shant Navasardyan, Yunchao Wei, Humphrey Shi:
Continuous Adaptation for Interactive Segmentation Using Teacher-Student Architecture. 778-788 - Qiyang Wan, Ruiping Wang, Xilin Chen:
Interpretable Object Recognition by Semantic Prototype Analysis. 789-798 - Gregor Köhler, Tassilo Wald, Constantin Ulrich, David Zimmerer, Paul F. Jaeger, Jörg K. H. Franke, Simon Kohl, Fabian Isensee, Klaus H. Maier-Hein:
RecycleNet: Latent Feature Recycling Leads to Iterative Decision Refinement. 799-807 - Junehyoung Kwon, Eunju Lee
, Yunsung Cho, YoungBin Kim:
Learning to Detour: Shortcut Mitigating Augmentation for Weakly Supervised Semantic Segmentation. 808-817 - Connor Anderson, Matthew Gwilliam, Evelyn Gaskin, Ryan Farrell:
Elusive Images: Beyond Coarse Analysis for Fine-Grained Recognition. 818-828 - Xiaoyu Dong, Naoto Yokoya:
Understanding Dark Scenes by Contrasting Multi-Modal Observations. 829-839 - Abdullah Rashwan, Jiageng Zhang, Ali Taalimi, Fan Yang, Xingyi Zhou, Chaochao Yan, Liang-Chieh Chen, Yeqing Li:
MaskConver: Revisiting Pure Convolution Model for Panoptic Segmentation. 840-850 - Fangwen Wu, Jingxuan He, Yufei Yin, Yanbin Hao, Gang Huang, Lechao Cheng:
Masked Collaborative Contrast for Weakly Supervised Semantic Segmentation. 851-860 - Zizheng Yan, Yushuang Wu, Yipeng Qin, Xiaoguang Han, Shuguang Cui
, Guanbin Li:
Universal Semi-supervised Model Adaptation via Collaborative Consistency Training. 861-871 - Sergi Garcia-Bordils, Dimosthenis Karatzas, Marçal Rusiñol:
STEP - Towards Structured Scene-Text Spotting. 872-881 - Zhuoming Liu, Xuefeng Hu, Ram Nevatia:
Efficient Feature Distillation for Zero-shot Annotation Object Detection. 882-891 - Shangbang Long, Siyang Qin, Yasuhisa Fujii, Alessandro Bissacco, Michalis Raptis:
Hierarchical Text Spotter for Joint Text Spotting and Layout Analysis. 892-902 - Taotao Jing, Lichen Wang, Naji Khosravan, Zhiqiang Wan, Zachary Bessinger, Zhengming Ding, Sing Bing Kang:
iBARLE: imBalance-Aware Room Layout Estimation. 903-913 - Shuo Wang, Jing Li, Zibo Zhao, Dongze Lian, Binbin Huang, Xiaomei Wang, Zhengxin Li, Shenghua Gao:
TSP-Transformer: Task-Specific Prompts Boosted Transformer for Holistic Scene Understanding. 914-923 - Peter Naylor, Diego Di Carlo, Arianna Traviglia, Makoto Yamada, Marco Fiorucci
:
Implicit neural representation for change detection. 924-934 - Hei Law, Jia Deng:
Label-Free Synthetic Pretraining of Object Detectors. 935-945 - Ximeng Sun, Rameswar Panda, Chun-Fu Richard Chen, Naigang Wang, Bowen Pan, Aude Oliva, Rogério Feris, Kate Saenko:
Improved Techniques for Quantizing Deep Networks with Adaptive Bit-Widths. 946-956 - Maximilian Bernhard, Roberto Amoroso, Yannic Kindermann, Lorenzo Baraldi, Rita Cucchiara, Volker Tresp, Matthias Schubert:
What's Outside the Intersection? Fine-grained Error Analysis for Semantic Segmentation Beyond IoU. 957-966 - Hao Chen, Yonghan Dong, Zheming Lu, Yunlong Yu, Jungong Han:
Pixel Matching Network for Cross-Domain Few-Shot Segmentation. 967-976 - Joonhyun Jeong, Beomyoung Kim, Joonsang Yu, Youngjoon Yoo:
EResFD: Rediscovery of the Effectiveness of Standard Convolution for Lightweight Face Detection. 977-987 - Mir Rayat Imtiaz Hossain, Leonid Sigal, James J. Little:
Framework-agnostic Semantically-aware Global Reasoning for Segmentation. 988-998 - Arvi Jonnarth
, Yushan Zhang, Michael Felsberg:
High-fidelity Pseudo-labels for Boosting Weakly-Supervised Segmentation. 999-1008 - Harsh Maheshwari, Yen-Cheng Liu, Zsolt Kira:
Missing Modality Robustness in Semi-Supervised Multi-Modal Semantic Segmentation. 1009-1019 - Jialiang Zhu, Danqing Huang, Chunyu Wang, Mingxi Cheng, Ji Li, Han Hu, Xin Geng, Baining Guo:
Unsupervised Graphic Layout Grouping with Transformers. 1020-1029 - Vuong D. Nguyen, Khadija Khaldi, Dung Nguyen, Pranav Mantini, Shishir Shah:
Contrastive Viewpoint-aware Shape Learning for Long-term Person Re-Identification. 1030-1038 - Xuan Yang, Liangzhe Yuan, Kimberly Wilber, Astuti Sharma, Xiuye Gu, Siyuan Qiao, Stephanie Debats, Huisheng Wang, Hartwig Adam, Mikhail Sirotenko, Liang-Chieh Chen:
PolyMaX: General Dense Prediction with Mask Transformer. 1039-1050 - Liyang Liu, Zihan Wang, Minh Hieu Phan, Bowen Zhang, Jinchao Ge, Yifan Liu:
BPKD: Boundary Privileged Knowledge Distillation For Semantic Segmentation. 1051-1061 - Changkun Ye, Russell Tsuchida, Lars Petersson, Nick Barnes:
Label Shift Estimation for Class-Imbalance Problem: A Bayesian Approach. 1062-1071 - Aditay Tripathi, Anand Mishra, Anirban Chakraborty:
Query-guided Attention in Vision Transformers for Localizing Objects Using a Single Sketch. 1072-1081 - Yiting Li, Adam David Goodge, Fayao Liu, Chuan-Sheng Foo:
PromptAD: Zero-shot Anomaly Detection using Text Prompts. 1082-1091 - Xiaosong Wang, Ziyue Xu, Dong Yang, Leo K. Tam, Holger Roth, Daguang Xu:
Learning Quality Labels for Robust Image Classification. 1092-1101 - Haitian He, Sarah M. Erfani, Mingming Gong, Qiuhong Ke:
Learning Transferable Representations for Image Anomaly Localization Using Dense Pretraining. 1102-1111 - Xiangyong Lu, Masanori Suganuma, Takayuki Okatani:
SBCFormer: Lightweight Network Capable of Full-size ImageNet Classification at 1 FPS on Single Board Computers. 1112-1122 - Andrei-Timotei Ardelean
, Tim Weyrich:
High-Fidelity Zero-Shot Texture Anomaly Localization Using Feature Correspondence Analysis. 1123-1133 - Jongwoo Park, Kumara Kahatapitiya, Donghyun Kim, Shivchander Sudalairaj, Quanfu Fan, Michael S. Ryoo:
Grafting Vision Transformers. 1134-1143 - Tao Liu, Chenshu Chen, Xi Yang, Wenming Tan:
Rethinking Knowledge Distillation with Raw Features for Semantic Segmentation. 1144-1153 - Soumya Roy, Vinay Kumar Verma, Deepak Gupta:
Efficient Expansion and Gradient Based Task Inference for Replay Free Incremental Learning. 1154-1164 - Hiroto Honda, Yusuke Uchida:
CLRerNet: Improving Confidence of Lane Detection with LaneIoU. 1165-1174 - Yuqi Hou, Zhongqun Zhang, Nora Horanyi, Jaewon Moon, Yihua Cheng, Hyung Jin Chang:
Multi-Modal Gaze Following in Conversational Scenarios. 1175-1184 - Sithu Aung, Haesol Park, Hyungjoo Jung, Junghyun Cho:
Enhancing Multi-view Pedestrian Detection Through Generalized 3D Feature Pulling. 1185-1194 - Zacharias Anastasakis, Dimitrios Mallis, Markos Diomataris, George Alexandridis, Stefanos Kollias, Vassilis Pitsikalis:
Self-Supervised Learning for Visual Relationship Detection through Masked Bounding Box Reconstruction. 1195-1204 - Sandra Kara, Hejer Ammar, Florian Chabot, Quoc-Cuong Pham:
The Background Also Matters: Background-Aware Motion-Guided Objects Discovery. 1205-1214 - Seonhoon Lee, Jong-Hwan Kim:
Semi-Supervised Scene Change Detection by Distillation from Feature-metric Alignment. 1215-1224