


default search action
CVPR 2021: Virtual Event
- IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2021, virtual, June 19-25, 2021. Computer Vision Foundation / IEEE 2021

Papers
- Tianyu Wang, Xiaowei Hu, Chi-Wing Fu, Pheng-Ann Heng:

Single-Stage Instance Shadow Detection With Bidirectional Relation Learning. 1-11 - Minghua Liu, Minhyuk Sung

, Radomír Mech, Hao Su:
DeepMetaHandles: Learning Deformation Meta-Handles of 3D Meshes With Biharmonic Coordinates. 12-21 - Marie-Julie Rakotosaona, Paul Guerrero, Noam Aigerman, Niloy J. Mitra, Maks Ovsjanikov:

Learning Delaunay Surface Elements for Mesh Reconstruction. 22-31 - Bingbing Zhuang, Manmohan Chandraker:

Fusing the Old with the New: Learning Relative Camera Pose with Geometry-Guided Uncertainty. 32-42 - Ruoxi Shi, Zhengrong Xue

, Yang You, Cewu Lu:
Skeleton Merger: An Unsupervised Aligned Keypoint Detector. 43-52 - Wenfei Yang, Tianzhu Zhang, Xiaoyuan Yu, Qi Tian, Yongdong Zhang, Feng Wu:

Uncertainty Guided Collaborative Training for Weakly Supervised Temporal Action Detection. 53-63 - Shugao Ma, Tomas Simon, Jason M. Saragih, Dawei Wang, Yuecheng Li, Fernando De la Torre, Yaser Sheikh:

Pixel Codec Avatars. 64-73 - Bumsoo Kim, Junhyun Lee, Jaewoo Kang, Eun-Sol Kim, Hyunwoo J. Kim:

HOTR: End-to-End Human-Object Interaction Detection With Transformers. 74-83 - Bo Sun, Junchi Yan, Xiao Zhou, Yinqiang Zheng

:
Tuning IR-Cut Filter for Illumination-Aware Spectral Reconstruction From RGB. 84-93 - Valentin Wolf, Andreas Lugmayr, Martin Danelljan, Luc Van Gool, Radu Timofte

:
DeFlow: Learning Complex Image Degradations From Unpaired Data With Conditional Flows. 94-103 - Peng Chen, Jing Liu

, Bohan Zhuang, Mingkui Tan, Chunhua Shen:
AQD: Towards Accurate Quantized Object Detection. 104-113 - Wei Gao, Shangwei Guo, Tianwei Zhang, Han Qiu, Yonggang Wen, Yang Liu:

Privacy-Preserving Collaborative Learning With Automatic Transformation Search. 114-123 - Pei Wang, Yijun Li, Nuno Vasconcelos

:
Rethinking and Improving the Robustness of Image Style Transfer. 124-133 - Jiaxin Cheng, Ayush Jaiswal, Yue Wu, Pradeep Natarajan, Prem Natarajan:

Style-Aware Normalized Loss for Improving Arbitrary Style Transfer. 134-143 - Youjiang Xu, Linchao Zhu, Lu Jiang, Yi Yang:

Faster Meta Update Strategy for Noise-Robust Deep Learning. 144-153 - Jindou Dai, Yuwei Wu, Zhi Gao, Yunde Jia:

A Hyperbolic-to-Hyperbolic Graph Convolutional Network. 154-163 - Jiangmiao Pang, Linlu Qiu, Xia Li

, Haofeng Chen, Qi Li, Trevor Darrell, Fisher Yu:
Quasi-Dense Similarity Learning for Multiple Object Tracking. 164-173 - Chen Huang, Shuangfei Zhai, Pengsheng Guo, Josh M. Susskind:

MetricOpt: Learning To Optimize Black-Box Evaluation Metrics. 174-183 - Shipeng Wang

, Xiaorong Li, Jian Sun, Zongben Xu:
Training Networks in Null Space of Feature Covariance for Continual Learning. 184-193 - Zhaowei Cai, Avinash Ravichandran, Subhransu Maji, Charless C. Fowlkes, Zhuowen Tu, Stefano Soatto:

Exponential Moving Average Normalization for Self-Supervised and Semi-Supervised Learning. 194-203 - Wei Yin, Jianming Zhang, Oliver Wang, Simon Niklaus, Long Mai, Simon Chen, Chunhua Shen:

Learning To Recover 3D Scene Shape From a Single Image. 204-213 - Yanwei Li, Hengshuang Zhao, Xiaojuan Qi, Liwei Wang, Zeming Li, Jian Sun, Jiaya Jia

:
Fully Convolutional Networks for Panoptic Segmentation. 214-223 - Lei Li, Ke Gao, Juan Cao, Ziyao Huang

, Yepeng Weng, Xiaoyue Mi, Zhengze Yu, Xiaoya Li, Boyang Xia:
Progressive Domain Expansion Network for Single Domain Generalization. 224-233 - Chaorui Deng, Shizhe Chen, Da Chen

, Yuan He, Qi Wu:
Sketch, Ground, and Refine: Top-Down Dense Video Captioning. 234-243 - Chiho Choi, Joon Hee Choi, Jiachen Li, Srikanth Malla:

Shared Cross-Modal Trajectory Prediction for Autonomous Driving. 244-253 - Shenzhi Wang, Liwei Wu, Lei Cui, Yujun Shen:

Glancing at the Patch: Anomaly Localization With Global and Local Feature Comparison. 254-263 - Ji Liu, Dong Li, Rongzhang Zheng, Lu Tian, Yi Shan:

RankDetNet: Delving Into Ranking Constraints for Object Detection. 264-273 - Xingyuan Bu

, Junran Peng, Junjie Yan, Tieniu Tan, Zhaoxiang Zhang:
GAIA: A Transfer Learning System of Object Detection That Fits Your Needs. 274-283 - Ruijie Yan, Liangrui Peng, Shanyu Xiao, Gang Yao:

Primitive Representation Learning for Scene Text Recognition. 284-293 - Lucas Tabelini Torres, Rodrigo Ferreira Berriel, Thiago M. Paixão, Claudine Badue, Alberto F. De Souza, Thiago Oliveira-Santos:

Keep Your Eyes on the Lane: Real-Time Attention-Guided Lane Detection. 294-302 - Zheng Ge, Songtao Liu, Zeming Li, Osamu Yoshie, Jian Sun:

OTA: Optimal Transport Assignment for Object Detection. 303-312 - Kai Fischer, Martin Simon, Florian Ölsner, Stefan Milz, Horst-Michael Gross, Patrick Mäder:

StickyPillars: Robust and Efficient Feature Matching on Point Clouds Using Graph Neural Networks. 313-323 - Yingjie Cai, Xuesong Chen, Chao Zhang, Kwan-Yee Lin, Xiaogang Wang, Hongsheng Li

:
Semantic Scene Completion via Integrating Instances and Scene In-the-Loop. 324-333 - Zhenzhen Weng

, Serena Yeung
:
Holistic 3D Human and Scene Mesh Estimation From Single View Images. 334-343 - Ruihui Li, Xianzhi Li, Pheng-Ann Heng, Chi-Wing Fu:

Point Cloud Upsampling via Disentangled Refinement. 344-353 - Tong He, Chunhua Shen, Anton van den Hengel:

DyCo3D: Robust Instance Segmentation of 3D Point Clouds Through Dynamic Convolution. 354-363 - Ruibo Li, Guosheng Lin, Tong He, Fayao Liu, Chunhua Shen:

HCRF-Flow: Scene Flow From Point Clouds With Continuous High-Order CRFs and Position-Aware Flow Embedding. 364-373 - Yifan Wang

, Shihao Wu, Cengiz Öztireli, Olga Sorkine-Hornung:
Iso-Points: Optimizing Neural Implicit Surfaces With Hybrid Representations. 374-383 - Gautam Pai, Jing Ren

, Simone Melzi, Peter Wonka, Maks Ovsjanikov:
Fast Sinkhorn Filters: Using Matrix Scaling for Non-Rigid Shape Correspondence With Functional Maps. 384-393 - Yaqing Ding

, Daniel Barath, Jian Yang, Hui Kong
, Zuzana Kukelova:
Globally Optimal Relative Pose Estimation With Gravity Prior. 394-403 - Natalia Neverova, Artsiom Sanakoyeu, Patrick Labatut, David Novotný, Andrea Vedaldi:

Discovering Relationships Between Object Categories via Universal Canonical Maps. 404-413 - Hugo Germain, Vincent Lepetit, Guillaume Bourmaud:

Neural Reprojection Error: Merging Feature Learning and Camera Pose Estimation. 414-423 - Seong Hun Lee

, Javier Civera:
Rotation-Only Bundle Adjustment. 424-433 - Chaoyang Wang, Simon Lucey

:
PAUL: Procrustean Autoencoder for Unsupervised Lifting. 434-443 - Kun Qian

, Shilin Zhu, Xinyu Zhang, Li Erran Li:
Robust Multimodal Vehicle Detection in Foggy Weather Using Complementary Lidar and Radar Signals. 444-453 - Li Wang, Liang Du, Xiaoqing Ye, Yanwei Fu

, Guodong Guo
, Xiangyang Xue, Jianfeng Feng, Li Zhang:
Depth-Conditioned Dynamic Message Propagation for Monocular 3D Object Detection. 454-463 - Junting Pan, Siyu Chen, Mike Zheng Shou, Yu Liu, Jing Shao, Hongsheng Li

:
Actor-Context-Actor Relation Network for Spatio-Temporal Action Localization. 464-474 - Toby Perrett, Alessandro Masullo

, Tilo Burghardt, Majid Mirmehdi
, Dima Damen
:
Temporal-Relational CrossTransformers for Few-Shot Action Recognition. 475-484 - Zhiwu Qing, Haisheng Su, Weihao Gan, Dongliang Wang, Wei Wu, Xiang Wang, Yu Qiao

, Junjie Yan, Changxin Gao, Nong Sang:
Temporal Context Aggregation Network for Temporal Action Proposal Refinement. 485-494 - Zhi Hou, Baosheng Yu, Yu Qiao

, Xiaojiang Peng, Dacheng Tao
:
Affordance Transfer Learning for Human-Object Interaction Detection. 495-504 - Mathieu Serrurier, Franck Mamalet, Alberto González-Sanz

, Thibaut Boissin, Jean-Michel Loubes, Eustasio del Barrio
:
Achieving Robustness in Classification Using Optimal Transport With Hinge Regularization. 505-514 - Roi Pony, Itay Naeh, Shie Mannor:

Over-the-Air Adversarial Flickering Attacks Against Video Recognition Networks. 515-524 - Zhenguang Liu, Haoming Chen, Runyang Feng, Shuang Wu

, Shouling Ji, Bailin Yang, Xun Wang:
Deep Dual Consecutive Network for Human Pose Estimation. 525-534 - Yang Hong, Juyong Zhang, Boyi Jiang, Yudong Guo, Ligang Liu, Hujun Bao:

StereoPIFu: Depth Aware Clothed Human Digitization via Stereo Vision. 535-545 - Jianfeng Zhang, Dongdong Yu, Jun Hao Liew, Xuecheng Nie, Jiashi Feng:

Body Meshes as Points. 546-556 - Qi Zhang

, Wei Lin
, Antoni B. Chan
:
Cross-View Cross-Scene Multi-View Crowd Counting. 557-567 - Stefano d'Apolito, Danda Pani Paudel

, Zhiwu Huang
, Andrés Romero, Luc Van Gool:
GANmut: Learning Interpretable Conditional Space for Gamut of Emotions. 568-577 - Xingkun Xu

, Yuge Huang, Pengcheng Shen, Shaoxin Li, Jilin Li, Feiyue Huang, Yong Li, Zhen Cui:
Consistent Instance False Positive Improves Fairness in Face Recognition. 578-586 - Yehansen Chen, Lin Wan, Zhihang Li, Qianyan Jing, Zongyuan Sun:

Neural Feature Search for RGB-Infrared Person Re-Identification. 587-597 - Anguo Zhang, Yueming Gao, Yuzhen Niu, Wenxi Liu, Yongcheng Zhou:

Coarse-To-Fine Person Re-Identification With Auxiliary-Domain Classification and Second-Order Information Bottleneck. 598-607 - Lin Wang

, Yujeong Chae, Sung-Hoon Yoon, Tae-Kyun Kim
, Kuk-Jin Yoon:
EvDistill: Asynchronous Events To End-Task Learning via Bidirectional Reconstruction-Guided Cross-Modal Knowledge Distillation. 608-619 - Shifeng Zhang, Chen Zhang, Ning Kang, Zhenguo Li:

iVPF: Numerical Invertible Volume Preserving Flow for Efficient Lossless Compression. 620-629 - Hongyi Zheng, Hongwei Yong, Lei Zhang

:
Deep Convolutional Dictionary Learning for Image Denoising. 630-641 - Zongsheng Yue, Jianwen Xie, Qian Zhao, Deyu Meng:

Semi-Supervised Video Deraining With Dynamical Rain Generator. 642-652 - Jie Liang, Hui Zeng, Miaomiao Cui, Xuansong Xie, Lei Zhang

:
PPR10K: A Large-Scale Portrait Photo Retouching Dataset With Human-Region Mask and Group-Level Consistency. 653-661 - Ruicheng Feng, Chongyi Li

, Huaijin G. Chen
, Shuai Li, Chen Change Loy, Jinwei Gu:
Removing Diffraction Image Artifacts in Under-Display Camera via Dynamic Skip Connection Network. 662-671 - Tao Yang, Peiran Ren, Xuansong Xie, Lei Zhang

:
GAN Prior Embedded Network for Blind Face Restoration in the Wild. 672-681 - Yoshiki Fukao, Ryo Kawahara

, Shohei Nobuhara
, Ko Nishino:
Polarimetric Normal Stereo. 682-690 - Younghyun Jo, Seon Joo Kim:

Practical Single-Image Super-Resolution Using Look-Up Table. 691-700 - Bowen Liu

, Yu Chen, Shiyu Liu, Hun-Seok Kim:
Deep Learning in Latent Space for Video Prediction and Compression. 701-710 - Peibei Cao, Zhangyang Wang, Kede Ma

:
Debiased Subjective Assessment of Real-World Image Enhancement. 711-721 - Abhinanda R. Punnakkal, Arjun Chandrasekaran, Nikos Athanasiou, Alejandra Quiros-Ramirez

, Michael J. Black:
BABEL: Bodies, Action and Behavior With English Labels. 722-731 - Dongyoon Han, Sangdoo Yun, Byeongho Heo, Youngjoon Yoo:

Rethinking Channel Dimensions for Efficient Model Design. 732-741 - Sangyun Oh, Hyeonuk Sim, Sugil Lee, Jongeun Lee:

Automated Log-Scale Quantization for Low-Cost Deep Neural Networks. 742-751 - Xinshao Wang

, Yang Hua
, Elyor Kodirov, David A. Clifton, Neil M. Robertson:
ProSelfLC: Progressive Self Label Correction for Training Robust Deep Neural Networks. 752-761 - Sharath Girish, Shishira R. Maiya, Kamal Gupta, Hao Chen, Larry S. Davis, Abhinav Shrivastava:

The Lottery Ticket Hypothesis for Object Recognition. 762-771 - Honggu Liu, Xiaodan Li, Wenbo Zhou, Yuefeng Chen, Yuan He, Hui Xue, Weiming Zhang, Nenghai Yu:

Spatial-Phase Shallow Learning: Rethinking Face Forgery Detection in Frequency Domain. 772-781 - Hila Chefer, Shir Gur, Lior Wolf:

Transformer Interpretability Beyond Attention Visualization. 782-791 - Aditya Golatkar, Alessandro Achille, Avinash Ravichandran, Marzia Polito, Stefano Soatto:

Mixed-Privacy Forgetting in Deep Networks. 792-801 - Seungmin Lee, Dongwan Kim, Bohyung Han:

CoSMo: Content-Style Modulation for Image Retrieval With Text Feedback. 802-812 - Furong Xu, Meng Wang, Wei Zhang, Yuan Cheng, Wei Chu:

Discrimination-Aware Mechanism for Fine-Grained Representation Learning. 813-822 - Gaurav Parmar, Dacheng Li, Kwonjoon Lee, Zhuowen Tu:

Dual Contradistinctive Generative Autoencoder. 823-832 - Han Zhang, Jing Yu Koh, Jason Baldridge, Honglak Lee, Yinfei Yang:

Cross-Modal Contrastive Learning for Text-to-Image Generation. 833-842 - Chia-Ni Lu, Ya-Chu Chang, Wei-Chen Chiu:

Bridging the Visual Gap: Wide-Range Image Blending. 843-851 - Hyunsu Kim, Yunjey Choi, Junho Kim, Sungjoo Yoo, Youngjung Uh

:
Exploiting Spatial Dimensions of Latent in GAN for Real-Time Image Editing. 852-861 - Jie An, Siyu Huang

, Yibing Song, Dejing Dou, Wei Liu, Jiebo Luo
:
ArtFlow: Unbiased Image Style Transfer via Reversible Neural Flows. 862-871 - Haibo Chen, Lei Zhao, Zhizhong Wang, Huiming Zhang, Zhiwen Zuo, Ailin Li, Wei Xing, Dongming Lu:

DualAST: Dual Style-Learning Networks for Artistic Style Transfer. 872-881 - Oran Gafni, Oron Ashual, Lior Wolf:

Single-Shot Freestyle Dance Reenactment. 882-891 - Shuhan Tan, Kelvin Wong, Shenlong Wang, Sivabalan Manivasagam, Mengye Ren, Raquel Urtasun:

SceneGen: Learning To Generate Realistic Traffic Scenes. 892-901 - Xinzhu Bei, Yanchao Yang, Stefano Soatto:

Learning Semantic-Aware Dynamics for Video Prediction. 902-912 - Jie Hong, Pengfei Fang, Weihao Li, Tong Zhang, Christian Simon, Mehrtash Harandi, Lars Petersson

:
Reinforced Attention for Few-Shot Learning and Beyond. 913-923 - Piotr Dollár, Mannat Singh, Ross B. Girshick:

Fast and Accurate Model Scaling. 924-932 - Elijah Cole, Oisin Mac Aodha, Titouan Lorieul, Pietro Perona, Dan Morris, Nebojsa Jojic:

Multi-Label Learning From Single Positive Labels. 933-942 - Peng Wang

, Kai Han, Xiu-Shen Wei, Lei Zhang, Lei Wang:
Contrastive Learning Based Hybrid Networks for Long-Tailed Image Classification. 943-952 - Muhammad Ferjad Naeem, Yongqin Xian, Federico Tombari, Zeynep Akata:

Learning Graph Embeddings for Compositional Zero-Shot Learning. 953-962 - Heng Guo

, Fumio Okura, Boxin Shi, Takuya Funatomi, Yasuhiro Mukaigawa, Yasuyuki Matsushita
:
Multispectral Photometric Stereo for Spatially-Varying Spectral Reflectances: A Well Posed Problem? 963-971 - Zhijie Deng, Xiao Yang, Shizhen Xu, Hang Su, Jun Zhu:

LiBRe: A Practical Bayesian Approach to Adversarial Detection. 972-982 - Sian-Yao Huang, Wei-Ta Chu

:
Searching by Generating: Flexible and Efficient One-Shot NAS With Architecture Generator. 983-992 - Naoya Takahashi, Yuki Mitsufuji:

Densely Connected Multi-Dilated Convolutional Networks for Dense Prediction Tasks. 993-1002 - Joy Hsu, Wah Chiu

, Serena Yeung
:
DARCNN: Domain Adaptive Region-Based Convolutional Neural Network for Unsupervised Instance Segmentation in Biomedical Images. 1003-1012 - Quande Liu, Cheng Chen

, Jing Qin
, Qi Dou, Pheng-Ann Heng:
FedDG: Federated Domain Generalization on Medical Image Segmentation via Episodic Learning in Continuous Frequency Space. 1013-1023 - Zikai Zhang

, Bineng Zhong, Shengping Zhang, Zhenjun Tang, Xin Liu, Zhaoxiang Zhang:
Distractor-Aware Fast Tracking via Dynamic Convolutions and MOT Philosophy. 1024-1033 - Sangryul Jeon, Dongbo Min, Seungryong Kim, Kwanghoon Sohn:

Mining Better Samples for Contrastive Learning of Temporal Correspondence. 1034-1044 - Kunming Luo, Chuan Wang, Shuaicheng Liu

, Haoqiang Fan, Jue Wang, Jian Sun:
UPFlow: Upsampling Pyramid for Unsupervised Optical Flow Learning. 1045-1054 - Chengyue Gong, Dilin Wang, Meng Li, Vikas Chandra, Qiang Liu:

KeepAugment: A Simple Information-Preserving Data Augmentation Approach. 1055-1064 - Shaobo Zhang, Wanqing Zhao, Ziyu Guan, Xianlin Peng, Jinye Peng:

Keypoint-Graph-Driven Learning Framework for Object Pose Estimation. 1065-1073 - Qianjiang Hu

, Xiao Wang, Wei Hu, Guo-Jun Qi:
AdCo: Adversarial Contrast for Efficient Learning of Unsupervised Representations From Self-Trained Negative Adversaries. 1074-1083 - Yu Mitsuzumi, Go Irie, Daiki Ikami, Takashi Shibata:

Generalized Domain Adaptation. 1084-1093 - Jaemin Na, Heechul Jung, Hyung Jin Chang

, Wonjun Hwang
:
FixBi: Bridging Domain Spaces for Unsupervised Domain Adaptation. 1094-1103 - Bo Li, Yezhen Wang, Shanghang Zhang, Dongsheng Li, Kurt Keutzer, Trevor Darrell, Han Zhao:

Learning Invariant Representations and Risks for Semi-Supervised Domain Adaptation. 1104-1113 - Umberto Michieli

, Pietro Zanuttigh:
Continual Semantic Segmentation via Repulsion-Attraction of Sparse and Disentangled Latent Representations. 1114-1124 - Ziyang Wu, Christina Baek, Chong You, Yi Ma:

Incremental Learning via Rate Reduction. 1125-1133 - Mouxing Yang, Yunfan Li

, Zhenyu Huang, Zitao Liu, Peng Hu, Xi Peng:
Partially View-Aligned Representation Learning With Noise-Robust Contrastive Loss. 1134-1143 - Byungseok Roh, Wuhyun Shin, Ildoo Kim, Sungwoong Kim:

Spatially Consistent Representation Learning. 1144-1153 - Qihang Yu, Jianming Zhang, He Zhang, Yilin Wang, Zhe Lin, Ning Xu, Yutong Bai, Alan L. Yuille:

Mask Guided Matting via Progressive Refinement Network. 1154-1163 - Jamie Watson, Oisin Mac Aodha, Victor Prisacariu, Gabriel J. Brostow, Michael Firman:

The Temporal Opportunist: Self-Supervised Multi-Frame Monocular Depth. 1164-1174 - Jaedong Hwang, Seoung Wug Oh, Joon-Young Lee, Bohyung Han:

Exemplar-Based Open-Set Panoptic Segmentation Network. 1175-1184 - Dan Andrei Ganea, Bas Boom

, Ronald Poppe:
Incremental Few-Shot Instance Segmentation. 1185-1194 - Jianpeng Zhang, Yutong Xie

, Yong Xia, Chunhua Shen:
DoDNet: Learning To Segment Multi-Organ and Tumors From Multiple Partially Labeled Datasets. 1195-1204 - Xin Lai, Zhuotao Tian, Li Jiang

, Shu Liu, Hengshuang Zhao, Liwei Wang, Jiaya Jia
:
Semi-Supervised Semantic Segmentation With Directional Context-Aware Consistency. 1205-1214 - Yuang Liu

, Wei Zhang, Jun Wang:
Source-Free Domain Adaptation for Semantic Segmentation. 1215-1224 - Lei Zhu, Qi She, Bin Zhang, Yanye Lu, Zhilin Lu, Duo Li, Jie Hu:

Learning the Superpixel in a Non-Iterative and Lifelong Manner. 1225-1234 - Xinyue Huo, Lingxi Xie, Jianzhong He, Zijie Yang, Wengang Zhou, Houqiang Li, Qi Tian:

ATSO: Asynchronous Teacher-Student Optimization for Semi-Supervised Image Segmentation. 1235-1244 - Bram Wallace, Ziyang Wu, Bharath Hariharan:

Can We Characterize Tasks Without Labels or Features? 1245-1254 - Daniel J. Trosten

, Sigurd Løkse, Robert Jenssen, Michael Kampffmeyer:
Reconsidering Representation Alignment for Multi-View Clustering. 1255-1265 - Gengshan Yang, Deva Ramanan

:
Learning To Segment Rigid Motions From Two Frames. 1266-1275 - Ziyuan Huang

, Shiwei Zhang, Jianwen Jiang, Mingqian Tang, Rong Jin, Marcelo H. Ang:
Self-Supervised Motion Learning From Static Images. 1276-1285 - Haozhe Xie

, Hongxun Yao, Shangchen Zhou, Shengping Zhang, Wenxiu Sun:
Efficient Regional Memory Network for Video Object Segmentation. 1286-1295 - Haochen Wang, Xiaolong Jiang, Haibing Ren, Yao Hu, Song Bai:

SwiftNet: Real-Time Video Object Segmentation. 1296-1305 - Jing Wang, Jinhui Tang, Mingkun Yang, Xiang Bai, Jiebo Luo

:
Improving OCR-Based Image Captioning by Incorporating Geometrical Relationship. 1306-1315 - Hao Zhou

, Wengang Zhou, Weizhen Qi, Junfu Pu, Houqiang Li:
Improving Sign Language Translation With Monolingual Data by Sign Back-Translation. 1316-1325 - Yu Wu

, Yi Yang:
Exploring Heterogeneous Clues for Weakly-Supervised Audio-Visual Video Parsing. 1326-1335 - Jiyoung Lee

, Soo-Whan Chung, Sunok Kim, Hong-Goo Kang, Kwanghoon Sohn:
Looking Into Your Speech: Learning Cross-Modal Affinity for Audio-Visual Speech Separation. 1336-1345 - Sijie Song, Xudong Lin, Jiaying Liu

, Zongming Guo, Shih-Fu Chang:
Co-Grounding Networks With Semantic Attention for Referring Expression Comprehension in Videos. 1346-1355 - Yifeng Zhang, Ming Jiang, Qi Zhao:

Explicit Knowledge Incorporation for Visual Reasoning. 1356-1365 - Shuang Xu

, Jiangshe Zhang, Zixiang Zhao
, Kai Sun, Junmin Liu, Chunxia Zhang:
Deep Gradient Projection Networks for Pan-sharpening. 1366-1375 - Kailun Yang, Jiaming Zhang, Simon Reiß

, Xinxin Hu, Rainer Stiefelhagen:
Capturing Omni-Range Context for Omnidirectional Segmentation. 1376-1386 - Pei Wang, Kabir Nagrecha, Nuno Vasconcelos

:
Gradient-Based Algorithms for Machine Teaching. 1387-1396 - Brent A. Griffin, Jason J. Corso:

Depth From Camera Motion and Object Detection. 1397-1406 - Peng Sun, Wenhu Zhang, Huanyu Wang, Songyuan Li, Xi Li:

Deep RGB-D Saliency Detection With Depth-Sensitive Attention and Automatic Multi-Modal Fusion. 1407-1417 - Yuan-Ting Hu, Jiahong Wang, Raymond A. Yeh

, Alexander G. Schwing:
SAIL-VOS 3D: A Synthetic Dataset and Baselines for Object Detection and 3D Mesh Reconstruction From Video Data. 1418-1428 - Zerong Zheng

, Tao Yu, Qionghai Dai, Yebin Liu:
Deep Implicit Templates for 3D Shape Representation. 1429-1439 - Christoph Lassner, Michael Zollhöfer:

Pulsar: Efficient Sphere-Based Neural Rendering. 1440-1449 - Aljaz Bozic, Pablo R. Palafox, Michael Zollhöfer, Justus Thies, Angela Dai, Matthias Nießner:

Neural Deformation Graphs for Globally-Consistent Non-Rigid Reconstruction. 1450-1459 - Praveen Tirupattur

, Kevin Duarte, Yogesh S. Rawat, Mubarak Shah
:
Modeling Multi-Label Action Dependencies for Temporal Action Localization. 1460-1470 - Patrick Grady, Chengcheng Tang, Christopher D. Twigg, Minh Vo, Samarth Brahmbhatt, Charles C. Kemp:

ContactOpt: Optimizing Contact To Improve Grasps. 1471-1481 - Chen Li

, Gim Hee Lee:
From Synthetic to Real: Unsupervised Domain Adaptation for Animal Pose Estimation. 1482-1491 - Xin Deng, Wenzhe Yang, Ren Yang

, Mai Xu, Enpeng Liu, Qianhan Feng
, Radu Timofte
:
Deep Homography for Efficient Stereo Image Compression. 1492-1501 - Zhihao Hu, Guo Lu, Dong Xu:

FVC: A New Framework Towards Deep Video Compression in Feature Space. 1502-1511 - Yuang Liu

, Wei Zhang, Jun Wang:
Zero-Shot Adversarial Quantization. 1512-1521 - Xudong Tian, Zhizhong Zhang, Shaohui Lin, Yanyun Qu, Yuan Xie, Lizhuang Ma:

Farewell to Mutual Information: Variational Distillation for Cross-Modal Person Re-Identification. 1522-1531 - Yujun Shen, Bolei Zhou:

Closed-Form Factorization of Latent Semantics in GANs. 1532-1540 - Moritz Kappel, Vladislav Golyanik, Mohamed Elgharib, Jann-Ole Henningson, Hans-Peter Seidel, Susana Castillo

, Christian Theobalt
, Marcus A. Magnor
:
High-Fidelity Neural Human Motion Transfer From Monocular Video. 1541-1550 - Mark Collier, Basil Mustafa, Efi Kokiopoulou, Rodolphe Jenatton, Jesse Berent:

Correlated Input-Dependent Label Noise in Large-Scale Image Classification. 1551-1560 - Junfu Wang

, Yunhong Wang, Zhen Yang, Liang Yang, Yuanfang Guo:
Bi-GCN: Binary Graph Convolutional Network. 1561-1570 - Ning Wang, Wengang Zhou, Jie Wang, Houqiang Li:

Transformer Meets Tracker: Exploiting Temporal Context for Robust Visual Tracking. 1571-1580 - Wei Chen, Xi Jia, Hyung Jin Chang

, Jinming Duan
, Linlin Shen, Ales Leonardis:
FS-Net: Fast Shape-Based Network for Category-Level 6D Object Pose Estimation With Decoupled Rotation Mechanism. 1581-1590 - Christian Simon, Piotr Koniusz, Mehrtash Harandi:

On Learning the Geodesic Path for Incremental Learning. 1591-1600 - Zhigang Dai, Bolun Cai

, Yugeng Lin, Junying Chen:
UP-DETR: Unsupervised Pre-Training for Object Detection With Transformers. 1601-1610 - Johannes Kopf, Xuejian Rong, Jia-Bin Huang:

Robust Consistent Video Depth Estimation. 1611-1621 - Tianfei Zhou, Wenguan Wang

, Si Liu, Yi Yang, Luc Van Gool:
Differentiable Multi-Granularity Human Representation Learning for Instance-Aware Human Semantic Parsing. 1622-1631 - Aleksandra Franz, Barbara Solenthaler, Nils Thuerey

:
Global Transport for Fluid Reconstruction With Learned Self-Supervision. 1632-1642 - Yicong Hong, Qi Wu, Yuankai Qi

, Cristian Rodriguez Opazo, Stephen Gould:
VLN BERT: A Recurrent Vision-and-Language BERT for Navigation. 1643-1653 - Yann Labbé, Justin Carpentier, Mathieu Aubry, Josef Sivic:

Single-View Robot Pose and Joint Angle Estimation via Render & Compare. 1654-1663 - Jiacheng Cheng

, Nuno Vasconcelos
:
Learning Deep Classifiers Consistent With Fine-Grained Novelty Detection. 1664-1673 - Noranart Vesdapunt, Baoyuan Wang:

CRFace: Confidence Ranker for Model-Agnostic Face Detection Refinement. 1674-1684 - Jingru Tan, Xin Lu

, Gang Zhang, Changqing Yin, Quanquan Li:
Equalization Loss v2: A New Gradient Balance Approach for Long-Tailed Object Detection. 1685-1694 - Wei Feng, Fei Yin, Xu-Yao Zhang, Cheng-Lin Liu:

Semantic-Aware Video Text Detection. 1695-1705 - Mohamed Sayed

, Gabriel J. Brostow:
Improved Handling of Motion Blur in Online Object Detection. 1706-1716 - Yuchen Ma, Songtao Liu, Zeming Li, Jian Sun:

IQDet: Instance-Wise Quality Distribution Sampling for Object Detection. 1717-1725 - Zhengzhe Liu, Xiaojuan Qi, Chi-Wing Fu:

One Thing One Click: A Self-Training Approach for Weakly Supervised 3D Semantic Segmentation. 1726-1736 - Filippos Kokkinos, Iasonas Kokkinos:

Learning Monocular 3D Reconstruction of Articulated Categories From Motion. 1737-1746 - Angela Dai, Yawar Siddiqui, Justus Thies, Julien Valentin, Matthias Nießner:

SPSG: Self-Supervised Photometric Scene Generation From RGB-D Scans. 1747-1756 - Shi Qiu

, Saeed Anwar, Nick Barnes:
Semantic Segmentation for Real Point Cloud Scenes via Bilateral Augmentation and Adaptive Fusion. 1757-1767 - Junzhe Zhang, Xinyi Chen, Zhongang Cai, Liang Pan

, Haiyu Zhao, Shuai Yi, Chai Kiat Yeo, Bo Dai, Chen Change Loy:
Unsupervised 3D Shape Completion Through GAN Inversion. 1768-1777 - Shengheng Deng, Xun Xu, Chaozheng Wu, Ke Chen, Kui Jia:

3D AffordanceNet: A Benchmark for Visual Object Affordance Understanding. 1778-1787 - Shi-Lin Liu

, Hao-Xiang Guo, Hao Pan, Peng-Shuai Wang, Xin Tong
, Yang Liu
:
Deep Implicit Moving Least-Squares Functions for 3D Reconstruction. 1788-1797 - Stefan Stojanov, Anh Thai, James M. Rehg:

Using Shape To Categorize: Low-Shot Learning With an Explicit Shape Bias. 1798-1808 - Marcel Geppert, Viktor Larsson, Pablo Speciale, Johannes L. Schönberger, Marc Pollefeys

:
Privacy Preserving Localization and Mapping From Uncalibrated Cameras. 1809-1819 - Feitong Tan, Danhang Tang

, Mingsong Dou, Kaiwen Guo, Rohit Pandey, Cem Keskin, Ruofei Du, Deqing Sun, Sofien Bouaziz, Sean Ryan Fanello, Ping Tan, Yinda Zhang:
HumanGPS: Geodesic PreServing Feature for Dense Human Correspondences. 1820-1830 - Shitao Tang, Chengzhou Tang, Rui Huang, Siyu Zhu, Ping Tan:

Learning Camera Localization via Dense Scene Matching. 1831-1841 - Liu Liu, Hongdong Li, Haodong Yao, Ruyi Zha:

PluckerNet: Learn To Register 3D Line Reconstructions. 1842-1852 - Luca Magri, Filippo Leveni

, Giacomo Boracchi
:
MultiLink: Multi-Class Structure Recovery via Agglomerative Clustering and Model Selection. 1853-1862 - Zetong Yang, Yin Zhou, Zhifeng Chen, Jiquan Ngiam:

3D-MAN: 3D Multi-Frame Attention Network for Object Detection. 1863-1872 - Shichao Li

, Zengqiang Yan, Hongyang Li, Kwang-Ting Cheng
:
Exploring intermediate representation for monocular vehicle pose estimation. 1873-1883 - Chao-Yuan Wu, Philipp Krähenbühl:

Towards Long-Form Video Understanding. 1884-1894 - Limin Wang, Zhan Tong

, Bin Ji, Gangshan Wu:
TDN: Temporal Difference Networks for Efficient Action Recognition. 1895-1904 - Xiang Wang

, Shiwei Zhang, Zhiwu Qing, Yuanjie Shao, Changxin Gao, Nong Sang:
Self-Supervised Learning for Semi-Supervised Temporal Action Proposal. 1905-1914 - Mingfei Gao, Yingbo Zhou, Ran Xu, Richard Socher, Caiming Xiong:

WOAD: Weakly Supervised Online Action Detection in Untrimmed Videos. 1915-1923 - Xiaosen Wang, Kun He:

Enhancing the Transferability of Adversarial Attacks Through Variance Tuning. 1924-1933 - Yanru Xiao, Cong Wang:

You See What I Want You To See: Exploring Targeted Black-Box Transferability Attack for Hash-Based Image Retrieval Systems. 1934-1943 - Ke Li, Shijie Wang, Xiang Zhang, Yifan Xu, Weijian Xu, Zhuowen Tu:

Pose Recognition With Cascade Transformers. 1944-1953 - Kevin Lin

, Lijuan Wang, Zicheng Liu:
End-to-End Human Pose and Mesh Reconstruction with Transformers. 1954-1963 - Hongsuk Choi, Gyeongsik Moon, Ju Yong Chang, Kyoung Mu Lee:

Beyond Static Features for Temporally Consistent 3D Human Pose and Shape From a Video. 1964-1973 - Jia Wan

, Ziquan Liu
, Antoni B. Chan
:
A Generalized Loss Function for Crowd Counting and Localization. 1974-1983 - Rohit Saha, Brendan Duke, Florian Shkurti, Graham W. Taylor, Parham Aarabi:

LOHO: Latent Optimization of Hairstyles via Orthogonalization. 1984-1993 - Guoli Wang, Jiaqi Ma, Qian Zhang, Jiwen Lu

, Jie Zhou:
Pseudo Facial Generation With Extreme Poses for Face Recognition. 1994-2003 - Hao Chen, Yaohui Wang, Benoit Lagadec, Antitza Dantcheva, François Brémond:

Joint Generative and Contrastive Learning for Unsupervised Person Re-Identification. 2004-2013 - Ruibing Hou, Hong Chang, Bingpeng Ma, Rui Huang, Shiguang Shan:

BiCnet-TKS: Learning Efficient Spatial-Temporal Representation for Video Person Re-Identification. 2014-2023 - Yunhao Zou, Yinqiang Zheng

, Tsuyoshi Takatani, Ying Fu:
Learning To Reconstruct High Speed and High Dynamic Range Videos From Events. 2024-2033 - Junyong Lee

, Hyeongseok Son
, Jaesung Rim, Sunghyun Cho, Seungyong Lee:
Iterative Filter Adaptive Network for Single Image Defocus Deblurring. 2034-2042 - Tongyao Pang, Huan Zheng, Yuhui Quan, Hui Ji

:
Recorrupted-to-Recorrupted: Unsupervised Deep Learning for Image Denoising. 2043-2052 - Yuntong Ye, Yi Chang, Hanyu Zhou, Luxin Yan:

Closing the Loop: Joint Rain Generation and Removal via Disentangled Image Translation. 2053-2062 - Zhihao Xia, Michaël Gharbi, Federico Perazzi, Kalyan Sunkavalli, Ayan Chakrabarti:

Deep Denoising of Flash and No-Flash Pairs for Photography in Low-Light Environments. 2063-2072 - Kinam Kwon, Eunhee Kang, Sangwon Lee, Su-Jin Lee, Hyong-Euk Lee, ByungIn Yoo, Jae-Joon Han:

Controllable Image Restoration for Under-Display Camera in Smartphones. 2073-2082 - Zhengjue Wang, Hao Zhang, Ziheng Cheng, Bo Chen, Xin Yuan:

MetaSCI: Scalable and Adaptive Reconstruction for Video Compressive Sensing. 2083-2092 - Zheng Hui

, Jie Li, Xiumei Wang
, Xinbo Gao:
Learning the Non-Differentiable Optimization for Blind Super-Resolution. 2093-2102 - Yuming Jiang

, Kelvin C. K. Chan, Xintao Wang, Chen Change Loy, Ziwei Liu:
Robust Reference-Based Super-Resolution via C2-Matching. 2103-2112 - Zeyu Xiao, Xueyang Fu

, Jie Huang, Zhen Cheng, Zhiwei Xiong:
Space-Time Distillation for Video Super-Resolution. 2113-2122 - Yan Bai, Jile Jiao, Ce Wang, Jun Liu

, Yihang Lou, Xuetao Feng, Ling-Yu Duan:
Person30K: A Dual-Meta Generalization Network for Person Re-Identification. 2123-2132 - Steve Cruz, Will Hutchcroft, Yuguang Li, Naji Khosravan, Ivaylo Boyadzhiev, Sing Bing Kang:

Zillow Indoor Dataset: Annotated Floor Plans With 360deg Panoramas and 3D Room Layouts. 2133-2143 - Yawei Li

, Wen Li, Martin Danelljan, Kai Zhang
, Shuhang Gu, Luc Van Gool, Radu Timofte
:
The Heterogeneity Hypothesis: Finding Layer-Wise Differentiated Network Architectures. 2144-2153 - Jianyuan Guo

, Kai Han, Yunhe Wang, Han Wu, Xinghao Chen
, Chunjing Xu, Chang Xu
:
Distilling Object Detectors via Decoupled Features. 2154-2164 - Zhiqiang Shen, Zechun Liu, Jie Qin, Lei Huang, Kwang-Ting Cheng

, Marios Savvides:
S2-BNN: Bridging the Gap Between Self-Supervised Real and 1-Bit Neural Networks via Guided Distribution Calibration. 2165-2174 - Xiu Su, Shan You, Fei Wang, Chen Qian, Changshui Zhang, Chang Xu

:
BCNet: Searching for Network Width With Bilaterally Coupled Network. 2175-2184 - Hanqing Zhao, Wenbo Zhou, Dongdong Chen, Tianyi Wei, Weiming Zhang, Nenghai Yu:

Multi-Attentional Deepfake Detection. 2185-2194 - Yunhao Ge, Yao Xiao

, Zhi Xu, Meng Zheng, Srikrishna Karanam, Terrence Chen, Laurent Itti, Ziyan Wu:
A Peek Into the Reasoning of Neural Networks: Interpreting With Structural Visual Concepts. 2195-2204 - Jinyu Tian

, Jiantao Zhou, Jia Duan
:
Probabilistic Selective Encryption of Convolutional Neural Networks for Hierarchical Services. 2205-2214 - Yawen Zeng, Da Cao, Xiaochi Wei, Meng Liu, Zhou Zhao, Zheng Qin:

Multi-Modal Relational Graph for Cross-Modal Video Moment Retrieval. 2215-2224 - Jianan Zhao, Fengliang Qi, Guangyu Ren, Lin Xu:

PhD Learning: Learning With Pompeiu-Hausdorff Distances for Video-Based Vehicle Re-Identification. 2225-2235 - Linsen Song, Wayne Wu, Chaoyou Fu, Chen Qian, Chen Change Loy, Ran He:

Pareidolia Face Reenactment. 2236-2245 - Mengyao Zhai, Lei Chen, Greg Mori:

Hyper-LifelongGAN: Scalable Lifelong Learning for Image Conditioned Generation. 2246-2255 - Weihao Xia

, Yujiu Yang
, Jing-Hao Xue, Baoyuan Wu:
TediGAN: Text-Guided Diverse Face Image Generation and Manipulation. 2256-2265 - Yuqian Zhou, Connelly Barnes, Eli Shechtman, Sohrab Amirghodsi:

TransFill: Reference-Guided Image Inpainting by Merging Multiple Color and Spatial Transformations. 2266-2276 - Hao Su, Jianwei Niu

, Xuefeng Liu, Qingfeng Li, Ji Wan, Mingliang Xu, Tao Ren:
ArtCoder: An End-to-End Method for Generating Scanning-Robust Stylized QR Codes. 2277-2286 - Elad Richardson, Yuval Alaluf, Or Patashnik, Yotam Nitzan, Yaniv Azar, Stav Shapiro, Daniel Cohen-Or:

Encoding in Style: A StyleGAN Encoder for Image-to-Image Translation. 2287-2296 - Zhichao Huang, Xintong Han, Jia Xu, Tong Zhang:

Few-Shot Human Motion Transfer by Personalized Geometry and Texture Modeling. 2297-2306 - Richard Strong Bowen, Huiwen Chang, Charles Herrmann, Piotr Teterwak, Ce Liu, Ramin Zabih:

OCONet: Image Extrapolation by Object Completion. 2307-2317 - Bohan Wu, Suraj Nair, Roberto Martín-Martín, Li Fei-Fei, Chelsea Finn

:
Greedy Hierarchical Variational Autoencoders for Large-Scale Video Prediction. 2318-2328 - Shixiang Tang, Dapeng Chen, Lei Bai

, Kaijian Liu, Yixiao Ge, Wanli Ouyang
:
Mutual CRF-GNN for Few-Shot Learning. 2329-2339 - Sangdoo Yun, Seong Joon Oh, Byeongho Heo, Dongyoon Han, Junsuk Choe

, Sanghyuk Chun:
Re-Labeling ImageNet: From Single to Multi-Labels, From Global to Localized Labels. 2340-2350 - Jean-Baptiste Cordonnier, Aravindh Mahendran, Alexey Dosovitskiy, Dirk Weissenborn, Jakob Uszkoreit, Thomas Unterthiner

:
Differentiable Patch Selection for Image Recognition. 2351-2360 - Songyang Zhang

, Zeming Li, Shipeng Yan, Xuming He, Jian Sun:
Distribution Alignment: A Unified Framework for Long-Tail Visual Recognition. 2361-2370 - Zongyan Han

, Zhenyong Fu, Shuo Chen, Jian Yang:
Contrastive Embedding for Generalized Zero-Shot Learning. 2371-2381 - Xu Cao, Boxin Shi, Fumio Okura, Yasuyuki Matsushita

:
Normal Integration via Inverse Plane Fitting With Minimum Point-to-Plane Distance. 2382-2391 - Yufei Cui

, Ziquan Liu
, Qiao Li
, Antoni B. Chan
, Chun Jason Xue:
Bayesian Nested Neural Networks for Uncertainty Calibration and Adaptive Compression. 2392-2401 - Tien-Ju Yang, Yi-Lun Liao, Vivienne Sze:

NetAdaptV2: Efficient Neural Architecture Search With Fast Super-Network Training and Architecture Optimization. 2402-2411 - Baptiste Angles, Yuhe Jin, Simon Kornblith, Andrea Tagliasacchi, Kwang Moo Yi:

MIST: Multiple Instance Spatial Transformer. 2412-2422 - Pengfei Guo, Puyang Wang, Jinyuan Zhou, Shanshan Jiang, Vishal M. Patel:

Multi-Institutional Collaborations for Improving Deep Learning-Based Magnetic Resonance Image Reconstruction Using Federated Learning. 2423-2432 - Zhanyu Wang, Luping Zhou

, Lei Wang, Xiu Li:
A Self-Boosting Framework for Automated Radiographic Report Generation. 2433-2442 - Peng Dai, Renliang Weng, Wongun Choi, Changshui Zhang, Zhangping He, Wei Ding:

Learning a Proposal Classifier for Multiple Object Tracking. 2443-2452 - Linyu Zheng, Ming Tang, Yingying Chen, Guibo Zhu, Jinqiao Wang, Hanqing Lu:

Improving Multiple Object Tracking With Single Object Tracking. 2453-2462 - Cheng Chi, Qingjie Wang, Tianyu Hao, Peng Guo, Xin Yang:

Feature-Level Collaboration: Joint Unsupervised Learning of Optical Flow, Stereo Depth and Camera Motion. 2463-2473 - Chengyue Gong, Tongzheng Ren, Mao Ye, Qiang Liu:

MaxUp: Lightweight Adversarial Training With Data Augmentation Improves Neural Network Training. 2474-2483 - Luca Schmidtke, Athanasios Vlontzos, Simon Ellershaw

, Anna Lukens, Tomoki Arichi, Bernhard Kainz
:
Unsupervised Human Pose Estimation Through Transforming Shape Templates. 2484-2494 - Feng Wang, Huaping Liu:

Understanding the Behaviour of Contrastive Loss. 2495-2504 - Jichang Li, Guanbin Li, Yemin Shi, Yizhou Yu:

Cross-Domain Adaptive Clustering for Semi-Supervised Domain Adaptation. 2505-2514 - Qing Yu, Atsushi Hashimoto, Yoshitaka Ushiku:

Divergence Optimization for Noisy Universal Domain Adaptation. 2515-2524 - Collin Burns, Jacob Steinhardt:

Limitations of Post-Hoc Feature Alignment for Robustness. 2525-2533 - Ali Cheraghian, Shafin Rahman

, Pengfei Fang, Soumava Kumar Roy, Lars Petersson
, Mehrtash Harandi:
Semantic-Aware Knowledge Distillation for Few-Shot Class-Incremental Learning. 2534-2543 - Yaoyao Liu

, Bernt Schiele
, Qianru Sun:
Adaptive Aggregation Networks for Class-Incremental Learning. 2544-2553 - Fengmao Lv, Xiang Chen, Yanyong Huang, Lixin Duan, Guosheng Lin:

Progressive Modality Reinforcement for Human Multimodal Emotion Recognition From Unaligned Multimodal Sequences. 2554-2562 - Guangting Wang, Yizhou Zhou, Chong Luo, Wenxuan Xie, Wenjun Zeng

, Zhiwei Xiong:
Unsupervised Visual Representation Learning by Tracking Patches in Video. 2563-2572 - Cheng Sun, Min Sun, Hwann-Tzong Chen:

HoHoNet: 360 Indoor Holistic Understanding With Latent Horizontal Features. 2573-2582 - Saif Muhammad Imran, Xiaoming Liu, Daniel Morris

:
Depth Completion With Twin Surface Extrapolation at Occlusion Boundaries. 2583-2592 - Ye Zheng, Jiahong Wu, Yongqiang Qin, Faen Zhang, Li Cui:

Zero-Shot Instance Segmentation. 2593-2602 - Zhenzhen Weng

, Mehmet Giray Ogut, Shai Limonchik, Serena Yeung
:
Unsupervised Discovery of the Long-Tail in Instance Segmentation Using Hierarchical Self-Supervision. 2603-2612 - Xiaokang Chen, Yuhui Yuan, Gang Zeng, Jingdong Wang:

Semi-Supervised Semantic Segmentation With Cross Pseudo Supervision. 2613-2622 - Yazhou Yao, Tao Chen, Guo-Sen Xie, Chuanyi Zhang, Fumin Shen, Qi Wu, Zhenmin Tang, Jian Zhang

:
Non-Salient Region Object Mining for Weakly Supervised Semantic Segmentation. 2623-2632 - Qiang Zhang, Shenlu Zhao

, Yongjiang Luo, Dingwen Zhang, Nianchang Huang, Jungong Han:
ABMDRNet: Adaptive-Weighted Bi-Directional Modality Difference Reduction Network for RGB-T Semantic Segmentation. 2633-2642 - Jungbeom Lee, Jihun Yi, Chaehun Shin, Sungroh Yoon:

BBAM: Bounding Box Attribution Map for Weakly Supervised Semantic and Instance Segmentation. 2643-2652 - Jianyuan Guo

, Kai Han, Han Wu, Chao Zhang, Xinghao Chen
, Chunjing Xu, Chang Xu
, Yunhe Wang:
Positive-Unlabeled Data Purification in the Wild for Object Detection. 2653-2662 - Yandong Li, Xuhui Jia, Ruoxin Sang, Yukun Zhu, Bradley Green, Liqiang Wang, Boqing Gong:

Ranking Neural Checkpoints. 2663-2673 - Colorado J. Reed, Sean Metzger, Aravind Srinivas, Trevor Darrell, Kurt Keutzer:

SelfAugment: Automatic Augmentation Policies for Self-Supervised Learning. 2674-2683 - Junhwa Hur, Stefan Roth:

Self-Supervised Multi-Frame Monocular Scene Flow. 2684-2694 - Amirhossein Habibian, Davide Abati, Taco S. Cohen, Babak Ehteshami Bejnordi:

Skip-Convolutions for Efficient Video Processing. 2695-2704 - Sanghyun Woo, Dahun Kim, Joon-Young Lee, In So Kweon:

Learning To Associate Every Segment for Video Panoptic Segmentation. 2705-2714 - Zhihao Chen, Liang Wan, Lei Zhu, Jia Shen, Huazhu Fu

, Wennan Liu, Jing Qin:
Triple-Cooperative Video Shadow Detection. 2715-2724 - Mehrdad Hosseinzadeh, Yang Wang:

Image Change Captioning by Learning From an Auxiliary Task. 2725-2734 - Amanda Cardoso Duarte

, Shruti Palaskar, Lucas Ventura, Deepti Ghadiyaram, Kenneth DeHaan, Florian Metze, Jordi Torres, Xavier Giró-i-Nieto:
How2Sign: A Large-Scale Multimodal Dataset for Continuous American Sign Language. 2735-2744 - Yapeng Tian, Di Hu, Chenliang Xu:

Cyclic Co-Learning of Sounding Object Visual Grounding and Sound Separation. 2745-2754 - Avisek Lahiri, Vivek Kwatra, Christian Früh, John Lewis, Chris Bregler:

LipSync3D: Data-Efficient Learning of Personalized 3D Talking Faces From Video Using Pose and Lighting Normalization. 2755-2764 - Guoshun Nan, Rui Qiao

, Yao Xiao, Jun Liu
, Sicong Leng, Hao Zhang
, Wei Lu
:
Interventional Video Grounding With Dual Contrastive Learning. 2765-2775 - Corentin Kervadec, Grigory Antipov, Moez Baccouche, Christian Wolf:

Roses Are Red, Violets Are Blue... but Should VQA Expect Them To? 2776-2785 - Jiaming Han, Jian Ding, Nan Xue, Gui-Song Xia

:
ReDet: A Rotation-Equivariant Detector for Aerial Object Detection. 2786-2795 - Yiming Qian, Hao Zhang, Yasutaka Furukawa:

Roof-GAN: Learning To Generate Roof Geometry and Relations for Residential Houses. 2796-2805 - Tal Reiss, Niv Cohen, Liron Bergman, Yedid Hoshen:

PANDA: Adapting Pretrained Features for Anomaly Detection and Segmentation. 2806-2814 - Péter Karkus, Shaojun Cai, David Hsu:

Differentiable SLAM-Net: Learning Particle SLAM for Visual Navigation. 2815-2825 - Yanchao Yang, Brian Lai, Stefano Soatto:

DyStaB: Unsupervised Object Segmentation via Dynamic-Static Bootstrapping. 2826-2836 - Shitong Luo, Wei Hu:

Diffusion Probabilistic Models for 3D Point Cloud Generation. 2837-2845 - Matthew Tancik, Ben Mildenhall, Terrance Wang, Divi Schmidt, Pratul P. Srinivasan, Jonathan T. Barron, Ren Ng:

Learned Initializations for Optimizing Coordinate-Based Neural Representations. 2846-2855 - Julian Ost, Fahim Mannan, Nils Thuerey

, Julian Knodt, Felix Heide:
Neural Scene Graphs for Dynamic Scenes. 2856-2865 - Ruwan B. Tennakoon

, David Suter
, Erchuan Zhang
, Tat-Jun Chin, Alireza Bab-Hadiashar:
Consensus Maximisation Using Influences of Monotone Boolean Functions. 2866-2875 - Jennifer J. Sun, Ann Kennedy, Eric Zhan, David J. Anderson, Yisong Yue, Pietro Perona:

Task Programming: Learning Data Efficient Behavior Representations. 2876-2885 - Shunsuke Saito, Jinlong Yang, Qianli Ma, Michael J. Black:

SCANimate: Weakly Supervised Learning of Skinned Clothed Avatar Networks. 2886-2897 - Yulin Li, Jianfeng He, Tianzhu Zhang, Xiang Liu, Yongdong Zhang, Feng Wu:

Diverse Part Discovery: Occluded Person Re-Identification With Part-Aware Transformer. 2898-2907 - Yuval Bahat, Tomer Michaeli:

What's in the Image? Explorable Decoding of Compressed Images. 2908-2917 - Golnaz Ghiasi, Yin Cui, Aravind Srinivas, Rui Qian, Tsung-Yi Lin, Ekin D. Cubuk, Quoc V. Le, Barret Zoph:

Simple Copy-Paste Is a Strong Data Augmentation Method for Instance Segmentation. 2918-2928 - Xiangyu Zhu

, Hao Wang
, Hongyan Fei
, Zhen Lei, Stan Z. Li:
Face Forgery Detection by 3D Decomposition. 2929-2939 - Juhong Min, Minsu Cho:

Convolutional Hough Matching Networks. 2940-2950 - Guoxing Yang, Nanyi Fei, Mingyu Ding, Guangzhen Liu, Zhiwu Lu, Tao Xiang:

L2M-GAN: Learning To Manipulate Latent Space Semantics for Facial Attribute Editing. 2951-2960 - Zilong Zheng, Jianwen Xie, Ping Li:

Patchwise Generative ConvNet: Training Energy-Based Models From a Single Natural Image for Internal Learning. 2961-2970 - Radek Mackowiak, Lynton Ardizzone, Ullrich Köthe, Carsten Rother:

Generative Classifiers as a Basis for Trustworthy Image Classification. 2971-2981 - Mingyu Ding, Xiaochen Lian, Linjie Yang, Peng Wang, Xiaojie Jin, Zhiwu Lu, Ping Luo:

HR-NAS: Searching Efficient High-Resolution Neural Architectures With Lightweight Transformers. 2982-2992 - Qiangqiang Wu

, Jia Wan
, Antoni B. Chan
:
Progressive Unsupervised Learning for Visual Object Tracking. 2993-3002 - Yisheng He, Haibin Huang, Haoqiang Fan, Qifeng Chen, Jian Sun:

FFB6D: A Full Flow Bidirectional Fusion Network for 6D Pose Estimation. 3003-3013 - Shipeng Yan, Jiangwei Xie, Xuming He:

DER: Dynamically Expandable Representation for Class Incremental Learning. 3014-3023 - Xinlong Wang, Rufeng Zhang, Chunhua Shen, Tao Kong, Lei Li:

Dense Contrastive Learning for Self-Supervised Visual Pre-Training. 3024-3033 - Xiaotian Chen, Yuwang Wang, Xuejin Chen, Wenjun Zeng

:
S2R-DepthNet: Learning a Generalizable Depth-Specific Structural Representation. 3034-3043 - Haiyang Mei, Bo Dong, Wen Dong, Pieter Peers, Xin Yang, Qiang Zhang

, Xiaopeng Wei:
Depth-Aware Mirror Segmentation. 3044-3053 - Sangmin Lee

, Hak Gu Kim, Dae Hwi Choi, Hyung-Il Kim
, Yong Man Ro
:
Video Prediction Recalling Long-Term Motion Context via Memory Alignment Learning. 3054-3063 - Chen Gao, Jinyu Chen, Si Liu, Luting Wang

, Qiong Zhang, Qi Wu:
Room-and-Object Aware Knowledge Reasoning for Remote Embodied Referring Expression. 3064-3073 - Cheol-Hui Min, Jinseok Bae, Junho Lee, Young Min Kim:

GATSBI: Generative Agent-Centric Spatio-Temporal Object Interaction. 3074-3083 - Peleg Harel, Ohad Ben-Shahar:

Crossing Cuts Polygonal Puzzles: Models and Solvers. 3084-3093 - Aoxue Li, Zhenguo Li:

Transformation Invariant Few-Shot Object Detection. 3094-3102 - Tong Wang, Yousong Zhu, Chaoyang Zhao, Wei Zeng

, Jinqiao Wang, Ming Tang:
Adaptive Class Suppression Loss for Long-Tail Object Detection. 3103-3112 - Jeonghun Baek

, Yusuke Matsui, Kiyoharu Aizawa:
What if We Only Use Real Datasets for Scene Text Recognition? Toward Scene Text Recognition With Fewer Labels. 3113-3122 - Yiqin Zhu, Jianyong Chen, Lingyu Liang, Zhanghui Kuang, Lianwen Jin, Wayne Zhang

:
Fourier Contour Embedding for Arbitrary-Shaped Text Detection. 3123-3131 - Yihe Tang, Weifeng Chen, Yijun Luo, Yuting Zhang:

Humble Teachers Teach Better Students for Semi-Supervised Object Detection. 3132-3141 - Longlong Jing, Elahe Vahdani, Jiaxing Tan, Yingli Tian

:
Cross-Modal Center Loss for 3D Cross-Modal Retrieval. 3142-3151 - Shuo Yang, Min Xu, Haozhe Xie

, Stuart W. Perry
, Jiahao Xia:
Single-View 3D Object Reconstruction From Shape Priors in Memory. 3152-3161 - Silvan Weder, Johannes L. Schönberger, Marc Pollefeys

, Martin R. Oswald:
NeuralFusion: Online Depth Fusion in Latent Space. 3162-3172 - Mutian Xu, Runyu Ding, Hengshuang Zhao, Xiaojuan Qi:

PAConv: Position Adaptive Convolution With Dynamic Kernel Assembling on Point Clouds. 3173-3182 - Chenxu Luo, Xiaodong Yang, Alan L. Yuille:

Self-Supervised Pillar Motion Learning for Autonomous Driving. 3183-3192 - Dave Zhenyu Chen, Ali Gholami, Matthias Nießner, Angel X. Chang:

Scan2Cap: Context-Aware Dense Captioning in RGB-D Scans. 3193-3203 - Despoina Paschalidou, Angelos Katharopoulos, Andreas Geiger, Sanja Fidler

:
Neural Parts: Learning Expressive 3D Shape Abstractions With Invertible Neural Networks. 3204-3215 - Arianna Rampini, Franco Pestarini, Luca Cosmo

, Simone Melzi, Emanuele Rodolà:
Universal Spectral Adversarial Attacks for Deformable Shapes. 3216-3226 - Donghwan Lee, Soo-Hyun Ryu, Suyong Yeon, Yonghan Lee, Deokhwa Kim, Cheolho Han, Yohann Cabon, Philippe Weinzaepfel, Nicolas Guérin, Gabriela Csurka, Martin Humenberger:

Large-Scale Localization Datasets in Crowded Indoor Spaces. 3227-3236 - Yuan Liu, Lingjie Liu, Cheng Lin, Zhen Dong, Wenping Wang:

Learnable Motion Coherence for Correspondence Pruning. 3237-3246 - Paul-Edouard Sarlin, Ajaykumar Unagar

, Måns Larsson, Hugo Germain, Carl Toft, Viktor Larsson, Marc Pollefeys
, Vincent Lepetit, Lars Hammarstrand, Fredrik Kahl, Torsten Sattler:
Back to the Feature: Learning Robust Camera Localization From Pixels To Pose. 3247-3257 - Kefan Chen, Noah Snavely, Ameesh Makadia:

Wide-Baseline Relative Camera Pose Estimation With Directional Learning. 3258-3268 - Mingyue Yang, Yuxin Wen

, Weikai Chen, Yongwei Chen, Kui Jia:
Deep Optimized Priors for 3D Shape Modeling and Reconstruction. 3269-3278 - Zhenwei Miao, Jikai Chen, Hongyu Pan, Ruiwen Zhang, Kaixuan Liu, Peihan Hao, Jun Zhu, Yang Wang, Xin Zhan:

PVGNet: A Bottom-Up One-Stage 3D Object Detector With Integrated Multi-Level Features. 3279-3288 - Yunpeng Zhang

, Jiwen Lu
, Jie Zhou:
Objects Are Different: Flexible Monocular 3D Object Detection. 3289-3298 - Christoph Feichtenhofer, Haoqi Fan, Bo Xiong, Ross B. Girshick, Kaiming He:

A Large-Scale Study on Unsupervised Spatiotemporal Representation Learning. 3299-3309 - Dong Li, Zhaofan Qiu, Yingwei Pan, Ting Yao, Houqiang Li, Tao Mei

:
Representing Videos As Discriminative Sub-Graphs for Action Recognition. 3310-3319 - Chuming Lin, Chengming Xu, Donghao Luo, Yabiao Wang

, Ying Tai
, Chengjie Wang, Jilin Li, Feiyue Huang, Yanwei Fu
:
Learning Salient Boundary Feature for Anchor-free Temporal Action Localization. 3320-3329 - Xiaodan Li, Jinfeng Li, Yuefeng Chen, Shaokai Ye, Yuan He, Shuhui Wang, Hang Su, Hui Xue:

QAIR: Practical Query-Efficient Black-Box Attacks for Image Retrieval. 3330-3339 - Karren Yang, Wan-Yi Lin, Manash Barman, Filipe Condessa, J. Zico Kolter:

Defending Multimodal Fusion Models Against Single-Source Adversaries. 3340-3349 - Chengchao Shen, Youtan Yin, Xinchao Wang

, Xubin Li, Jie Song, Mingli Song:
Training Generative Adversarial Networks in One Stage. 3350-3360 - Mallikarjun B. R., Ayush Tewari, Hans-Peter Seidel, Mohamed Elgharib, Christian Theobalt

:
Learning Complete 3D Morphable Face Models From Images and Videos. 3361-3371 - Yan Zhang, Michael J. Black, Siyu Tang

:
We Are More Than Our Joints: Predicting How 3D Bodies Move. 3372-3382 - Jiefeng Li

, Chao Xu, Zhicun Chen, Siyuan Bian, Lixin Yang, Cewu Lu:
HybrIK: A Hybrid Analytical-Neural Inverse Kinematics Solution for 3D Human Pose and Shape Estimation. 3383-3393 - Viresh Ranjan, Udbhav Sharma, Thu Nguyen, Minh Hoai:

Learning To Count Everything. 3394-3403 - Gege Gao, Huaibo Huang, Chaoyou Fu, Zhaoyang Li, Ran He:

Information Bottleneck Disentanglement for Identity Swapping. 3404-3413 - Sixue Gong, Xiaoming Liu, Anil K. Jain:

Mitigating Face Recognition Bias via Group Adaptive Classifier. 3414-3424 - Seokeon Choi, Taekyung Kim

, Minki Jeong, Hyoungseob Park, Changick Kim:
Meta Batch-Instance Normalization for Generalizable Person Re-Identification. 3425-3435 - Xiao Zhang, Yixiao Ge, Yu Qiao

, Hongsheng Li
:
Refining Pseudo Labels With Clustering Consensus Over Generations for Unsupervised Object Re-Identification. 3436-3445 - Federico Paredes-Vallés, Guido C. H. E. de Croon:

Back to Event Basics: Self-Supervised Learning of Image Reconstruction for Event Cameras via Photometric Constancy. 3446-3455 - Denys Rozumnyi

, Martin R. Oswald, Vittorio Ferrari, Jiri Matas
, Marc Pollefeys
:
DeFMO: Deblurring and Shape Recovery of Fast Moving Objects. 3456-3465 - Matteo Maggioni, Yibin Huang, Cheng Li, Shuai Xiao, Zhongqian Fu, Fenglong Song:

Efficient Multi-Stage Video Denoising With Recurrent Spatio-Temporal Fusion. 3466-3475 - Zheng Shi, Ethan Tseng, Mario Bijelic, Werner Ritter, Felix Heide:

ZeroScatter: Domain Transfer for Long Distance Imaging and Vision Through Scattering Media. 3476-3486 - Mohit Lamba, Kaushik Mitra

:
Restoring Extremely Dark Images in Real Time. 3487-3497 - Jing Tan, Shan Zhao, Pengfei Xiong, Jiangyu Liu, Haoqiang Fan, Shuaicheng Liu

:
Practical Wide-Angle Portraits Correction With Deep Structured Models. 3498-3506 - Wenzhu Xing, Karen O. Egiazarian

:
End-to-End Learning for Joint Image Demosaicing, Denoising and Super-Resolution. 3507-3516 - Yiqun Mei, Yuchen Fan, Yuqian Zhou:

Image Super-Resolution With Non-Local Sparse Attention. 3517-3526 - Yan-Cheng Huang, Yi-Hsin Chen

, Cheng-You Lu
, Hui-Po Wang, Wen-Hsiao Peng
, Ching-Chun Huang:
Video Rescaling Networks With Joint Optimization Strategies for Downscaling and Upscaling. 3527-3536 - Seunghwan Lee, Donghyeon Cho, Jiwon Kim, Tae Hyun Kim:

Restore From Restored: Video Restoration With Pseudo Clean Video. 3537-3546 - Brett D. Roads, Bradley C. Love:

Enriching ImageNet With Human Similarity Judgments and Psychological Embeddings. 3547-3557 - Soravit Changpinyo, Piyush Sharma, Nan Ding, Radu Soricut:

Conceptual 12M: Pushing Web-Scale Image-Text Pre-Training To Recognize Long-Tail Visual Concepts. 3558-3568 - Le Yang

, Haojun Jiang, Ruojin Cai, Yulin Wang
, Shiji Song, Gao Huang, Qi Tian:
CondenseNet V2: Sparse Feature Reactivation for Deep Networks. 3569-3578 - Zhen Huang, Xu Shen, Jun Xing, Tongliang Liu

, Xinmei Tian, Houqiang Li, Bing Deng, Jianqiang Huang, Xian-Sheng Hua:
Revisiting Knowledge Distillation: An Inheritance and Exploration Framework. 3579-3588 - Chong Yu:

Minimally Invasive Surgery for Sparse Neural Networks in Contrastive Manner. 3589-3598 - Xiao Zhou, Weizhong Zhang, Hang Xu, Tong Zhang:

Effective Sparsification of Neural Networks With Global Sparsity Constraint. 3599-3608 - Zekun Sun

, Yujie Han, Zeyu Hua, Na Ruan, Weijia Jia:
Improving the Efficiency and Robustness of Deepfakes Detection Through Precise Geometric Features. 3609-3618 - Wolfgang Stammer, Patrick Schramowski, Kristian Kersting:

Right for the Right Concept: Revising Neuro-Symbolic Concepts by Interacting With Their Explanations. 3619-3629 - Ding Sheng Ong, Chee Seng Chan, Kam Woh Ng, Lixin Fan, Qiang Yang:

Protecting Intellectual Property of Generative Adversarial Networks From Ambiguity Attacks. 3630-3639 - Sijie Zhu, Taojiannan Yang, Chen Chen:

VIGOR: Cross-View Image Geo-Localization Beyond One-to-One Retrieval. 3640-3649 - Michael Wray

, Hazel Doughty
, Dima Damen
:
On Semantic Similarity in Video Retrieval. 3650-3660 - Zhimeng Zhang, Lincheng Li, Yu Ding, Changjie Fan:

Flow-Guided One-Shot Talking Face Generation With a High-Resolution Audio-Visual Dataset. 3661-3670 - Anton Cherepkov, Andrey Voynov, Artem Babenko:

Navigating the GAN Parameter Space for Semantic Image Editing. 3671-3680 - Pei Wang, Yijun Li, Krishna Kumar Singh, Jingwan Lu, Nuno Vasconcelos

:
IMAGINE: Image Synthesis by Image-Guided Model Inversion. 3681-3690 - Qiang Zhou, Shiyin Wang, Yitong Wang, Zilong Huang, Xinggang Wang

:
Human De-Occlusion: Invisible Perception and Recovery for Humans. 3691-3701 - Xiao-Chang Liu

, Yong-Liang Yang, Peter Hall
:
Learning To Warp for Style Transfer. 3702-3711 - Moustafa Meshry, Yixuan Ren

, Larry S. Davis, Abhinav Shrivastava:
StEP: Style-Based Encoder Pre-Training for Multi-Modal Image Synthesis. 3712-3721 - Amit Raj, Julian Tanke

, James Hays, Minh Vo, Carsten Stoll, Christoph Lassner:
ANR: Articulated Neural Rendering for Virtual Avatars. 3722-3731 - Cheng-Fu Yang, Wan-Cyuan Fan, Fu-En Yang, Yu-Chiang Frank Wang:

LayoutTransformer: Scene Layout Generation With Conceptual and Spatial Diversity. 3732-3741 - Michael Dorkenwald, Timo Milbich, Andreas Blattmann, Robin Rombach, Konstantinos G. Derpanis, Björn Ommer:

Stochastic Image-to-Video Synthesis Using cINNs. 3742-3753 - Baoquan Zhang, Xutao Li, Yunming Ye, Zhichao Huang, Lisai Zhang:

Prototype Completion With Primitive Knowledge for Few-Shot Learning. 3754-3762 - Bi Li, Teng Xi, Gang Zhang, Haocheng Feng, Junyu Han, Jingtuo Liu, Errui Ding, Wenyu Liu:

Dynamic Class Queue for Large Scale Face Recognition in the Wild. 3763-3772 - Anadi Chaman, Ivan Dokmanic

:
Truly Shift-Invariant Convolutional Neural Networks. 3773-3783 - Jianfeng Wang, Thomas Lukasiewicz, Xiaolin Hu, Jianfei Cai, Zhenghua Xu:

RSG: A Simple but Effective Module for Learning Imbalanced Datasets. 3784-3793 - Yang Liu, Lei Zhou

, Xiao Bai, Yifei Huang, Lin Gu
, Jun Zhou, Tatsuya Harada:
Goal-Oriented Gaze Estimation for Zero-Shot Learning. 3794-3803 - Berk Kaya, Suryansh Kumar

, Carlos E. P. de Oliveira, Vittorio Ferrari, Luc Van Gool:
Uncalibrated Neural Inverse Rendering for Photometric Stereo of General Surfaces. 3804-3814 - Jiaru Zhang, Yang Hua

, Zhengui Xue, Tao Song
, Chengyu Zheng, Ruhui Ma, Haibing Guan:
Robust Bayesian Neural Networks by Spectral Expectation Bound Regularization. 3815-3824 - Yunyang Xiong, Hanxiao Liu, Suyog Gupta, Berkin Akin, Gabriel Bender, Yongzhe Wang, Pieter-Jan Kindermans, Mingxing Tan, Vikas Singh, Bo Chen:

MobileDets: Searching for Object Detection Architectures for Mobile Accelerators. 3825-3834 - Qian Li, Zhichao Wang, Gang Li

, Jun Pang, Guandong Xu:
Hilbert Sinkhorn Divergence for Optimal Transport. 3835-3844 - Hamad Ahmed, Ronnie B. Wilbur, Hari M. Bharadwaj, Jeffrey Mark Siskind

:
Object Classification From Randomized EEG Trials. 3845-3854 - Yuxing Tang, Zhenjie Cao, Yanbo Zhang, Zhicheng Yang, Zongcheng Ji, Yiwei Wang, Mei Han, Jie Ma, Jing Xiao, Peng Chang:

Leveraging Large-Scale Weakly Labeled Data for Semi-Supervised Mass Detection in Mammograms. 3855-3864 - Ramana Sundararaman, Cedric De Almeida Braga, Éric Marchand, Julien Pettré:

Tracking Pedestrian Heads in Dense Crowd. 3865-3875 - Qiang Wang, Yun Zheng, Pan Pan, Yinghui Xu:

Multiple Object Tracking With Correlation Learning. 3876-3886 - Austin Stone, Daniel Maurer, Alper Ayvaci, Anelia Angelova, Rico Jonschkowski:

SMURF: Self-Teaching Multi-Frame Unsupervised RAFT With Full-Image Warping. 3887-3896 - Marcus Valtonen Örnhag

, José Pedro Iglesias, Carl Olsson:
Bilinear Parameterization for Non-Separable Singular Value Penalties. 3897-3906 - Zongxin Yang, Xin Yu, Yi Yang:

DSC-PoseNet: Learning 6DoF Object Pose Estimation via Dual-Scale Consistency. 3907-3916 - Yutong Zheng, Yu-Kai Huang, Ran Tao, Zhiqiang Shen, Marios Savvides:

Unsupervised Disentanglement of Linear-Encoded Facial Semantics. 3917-3926 - Xiaoqing Guo

, Chen Yang
, Baopu Li, Yixuan Yuan
:
MetaCorrection: Domain-Aware Meta Loss Correction for Unsupervised Domain Adaptation in Semantic Segmentation. 3927-3936 - Zhekai Du, Jingjing Li, Hongzu Su, Lei Zhu

, Ke Lu:
Cross-Domain Gradient Discrepancy Minimization for Unsupervised Domain Adaptation. 3937-3946 - Chengzhi Mao, Augustine Cha, Amogh Gupta, Hao Wang, Junfeng Yang, Carl Vondrick:

Generative Interventions for Causal Learning. 3947-3956 - Xinting Hu

, Kaihua Tang, Chunyan Miao, Xian-Sheng Hua, Hanwang Zhang
:
Distilling Causal Effect of Data in Class-Incremental Learning. 3957-3966 - Sungyeon Kim, Dongwon Kim, Minsu Cho, Suha Kwak:

Embedding Transfer With Label Relaxation for Improved Metric Learning. 3967-3976 - Minheng Ni, Haoyang Huang, Lin Su, Edward Cui, Taroon Bharti, Lijuan Wang, Dongdong Zhang, Nan Duan

:
M3P: Learning Universal Representations via Multitask Multilingual Multimodal Pre-Training. 3977-3986 - Ceyuan Yang, Zhirong Wu, Bolei Zhou, Stephen Lin:

Instance Localization for Self-Supervised Detection Pretraining. 3987-3996 - Siyuan Qiao, Yukun Zhu, Hartwig Adam, Alan L. Yuille, Liang-Chieh Chen:

VIP-DeepLab: Learning Visual Perception With Depth-Aware Video Panoptic Segmentation. 3997-4008 - Shariq Farooq Bhat, Ibraheem Alhashim

, Peter Wonka:
AdaBins: Depth Estimation Using Adaptive Bins. 4009-4018 - Lei Ke, Yu-Wing Tai

, Chi-Keung Tang:
Deep Occlusion-Aware Instance Segmentation With Overlapping BiLayers. 4019-4028 - Pedro Savarese, Sunnie S. Y. Kim, Michael Maire, Greg Shakhnarovich, David McAllester:

Information-Theoretic Segmentation by Inpainting Error Maximization. 4029-4039 - Arthur Douillard, Yifu Chen, Arnaud Dapogny, Matthieu Cord:

PLOP: Learning Without Forgetting for Continual Semantic Segmentation. 4040-4050 - Haoyu Ma, Xiangru Lin, Zifeng Wu, Yizhou Yu:

Coarse-To-Fine Domain Adaptive Semantic Segmentation With Photometric Alignment and Category-Center Regularization. 4051-4060 - Yuval Nirkin, Lior Wolf, Tal Hassner:

HyperSeg: Patch-Wise Hypernetwork for Real-Time Semantic Segmentation. 4061-4070 - Jungbeom Lee, Eunji Kim, Sungroh Yoon:

Anti-Adversarially Manipulated Attributions for Weakly and Semi-Supervised Semantic Segmentation. 4071-4080 - Qiang Zhou, Chaohui Yu, Zhibin Wang, Qi Qian, Hao Li:

Instant-Teaching: An End-to-End Semi-Supervised Object Detection Framework. 4081-4090 - Jinhong Deng, Wen Li, Yuhua Chen, Lixin Duan:

Unbiased Mean Teacher for Cross-Domain Object Detection. 4091-4101 - Jennifer Jang, Heinrich Jiang:

MeanShift++: Extremely Fast Mode-Seeking With Applications to Segmentation and Object Tracking. 4102-4113 - Yair Kittenplon, Yonina C. Eldar, Dan Raviv:

FlowStep3D: Model Unrolling for Self-Supervised Scene Flow Estimation. 4114-4123 - A. J. Piergiovanni, Michael S. Ryoo:

Recognizing Actions in Videos From Unseen Viewpoints. 4124-4132 - Jiaxu Miao, Yunchao Wei, Yu Wu

, Chen Liang, Guangrui Li, Yi Yang:
VSPW: A Large-scale Dataset for Video Scene Parsing in the Wild. 4133-4143 - Li Hu, Peng Zhang, Bang Zhang, Pan Pan, Yinghui Xu, Rong Jin:

Learning Position and Target Consistency for Memory-Based Video Object Segmentation. 4144-4154 - Mingyang Zhou, Luowei Zhou, Shuohang Wang, Yu Cheng, Linjie Li, Zhou Yu, Jingjing Liu:

UC2: Universal Cross-Lingual Cross-Modal Vision-and-Language Pre-Training. 4155-4165 - Bowen Shi, Diane Brentari, Greg Shakhnarovich, Karen Livescu

:
Fingerspelling Detection in American Sign Language. 4166-4175 - Hang Zhou, Yasheng Sun, Wayne Wu, Chen Change Loy, Xiaogang Wang, Ziwei Liu:

Pose-Controllable Talking Face Generation by Implicitly Modularized Audio-Visual Representation. 4176-4186 - Tianrui Hui, Shaofei Huang

, Si Liu, Zihan Ding, Guanbin Li, Wenguan Wang
, Jizhong Han
, Fei Wang:
Collaborative Spatial-Temporal Modeling for Language-Queried Video Actor Segmentation. 4187-4196 - Yang Zhao, Zhou Zhao, Zhu Zhang, Zhijie Lin:

Cascaded Prediction Network via Segment Tree for Temporal Video Grounding. 4197-4206 - Corentin Kervadec, Theo Jaunet, Grigory Antipov, Moez Baccouche, Romain Vuillemot, Christian Wolf:

How Transferable Are Reasoning Patterns in VQA? 4207-4216 - Xiangtai Li, Hao He, Xia Li

, Duo Li, Guangliang Cheng, Jianping Shi, Lubin Weng, Yunhai Tong, Zhouchen Lin:
PointFlow: Flowing Semantics Through Points for Aerial Image Segmentation. 4217-4226 - Lu Mi, Hang Zhao, Charlie Nash, Xiaohan Jin, Jiyang Gao, Chen Sun, Cordelia Schmid, Nir Shavit, Yuning Chai, Dragomir Anguelov:

HDMapGen: A Hierarchical Graph Generative Model of High Definition Maps. 4227-4236 - Jingyuan Yang, Jie Li, Leida Li

, Xiumei Wang, Xinbo Gao:
A Circular-Structured Representation for Visual Emotion Distribution Learning. 4237-4246 - Ayan Kumar Bhunia, Pinaki Nath Chowdhury, Aneeshan Sain

, Yongxin Yang, Tao Xiang, Yi-Zhe Song
:
More Photos Are All You Need: Semi-Supervised Learning for Fine-Grained Sketch Based Image Retrieval. 4247-4256 - Yifan Xu, Weijian Xu, David Cheung, Zhuowen Tu:

Line Segment Detection Using Transformers Without Edges. 4257-4266 - Shengyu Huang, Zan Gojcic, Mikhail Usvyatsov, Andreas Wieser

, Konrad Schindler:
Predator: Registration of 3D Point Clouds With Low Overlap. 4267-4276 - Cheng Lin, Changjian Li, Yuan Liu, Nenglun Chen, Yi-King Choi, Wenping Wang:

Point2Skeleton: Learning Skeletal Representations from Point Clouds. 4277-4286 - Petr Kellnhofer, Lars Jebe, Andrew Jones, Ryan Spicer, Kari Pulli, Gordon Wetzstein

:
Neural Lumigraph Rendering. 4287-4297 - Álvaro Parra, Shin-Fang Ch'ng, Tat-Jun Chin, Anders P. Eriksson, Ian Reid:

Rotation Coordinate Descent for Fast Globally Optimal Rotation Averaging. 4298-4307 - Zhaoyang Lyu, Minghao Guo, Tong Wu, Guodong Xu, Kehuan Zhang, Dahua Lin:

Towards Evaluating and Training Verifiably Robust Neural Networks. 4308-4317 - Vladimir Guzov, Aymen Mir, Torsten Sattler, Gerard Pons-Moll:

Human POSEitioning System (HPS): 3D Human Pose Estimation and Self-Localization in Large Scenes From Body-Mounted Sensors. 4318-4329 - Qiong Wu, Pingyang Dai

, Jie Chen, Chia-Wen Lin, Yongjian Wu, Feiyue Huang, Bineng Zhong, Rongrong Ji:
Discover Cross-Modality Nuances for Visible-Infrared Person Re-Identification. 4330-4339 - Liyuan Pan, Shah Chowdhury, Richard Hartley, Miaomiao Liu, Hongguang Zhang, Hongdong Li

:
Dual Pixel Exploration: Simultaneous Depth Estimation and Image Restoration. 4340-4349 - Yuan-Hong Liao, Amlan Kar, Sanja Fidler

:
Towards Good Practices for Efficiently Annotating Large-Scale Image Classification Datasets. 4350-4359 - Yinan He, Bei Gan, Siyu Chen, Yichun Zhou, Guojun Yin, Luchuan Song, Lu Sheng

, Jing Shao, Ziwei Liu:
ForgeryNet: A Versatile Benchmark for Comprehensive Forgery Analysis. 4360-4369 - Jiawei Liu, Zheng-Jun Zha, Wei Wu, Kecheng Zheng, Qibin Sun:

Spatial-Temporal Correlation and Topology Learning for Person Re-Identification in Videos. 4370-4379 - Yichen Sheng, Jianming Zhang, Bedrich Benes:

SSN: Soft Shadow Network for Image Compositing. 4380-4390 - Tal Daniel, Aviv Tamar:

Soft-IntroVAE: Analyzing and Improving the Introspective Variational Autoencoder. 4391-4400 - Da-Wei Zhou, Han-Jia Ye, De-Chuan Zhan:

Learning Placeholders for Open-Set Recognition. 4401-4410 - Yixing Xu, Yunhe Wang, Kai Han, Yehui Tang, Shangling Jui, Chunjing Xu, Chang Xu

:
ReNAS: Relativistic Evaluation of Neural Architecture Search. 4411-4420 - Siyuan Cheng, Bineng Zhong, Guorong Li, Xin Liu, Zhenjun Tang, Xianxian Li, Jing Wang:

Learning To Filter: Siamese Relation Network for Robust Tracking. 4421-4431 - Yinghao Xu, Yujun Shen, Jiapeng Zhu, Ceyuan Yang, Bolei Zhou:

Generative Hierarchical Features From Synthesizing Images. 4432-4442 - Riccardo Volpi, Diane Larlus, Grégory Rogez:

Continual Adaptation of Visual Representations via Domain Randomization and Meta-Learning. 4443-4453 - Miguel Jaques, Michael Burke, Timothy M. Hospedales:

NewtonianVAE: Proportional Control and Goal Identification From Pixels via Physical Latent Spaces. 4454-4463 - Zhengzhe Liu, Xiaojuan Qi, Chi-Wing Fu:

3D-to-2D Distillation for Indoor Scene Parsing. 4464-4474 - Nontawat Tritrong, Pitchaporn Rewatbowornwong, Supasorn Suwajanakorn:

Repurposing GANs for One-Shot Semantic Part Segmentation. 4475-4485 - Chuhan Zhang, Ankush Gupta, Andrew Zisserman:

Temporal Query Networks for Fine-Grained Video Understanding. 4486-4496 - Kiana Ehsani, Winson Han, Alvaro Herrasti, Eli VanderBilt, Luca Weihs, Eric Kolve, Aniruddha Kembhavi, Roozbeh Mottaghi:

ManipulaTHOR: A Framework for Visual Object Manipulation. 4497-4506 - Erika Lu, Forrester Cole, Tali Dekel, Andrew Zisserman, William T. Freeman, Michael Rubinstein:

Omnimatte: Associating Objects and Their Effects in Video. 4507-4515 - Vibashan VS, Vikram Gupta, Poojan Oza, Vishwanath A. Sindagi, Vishal M. Patel:

MeGA-CDA: Memory Guided Attention for Category-Aware Unsupervised Domain Adaptive Object Detection. 4516-4526 - Zhibo Fan

, Yuchen Ma, Zeming Li, Jian Sun:
Generalized Few-Shot Object Detection Without Forgetting. 4527-4536 - Yuanyi Zhong, Jianfeng Wang, Lijuan Wang, Jian Peng, Yu-Xiong Wang, Lei Zhang:

DAP: Detection-Aware Pre-Training With Weak Supervision. 4537-4546 - Jing Huang

, Guan Pang, Rama Kovvuri, Mandy Toh, Kevin J. Liang, Praveen Krishnan, Xi Yin, Tal Hassner:
A Multiplexed Network for End-to-End, Multilingual OCR. 4547-4557 - Hao Wang, Xiang Bai, Mingkun Yang, Shenggao Zhu, Jing Wang, Wenyu Liu:

Scene Text Retrieval via Joint Text Detection and Similarity Learning. 4558-4567 - Zhenyu Wang, Yali Li, Ye Guo, Lu Fang, Shengjin Wang:

Data-Uncertainty Guided Multi-Phase Learning for Semi-Supervised Object Detection. 4568-4577 - Alex Yu, Vickie Ye, Matthew Tancik, Angjoo Kanazawa:

pixelNeRF: Neural Radiance Fields From One or Few Images. 4578-4587 - Francis Engelmann

, Konstantinos Rematas, Bastian Leibe
, Vittorio Ferrari:
From Points to Multi-Object 3D Reconstruction. 4588-4597 - Weihang Liao, Art Subpa-Asa, Yinqiang Zheng

, Imari Sato:
4D Hyperspectral Photoacoustic Data Restoration With Reliability Analysis. 4598-4607 - Yinyu Nie

, Ji Hou, Xiaoguang Han, Matthias Nießner:
RfD-Net: Point Scene Understanding by Semantic Instance Reconstruction. 4608-4618 - Chulin Xie, Chuxin Wang, Bo Zhang

, Hao Yang, Dong Chen, Fang Wen:
Style-Based Point Generator With Adversarial Rendering for Point Cloud Completion. 4619-4628 - Antonio Alliegro, Diego Valsesia, Giulia Fracastoro, Enrico Magli, Tatiana Tommasi:

Denoise and Contrast for Category Agnostic Shape Completion. 4629-4638 - Luca Morreale

, Noam Aigerman, Vladimir G. Kim, Niloy J. Mitra:
Neural Surface Maps. 4639-4648 - Luyang Zhu, Arsalan Mousavian, Yu Xiang, Hammad Mazhar, Jozef van Eenbergen, Shoubhik Debnath, Dieter Fox:

RGB-D Local Implicit Function for Depth Completion of Transparent Objects. 4649-4658 - Alexander Vakhitov

, Luis Ferraz, Antonio Agudo
, Francesc Moreno-Noguer:
Uncertainty-Aware Camera Pose Estimation From Points and Lines. 4659-4668 - Qunjie Zhou, Torsten Sattler, Laura Leal-Taixé:

Patch2Pix: Epipolar-Guided Pixel-Level Correspondences. 4669-4678 - John Phillips, Julieta Martinez, Ioan Andrei Barsan, Sergio Casas, Abbas Sadat, Raquel Urtasun:

Deep Multi-Task Learning for Joint Localization, Perception, and Prediction. 4679-4689 - Qianqian Wang, Zhicheng Wang, Kyle Genova, Pratul P. Srinivasan, Howard Zhou, Jonathan T. Barron, Ricardo Martin-Brualla, Noah Snavely, Thomas A. Funkhouser:

IBRNet: Learning Multi-View Image-Based Rendering. 4690-4699 - Philipp Henzler, Jeremy Reizenstein, Patrick Labatut, Roman Shapovalov, Tobias Ritschel, Andrea Vedaldi, David Novotný:

Unsupervised Learning of 3D Object Categories From Videos in the Wild. 4700-4709 - Jin Fang, Xinxin Zuo

, Dingfu Zhou, Shengze Jin, Sen Wang
, Liangjun Zhang:
LiDAR-Aug: A General Rendering-Based Augmentation Framework for 3D Object Detection. 4710-4720 - Xinzhu Ma, Yinmin Zhang, Dan Xu, Dongzhan Zhou, Shuai Yi, Haojie Li, Wanli Ouyang

:
Delving Into Localization Errors for Monocular 3D Object Detection. 4721-4730 - Mohsen Fayyaz, Emad Bahrami Rad, Ali Diba

, Mehdi Noroozi, Ehsan Adeli
, Luc Van Gool, Jürgen Gall:
3D CNNs With Adaptive Temporal Feature Resolutions. 4731-4740 - Linguo Li, Minsi Wang, Bingbing Ni, Hang Wang, Jiancheng Yang

, Wenjun Zhang:
3D Human Action Representation Learning via Cross-View Consistency Pursuit. 4741-4750 - Zhihui Li, Lina Yao:

Three Birds with One Stone: Multi-Task Temporal Action Detection via Recycling Temporal Annotations. 4751-4760 - Wenxuan Wang, Bangjie Yin, Taiping Yao, Li Zhang, Yanwei Fu

, Shouhong Ding, Jilin Li, Feiyue Huang, Xiangyang Xue:
Delving into Data: Effectively Substitute Training for Black-box Attack. 4761-4770 - Jean-Baptiste Truong, Pratyush Maini, Robert J. Walls, Nicolas Papernot:

Data-Free Model Extraction. 4771-4780 - Vasily Zadorozhnyy, Qiang Cheng, Qiang Ye:

Adaptive Weighted Discriminator for Training Generative Adversarial Networks. 4781-4790 - Mallikarjun B. R., Ayush Tewari, Tae-Hyun Oh, Tim Weyrich

, Bernd Bickel, Hans-Peter Seidel, Hanspeter Pfister
, Wojciech Matusik, Mohamed Elgharib, Christian Theobalt
:
Monocular Reconstruction of Neural Face Reflectance Fields. 4791-4800 - Qiongjie Cui, Huaijiang Sun:

Towards Accurate 3D Human Motion Prediction From Incomplete Observations. 4801-4810 - Yuxiao Zhou, Marc Habermann, Ikhsanul Habibie

, Ayush Tewari, Christian Theobalt
, Feng Xu:
Monocular Real-Time Full Body Capture With Inter-Part Correlations. 4811-4822 - Lingbo Liu, Jiaqi Chen, Hefeng Wu, Guanbin Li, Chenglong Li, Liang Lin:

Cross-Modal Collaborative Representation Learning and a Large-Scale RGBT Benchmark for Crowd Counting. 4823-4833 - Yuhao Zhu, Qi Li, Jian Wang, Cheng-Zhong Xu

, Zhenan Sun:
One Shot Face Swapping on Megapixels. 4834-4844 - Tengfei Song, Zijun Cui, Yuru Wang, Wenming Zheng, Qiang Ji:

Dynamic Probabilistic Graph Convolution for Facial Action Unit Intensity Estimation. 4845-4854 - Fengxiang Yang, Zhun Zhong, Zhiming Luo, Yuanzheng Cai, Yaojin Lin, Shaozi Li, Nicu Sebe

:
Joint Noise-Tolerant Learning and Meta Camera Shift Adaptation for Unsupervised Person Re-Identification. 4855-4864 - Hanjae Kim, Sunghun Joung, Ig-Jae Kim, Kwanghoon Sohn:

Prototype-Guided Saliency Feature Learning for Person Search. 4865-4874 - K. Ram Prabhakar

, Gowtham Senthil, Susmit Agrawal, R. Venkatesh Babu
, Rama Krishna Sai Subrahmanyam Gorthi:
Labeled From Unlabeled: Exploiting Unlabeled Data for Few-Shot Deep HDR Deghosting. 4875-4885 - Jiangxin Dong, Stefan Roth, Bernt Schiele

:
Learning Spatially-Variant MAP Models for Non-Blind Image Deblurring. 4886-4895 - Shen Cheng, Yuzhi Wang, Haibin Huang, Donghao Liu, Haoqiang Fan, Shuaicheng Liu

:
NBNet: Noise Basis Learning for Image Denoising With Subspace Projection. 4896-4906 - Man Zhou, Jie Xiao, Yifan Chang, Xueyang Fu

, Aiping Liu, Jinshan Pan, Zheng-Jun Zha:
Image De-Raining via Continual Learning. 4907-4916 - Longguang Wang, Xiaoyu Dong, Yingqian Wang

, Xinyi Ying
, Zaiping Lin, Wei An, Yulan Guo:
Exploring Sparsity in Image Super-Resolution for Efficient Inference. 4917-4926 - Zhihao Liu, Hui Yin, Xinyi Wu, Zhenyao Wu, Yang Mi

, Song Wang
:
From Shadow Generation To Shadow Removal. 4927-4936 - Daqi Liu, Álvaro Parra, Tat-Jun Chin:

Spatiotemporal Registration for Event-Based Visual Odometry. 4937-4946 - Kelvin C. K. Chan

, Xintao Wang, Ke Yu, Chao Dong, Chen Change Loy:
BasicVSR: The Search for Essential Components in Video Super-Resolution and Beyond. 4947-4956 - Aupendu Kar, Prabir Kumar Biswas:

Fast Bayesian Uncertainty Estimation and Reduction of Batch Normalized Single Image Super-Resolution Network. 4957-4966 - Fan Zhang

, Yu Li
, Shaodi You, Ying Fu:
Learning Temporal Consistency for Low Light Video Enhancement From Single Images. 4967-4976 - Qingyong Hu, Bo Yang

, Sheikh Khalid, Wen Xiao, Niki Trigoni, Andrew Markham:
Towards Semantic Segmentation of Urban-Scale 3D Point Clouds: A Dataset, Benchmarks and Challenges. 4977-4987 - Valentin Khrulkov, Artem Babenko:

Neural Side-by-Side: Predicting Human Preferences for No-Reference Super-Resolution Evaluation. 4988-4997 - Fei Yang, Luis Herranz

, Yongmei Cheng, Mikhail G. Mozerov:
Slimmable Compressive Autoencoders for Practical Neural Image Compression. 4998-5007 - Pengguang Chen, Shu Liu, Hengshuang Zhao, Jiaya Jia

:
Distilling Knowledge via Knowledge Review. 5008-5017 - Yehui Tang, Yunhe Wang, Yixing Xu, Yiping Deng, Chao Xu, Dacheng Tao

, Chang Xu
:
Manifold Regularized Dynamic Network Pruning. 5018-5028 - Kohei Yamamoto:

Learnable Companding Quantization for Accurate Low-Bit Neural Networks. 5029-5038 - Alexandros Haliassos, Konstantinos Vougioukas, Stavros Petridis, Maja Pantic:

Lips Don't Lie: A Generalisable and Robust Approach To Face Forgery Detection. 5039-5049 - Andrei Kapishnikov, Subhashini Venugopalan, Besim Avci, Ben Wedin, Michael Terry, Tolga Bolukbasi:

Guided Integrated Gradients: An Adaptive Path Method for Removing Noise. 5050-5058 - Zelun Luo

, Daniel J. Wu, Ehsan Adeli
, Li Fei-Fei:
Scalable Differential Privacy With Sparse Network Finetuning. 5059-5068 - Quankai Gao, Fudong Wang, Nan Xue, Jin-Gang Yu, Gui-Song Xia

:
Deep Graph Matching Under Quadratic Constraint. 5069-5078 - Xiaohan Wang, Linchao Zhu, Yi Yang:

T2VLAD: Global-Local Sequence Alignment for Text-Video Retrieval. 5079-5088 - Jia Li, Zhaoyang Li, Jie Cao, Xingguang Song, Ran He:

FaceInpainter: High Fidelity Face Adaptation to Heterogeneous Domains. 5089-5098 - Mohammadreza Armandpour, Ali Sadeghian, Chunyuan Li, Mingyuan Zhou:

Partition-Guided GANs. 5099-5109 - Yifan Wang, Andrew Liu, Richard Tucker, Jiajun Wu, Brian L. Curless, Steven M. Seitz, Noah Snavely:

Repopulating Street Scenes. 5110-5119 - Tengfei Wang

, Hao Ouyang, Qifeng Chen:
Image Inpainting With External-Internal Learning and Monochromic Bottleneck. 5120-5129 - Yangchen Xie, Xinyuan Chen, Li Sun, Yue Lu:

DG-Font: Deformable Generative Networks for Unsupervised Font Generation. 5130-5140 - Tianwei Lin, Zhuoqi Ma, Fu Li, Dongliang He, Xin Li, Errui Ding, Nannan Wang, Jie Li, Xinbo Gao:

Drafting and Revision: Laplacian Pyramid Network for Fast High-Quality Artistic Style Transfer. 5141-5150 - Artur Grigorev, Karim Iskakov, Anastasia Ianina

, Renat Bashirov, Ilya Zakharkin
, Alexander Vakhitov
, Victor Lempitsky:
StylePeople: A Generative Model of Fullbody Human Avatars. 5151-5160 - Arghya Pal, Raphael C.-W. Phan, KokSheik Wong

:
Synthesize-It-Classifier: Learning a Generative Classifier Through Recurrent Self-Analysis. 5161-5170 - Andreas Blattmann, Timo Milbich, Michael Dorkenwald, Björn Ommer:

Understanding Object Dynamics for Interactive Image-to-Video Synthesis. 5171-5181 - Chengming Xu, Yanwei Fu

, Chen Liu, Chengjie Wang, Jilin Li, Feiyue Huang, Li Zhang, Xiangyang Xue:
Learning Dynamic Alignment via Meta-Filter for Few-Shot Learning. 5182-5191 - Yazhou Yao, Zeren Sun, Chuanyi Zhang, Fumin Shen, Qi Wu, Jian Zhang

, Zhenmin Tang:
Jo-SRC: A Contrastive Approach for Combating Noisy Labels. 5192-5201 - Nontawat Charoenphakdee

, Jayakorn Vongkulbhisal, Nuttapong Chairatanakul, Masashi Sugiyama:
On Focal Loss for Class-Posterior Probability Estimation: A Theoretical Perspective. 5202-5211 - Shuang Li, Kaixiong Gong

, Chi Harold Liu
, Yulin Wang
, Feng Qiao
, Xinjing Cheng:
MetaSAug: Meta Semantic Augmentation for Long-Tailed Visual Recognition. 5212-5221 - Massimiliano Mancini

, Muhammad Ferjad Naeem, Yongqin Xian, Zeynep Akata:
Open World Compositional Zero-Shot Learning. 5222-5230 - Zhile Chen, Feng Li, Yuhui Quan, Yong Xu, Hui Ji

:
Deep Texture Recognition via Exploiting Cross-Layer Statistical Self-Similarity. 5231-5240 - Runzhong Wang

, Tianqi Zhang, Tianshu Yu, Junchi Yan, Xiaokang Yang:
Combinatorial Learning of Graph Edit Distance via Dynamic Embedding. 5241-5250 - Yawen Duan, Xin Chen, Hang Xu, Zewei Chen, Xiaodan Liang, Tong Zhang, Zhenguo Li:

TransNAS-Bench-101: Improving Transferability and Generalizability of Cross-Task Neural Architecture Search. 5251-5260 - Gregory P. Meyer:

An Alternative Probabilistic Interpretation of the Huber Loss. 5261-5269 - Yohan Jun, Hyungseob Shin

, Taejoon Eo, Dosik Hwang:
Joint Deep Model-Based MR Image and Coil Sensitivity Reconstruction Network (Joint-ICNet) for Fast MRI. 5270-5279 - Fakai Wang, Kang Zheng, Le Lu, Jing Xiao, Min Wu, Shun Miao:

Automatic Vertebra Localization and Identification in CT by Spine Rectification and Anatomically-Constrained Optimization. 5280-5288 - Bin Yan, Xinyu Zhang, Dong Wang, Huchuan Lu, Xiaoyun Yang:

Alpha-Refine: Boosting Tracking Performance by Precise Bounding Box Estimation. 5289-5298 - Jiawei He

, Zehao Huang
, Naiyan Wang, Zhaoxiang Zhang:
Learnable Graph Matching: Incorporating Graph Partitioning With Deep Feature Learning for Multiple Object Tracking. 5299-5309 - Kecheng Zheng, Wu Liu, Lingxiao He, Tao Mei

, Jiebo Luo
, Zheng-Jun Zha:
Group-aware Label Transfer for Domain Adaptive Person Re-identification. 5310-5319 - Zhiqiang Fu, Yao Zhao, Dongxia Chang, Xingxing Zhang, Yiming Wang:

Double Low-Rank Representation With Projection Distance Penalty for Clustering. 5320-5329 - Tianning Yuan, Fang Wan, Mengying Fu, Jianzhuang Liu, Songcen Xu, Xiangyang Ji, Qixiang Ye:

Multiple Instance Active Learning for Object Detection. 5330-5339 - Boyan Jiang, Yinda Zhang, Xingkui Wei, Xiangyang Xue, Yanwei Fu

:
Learning Compositional Representation for 4D Captures With Neural ODE. 5340-5350 - Subhankar Roy

, Evgeny Krivosheev, Zhun Zhong, Nicu Sebe
, Elisa Ricci
:
Curriculum Graph Co-Teaching for Multi-Target Domain Adaptation. 5351-5360 - Astuti Sharma, Tarun Kalluri, Manmohan Chandraker:

Instance Level Affinity-Based Transfer for Unsupervised Domain Adaptation. 5361-5371 - Xingxuan Zhang, Peng Cui, Renzhe Xu

, Linjun Zhou, Yue He, Zheyan Shen:
Deep Stable Learning for Out-of-Distribution Generalization. 5372-5382 - Liyuan Wang, Kuo Yang, Chongxuan Li, Lanqing Hong, Zhenguo Li, Jun Zhu:

ORDisCo: Effective and Efficient Usage of Incremental Unlabeled Data for Semi-Supervised Continual Learning. 5383-5392 - Yifan Sun, Yuke Zhu, Yuhan Zhang

, Pengkun Zheng, Xi Qiu, Chi Zhang, Yichen Wei:
Dynamic Metric Learning: Towards a Scalable Metric Space To Accommodate Multiple Semantic Scales. 5393-5402 - Peng Hu, Xi Peng, Hongyuan Zhu, Liangli Zhen, Jie Lin:

Learning Cross-Modal Retrieval With Noisy Labels. 5403-5413 - Linus Ericsson, Henry Gouk, Timothy M. Hospedales:

How Well Do Self-Supervised Models Transfer? 5414-5423 - Yifan Liu

, Hao Chen, Yu Chen, Wei Yin, Chunhua Shen:
Generic Perceptual Loss for Modeling Structured Output Dependencies. 5424-5432 - Songyan Zhang, Zhicheng Wang, Qiang Wang, Jinshuo Zhang, Gang Wei, Xiaowen Chu:

EDNet: Efficient Disparity Estimation With Cost Volume Combination and Attention-Based Spatial Residual. 5433-5442 - Zhi Tian, Chunhua Shen, Xinlong Wang, Hao Chen:

BoxInst: High-Performance Instance Segmentation With Box Annotations. 5443-5452 - Kai Zhang, Fujun Luan

, Qianqian Wang, Kavita Bala
, Noah Snavely:
PhySG: Inverse Rendering With Spherical Gaussians for Physics-Based Material Editing and Relighting. 5453-5462 - Huiyu Wang, Yukun Zhu, Hartwig Adam, Alan L. Yuille, Liang-Chieh Chen:

MaX-DeepLab: End-to-End Panoptic Segmentation With Mask Transformers. 5463-5474 - Guo-Sen Xie, Jie Liu, Huan Xiong, Ling Shao:

Scale-Aware Graph Neural Network for Few-Shot Semantic Segmentation. 5475-5484 - Daan de Geus

, Panagiotis Meletis
, Chenyang Lu
, Xiaoxiao Wen, Gijs Dubbelman:
Part-Aware Panoptic Segmentation. 5485-5494 - Seungho Lee

, Minhyun Lee, Jongwuk Lee, Hyunjung Shim:
Railroad Is Not a Train: Saliency As Pseudo-Pixel Supervision for Weakly Supervised Semantic Segmentation. 5495-5505 - Yi Liu, Xiaoyang Huo, Tianyi Chen, Xiangping Zeng, Si Wu, Zhiwen Yu, Hau-San Wong

:
Mask-Embedded Discriminator With Region-Based Semantic Regularization for Semi-Supervised Class-Conditional Image Synthesis. 5506-5515 - Jiwoong Park, Junho Cho, Hyung Jin Chang

, Jin Young Choi:
Unsupervised Hyperbolic Representation Learning via Message Passing Auto-Encoders. 5516-5526 - Mehmet Aygun, Aljosa Osep, Mark Weber, Maxim Maximov, Cyrill Stachniss

, Jens Behley
, Laura Leal-Taixé:
4D Panoptic LiDAR Segmentation. 5527-5537 - Yang Jiao, Trac D. Tran, Guangming Shi:

EffiScene: Efficient Per-Pixel Rigidity Inference for Unsupervised Joint Learning of Optical Flow, Depth, Camera Pose and Motion Segmentation. 5538-5547 - Sanjay Haresh, Sateesh Kumar, Huseyin Coskun, Shahram Najam Syed, Andrey Konin, M. Zeeshan Zia, Quoc-Huy Tran:

Learning by Aligning Videos in Time. 5548-5558 - Ho Kei Cheng

, Yu-Wing Tai
, Chi-Keung Tang:
Modular Interactive Video Object Segmentation: Interaction-to-Mask, Propagation and Difference-Aware Fusion. 5559-5568 - Gunhee Nam, Miran Heo, Seoung Wug Oh, Joon-Young Lee, Seon Joo Kim:

Polygonal Point Set Tracking. 5569-5578 - Pengchuan Zhang, Xiujun Li, Xiaowei Hu, Jianwei Yang, Lei Zhang, Lijuan Wang, Yejin Choi, Jianfeng Gao:

VinVL: Revisiting Visual Representations in Vision-Language Models. 5579-5588 - Arka Sadhu, Tanmay Gupta

, Mark Yatskar, Ram Nevatia, Aniruddha Kembhavi:
Visual Semantic Role Labeling for Video Understanding. 5589-5600 - Yapeng Tian, Chenliang Xu:

Can Audio-Visual Integration Strengthen Robustness Under Multimodal Attacks? 5601-5611 - Yongfei Liu, Bo Wan, Lin Ma, Xuming He:

Relation-aware Instance Refinement for Weakly Supervised Visual Grounding. 5612-5621 - Tao Tu, Qing Ping, Govindarajan Thattai, Gökhan Tür

, Prem Natarajan:
Learning Better Visual Dialog Agents With Pretrained Visual-Linguistic Representation. 5622-5631 - Spencer Whitehead, Hui Wu, Heng Ji, Rogério Feris, Kate Saenko

:
Separating Skills and Concepts for Novel Visual Question Answering. 5632-5641 - Lvmin Zhang, Xinrui Wang, Qingnan Fan, Yi Ji, Chunping Liu:

Generating Manga From Illustrations via Mimicking Manga Creation Workflow. 5642-5651 - Peizhao Li, Jiuxiang Gu, Jason Kuen, Vlad I. Morariu, Handong Zhao, Rajiv Jain, Varun Manjunatha, Hongfu Liu:

SelfDoc: Self-Supervised Document Representation Learning. 5652-5660 - Trisha Mittal, Puneet Mathur, Aniket Bera, Dinesh Manocha:

Affect2MM: Affective Analysis of Multimedia Content Using Emotion Causality. 5661-5671 - Ayan Kumar Bhunia, Pinaki Nath Chowdhury, Yongxin Yang, Timothy M. Hospedales, Tao Xiang, Yi-Zhe Song

:
Vectorization and Rasterization: Self-Supervised Learning for Sketch and Handwriting. 5672-5681 - Sheng Xu, Junhe Zhao, Jinhu Lu

, Baochang Zhang, Shumin Han, David S. Doermann:
Layer-Wise Searching for 1-Bit Detectors. 5682-5691 - Zan Gojcic, Or Litany, Andreas Wieser

, Leonidas J. Guibas, Tolga Birdal
:
Weakly Supervised Learning of Rigid 3D Scene Flow. 5692-5703 - Ziyan Wang, Timur M. Bagautdinov, Stephen Lombardi, Tomas Simon, Jason M. Saragih, Jessica K. Hodgins, Michael Zollhöfer:

Learning Compositional Radiance Fields of Dynamic Human Heads. 5704-5713 - Prune Truong, Martin Danelljan, Luc Van Gool, Radu Timofte

:
Learning Accurate Dense Correspondences and When To Trust Them. 5714-5724 - Pei Sun, Weiyue Wang, Yuning Chai, Gamaleldin Elsayed, Alex Bewley, Xiao Zhang, Cristian Sminchisescu

, Dragomir Anguelov:
RSN: Range Sparse Net for Efficient, Accurate LiDAR 3D Object Detection. 5725-5734 - Yunrui Yu, Xitong Gao

, Cheng-Zhong Xu
:
LAFEAT: Piercing Through Adversarial Defenses With Latent Features. 5735-5745 - Tao Yu, Zerong Zheng

, Kaiwen Guo, Pengpeng Liu, Qionghai Dai, Yebin Liu:
Function4D: Real-Time Human Volumetric Capture From Very Sparse Consumer RGBD Sensors. 5746-5756 - Seung-Hwan Baek

, Felix Heide:
Polka Lines: Learning Structured Illumination and Reconstruction for Active Stereo. 5757-5767 - Jaeseok Byun, Sungmin Cha, Taesup Moon:

FBI-Denoiser: Fast Blind Image Denoiser for Poisson-Gaussian Noise. 5768-5777 - Tianfei Zhou, Wenguan Wang

, Zhiyuan Liang, Jianbing Shen:
Face Forensics in the Wild. 5778-5788 - Dongze Li, Wei Wang

, Hongxing Fan, Jing Dong:
Exploring Adversarial Fake Images on Face Manifold. 5789-5798 - Eric R. Chan, Marco Monteiro, Petr Kellnhofer, Jiajun Wu, Gordon Wetzstein

:
Pi-GAN: Periodic Implicit Generative Adversarial Networks for 3D-Aware Image Synthesis. 5799-5809 - Aleksander Holynski, Brian L. Curless, Steven M. Seitz, Richard Szeliski:

Animating Pictures With Eulerian Motion Fields. 5810-5819 - Seung Wook Kim

, Jonah Philion, Antonio Torralba, Sanja Fidler
:
DriveGAN: Towards a Controllable High-Quality Neural Simulation. 5820-5829 - K. J. Joseph, Salman H. Khan, Fahad Shahbaz Khan, Vineeth N. Balasubramanian:

Towards Open World Object Detection. 5830-5840 - Yufan He, Dong Yang, Holger Roth, Can Zhao, Daguang Xu:

DiNTS: Differentiable Neural Network Topology Search for 3D Medical Image Segmentation. 5841-5850 - Qi Feng

, Vitaly Ablavsky, Qinxun Bai, Stan Sclaroff:
Siamese Natural Language Tracker: Tracking by Natural Language Descriptions With Siamese Trackers. 5851-5860 - Xinqi Zhu, Chang Xu

, Dacheng Tao
:
Where and What? Examining Interpretable Disentangled Representations. 5861-5870 - Fei Zhu, Xu-Yao Zhang, Chuang Wang, Fei Yin, Cheng-Lin Liu:

Prototype Augmentation and Self-Supervision for Incremental Learning. 5871-5880 - Yawen Huang, Feng Zheng, Danyang Wang, Weilin Huang, Matthew R. Scott

, Ling Shao:
Brain Image Synthesis With Unsupervised Multivariate Canonical CSCl4Net. 5881-5890 - Nicolas Girard, Dmitriy Smirnov

, Justin Solomon, Yuliya Tarabalka:
Polygonal Building Extraction by Frame Field Learning. 5891-5900 - Shubhankar Borse

, Ying Wang, Yizhe Zhang, Fatih Porikli
:
InverseForm: A Loss Function for Structured Boundary-Aware Segmentation. 5901-5911 - Brendan Duke, Abdalla Ahmed, Christian Wolf, Parham Aarabi, Graham W. Taylor:

SSTVOS: Sparse Spatiotemporal Transformers for Video Object Segmentation. 5912-5921 - Luca Weihs, Matt Deitke, Aniruddha Kembhavi, Roozbeh Mottaghi:

Visual Room Rearrangement. 5922-5931 - Mianlun Zheng, Yi Zhou, Duygu Ceylan, Jernej Barbic:

A Deep Emulator for Secondary Motion of 3D Characters. 5932-5940 - Qize Yang, Xihan Wei, Biao Wang, Xian-Sheng Hua, Lei Zhang

:
Interactive Self-Training With Mean Teachers for Semi-Supervised Object Detection. 5941-5950 - Siddhesh Khandelwal, Raghav Goyal, Leonid Sigal:

UniT: Unified Knowledge Transfer for Any-Shot Object Detection and Segmentation. 5951-5961 - Hao Tian, Yuntao Chen, Jifeng Dai

, Zhaoxiang Zhang, Xizhou Zhu
:
Unsupervised Object Detection With LIDAR Clues. 5962-5972 - Tianwei Wang, Yuanzhi Zhu, Lianwen Jin, Dezhi Peng, Zhe Li

, Mengchao He, Yongpan Wang, Canjie Luo:
Implicit Feature Alignment: Learn To Convert Text Recognizer to Text Spotter. 5973-5982 - Qi Wan, Haoqin Ji, Linlin Shen:

Self-Attention Based Text Knowledge Mining for Text Detection. 5983-5992 - Jun Wei

, Qin Wang, Zhen Li, Sheng Wang
, S. Kevin Zhou, Shuguang Cui
:
Shallow Feature Matters for Weakly Supervised Object Localization. 5993-6001 - Tao Hu, Liwei Wang, Xiaogang Xu, Shu Liu, Jiaya Jia

:
Self-Supervised 3D Mesh Reconstruction From Single Images. 6002-6011 


Google
Google Scholar
Semantic Scholar
Internet Archive Scholar
CiteSeerX
ORCID