


default search action
CVPR 2016: Las Vegas, NV, USA
- 2016 IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2016, Las Vegas, NV, USA, June 27-30, 2016. IEEE Computer Society 2016, ISBN 978-1-4673-8851-1
Oral & Spotlight Session 1-1A
O1-1A: Image Captioning and Question Answering
- Lisa Anne Hendricks, Subhashini Venugopalan, Marcus Rohrbach, Raymond J. Mooney, Kate Saenko
, Trevor Darrell:
Deep Compositional Captioning: Describing Novel Object Categories without Paired Training Data. 1-10 - Junhua Mao, Jonathan Huang, Alexander Toshev, Oana Camburu, Alan L. Yuille
, Kevin Murphy:
Generation and Comprehension of Unambiguous Object Descriptions. 11-20 - Zichao Yang, Xiaodong He, Jianfeng Gao, Li Deng, Alexander J. Smola:
Stacked Attention Networks for Image Question Answering. 21-29 - Hyeonwoo Noh, Paul Hongsuck Seo, Bohyung Han:
Image Question Answering Using Convolutional Neural Network with Dynamic Parameter Prediction. 30-38 - Jacob Andreas, Marcus Rohrbach, Trevor Darrell, Dan Klein:
Neural Module Networks. 39-48
S1-1A: Language and Vision
- Scott E. Reed, Zeynep Akata, Honglak Lee, Bernt Schiele
:
Learning Deep Representations of Fine-Grained Visual Descriptions. 49-58 - Zeynep Akata, Mateusz Malinowski
, Mario Fritz, Bernt Schiele
:
Multi-cue Zero-Shot Learning with Strong Supervision. 59-68 - Yongqin Xian, Zeynep Akata, Gaurav Sharma, Quynh Nguyen, Matthias Hein, Bernt Schiele
:
Latent Embeddings for Zero-Shot Classification. 69-77 - Roland Kwitt
, Sebastian Hegenbart, Marc Niethammer:
One-Shot Learning of Scene Locations via Feature Trajectory Transfer. 78-86 - Chuang Gan, Tianbao Yang, Boqing Gong:
Learning Attributes Equals Multi-Source Domain Generalization. 87-97 - Carl Vondrick, Hamed Pirsiavash, Antonio Torralba:
Anticipating Visual Representations from Unlabeled Video. 98-106
Oral & Spotlight Session 1-1B
O1-1B: Matching and Alignment
- Kwang Moo Yi, Yannick Verdie, Pascal Fua, Vincent Lepetit:
Learning to Assign Orientations to Feature Points. 107-116 - Tinghui Zhou, Philipp Krähenbühl, Mathieu Aubry, Qi-Xing Huang, Alexei A. Efros
:
Learning Dense Correspondence via 3D-Guided Cycle Consistency. 117-126 - Shenlong Wang, Sean Ryan Fanello
, Christoph Rhemann, Shahram Izadi, Pushmeet Kohli:
The Global Patch Collider. 127-135 - Seyed Hamid Rezatofighi, Anton Milan, Zhen Zhang
, Qinfeng Shi
, Anthony R. Dick
, Ian D. Reid:
Joint Probabilistic Matching Using m-Best Solutions. 136-145 - Xiangyu Zhu, Zhen Lei, Xiaoming Liu, Hailin Shi, Stan Z. Li:
Face Alignment Across Large Poses: A 3D Solution. 146-155
S1-1B: Segmentation and Contour Detection
- Jie Feng, Brian L. Price, Scott Cohen, Shih-Fu Chang:
Interactive Segmentation on RGBD Images via Cue Selection. 156-164 - Chen Liu, Pushmeet Kohli, Yasutaka Furukawa:
Layered Scene Decomposition via the Occlusion-CRF. 165-173 - Michael Maire, Takuya Narihira, Stella X. Yu:
Affinity CNN: Learning Pixel-Centric Pairwise Relations for Figure/Ground Embedding. 174-182 - Anna Khoreva, Rodrigo Benenson, Mohamed Omran, Matthias Hein, Bernt Schiele
:
Weakly Supervised Object Boundaries. 183-192 - Jimei Yang, Brian L. Price, Scott Cohen, Honglak Lee, Ming-Hsuan Yang:
Object Contour Detection with a Fully Convolutional Encoder-Decoder Network. 193-202
Poster Session P1-1
- Qi Wu, Chunhua Shen, Lingqiao Liu
, Anthony R. Dick
, Anton van den Hengel
:
What Value Do Explicit High Level Concepts Have in Vision to Language Problems? 203-212 - Nati Ofir, Meirav Galun, Boaz Nadler, Ronen Basri:
Fast Detection of Curved Edges at Low SNR. 213-221 - Wei Shen, Kai Zhao, Yuan Jiang, Yan Wang, Zhijiang Zhang, Xiang Bai:
Object Skeleton Extraction in Natural Images by Fusing Scale-Associated Deep Side Outputs. 222-230 - Yu Liu, Michael S. Lew:
Learning Relaxed Deep Supervision for Better Edge Detection. 231-240 - Huan Fu
, Chaohui Wang, Dacheng Tao, Michael J. Black:
Occlusion Boundary Detection via Deep Exploration of Context. 241-250 - Zizhao Zhang, Fuyong Xing
, Xiaoshuang Shi, Lin Yang:
SemiContour: A Semi-Supervised Learning Approach for Contour Detection. 251-259 - Saurabh Singh, Derek Hoiem, David A. Forsyth:
Learning to Localize Little Landmarks. 260-269 - Lingxi Xie, Liang Zheng
, Jingdong Wang
, Alan L. Yuille
, Qi Tian:
InterActive: Inter-Layer Activeness Propagation. 270-279 - Hao Yang, Joey Tianyi Zhou, Yu Zhang, Bin-Bin Gao
, Jianxin Wu, Jianfei Cai
:
Exploit Bounding Box Annotations for Multi-Label Object Recognition. 280-288 - Dmitry Laptev, Nikolay Savinov, Joachim M. Buhmann, Marc Pollefeys
:
TI-POOLING: Transformation-Invariant Pooling for Feature Learning in Convolutional Neural Networks. 289-297 - Edgar Simo-Serra, Hiroshi Ishikawa:
Fashion Style in 128 Floats: Joint Ranking and Classification Using Weak Data for Feature Extraction. 298-307 - Yuhui Quan, Chenglong Bao, Hui Ji
:
Equiangular Kernel Dictionary Learning with Applications to Dynamic Texture Analysis. 308-316 - Yang Gao, Oscar Beijbom, Ning Zhang, Trevor Darrell:
Compact Bilinear Pooling. 317-326 - Tsun-Yi Yang, Yen-Yu Lin
, Yung-Yu Chuang:
Accumulated Stability Voting: A Robust Descriptor from Descriptors of Multiple Scales. 327-335 - Swarna Kamlam Ravindran, Anurag Mittal:
CoMaL: Good Features to Match on Object Boundaries. 336-345 - Yuan-Ting Hu, Yen-Yu Lin
:
Progressive Feature Matching with Alternate Descriptor Selection and Correspondence Enrichment. 346-354 - Da Chen, Jean-Marie Mirebeau, Laurent D. Cohen:
A New Finsler Minimal Path Model with Curvature Penalization for Image Segmentation and Closed Contour Detection. 355-363 - Yuhua Chen, Dengxin Dai, Jordi Pont-Tuset, Luc Van Gool:
Scale-Aware Alignment of Hierarchical Image Segmentation. 364-372 - Ning Xu, Brian L. Price, Scott Cohen, Jimei Yang, Thomas S. Huang:
Deep Interactive Object Selection. 373-381 - Danna Gurari, Suyog Dutt Jain, Margrit Betke, Kristen Grauman:
Pull the Plug? Predicting If Computers or Humans Should Segment Images. 382-391 - Yuka Kihara, Matvey Soloviev, Tsuhan Chen
:
In the Shadows, Shape Priors Shine: Using Occlusion to Improve Multi-region Segmentation. 392-401 - Loïc Alain Royer
, David L. Richmond, Carsten Rother, Bjoern Andres
, Dagmar Kainmueller
:
Convexity Shape Constraints for Image Segmentation. 402-410 - Ertunc Erdil, Sinan Yildirim
, Müjdat Çetin, Tolga Tasdizen:
MCMC Shape Sampling for Image Segmentation with Nonparametric Shape Priors. 411-419 - Fengyuan Zhu, Guangyong Chen, Pheng-Ann Heng
:
From Noise Modeling to Blind Image Denoising. 420-429 - Jaesik Park
, Yu-Wing Tai
, Sudipta N. Sinha, In-So Kweon:
Efficient and Robust Color Consistency for Community Photo Collections. 430-438 - Or Lotan, Michal Irani:
Needle-Match: Reliable Patch Matching under High Uncertainty. 439-448 - Kuldeep Kulkarni, Suhas Lohit, Pavan K. Turaga
, Ronan Kerviche, Amit Ashok:
ReconNet: Non-Iterative Reconstruction of Images from Compressively Sensed Measurements. 449-458 - Jin-shan Pan, Zhe Hu, Zhixun Su
, Hsin-Ying Lee, Ming-Hsuan Yang:
Soft-Segmentation Guided Object Motion Deblurring. 459-468 - Dongliang Cheng, Abdelrahman Kamel, Brian L. Price, Scott Cohen, Michael S. Brown:
Two Illuminant Estimation and User Correction Preference. 469-477 - Guanbin Li, Yizhou Yu:
Deep Contrast Learning for Salient Object Detection. 478-487 - Seung-Hwan Baek
, Inchang Choi, Min H. Kim:
Multiview Image Completion with Space Structure Propagation. 488-496 - Long Mai, Hailin Jin, Feng Liu:
Composition-Preserving Deep Photo Aesthetics Assessment. 497-506 - Jiansheng Chen, Gaocheng Bai, Shaoheng Liang, Zhengqin Li:
Automatic Image Cropping: A Computational Complexity Study. 507-515 - Neil D. B. Bruce, Christopher Catton, Sasa Janjic:
A Deeper Look at Saliency: Feature Contrast, Semantics, and Beyond. 516-524 - Calden Wloka, John K. Tsotsos
:
Spatially Binned ROC: A Comprehensive Saliency Metric. 525-534 - Qiaosong Wang
, Wen Zheng, Robinson Piramuthu:
GraB: Visual Saliency via Novel Graph Model and Background Priors. 535-543 - Anna Volokitin, Michael Gygli, Xavier Boix:
Predicting When Saliency Maps are Accurate and Eye Fixations Consistent. 544-552 - Oriel Frigo, Neus Sabater, Julie Delon
, Pierre Hellier:
Split and Match: Example-Based Adaptive Patch Sampling for Unsupervised Style Transfer. 553-561 - Lilian Calvet, Pierre Gurdjos, Carsten Griwodz, Simone Gasparini
:
Detection and Accurate Localization of Circular Fiducials under Highly Challenging Conditions. 562-570 - Luis Herranz, Shuqiang Jiang, Xiangyang Li:
Scene Recognition with CNNs: Objects, Scales and Dataset Bias. 571-579 - Nicholas Rhinehart
, Kris Makoto Kitani:
Learning Action Maps of Large Environments via First-Person Vision. 580-588 - Yingying Zhang, Desen Zhou, Siqin Chen, Shenghua Gao, Yi Ma:
Single-Image Crowd Counting via Multi-Column Convolutional Neural Network. 589-597 - Junting Pan, Elisa Sayrol, Xavier Giró-i-Nieto, Kevin McGuinness, Noel E. O'Connor:
Shallow and Deep Convolutional Networks for Saliency Prediction. 598-606 - Mohammad Najafi, Sarah Taghavi Namin, Mathieu Salzmann, Lars Petersson
:
Sample and Filter: Nonparametric Scene Parsing via Efficient Filtering. 607-615 - Saumitro Dasgupta, Kuan Fang, Kevin Chen, Silvio Savarese:
DeLay: Robust Spatial Layout Estimation for Cluttered Indoor Scenes. 616-624 - Siyu Zhu, Richard Zanibbi
:
A Text Detection System for Natural Scenes with Convolutional Feature Learning and Cascaded Classification. 625-632 - Xiaodan Liang, Yunchao Wei, Xiaohui Shen, Zequn Jie, Jiashi Feng, Liang Lin, Shuicheng Yan:
Reversible Recursive Instance-Level Object Segmentation. 633-641 - Yao Lu, Xue Bai, Linda G. Shapiro, Jue Wang
:
Coherent Parametric Contours for Interactive Video Object Segmentation. 642-650 - Yong-Jin Liu, Cheng-Chi Yu, Minjing Yu, Ying He
:
Manifold SLIC: A Fast Method to Compute Content-Sensitive Superpixels. 651-659 - Gayoung Lee, Yu-Wing Tai
, Junmo Kim:
Deep Saliency with Encoded Low Level Distance Map and High Level Features. 660-668 - Ziyu Zhang, Sanja Fidler, Raquel Urtasun:
Instance-Level Segmentation for Autonomous Driving with Deep Densely Connected MRFs. 669-677 - Nian Liu, Junwei Han:
DHSNet: Deep Hierarchical Saliency Network for Salient Object Detection. 678-686 - Rong Quan, Junwei Han, Dingwen Zhang, Feiping Nie
:
Object Co-segmentation via Graph Optimized-Flexible Manifold Ranking. 687-695 - Won-Dong Jang, Chulwoo Lee, Chang-Su Kim
:
Primary Object Segmentation in Videos via Alternate Convex Optimization of Foreground and Background Distributions. 696-704 - Renjiao Yi, Jue Wang
, Ping Tan:
Automatic Fence Segmentation in Videos of Dynamic Scenes. 705-713 - Luca Del Pero, Susanna Ricco, Rahul Sukthankar, Vittorio Ferrari:
Discovering the Physical Parts of an Articulated Object Class from Multiple Videos. 714-723 - Federico Perazzi, Jordi Pont-Tuset, Brian McWilliams, Luc Van Gool, Markus H. Gross
, Alexander Sorkine-Hornung:
A Benchmark Dataset and Evaluation Methodology for Video Object Segmentation. 724-732 - Mahmudul Hasan, Jonghyun Choi
, Jan Neumann, Amit K. Roy-Chowdhury, Larry S. Davis:
Learning Temporal Regularity in Video Sequences. 733-742 - Nicolas Marki, Federico Perazzi, Oliver Wang, Alexander Sorkine-Hornung:
Bilateral Space Video Segmentation. 743-751 - Zhang Zhang, Kaiqi Huang, Tieniu Tan, Peipei Yang, Jun Li
:
ReD-SFA: Relation Discovery Based Slow Feature Analysis for Trajectory Clustering. 752-760
Oral & Spotlight Session 1-2A
O1-2A: Object Recognition and Detection
- Abhinav Shrivastava, Abhinav Gupta, Ross B. Girshick:
Training Region-Based Object Detectors with Online Hard Example Mining. 761-769 - Kaiming He, Xiangyu Zhang, Shaoqing Ren, Jian Sun:
Deep Residual Learning for Image Recognition. 770-778 - Joseph Redmon, Santosh Kumar Divvala, Ross B. Girshick, Ali Farhadi:
You Only Look Once: Unified, Real-Time Object Detection. 779-788 - Spyros Gidaris, Nikos Komodakis:
LocNet: Improving Localization Accuracy for Object Detection. 789-798 - Qian Yu, Feng Liu, Yi-Zhe Song
, Tao Xiang, Timothy M. Hospedales, Chen Change Loy:
Sketch Me That Shoe. 799-807
S1-2A: Object Detection 1
- Shuran Song, Jianxiong Xiao:
Deep Sliding Shapes for Amodal 3D Object Detection in RGB-D Images. 808-816 - Kai Kang, Wanli Ouyang
, Hongsheng Li
, Xiaogang Wang:
Object Detection from Video Tubelets with Convolutional Neural Networks. 817-825 - Judy Hoffman
, Saurabh Gupta, Trevor Darrell:
Learning with Side Information through Modality Hallucination. 826-834 - Neelima Chavali, Harsh Agrawal
, Aroma Mahendru, Dhruv Batra:
Object-Proposal Evaluation Protocol is 'Gameable'. 835-844 - Tao Kong, Anbang Yao, Yurong Chen
, Fuchun Sun:
HyperNet: Towards Accurate Region Proposal Generation and Joint Object Detection. 845-853 - Dim P. Papadopoulos
, Jasper R. R. Uijlings, Frank Keller, Vittorio Ferrari:
We Don't Need No Bounding-Boxes: Training Object Class Detectors Using Only Human Verification. 854-863 - Wanli Ouyang, Xiaogang Wang, Cong Zhang, Xiaokang Yang:
Factors in Finetuning Deep Model for Object Detection with Long-Tail Distribution. 864-873
Oral & Spotlight Session 1-2B
O1-2B: Vision with Alternative Sensors
- Guy Rosman, Daniela Rus, John W. Fisher III:
Information-Driven Adaptive Structured-Light Scanners. 874-883 - Patrick Bardow, Andrew J. Davison, Stefan Leutenegger:
Simultaneous Optical Flow and Intensity Estimation from an Event Camera. 884-892 - Achuta Kadambi, Jamie Schiel, Ramesh Raskar:
Macroscopic Interferometry: Rethinking Depth Estimation with Frequency-Domain Time-of-Flight. 893-902 - Huaijin G. Chen
, Suren Jayasuriya, Jiyue Yang, Judy Stephen, Sriram Sivaramakrishnan, Ashok Veeraraghavan, Alyosha C. Molnar:
ASP Vision: Optically Computing the First Layer of Convolutional Neural Networks Using Angle Sensitive Pixels. 903-912 - Katherine L. Bouman, Michael D. Johnson, Daniel Zoran, Vincent L. Fish, Sheperd S. Doeleman, William T. Freeman:
Computational Imaging for VLBI Image Reconstruction. 913-922
S1-2B: Video Analysis 1
- Chuang Gan, Ting Yao, Kuiyuan Yang, Yi Yang, Tao Mei:
You Lead, We Exceed: Labor-Free Video Concept Learning by Jointly Exploiting Web Videos and Images. 923-932 - Fanyi Xiao, Yong Jae Lee:
Track and Segment: An Iterative Unsupervised Approach for Video Object Proposals. 933-942 - Gao Zhu, Fatih Porikli, Hongdong Li
:
Beyond Local Search: Tracking Objects Everywhere with Instance-Specific Proposals. 943-951 - Hongkai Yu, Youjie Zhou, Jeff P. Simmons, Craig P. Przybyla, Yuewei Lin, Xiaochuan Fan, Yang Mi
, Song Wang
:
Groupwise Tracking of Crowded Similar-Appearance Targets from Low-Continuity Image Sequences. 952-960 - Alexandre Alahi
, Kratarth Goel, Vignesh Ramanathan, Alexandre Robicquet, Li Fei-Fei, Silvio Savarese:
Social LSTM: Human Trajectory Prediction in Crowded Spaces. 961-971 - Andrii Maksai, Xinchao Wang
, Pascal Fua:
What Players do with the Ball: A Physically Constrained Interaction Modeling. 972-981 - Ting Yao, Tao Mei, Yong Rui:
Highlight Detection with Pairwise Deep Ranking for First-Person Video Summarization. 982-990
Poster Session P1-2
- Bugra Tekin, Artem Rozantsev, Vincent Lepetit, Pascal Fua:
Direct Prediction of 3D Body Poses from Motion Compensated Sequences. 991-1000 - Michael Gygli, Yale Song, Liangliang Cao:
Video2GIF: Automatic Generation of Animated GIFs from Video. 1001-1009 - Amir Shahroudy
, Jun Liu
, Tian-Tsong Ng, Gang Wang:
NTU RGB+D: A Large Scale Dataset for 3D Human Activity Analysis. 1010-1019 - Bingbing Ni, Xiaokang Yang, Shenghua Gao:
Progressively Parsing Interactional Objects for Fine Grained Action Detection. 1020-1028 - Pingbo Pan, Zhongwen Xu, Yi Yang, Fei Wu, Yueting Zhuang:
Hierarchical Recurrent Neural Encoder for Video Representation with Application to Captioning. 1029-1038 - Jingjing Meng, Hongxing Wang
, Junsong Yuan, Yap-Peng Tan:
From Keyframes to Key Objects: Video Summarization by Representative Object Proposal Selection. 1039-1048 - Zheng Shou, Dongang Wang
, Shih-Fu Chang:
Temporal Action Localization in Untrimmed Videos via Multi-stage CNNs. 1049-1058 - Ke Zhang, Wei-Lun Chao, Fei Sha, Kristen Grauman:
Summary Transfer: Exemplar-Based Subset Selection for Video Summarization. 1059-1067 - Yeong Jun Koh, Won-Dong Jang, Chang-Su Kim
:
POD: Discovering Primary Objects in Videos Based on Evolutionary Refinement of Object Recurrence, Background, and Primary Object Models. 1068-1076 - Waqas Sultani
, Mubarak Shah
:
What If We Do Not have Multiple Videos of the Same Action? - Video Action Localization Using Web Images. 1077-1085 - Lu Zhang, Hayley Hung:
Beyond F-Formations: Determining Social Involvement in Free Standing Conversing Groups from Static Images. 1086-1095 - Ziwei Liu, Ping Luo, Shi Qiu, Xiaogang Wang, Xiaoou Tang:
DeepFashion: Powering Robust Clothes Recognition and Retrieval with Rich Annotations. 1096-1104 - Hua Zhang, Si Liu, Changqing Zhang, Wenqi Ren, Rui Wang
, Xiaochun Cao:
SketchNet: Sketch Classification with Web Images. 1105-1113 - Xiaofan Zhang, Feng Zhou, Yuanqing Lin, Shaoting Zhang
:
Embedding Label Structures for Fine-Grained Feature Representation. 1114-1123 - Feng Zhou, Yuanqing Lin:
Fine-Grained Image Classification by Exploring Bipartite-Graph Labels. 1124-1133 - Xiaopeng Zhang, Hongkai Xiong
, Wengang Zhou, Weiyao Lin
, Qi Tian:
Picking Deep Filter Responses for Fine-Grained Image Recognition. 1134-1142