CVPR 2017论文

转自:https://blog.csdn.net/u010510350/article/details/77218879

近期在看CVPR2017的文章,顺便就把CVPR2017整理一下,分享给大家,更多的 Computer Vision的文章可以访问Computer Vision Foundation open accessCVPapers

Machine Learning 1

Spotlight 1-1A

Exclusivity-Consistency Regularized Multi-View Subspace Clustering 
Xiaojie Guo, Xiaobo Wang, Zhen Lei, Changqing Zhang, Stan Z. Li 
Borrowing Treasures From the Wealthy: Deep Transfer Learning Through Selective Joint Fine-Tuning 
Weifeng Ge, Yizhou Yu 
The More You Know: Using Knowledge Graphs for Image Classification 
Kenneth Marino, Ruslan Salakhutdinov, Abhinav Gupta 
Dynamic Edge-Conditioned Filters in Convolutional Neural Networks on Graphs 
Martin Simonovsky, Nikos Komodakis 
Convolutional Neural Network Architecture for Geometric Matching 
Ignacio Rocco, Relja Arandjelović, Josef Sivic 
Deep Affordance-Grounded Sensorimotor Object Recognition 
Spyridon Thermos, Georgios Th. Papadopoulos, Petros Daras, Gerasimos Potamianos 
Discovering Causal Signals in Images 
David Lopez-Paz, Robert Nishihara, Soumith Chintala, Bernhard Schölkopf, Léon Bottou 
On Compressing Deep Models by Low Rank and Sparse Decomposition 
Xiyu Yu, Tongliang Liu, Xinchao Wang, Dacheng Tao

Oral 1-1A

PointNet: Deep Learning on Point Sets for 3D Classification and Segmentation 
Charles R. Qi, Hao Su, Kaichun Mo, Leonidas J. Guibas 
Universal Adversarial Perturbations 
Seyed-Mohsen Moosavi-Dezfooli, Alhussein Fawzi, Omar Fawzi, Pascal Frossard 
Unsupervised Pixel-Level Domain Adaptation With Generative Adversarial Networks 
Konstantinos Bousmalis, Nathan Silberman, David Dohan, Dumitru Erhan, Dilip Krishnan 
Photo-Realistic Single Image Super-Resolution Using a Generative Adversarial Network (PDFcode
Christian Ledig, Lucas Theis, Ferenc Huszár, Jose Caballero, Andrew Cunningham, Alejandro Acosta, Andrew Aitken, Alykhan Tejani, Johannes Totz, Zehan Wang, Wenzhe Shi

3D Vision 1

Spotlight 1-1B

Context-Aware Captions From Context-Agnostic Supervision 
Ramakrishna Vedantam, Samy Bengio, Kevin Murphy, Devi Parikh, Gal Chechik 
Global Hypothesis Generation for 6D Object Pose Estimation (PDF
Frank Michel, Alexander Kirillov, Eric Brachmann, Alexander Krull, Stefan Gumhold, Bogdan Savchynskyy, Carsten Rother 
A Practical Method for Fully Automatic Intrinsic Camera Calibration Using Directionally Encoded Light 
Mahdi Abbaspour Tehrani, Thabo Beeler, Anselm Grundhöfer 
CATS: A Color and Thermal Stereo Benchmark 
Wayne Treible, Philip Saponaro, Scott Sorensen, Abhishek Kolagunda, Michael O’Neal, Brian Phelan, Kelly Sherbondy, Chandra Kambhamettu 
Elastic Shape-From-Template With Spatially Sparse Deforming Forces 
Abed Malti, Cédric Herzet 
Distinguishing the Indistinguishable: Exploring Structural Ambiguities via Geodesic Context 
Qingan Yan, Long Yang, Ling Zhang, Chunxia Xiao 
Multi-Scale Continuous CRFs as Sequential Deep Networks for Monocular Depth Estimation 
Dan Xu, Elisa Ricci, Wanli Ouyang, Xiaogang Wang, Nicu Sebe 
Dynamic Time-Of-Flight 
Michael Schober, Amit Adam, Omer Yair, Shai Mazor, Sebastian Nowozin

Oral 1-1B

Semantic Scene Completion From a Single Depth Image 
Shuran Song, Fisher Yu, Andy Zeng, Angel X. Chang, Manolis Savva, Thomas Funkhouser 
3DMatch: Learning Local Geometric Descriptors From RGB-D Reconstructions 
Andy Zeng, Shuran Song, Matthias Nießner, Matthew Fisher, Jianxiong Xiao, Thomas Funkhouser 
Multi-View Supervision for Single-View Reconstruction via Differentiable Ray Consistency (PDFprojectcode
Shubham Tulsiani, Tinghui Zhou, Alexei A. Efros, Jitendra Malik 
On-The-Fly Adaptation of Regression Forests for Online Camera Relocalisation (PDF
Tommaso Cavallari, Stuart Golodetz, Nicholas A. Lord, Julien Valentin, Luigi Di Stefano, Philip H. S. Torr

Low- & Mid-Level Vision

Spotlight 1-1C

Designing Effective Inter-Pixel Information Flow for Natural Image Matting 
YaÄŸiz Aksoy, Tunç Ozan Aydin, Marc Pollefeys 
Deep Video Deblurring for Hand-Held Cameras 
Shuochen Su, Mauricio Delbracio, Jue Wang, Guillermo Sapiro, Wolfgang Heidrich, Oliver Wang 
Instance-Level Salient Object Segmentation 
Guanbin Li, Yuan Xie, Liang Lin, Yizhou Yu 
Deep Multi-Scale Convolutional Neural Network for Dynamic Scene Deblurring 
Seungjun Nah, Tae Hyun Kim, Kyoung Mu Lee 
Diversified Texture Synthesis With Feed-Forward Networks 
Yijun Li, Chen Fang, Jimei Yang, Zhaowen Wang, Xin Lu, Ming-Hsuan Yang 
Radiometric Calibration for Internet Photo Collections (PDF
Zhipeng Mo, Boxin Shi, Sai-Kit Yeung, Yasuyuki Matsushita 
Deeply Aggregated Alternating Minimization for Image Restoration 
Youngjung Kim, Hyungjoo Jung, Dongbo Min, Kwanghoon Sohn 
End-To-End Instance Segmentation With Recurrent Attention 
Mengye Ren, Richard S. Zemel

Oral 1-1C

SRN: Side-output Residual Network for Object Symmetry Detection in the Wild 
Wei Ke, Jie Chen, Jianbin Jiao, Guoying Zhao, Qixiang Ye 
Deep Image Matting (PDFabstract
Ning Xu, Brian Price, Scott Cohen, Thomas Huang 
Wetness and Color From a Single Multispectral Image 
Mihoko Shimano, Hiroki Okawa, Yuta Asano, Ryoma Bise, Ko Nishino, Imari Sato 
FC4: Fully Convolutional Color Constancy With Confidence-Weighted Pooling 
Yuanming Hu, Baoyuan Wang, Stephen Lin

Poster 1-1

3D Computer Vision

Face Normals “In-The-Wild” Using Fully Convolutional Networks 
George Trigeorgis, Patrick Snape, Iasonas Kokkinos, Stefanos Zafeiriou 
A Non-Convex Variational Approach to Photometric Stereo Under Inaccurate Lighting 
Yvain Quéau, Tao Wu, François Lauze, Jean-Denis Durou, Daniel Cremers 
A Linear Extrinsic Calibration of Kaleidoscopic Imaging System From Single 3D Point 
Kosuke Takahashi, Akihiro Miyata, Shohei Nobuhara, Takashi Matsuyama 
Polarimetric Multi-View Stereo 
Zhaopeng Cui, Jinwei Gu, Boxin Shi, Ping Tan, Jan Kautz 
An Exact Penalty Method for Locally Convergent Maximum Consensus (PDFcode
Huu Le, Tat-Jun Chin, David Suter 
Deep Supervision With Shape Concepts for Occlusion-Aware 3D Object Parsing 
Chi Li, M. Zeeshan Zia, Quoc-Huy Tran, Xiang Yu, Gregory D. Hager, Manmohan Chandraker 
Amodal Detection of 3D Objects: Inferring 3D Bounding Boxes From 2D Ones in RGB-Depth Images 
Zhuo Deng, Longin Jan Latecki

Analyzing Humans in Images

Transition Forests: Learning Discriminative Temporal Transitions for Action Recognition and Detection 
Guillermo Garcia-Hernando, Tae-Kyun Kim 
Scene Flow to Action Map: A New Representation for RGB-D Based Action Recognition With Convolutional Neural Networks 
Pichao Wang, Wanqing Li, Zhimin Gao, Yuyao Zhang, Chang Tang, Philip Ogunbona 
Detecting Masked Faces in the Wild With LLE-CNNs 
Shiming Ge, Jia Li, Qiting Ye, Zhao Luo 
A Domain Based Approach to Social Relation Recognition 
Qianru Sun, Bernt Schiele, Mario Fritz 
Spatio-Temporal Naive-Bayes Nearest-Neighbor (ST-NBNN) for Skeleton-Based Action Recognition 
Junwu Weng, Chaoqun Weng, Junsong Yuan 
Personalizing Gesture Recognition Using Hierarchical Bayesian Neural Networks 
Ajjen Joshi, Soumya Ghosh, Margrit Betke, Stan Sclaroff, Hanspeter Pfister

Applications

Real-Time 3D Model Tracking in Color and Depth on a Single CPU Core 
Wadim Kehl, Federico Tombari, Slobodan Ilic, Nassir Navab 
Multi-Scale FCN With Cascaded Instance Aware Segmentation for Arbitrary Oriented Word Spotting in the Wild 
Dafang He, Xiao Yang, Chen Liang, Zihan Zhou, Alexander G. Ororbi II, Daniel Kifer, C. Lee Giles 
Viraliency: Pooling Local Virality 
Xavier Alameda-Pineda, Andrea Pilzer, Dan Xu, Nicu Sebe, Elisa Ricci

Biomedical Image/Video Analysis

A Non-Local Low-Rank Framework for Ultrasound Speckle Reduction 
Lei Zhu, Chi-Wing Fu, Michael S. Brown, Pheng-Ann Heng

Image Motion & Tracking

Video Acceleration Magnification 
Silvia L. Pintea, Yichao Zhang, Jan C. van Gemert 
Superpixel-Based Tracking-By-Segmentation Using Markov Chains 
Donghun Yeo, Jeany Son, Bohyung Han, Joon Hee Han 
BranchOut: Regularization for Online Ensemble Tracking With Convolutional Neural Networks 
Bohyung Han, Jack Sim, Hartwig Adam 
Learning Motion Patterns in Videos 
Pavel Tokmakov, Karteek Alahari, Cordelia Schmid

Low- & Mid-Level Vision

Deep Level Sets for Salient Object Detection 
Ping Hu, Bing Shuai, Jun Liu, Gang Wang 
Binary Constraint Preserving Graph Matching 
Bo Jiang, Jin Tang, Chris Ding, Bin Luo 
From Local to Global: Edge Profiles to Camera Motion in Blurred Images 
Subeesh Vasu, A. N. Rajagopalan 
What Is the Space of Attenuation Coefficients in Underwater Computer Vision? 
Derya Akkaynak, Tali Treibitz, Tom Shlesinger, Yossi Loya, Raz Tamir, David Iluz 
Robust Energy Minimization for BRDF-Invariant Shape From Light Fields 
Zhengqin Li, Zexiang Xu, Ravi Ramamoorthi, Manmohan Chandraker 
Boundary-Aware Instance Segmentation 
Zeeshan Hayder, Xuming He, Mathieu Salzmann 
Spatially-Varying Blur Detection Based on Multiscale Fused and Sorted Transform Coefficients of Gradient Magnitudes 
S. Alireza Golestaneh, Lina J. Karam 
Model-Based Iterative Restoration for Binary Document Image Compression With Dictionary Learning 
Yandong Guo, Cheng Lu, Jan P. Allebach, Charles A. Bouman 
FCSS: Fully Convolutional Self-Similarity for Dense Semantic Correspondence 
Seungryong Kim, Dongbo Min, Bumsub Ham, Sangryul Jeon, Stephen Lin, Kwanghoon Sohn

Machine Learning

Learning by Association — A Versatile Semi-Supervised Training Method for Neural Networks 
Philip Haeusser, Alexander Mordvintsev, Daniel Cremers 
Dilated Residual Networks 
Fisher Yu, Vladlen Koltun, Thomas Funkhouser 
Split-Brain Autoencoders: Unsupervised Learning by Cross-Channel Prediction 
Richard Zhang, Phillip Isola, Alexei A. Efros 
Nonnegative Matrix Underapproximation for Robust Multiple Model Fitting 
Mariano Tepper, Guillermo Sapiro 
Truncated Max-Of-Convex Models 
Pankaj Pansari, M. Pawan Kumar 
Additive Component Analysis 
Calvin Murdock, Fernando De la Torre 
Subspace Clustering via Variance Regularized Ridge Regression 
Zhao Kang, Chong Peng, Qiang Cheng 
The Incremental Multiresolution Matrix Factorization Algorithm 
Vamsi K. Ithapu, Risi Kondor, Sterling C. Johnson, Vikas Singh 
Transformation-Grounded Image Generation Network for Novel 3D View Synthesis 
Eunbyung Park, Jimei Yang, Ersin Yumer, Duygu Ceylan, Alexander C. Berg 
Learning Dynamic Guidance for Depth Image Enhancement (PDF
Shuhang Gu, Wangmeng Zuo, Shi Guo, Yunjin Chen, Chongyu Chen, Lei Zhang 
A-Lamp: Adaptive Layout-Aware Multi-Patch Deep Convolutional Neural Network for Photo Aesthetic Assessment (PDF
Shuang Ma, Jing Liu, Chang Wen Chen 
Teaching Compositionality to CNNs 
Austin Stone, Huayan Wang, Michael Stark, Yi Liu, D. Scott Phoenix, Dileep George 
Using Ranking-CNN for Age Estimation 
Shixing Chen, Caojin Zhang, Ming Dong, Jialiang Le, Mike Rao 
Accurate Single Stage Detector Using Recurrent Rolling Convolution 
Jimmy Ren, Xiaohao Chen, Jianbo Liu, Wenxiu Sun, Jiahao Pang, Qiong Yan, Yu-Wing Tai, Li Xu 
A Compact DNN: Approaching GoogLeNet-Level Accuracy of Classification and Domain Adaptation 
Chunpeng Wu, Wei Wen, Tariq Afzal, Yongmei Zhang, Yiran Chen, Hai (Helen) Li 
The Impact of Typicality for Informative Representative Selection 
Jawadul H. Bappy, Sujoy Paul, Ertem Tuncel, Amit K. Roy-Chowdhury 
Infinite Variational Autoencoder for Semi-Supervised Learning 
M. Ehsan Abbasnejad, Anthony Dick, Anton van den Hengel 
SurfNet: Generating 3D Shape Surfaces Using Deep Residual Networks 
Ayan Sinha, Asim Unmesh, Qixing Huang, Karthik Ramani 
Intrinsic Grassmann Averages for Online Linear and Robust Subspace Learning 
Rudrasis Chakraborty, Søren Hauberg, Baba C. Vemuri 
Variational Bayesian Multiple Instance Learning With Gaussian Processes 
Manuel Haußmann, Fred A. Hamprecht, Melih Kandemir 
Temporal Attention-Gated Model for Robust Sequence Classification 
Wenjie Pei, Tadas BaltruÅ¡aitis, David M.J. Tax, Louis-Philippe Morency 
Non-Uniform Subset Selection for Active Learning in Structured Data 
Sujoy Paul, Jawadul H. Bappy, Amit K. Roy-Chowdhury 
Colorization as a Proxy Task for Visual Understanding 
Gustav Larsson, Michael Maire, Gregory Shakhnarovich 
Shading Annotations in the Wild 
Balazs Kovacs, Sean Bell, Noah Snavely, Kavita Bala 
LCNN: Lookup-Based Convolutional Neural Network 
Hessam Bagherinezhad, Mohammad Rastegari, Ali Farhadi

Object Recognition & Scene Understanding

Physics Inspired Optimization on Semantic Transfer Features: An Alternative Method for Room Layout Estimation 
Hao Zhao, Ming Lu, Anbang Yao, Yiwen Guo, Yurong Chen, Li Zhang 
Pixelwise Instance Segmentation With a Dynamically Instantiated Network 
Anurag Arnab, Philip H. S. Torr 
Object Detection in Videos With Tubelet Proposal Networks 
Kai Kang, Hongsheng Li, Tong Xiao, Wanli Ouyang, Junjie Yan, Xihui Liu, Xiaogang Wang 
AMVH: Asymmetric Multi-Valued Hashing 
Cheng Da, Shibiao Xu, Kun Ding, Gaofeng Meng, Shiming Xiang, Chunhong Pan 
Spindle Net: Person Re-Identification With Human Body Region Guided Feature Decomposition and Fusion 
Haiyu Zhao, Maoqing Tian, Shuyang Sun, Jing Shao, Junjie Yan, Shuai Yi, Xiaogang Wang, Xiaoou Tang 
Deep Visual-Semantic Quantization for Efficient Image Retrieval 
Yue Cao, Mingsheng Long, Jianmin Wang, Shichen Liu 
Efficient Diffusion on Region Manifolds: Recovering Small Objects With Compact CNN Representations 
Ahmet Iscen, Giorgos Tolias, Yannis Avrithis, Teddy Furon, OndÅ™ej Chum 
Feature Pyramid Networks for Object Detection 
Tsung-Yi Lin, Piotr Dollár, Ross Girshick, Kaiming He, Bharath Hariharan, Serge Belongie 
Mind the Class Weight Bias: Weighted Maximum Mean Discrepancy for Unsupervised Domain Adaptation 
Hongliang Yan, Yukang Ding, Peihua Li, Qilong Wang, Yong Xu, Wangmeng Zuo 
StyleNet: Generating Attractive Visual Captions With Styles 
Chuang Gan, Zhe Gan, Xiaodong He, Jianfeng Gao, Li Deng 
Fine-Grained Recognition of Thousands of Object Categories With Single-Example Training 
Leonid Karlinsky, Joseph Shtok, Yochay Tzur, Asaf Tzadok 
Improving Interpretability of Deep Neural Networks With Semantic Information 
Yinpeng Dong, Hang Su, Jun Zhu, Bo Zhang 
Video Captioning With Transferred Semantic Attributes 
Yingwei Pan, Ting Yao, Houqiang Li, Tao Mei 
Fast Boosting Based Detection Using Scale Invariant Multimodal Multiresolution Filtered Features 
Arthur Daniel Costea, Robert Varga, Sergiu Nedevschi

Video Analytics

Temporal Convolutional Networks for Action Segmentation and Detection (PDFcode
Colin Lea, Michael D. Flynn, René Vidal, Austin Reiter, Gregory D. Hager 
Surveillance Video Parsing With Single Frame Supervision 
Si Liu, Changhu Wang, Ruihe Qian, Han Yu, Renda Bao, Yao Sun 
Weakly Supervised Actor-Action Segmentation via Robust Multi-Task Ranking 
Yan Yan, Chenliang Xu, Dawen Cai, Jason J. Corso 
Unsupervised Visual-Linguistic Reference Resolution in Instructional Videos 
De-An Huang, Joseph J. Lim, Li Fei-Fei, Juan Carlos Niebles 
Zero-Shot Action Recognition With Error-Correcting Output Codes 
Jie Qin, Li Liu, Ling Shao, Fumin Shen, Bingbing Ni, Jiaxin Chen, Yunhong Wang 
Enhancing Video Summarization via Vision-Language Embedding 
Bryan A. Plummer, Matthew Brown, Svetlana Lazebnik 
Synthesizing Dynamic Patterns by Spatial-Temporal Generative ConvNet 
Jianwen Xie, Song-Chun Zhu, Ying Nian Wu

Object Recognition & Scene Understanding - Computer Vision & Language

Discriminative Bimodal Networks for Visual Localization and Detection With Natural Language Queries 
Yuting Zhang, Luyao Yuan, Yijie Guo, Zhiyuan He, I-An Huang, Honglak Lee 
Automatic Understanding of Image and Video Advertisements 
Zaeem Hussain, Mingda Zhang, Xiaozhong Zhang, Keren Ye, Christopher Thomas, Zuha Agha, Nathan Ong, Adriana Kovashka 
Deep Sketch Hashing: Fast Free-Hand Sketch-Based Image Retrieval 
Li Liu, Fumin Shen, Yuming Shen, Xianglong Liu, Ling Shao 
Discover and Learn New Objects From Documentaries 
Kai Chen, Hang Song, Chen Change Loy, Dahua Lin 
Spatial-Semantic Image Search by Visual Feature Synthesis 
Long Mai, Hailin Jin, Zhe Lin, Chen Fang, Jonathan Brandt, Feng Liu 
Fully-Adaptive Feature Sharing in Multi-Task Networks With Applications in Person Attribute Classification 
Yongxi Lu, Abhishek Kumar, Shuangfei Zhai, Yu Cheng, Tara Javidi, Rogerio Feris 
Semantic Compositional Networks for Visual Captioning 
Zhe Gan, Chuang Gan, Xiaodong He, Yunchen Pu, Kenneth Tran, Jianfeng Gao, Lawrence Carin, Li Deng 
Training Object Class Detectors With Click Supervision 
Dim P. Papadopoulos, Jasper R. R. Uijlings, Frank Keller, Vittorio Ferrari

Oral 1-2A

Deep Reinforcement Learning-Based Image Captioning With Embedding Reward 
Zhou Ren, Xiaoyu Wang, Ning Zhang, Xutao Lv, Li-Jia Li 
From Red Wine to Red Tomato: Composition With Context 
Ishan Misra, Abhinav Gupta, Martial Hebert 
Captioning Images With Diverse Objects 
Subhashini Venugopalan, Lisa Anne Hendricks, Marcus Rohrbach, Raymond Mooney, Trevor Darrell, Kate Saenko 
Self-Critical Sequence Training for Image Captioning 
Steven J. Rennie, Etienne Marcheret, Youssef Mroueh, Jerret Ross, Vaibhava Goel

Analyzing Humans 1

Spotlight 1-2B

Crossing Nets: Combining GANs and VAEs With a Shared Latent Space for Hand Pose Estimation 
Chengde Wan, Thomas Probst, Luc Van Gool, Angela Yao 
Predicting Behaviors of Basketball Players From First Person Videos 
Shan Su, Jung Pyo Hong, Jianbo Shi, Hyun Soo Park 
LCR-Net: Localization-Classification-Regression for Human Pose 
Grégory Rogez, Philippe Weinzaepfel, Cordelia Schmid 
Learning Residual Images for Face Attribute Manipulation 
Wei Shen, Rujie Liu 
Seeing What Is Not There: Learning Context to Determine Where Objects Are Missing 
Jin Sun, David W. Jacobs 
Deep Learning on Lie Groups for Skeleton-Based Action Recognition 
Zhiwu Huang, Chengde Wan, Thomas Probst, Luc Van Gool 
Harvesting Multiple Views for Marker-Less 3D Human Pose Annotations 
Georgios Pavlakos, Xiaowei Zhou, Konstantinos G. Derpanis, Kostas Daniilidis 
Coarse-To-Fine Volumetric Prediction for Single-Image 3D Human Pose 
Georgios Pavlakos, Xiaowei Zhou, Konstantinos G. Derpanis, Kostas Daniilidis

Oral 1-2B

Weakly Supervised Action Learning With RNN Based Fine-To-Coarse Modeling 
Alexander Richard, Hilde Kuehne, Juergen Gall 
Disentangled Representation Learning GAN for Pose-Invariant Face Recognition 
Luan Tran, Xi Yin, Xiaoming Liu 
ArtTrack: Articulated Multi-Person Tracking in the Wild 
Eldar Insafutdinov, Mykhaylo Andriluka, Leonid Pishchulin, Siyu Tang, Evgeny Levinkov, Bjoern Andres, Bernt Schiele 
Realtime Multi-Person 2D Pose Estimation Using Part Affinity Fields (PDFcode
Zhe Cao, Tomas Simon, Shih-En Wei, Yaser Sheikh

Image Motion & Tracking; Video Analysis

Spotlight 1-2C

Template Matching With Deformable Diversity Similarity 
Itamar Talmi, Roey Mechrez, Lihi Zelnik-Manor 
Beyond Triplet Loss: A Deep Quadruplet Network for Person Re-Identification 
Weihua Chen, Xiaotang Chen, Jianguo Zhang, Kaiqi Huang 
Agent-Centric Risk Assessment: Accident Anticipation and Risky Region Localization 
Kuo-Hao Zeng, Shih-Han Chou, Fu-Hsiang Chan, Juan Carlos Niebles, Min Sun 
Bidirectional Multirate Reconstruction for Temporal Modeling in Videos 
Linchao Zhu, Zhongwen Xu, Yi Yang 
Action-Decision Networks for Visual Tracking With Deep Reinforcement Learning 
Sangdoo Yun, Jongwon Choi, Youngjoon Yoo, Kimin Yun, Jin Young Choi 
TGIF-QA: Toward Spatio-Temporal Reasoning in Visual Question Answering 
Yunseok Jang, Yale Song, Youngjae Yu, Youngjin Kim, Gunhee Kim 
Making 360° Video Watchable in 2D: Learning Videography for Click Free Viewing 
Yu-Chuan Su, Kristen Grauman 
Unsupervised Adaptive Re-Identification in Open World Dynamic Camera Networks 
Rameswar Panda, Amran Bhuiyan, Vittorio Murino, Amit K. Roy-Chowdhury

Oral 1-2C

Context-Aware Correlation Filter Tracking 
Matthias Mueller, Neil Smith, Bernard Ghanem 
Deep 360 Pilot: Learning a Deep Agent for Piloting Through 360° Sports Videos 
Hou-Ning Hu, Yen-Chen Lin, Ming-Yu Liu, Hsien-Tzu Cheng, Yung-Ju Chang, Min Sun 
Slow Flow: Exploiting High-Speed Cameras for Accurate and Diverse Optical Flow Reference Data 
Joel Janai, Fatma Güney, Jonas Wulff, Michael J. Black, Andreas Geiger 
CDC: Convolutional-De-Convolutional Networks for Precise Temporal Action Localization in Untrimmed Videos 
Zheng Shou, Jonathan Chan, Alireza Zareian, Kazuyuki Miyazawa, Shih-Fu Chang

Poster 1-2

3D Computer Vision

Exploiting 2D Floorplan for Building-Scale Panorama RGBD Alignment 
Erik Wijmans, Yasutaka Furukawa 
A Combinatorial Solution to Non-Rigid 3D Shape-To-Image Matching 
Florian Bernard, Frank R. Schmidt, Johan Thunberg, Daniel Cremers 
NID-SLAM: Robust Monocular SLAM Using Normalised Information Distance 
Geoffrey Pascoe, Will Maddern, Michael Tanner, Pedro Piniés, Paul Newman 
End-To-End Training of Hybrid CNN-CRF Models for Stereo 
Patrick Knöbelreiter, Christian Reinbacher, Alexander Shekhovtsov, Thomas Pock 
Learning Shape Abstractions by Assembling Volumetric Primitives (PDFprojectcode
Shubham Tulsiani, Hao Su, Leonidas J. Guibas, Alexei A. Efros, Jitendra Malik 
Locality-Sensitive Deconvolution Networks With Gated Fusion for RGB-D Indoor Semantic Segmentation 
Yanhua Cheng, Rui Cai, Zhiwei Li, Xin Zhao, Kaiqi Huang 
Acquiring Axially-Symmetric Transparent Objects Using Single-View Transmission Imaging (PDF
Jaewon Kim, Ilya Reshetouski, Abhijeet Ghosh 
Regressing Robust and Discriminative 3D Morphable Models With a Very Deep Neural Network 
Anh Tuấn Trần, Tal Hassner, Iacopo Masi, Gérard Medioni 
End-To-End 3D Face Reconstruction With Deep Neural Networks 
Pengfei Dou, Shishir K. Shah, Ioannis A. Kakadiaris 
DUST: Dual Union of Spatio-Temporal Subspaces for Monocular Multiple Object 3D Reconstruction 
Antonio Agudo, Francesc Moreno-Noguer

Analyzing Humans in Images

Finding Tiny Faces 
Peiyun Hu, Deva Ramanan 
Dynamic Facial Analysis: From Bayesian Filtering to Recurrent Neural Network 
Jinwei Gu, Xiaodong Yang, Shalini De Mello, Jan Kautz 
Deep Temporal Linear Encoding Networks (PDF
Ali Diba, Vivek Sharma, Luc Van Gool 
Joint Registration and Representation Learning for Unconstrained Face Identification (PDF
Munawar Hayat, Salman H. Khan, Naoufel Werghi, Roland Goecke 
3D Human Pose Estimation From a Single Image via Distance Matrix Regression 
Francesc Moreno-Noguer 
One-Shot Metric Learning for Person Re-Identification 
Slawomir BÄ…k, Peter Carr 
Generalized Rank Pooling for Activity Recognition 
Anoop Cherian, Basura Fernando, Mehrtash Harandi, Stephen Gould 
Deep Representation Learning for Human Motion Prediction and Classification 
Judith Bütepage, Michael J. Black, Danica Kragic, Hedvig Kjellström 
Interspecies Knowledge Transfer for Facial Keypoint Detection 
Maheen Rashid, Xiuye Gu, Yong Jae Lee 
Recurrent Convolutional Neural Networks for Continuous Sign Language Recognition by Staged Optimization 
Runpeng Cui, Hu Liu, Changshui Zhang

Applications

Modeling Sub-Event Dynamics in First-Person Action Recognition 
Hasan F. M. Zaki, Faisal Shafait, Ajmal Mian

Computational Photography

Turning an Urban Scene Video Into a Cinemagraph 
Hang Yan, Yebin Liu, Yasutaka Furukawa 
Light Field Reconstruction Using Deep Convolutional Network on EPI 
Gaochang Wu, Mandan Zhao, Liangyong Wang, Qionghai Dai, Tianyou Chai, Yebin Liu

Image Motion & Tracking

FlowNet 2.0: Evolution of Optical Flow Estimation With Deep Networks 
Eddy Ilg, Nikolaus Mayer, Tonmoy Saikia, Margret Keuper, Alexey Dosovitskiy, Thomas Brox

Low- & Mid-Level Vision

Attention-Aware Face Hallucination via Deep Reinforcement Learning 
Qingxing Cao, Liang Lin, Yukai Shi, Xiaodan Liang, Guanbin Li 
Simple Does It: Weakly Supervised Instance and Semantic Segmentation 
Anna Khoreva, Rodrigo Benenson, Jan Hosang, Matthias Hein, Bernt Schiele 
Anti-Glare: Tightly Constrained Optimization for Eyeglass Reflection Removal 
Tushar Sandhan, Jin Young Choi 
Deep Joint Rain Detection and Removal From a Single Image 
Wenhan Yang, Robby T. Tan, Jiashi Feng, Jiaying Liu, Zongming Guo, Shuicheng Yan 
Radiometric Calibration From Faces in Images 
Chen Li, Stephen Lin, Kun Zhou, Katsushi Ikeuchi 
Webly Supervised Semantic Segmentation 
Bin Jin, Maria V. Ortiz Segovia, Sabine Süsstrunk 
Removing Rain From Single Images via a Deep Detail Network 
Xueyang Fu, Jiabin Huang, Delu Zeng, Yue Huang, Xinghao Ding, John Paisley 
Deep Crisp Boundaries 
Yupei Wang, Xin Zhao, Kaiqi Huang 
Coarse-To-Fine Segmentation With Shape-Tailored Continuum Scale Spaces 
Naeemullah Khan, Byung-Woo Hong, Anthony Yezzi, Ganesh Sundaramoorthi 
Large Kernel Matters — Improve Semantic Segmentation by Global Convolutional Network 
Chao Peng, Xiangyu Zhang, Gang Yu, Guiming Luo, Jian Sun 
Single Image Reflection Suppression 
Nikolaos Arvanitopoulos, Radhakrishna Achanta, Sabine Süsstrunk 
CASENet: Deep Category-Aware Semantic Edge Detection 
Zhiding Yu, Chen Feng, Ming-Yu Liu, Srikumar Ramalingam 
Reflectance Adaptive Filtering Improves Intrinsic Image Estimation 
Thomas Nestmeyer, Peter V. Gehler

Machine Learning

Conditional Similarity Networks 
Andreas Veit, Serge Belongie, Theofanis Karaletsos 
Spatially Adaptive Computation Time for Residual Networks 
Michael Figurnov, Maxwell D. Collins, Yukun Zhu, Li Zhang, Jonathan Huang, Dmitry Vetrov, Ruslan Salakhutdinov 
Xception: Deep Learning With Depthwise Separable Convolutions 
François Chollet 
Feedback Networks 
Amir R. Zamir, Te-Lin Wu, Lin Sun, William B. Shen, Bertram E. Shi, Jitendra Malik, Silvio Savarese 
Online Summarization via Submodular and Convex Optimization 
Ehsan Elhamifar, M. Clara De Paolis Kaluza 
Deep MANTA: A Coarse-To-Fine Many-Task Network for Joint 2D and 3D Vehicle Analysis From Monocular Image 
Florian Chabot, Mohamed Chaouch, Jaonary Rabarisoa, Céline Teulière, Thierry Chateau 
Improving Pairwise Ranking for Multi-Label Image Classification 
Yuncheng Li, Yale Song, Jiebo Luo 
Active Convolution: Learning the Shape of Convolution for Image Classification 
Yunho Jeon, Junmo Kim 
Linking Image and Text With 2-Way Nets 
Aviv Eisenschtat, Lior Wolf 
Stacked Generative Adversarial Networks 
Xun Huang, Yixuan Li, Omid Poursaeed, John Hopcroft, Serge Belongie 
Image Splicing Detection via Camera Response Function Analysis 
Can Chen, Scott McCloskey, Jingyi Yu 
Building a Regular Decision Boundary With Deep Networks 
Edouard Oyallon 
More Is Less: A More Complicated Network With Less Inference Complexity 
Xuanyi Dong, Junshi Huang, Yi Yang, Shuicheng Yan 
Joint Graph Decomposition and Node Labeling: Problem, Algorithms, Applications 
Evgeny Levinkov, Jonas Uhrig, Siyu Tang, Mohamed Omran, Eldar Insafutdinov, Alexander Kirillov, Carsten Rother, Thomas Brox, Bernt Schiele, Bjoern Andres 
Scale-Aware Face Detection 
Zekun Hao, Yu Liu, Hongwei Qin, Junjie Yan, Xiu Li, Xiaolin Hu 
Deep Unsupervised Similarity Learning Using Partially Ordered Sets 
Miguel A. Bautista, Artsiom Sanakoyeu, Björn Ommer 
Generative Hierarchical Learning of Sparse FRAME Models 
Jianwen Xie, Yifei Xu, Erik Nijkamp, Ying Nian Wu, Song-Chun Zhu

Object Recognition & Scene Understanding

Generating Holistic 3D Scene Abstractions for Text-Based Image Retrieval 
Ang Li, Jin Sun, Joe Yue-Hei Ng, Ruichi Yu, Vlad I. Morariu, Larry S. Davis 
Perceptual Generative Adversarial Networks for Small Object Detection 
Jianan Li (Group: Work group, Company,… - optional), Xiaodan Liang (Group: Work group, Company,… - optional), Yunchao Wei (Group: Work group, Company,… - optional), Tingfa Xu (Group: Work group, Company,… - optional), Jiashi Feng (Group: Work group, Company,… - optional), Shuicheng Yan (Group: Work group, Company,… - optional) 
Emotion Recognition in Context (PDFsupplementary material
Ronak Kosti, Jose M. Alvarez, Adria Recasens, Agata Lapedriza 
Deep Learning of Human Visual Sensitivity in Image Quality Assessment Framework 
Jongyoo Kim, Sanghoon Lee 
Dense Captioning With Joint Inference and Visual Context 
Linjie Yang, Kevin Tang, Jianchao Yang, Li-Jia Li 
CLEVR: A Diagnostic Dataset for Compositional Language and Elementary Visual Reasoning 
Justin Johnson, Bharath Hariharan, Laurens van der Maaten, Li Fei-Fei, C. Lawrence Zitnick, Ross Girshick 
Cross-View Image Matching for Geo-Localization in Urban Environments 
Yicong Tian, Chen Chen, Mubarak Shah 
Matrix Tri-Factorization With Manifold Regularizations for Zero-Shot Learning 
Xing Xu, Fumin Shen, Yang Yang, Dongxiang Zhang, Heng Tao Shen, Jingkuan Song 
Self-Supervised Learning of Visual Features Through Embedding Images Into Text Topic Spaces 
Lluis Gomez, Yash Patel, Marçal Rusiñol, Dimosthenis Karatzas, C. V. Jawahar 
Learning Spatial Regularization With Image-Level Supervisions for Multi-Label Image Classification 
Feng Zhu, Hongsheng Li, Wanli Ouyang, Nenghai Yu, Xiaogang Wang 
Semantically Consistent Regularization for Zero-Shot Recognition 
Pedro Morgado, Nuno Vasconcelos 
Can Walking and Measuring Along Chord Bunches Better Describe Leaf Shapes? 
Bin Wang, Yongsheng Gao, Changming Sun, Michael Blumenstein, John La Salle

Video Analytics

Self-Learning Scene-Specific Pedestrian Detectors Using a Progressive Latent Model 
Qixiang Ye, Tianliang Zhang, Wei Ke, Qiang Qiu, Jie Chen, Guillermo Sapiro, Baochang Zhang 
Predictive-Corrective Networks for Action Detection (projectabstractPDF
Achal Dave, Olga Russakovsky, Deva Ramanan 
Budget-Aware Deep Semantic Video Segmentation 
Behrooz Mahasseni, Sinisa Todorovic, Alan Fern 
Unified Embedding and Metric Learning for Zero-Exemplar Event Detection 
Noureldien Hussein, Efstratios Gavves, Arnold W.M. Smeulders 
Spatiotemporal Pyramid Network for Video Action Recognition 
Yunbo Wang, Mingsheng Long, Jianmin Wang, Philip S. Yu 
ER3: A Unified Framework for Event Retrieval, Recognition and Recounting 
Zhanning Gao, Gang Hua, Dongqing Zhang, Nebojsa Jojic, Le Wang, Jianru Xue, Nanning Zheng 
FusionSeg: Learning to Combine Motion and Appearance for Fully Automatic Segmentation of Generic Objects in Videos 
Suyog Dutt Jain, Bo Xiong, Kristen Grauman 
Query-Focused Video Summarization: Dataset, Evaluation, and a Memory Network Based Approach 
Aidean Sharghi, Jacob S. Laurel, Boqing Gong 
Flexible Spatio-Temporal Networks for Video Prediction 
Chaochao Lu, Michael Hirsch, Bernhard Schölkopf 
Temporal Action Co-Segmentation in 3D Motion Capture Data and Videos 
Konstantinos Papoutsakis, Costas Panagiotakis, Antonis A. Argyros

Machine Learning 2

Spotlight 2-1A

Dual Attention Networks for Multimodal Reasoning and Matching 
Hyeonseob Nam, Jung-Woo Ha, Jeonghee Kim 
DESIRE: Distant Future Prediction in Dynamic Scenes With Interacting Agents 
Namhoon Lee, Wongun Choi, Paul Vernaza, Christopher B. Choy, Philip H. S. Torr, Manmohan Chandraker 
Interpretable Structure-Evolving LSTM 
Xiaodan Liang, Liang Lin, Xiaohui Shen, Jiashi Feng, Shuicheng Yan, Eric P. Xing 
ShapeOdds: Variational Bayesian Learning of Generative Shape Models 
Shireen Elhabian, Ross Whitaker 
Fast Video Classification via Adaptive Cascading of Deep Models 
Haichen Shen, Seungyeop Han, Matthai Philipose, Arvind Krishnamurthy 
Deep Metric Learning via Facility Location 
Hyun Oh Song, Stefanie Jegelka, Vivek Rathod, Kevin Murphy 
Semi-Supervised Deep Learning for Monocular Depth Map Prediction 
Yevhen Kuznietsov, Jörg Stückler, Bastian Leibe 
Weakly Supervised Semantic Segmentation Using Web-Crawled Videos 
Seunghoon Hong, Donghun Yeo, Suha Kwak, Honglak Lee, Bohyung Han

Oral 2-1A

Making Deep Neural Networks Robust to Label Noise: A Loss Correction Approach 
Giorgio Patrini, Alessandro Rozza, Aditya Krishna Menon, Richard Nock, Lizhen Qu 
Learning From Simulated and Unsupervised Images Through Adversarial Training 
Ashish Shrivastava, Tomas Pfister, Oncel Tuzel, Joshua Susskind, Wenda Wang, Russell Webb 
Inverse Compositional Spatial Transformer Networks 
Chen-Hsuan Lin, Simon Lucey 
Densely Connected Convolutional Networks 
Gao Huang, Zhuang Liu, Laurens van der Maaten, Kilian Q. Weinberger

Computational Photography

Spotlight 2-1B

Visual Dialog 
Abhishek Das, Satwik Kottur, Khushi Gupta, Avi Singh, Deshraj Yadav, José M. F. Moura, Devi Parikh, Dhruv Batra 
Video Frame Interpolation via Adaptive Convolution 
Simon Niklaus, Long Mai, Feng Liu 
FastMask: Segment Multi-Scale Object Candidates in One Shot 
Hexiang Hu, Shiyi Lan, Yuning Jiang, Zhimin Cao, Fei Sha 
Reconstructing Transient Images From Single-Photon Sensors 
Matthew O’Toole, Felix Heide, David B. Lindell, Kai Zang, Steven Diamond, Gordon Wetzstein 
DeshadowNet: A Multi-Context Embedding Deep Network for Shadow Removal 
Liangqiong Qu, Jiandong Tian, Shengfeng He, Yandong Tang, Rynson W. H. Lau 
Illuminant-Camera Communication to Observe Moving Objects Under Strong External Light by Spread Spectrum Modulation 
Ryusuke Sagawa, Yutaka Satoh 
Photorealistic Facial Texture Inference Using Deep Neural Networks 
Shunsuke Saito, Lingyu Wei, Liwen Hu, Koki Nagano, Hao Li 
The Geometry of First-Returning Photons for Non-Line-Of-Sight Imaging 
Chia-Yin Tsai, Kiriakos N. Kutulakos, Srinivasa G. Narasimhan, Aswin C. Sankaranarayanan

Oral 2-1B

Unrolling the Shutter: CNN to Correct Motion Distortions 
Vijay Rengarajan, Yogesh Balaji, A. N. Rajagopalan 
Light Field Blind Motion Deblurring 
Pratul P. Srinivasan, Ren Ng, Ravi Ramamoorthi 
Computational Imaging on the Electric Grid 
Mark Sheinin, Yoav Y. Schechner, Kiriakos N. Kutulakos 
Deep Outdoor Illumination Estimation 
Yannick Hold-Geoffroy, Kalyan Sunkavalli, Sunil Hadap, Emiliano Gambaretto, Jean-François Lalonde

3D Vision 2

Spotlight 2-1C

Efficient Solvers for Minimal Problems by Syzygy-Based Reduction 
Viktor Larsson, Kalle Ã…ström, Magnus Oskarsson 
HSfM: Hybrid Structure-from-Motion 
Hainan Cui, Xiang Gao, Shuhan Shen, Zhanyi Hu 
Efficient Global Point Cloud Alignment Using Bayesian Nonparametric Mixtures 
Julian Straub, Trevor Campbell, Jonathan P. How, John W. Fisher III 
A New Rank Constraint on Multi-View Fundamental Matrices, and Its Application to Camera Location Recovery 
Soumyadip Sengupta, Tal Amir, Meirav Galun, Tom Goldstein, David W. Jacobs, Amit Singer, Ronen Basri 
IM2CAD 
Hamid Izadinia, Qi Shan, Steven M. Seitz 
ScanNet: Richly-Annotated 3D Reconstructions of Indoor Scenes 
Angela Dai, Angel X. Chang, Manolis Savva, Maciej Halber, Thomas Funkhouser, Matthias Nießner 
Noise Robust Depth From Focus Using a Ring Difference Filter 
Jaeheung Surh, Hae-Gon Jeon, Yunwon Park, Sunghoon Im, Hyowon Ha, In So Kweon 
Group-Wise Point-Set Registration Based on Rényi’s Second Order Entropy 
Luis G. Sanchez Giraldo, Erion Hasanbelliu, Murali Rao, Jose C. Principe

Oral 2-1C

A Point Set Generation Network for 3D Object Reconstruction From a Single Image 
Haoqiang Fan, Hao Su, Leonidas J. Guibas 
3D Point Cloud Registration for Localization Using a Deep Neural Network Auto-Encoder 
Gil Elbaz, Tamar Avraham, Anath Fischer 
Flight Dynamics-Based Recovery of a UAV Trajectory Using Ground Cameras 
Artem Rozantsev, Sudipta N. Sinha, Debadeepta Dey, Pascal Fua 
DSAC - Differentiable RANSAC for Camera Localization (PDFcodeproject
Eric Brachmann, Alexander Krull, Sebastian Nowozin, Jamie Shotton, Frank Michel, Stefan Gumhold, Carsten Rother

Poster 2-1

3D Computer Vision

Scalable Surface Reconstruction From Point Clouds With Extreme Scale and Density Diversity 
Christian Mostegel, Rudolf Prettenthaler, Friedrich Fraundorfer, Horst Bischof 
Synthesizing 3D Shapes via Modeling Multi-View Depth Maps and Silhouettes With Deep Generative Networks 
Amir Arsalan Soltani, Haibin Huang, Jiajun Wu, Tejas D. Kulkarni, Joshua B. Tenenbaum 
General Models for Rational Cameras and the Case of Two-Slit Projections 
Matthew Trager, Bernd Sturmfels, John Canny, Martial Hebert, Jean Ponce 
Accurate Depth and Normal Maps From Occlusion-Aware Focal Stack Symmetry 
Michael Strecke, Anna Alperovich, Bastian Goldluecke 
A Multi-View Stereo Benchmark With High-Resolution Images and Multi-Camera Videos 
Thomas Schöps, Johannes L. Schönberger, Silvano Galliani, Torsten Sattler, Konrad Schindler, Marc Pollefeys, Andreas Geiger 
Non-Contact Full Field Vibration Measurement Based on Phase-Shifting 
Hiroyuki Kayaba, Yuji Kokumai 
A Minimal Solution for Two-View Focal-Length Estimation Using Two Affine Correspondences (PDFcode
Daniel Barath, Tekla Toth, Levente Hajder 
PoseAgent: Budget-Constrained 6D Object Pose Estimation via Reinforcement Learning 
Alexander Krull, Eric Brachmann, Sebastian Nowozin, Frank Michel, Jamie Shotton, Carsten Rother 
An Efficient Background Term for 3D Reconstruction and Tracking With Smooth Surface Models 
Mariano Jaimez, Thomas J. Cashman, Andrew Fitzgibbon, Javier Gonzalez-Jimenez, Daniel Cremers

Analyzing Humans in Images

Reliable Crowdsourcing and Deep Locality-Preserving Learning for Expression Recognition in the Wild 
Shan Li, Weihong Deng, JunPing Du 
Procedural Generation of Videos to Train Deep Action Recognition Networks 
César Roberto de Souza, Adrien Gaidon, Yohann Cabon, Antonio Manuel López 
BigHand2.2M Benchmark: Hand Pose Dataset and State of the Art Analysis 
Shanxin Yuan, Qi Ye, Björn Stenger, Siddhant Jain, Tae-Kyun Kim 
DenseReg: Fully Convolutional Dense Shape Regression In-The-Wild 
Rıza Alp Güler, George Trigeorgis, Epameinondas Antonakos, Patrick Snape, Stefanos Zafeiriou, Iasonas Kokkinos 
Adaptive Class Preserving Representation for Image Classification 
Jian-Xun Mi, Qiankun Fu, Weisheng Li

Applications

Generalized Semantic Preserving Hashing for N-Label Cross-Modal Retrieval 
Devraj Mandal, Kunal N. Chaudhury, Soma Biswas 
EAST: An Efficient and Accurate Scene Text Detector 
Xinyu Zhou, Cong Yao, He Wen, Yuzhi Wang, Shuchang Zhou, Weiran He, Jiajun Liang 
VidLoc: A Deep Spatio-Temporal Model for 6-DoF Video-Clip Relocalization 
Ronald Clark, Sen Wang, Andrew Markham, Niki Trigoni, Hongkai Wen

Biomedical Image/Video Analysis

Improving RANSAC-Based Segmentation Through CNN Encapsulation 
Dustin Morley, Hassan Foroosh

Computational Photography

Position Tracking for Virtual Reality Using Commodity WiFi 
Manikanta Kotaru, Sachin Katti 
Designing Illuminant Spectral Power Distributions for Surface Classification 
Henryk Blasinski, Joyce Farrell, Brian Wandell 
One-Shot Hyperspectral Imaging Using Faced Reflectors 
Tsuyoshi Takatani, Takahito Aoto, Yasuhiro Mukaigawa

Image Motion & Tracking

Direct Photometric Alignment by Mesh Deformation 
Kaimo Lin, Nianjuan Jiang, Shuaicheng Liu, Loong-Fah Cheong, Minh Do, Jiangbo Lu 
CNN-Based Patch Matching for Optical Flow With Thresholded Hinge Embedding Loss 
Christian Bailer, Kiran Varanasi, Didier Stricker 
Optical Flow Estimation Using a Spatial Pyramid Network 
Anurag Ranjan, Michael J. Black 
Deep Network Flow for Multi-Object Tracking 
Manmohan Chandraker, Paul Vernaza, Wongun Choi, Samuel Schulter

Low- & Mid-Level Vision

Material Classification Using Frequency- and Depth-Dependent Time-Of-Flight Distortion 
Kenichiro Tanaka, Yasuhiro Mukaigawa, Takuya Funatomi, Hiroyuki Kubo, Yasuyuki Matsushita, Yasushi Yagi 
Benchmarking Denoising Algorithms With Real Photographs 
Tobias Plötz, Stefan Roth 
A Unified Approach of Multi-Scale Deep and Hand-Crafted Features for Defocus Estimation (PDFproject
Jinsun Park, Yu-Wing Tai, Donghyeon Cho, In So Kweon 
StyleBank: An Explicit Representation for Neural Image Style Transfer 
Dongdong Chen, Lu Yuan, Jing Liao, Nenghai Yu, Gang Hua 
Specular Highlight Removal in Facial Images 
Chen Li, Stephen Lin, Kun Zhou, Katsushi Ikeuchi 
Image Super-Resolution via Deep Recursive Residual Network 
Ying Tai, Jian Yang, Xiaoming Liu 
Deep Image Harmonization 
Yi-Hsuan Tsai, Xiaohui Shen, Zhe Lin, Kalyan Sunkavalli, Xin Lu, Ming-Hsuan Yang 
Learning Deep CNN Denoiser Prior for Image Restoration (PDFcode
Kai Zhang, Wangmeng Zuo, Shuhang Gu, Lei Zhang 
A Novel Tensor-Based Video Rain Streaks Removal Approach via Utilizing Discriminatively Intrinsic Priors 
Tai-Xiang Jiang, Ting-Zhu Huang, Xi-Le Zhao, Liang-Jian Deng, Yao Wang 
GMS: Grid-based Motion Statistics for Fast, Ultra-Robust Feature Correspondence 
JiaWang Bian, Wen-Yan Lin, Yasuyuki Matsushita, Sai-Kit Yeung, Tan-Dat Nguyen, Ming-Ming Cheng 
Video Desnowing and Deraining Based on Matrix Decomposition 
Weihong Ren, Jiandong Tian, Zhi Han, Antoni Chan, Yandong Tang 
Real-Time Video Super-Resolution With Spatio-Temporal Networks and Motion Compensation (PDF
Jose Caballero, Christian Ledig, Andrew Aitken, Alejandro Acosta, Johannes Totz, Zehan Wang, Wenzhe Shi 
Deep Watershed Transform for Instance Segmentation 
Min Bai, Raquel Urtasun 
AnchorNet: A Weakly Supervised Network to Learn Geometry-Sensitive Features for Semantic Matching 
David Novotny, Diane Larlus, Andrea Vedaldi 
Learning Diverse Image Colorization 
Aditya Deshpande, Jiajun Lu, Mao-Chuang Yeh, Min Jin Chong, David Forsyth 
Awesome Typography: Statistics-Based Text Effects Transfer 
Shuai Yang, Jiaying Liu, Zhouhui Lian, Zongming Guo

Machine Learning

Unsupervised Video Summarization With Adversarial LSTM Networks 
Behrooz Mahasseni, Michael Lam, Sinisa Todorovic 
Deep TEN: Texture Encoding Network 
Hang Zhang, Jia Xue, Kristin Dana 
Order-Preserving Wasserstein Distance for Sequence Matching 
Bing Su, Gang Hua 
A Dual Ascent Framework for Lagrangean Decomposition of Combinatorial Problems 
Paul Swoboda, Jan Kuske, Bogdan Savchynskyy 
Attend in Groups: A Weakly-Supervised Deep Learning Framework for Learning From Web Data 
Bohan Zhuang, Lingqiao Liu, Yao Li, Chunhua Shen, Ian Reid 
Hierarchical Multimodal Metric Learning for Multimodal Classification 
Heng Zhang, Vishal M. Patel, Rama Chellappa 
Efficient Linear Programming for Dense CRFs 
Thalaiyasingam Ajanthan, Alban Desmaison, Rudy Bunel, Mathieu Salzmann, Philip H. S. Torr, M. Pawan Kumar 
Variational Autoencoded Regression: High Dimensional Regression of Visual Data on Complex Manifold 
YoungJoon Yoo, Sangdoo Yun, Hyung Jin Chang, Yiannis Demiris, Jin Young Choi 
Learning Random-Walk Label Propagation for Weakly-Supervised Semantic Segmentation 
Paul Vernaza, Manmohan Chandraker 
Low-Rank-Sparse Subspace Representation for Robust Regression 
Yongqiang Zhang, Daming Shi, Junbin Gao, Dansong Cheng

Object Recognition & Scene Understanding

Generating the Future With Adversarial Transformers 
Carl Vondrick, Antonio Torralba 
Semantic Amodal Segmentation 
Yan Zhu, Yuandong Tian, Dimitris Metaxas, Piotr Dollár 
Learning a Deep Embedding Model for Zero-Shot Learning 
Li Zhang, Tao Xiang, Shaogang Gong 
BIND: Binary Integrated Net Descriptors for Texture-Less Object Recognition 
Jacob Chan, Jimmy Addison Lee, Qian Kemao 
Growing a Brain: Fine-Tuning by Increasing Model Capacity 
Yu-Xiong Wang, Deva Ramanan, Martial Hebert 
A-Fast-RCNN: Hard Positive Generation via Adversary for Object Detection (PDF
Xiaolong Wang, Abhinav Shrivastava, Abhinav Gupta 
Multiple Instance Detection Network With Online Instance Classifier Refinement 
Peng Tang, Xinggang Wang, Xiang Bai, Wenyu Liu 
Kernel Pooling for Convolutional Neural Networks 
Yin Cui, Feng Zhou, Jiang Wang, Xiao Liu, Yuanqing Lin, Serge Belongie 
Learning Cross-Modal Embeddings for Cooking Recipes and Food Images 
Amaia Salvador, Nicholas Hynes, Yusuf Aytar, Javier Marin, Ferda Ofli, Ingmar Weber, Antonio Torralba 
Zero-Shot Learning - the Good, the Bad and the Ugly 
Yongqin Xian, Bernt Schiele, Zeynep Akata 
DeepNav: Learning to Navigate Large Cities 
Samarth Brahmbhatt, James Hays 
Scene Graph Generation by Iterative Message Passing 
Danfei Xu, Yuke Zhu, Christopher B. Choy, Li Fei-Fei 
Visual Translation Embedding Network for Visual Relation Detection 
Hanwang Zhang, Zawlin Kyaw, Shih-Fu Chang, Tat-Seng Chua 
Unsupervised Part Learning for Visual Recognition 
Ronan Sicre, Yannis Avrithis, Ewa Kijak, Frédéric Jurie 
Comprehension-Guided Referring Expressions 
Ruotian Luo, Gregory Shakhnarovich 
Top-Down Visual Saliency Guided by Captions 
Vasili Ramanishka, Abir Das, Jianming Zhang, Kate Saenko

Theory

Grassmannian Manifold Optimization Assisted Sparse Spectral Clustering 
Junbin Gao, Qiong Wang, Hong Li

Video Analytics

Video Propagation Networks 
Varun Jampani, Raghudeep Gadde, Peter V. Gehler 
ActionVLAD: Learning Spatio-Temporal Aggregation for Action Classification 
Rohit Girdhar, Deva Ramanan, Abhinav Gupta, Josef Sivic, Bryan Russell 
SCC: Semantic Context Cascade for Efficient Action Detection 
Fabian Caba Heilbron, Wayner Barrios, Victor Escorcia, Bernard Ghanem 
Hierarchical Boundary-Aware Neural Encoder for Video Captioning 
Lorenzo Baraldi, Costantino Grana, Rita Cucchiara 
HOPE: Hierarchical Object Prototype Encoding for Efficient Object Instance Search in Videos 
Tan Yu, Yuwei Wu, Junsong Yuan 
Spatio-Temporal Vector of Locally Max Pooled Features for Action Recognition in Videos (PDF
Ionut Cosmin Duta, Bogdan Ionescu, Kiyoharu Aizawa, Nicu Sebe 
Temporal Action Localization by Structured Maximal Sums 
Zehuan Yuan, Jonathan C. Stroud, Tong Lu, Jia Deng 
Predicting Salient Face in Multiple-Face Videos 
Yufan Liu, Songyang Zhang, Mai Xu, Xuming He

Object Recognition & Scene Understanding 1

Spotlight 2-2A

Graph-Structured Representations for Visual Question Answering 
Damien Teney, Lingqiao Liu, Anton van den Hengel 
Knowing When to Look: Adaptive Attention via a Visual Sentinel for Image Captioning 
Jiasen Lu, Caiming Xiong, Devi Parikh, Richard Socher 
Learned Contextual Feature Reweighting for Image Geo-Localization 
Hyo Jin Kim, Enrique Dunn, Jan-Michael Frahm 
End-To-End Concept Word Detection for Video Captioning, Retrieval, and Question Answering 
Youngjae Yu, Hyungjin Ko, Jongwook Choi, Gunhee Kim 
Deep Cross-Modal Hashing 
Qing-Yuan Jiang, Wu-Jun Li 
Unambiguous Text Localization and Retrieval for Cluttered Scenes 
Xuejian Rong, Chucai Yi, Yingli Tian 
Bayesian Supervised Hashing 
Zihao Hu, Junxuan Chen, Hongtao Lu, Tongzhen Zhang 
Speed/Accuracy Trade-Offs for Modern Convolutional Object Detectors 
Jonathan Huang, Vivek Rathod, Chen Sun, Menglong Zhu, Anoop Korattikara, Alireza Fathi, Ian Fischer, Zbigniew Wojna, Yang Song, Sergio Guadarrama, Kevin Murphy

Oral 2-2A

Detecting Visual Relationships With Deep Relational Networks 
Bo Dai, Yuqi Zhang, Dahua Lin 
Full-Resolution Residual Networks for Semantic Segmentation in Street Scenes (PDFvideoscode
Tobias Pohlen, Alexander Hermans, Markus Mathias, Bastian Leibe 
Network Dissection: Quantifying Interpretability of Deep Visual Representations 
David Bau, Bolei Zhou, Aditya Khosla, Aude Oliva, Antonio Torralba 
AGA: Attribute-Guided Augmentation 
Mandar Dixit, Roland Kwitt, Marc Niethammer, Nuno Vasconcelos

Analyzing Humans 2

Spotlight 2-2B

A Hierarchical Approach for Generating Descriptive Image Paragraphs 
Jonathan Krause, Justin Johnson, Ranjay Krishna, Li Fei-Fei 
Person Re-Identification in the Wild 
Liang Zheng, Hengheng Zhang, Shaoyan Sun, Manmohan Chandraker, Yi Yang, Qi Tian 
Scalable Person Re-Identification on Supervised Smoothed Manifold 
Song Bai, Xiang Bai, Qi Tian 
Binge Watching: Scaling Affordance Learning From Sitcoms (PDF
Xiaolong Wang, Rohit Girdhar, Abhinav Gupta 
Joint Detection and Identification Feature Learning for Person Search 
Tong Xiao, Shuang Li, Bochao Wang, Liang Lin, Xiaogang Wang 
Synthesizing Normalized Faces From Facial Identity Features 
Forrester Cole, David Belanger, Dilip Krishnan, Aaron Sarna, Inbar Mosseri, William T. Freeman 
Consistent-Aware Deep Learning for Person Re-Identification in a Camera Network 
Ji Lin, Liangliang Ren, Jiwen Lu, Jianjiang Feng, Jie Zhou 
Level Playing Field for Million Scale Face Recognition 
Aaron Nech, Ira Kemelmacher-Shlizerman

Oral 2-2B

Re-Sign: Re-Aligned End-To-End Sequence Modelling With Deep Recurrent CNN-HMMs 
Oscar Koller, Sepehr Zargaran, Hermann Ney 
Social Scene Understanding: End-To-End Multi-Person Action Localization and Collective Activity Recognition 
Timur Bagautdinov, Alexandre Alahi, François Fleuret, Pascal Fua, Silvio Savarese 
Detangling People: Individuating Multiple Close People and Their Body Parts via Region Assembly 
Hao Jiang, Kristen Grauman 
Lip Reading Sentences in the Wild 
Joon Son Chung, Andrew Senior, Oriol Vinyals, Andrew Zisserman

Applications

Spotlight 2-2C

Deep Matching Prior Network: Toward Tighter Multi-Oriented Text Detection 
Lianwen Jin, Yuliang Liu 
ChestX-ray8: Hospital-Scale Chest X-Ray Database and Benchmarks on Weakly-Supervised Classification and Localization of Common Thorax Diseases 
Xiaosong Wang, Yifan Peng, Le Lu, Zhiyong Lu, Mohammadhadi Bagheri, Ronald M. Summers 
Attentional Push: A Deep Convolutional Network for Augmenting Image Salience With Shared Attention Modeling in Social Scenes 
Siavash Gorji, James J. Clark 
Detecting Oriented Text in Natural Images by Linking Segments 
Baoguang Shi, Xiang Bai, Serge Belongie 
Learning Video Object Segmentation From Static Images 
Federico Perazzi, Anna Khoreva, Rodrigo Benenson, Bernt Schiele, Alexander Sorkine-Hornung 
Seeing Invisible Poses: Estimating 3D Body Pose From Egocentric Video 
Hao Jiang, Kristen Grauman 
Plug & Play Generative Networks: Conditional Iterative Generation of Images in Latent Space 
Anh Nguyen, Jeff Clune, Yoshua Bengio, Alexey Dosovitskiy, Jason Yosinski 
A Joint Speaker-Listener-Reinforcer Model for Referring Expressions 
Licheng Yu, Hao Tan, Mohit Bansal, Tamara L. Berg

Oral 2-2C

End-To-End Learning of Driving Models From Large-Scale Video Datasets 
Huazhe Xu, Yang Gao, Fisher Yu, Trevor Darrell 
Deep Future Gaze: Gaze Anticipation on Egocentric Videos Using Adversarial Networks 
Mengmi Zhang, Keng Teck Ma, Joo Hwee Lim, Qi Zhao, Jiashi Feng 
MDNet: A Semantically and Visually Interpretable Medical Image Diagnosis Network 
Zizhao Zhang, Yuanpu Xie, Fuyong Xing, Mason McGough, Lin Yang

Poster 2-2

3D Computer Vision

Surface Motion Capture Transfer With Gaussian Process Regression 
Adnane Boukhayma, Jean-Sébastien Franco, Edmond Boyer 
Visual-Inertial-Semantic Scene Representation for 3D Object Detection 
Jingming Dong, Xiaohan Fei, Stefano Soatto 
Template-Based Monocular 3D Recovery of Elastic Shapes Using Lagrangian Multipliers 
Nazim Haouchine, Stephane Cotin 
Learning Category-Specific 3D Shape Models From Weakly Labeled 2D Images 
Dingwen Zhang, Junwei Han, Yang Yang, Dong Huang 
Simultaneous Geometric and Radiometric Calibration of a Projector-Camera Pair 
Marjan Shahpaski, Luis Ricardo Sapaico, Gaspard Chevassus, Sabine Süsstrunk 
Learning Barycentric Representations of 3D Shapes for Sketch-Based 3D Shape Retrieval 
Jin Xie, Guoxian Dai, Fan Zhu, Yi Fang 
Geodesic Distance Descriptors 
Gil Shamai, Ron Kimmel

Analyzing Humans in Images

Modeling Temporal Dynamics and Spatial Configurations of Actions Using Two-Stream Recurrent Neural Networks 
Hongsong Wang, Liang Wang 
Forecasting Human Dynamics From Static Images 
Yu-Wei Chao, Jimei Yang, Brian Price, Scott Cohen, Jia Deng 
Re-Ranking Person Re-Identification With k-Reciprocal Encoding 
Zhun Zhong, Liang Zheng, Donglin Cao, Shaozi Li 
Deep Sequential Context Networks for Action Prediction 
Yu Kong, Zhiqiang Tao, Yun Fu 
Global Context-Aware Attention LSTM Networks for 3D Action Recognition 
Jun Liu, Gang Wang, Ping Hu, Ling-Yu Duan, Alex C. Kot 
Dynamic Attention-Controlled Cascaded Shape Regression Exploiting Training Data Augmentation and Fuzzy-Set Sample Weighting 
Zhen-Hua Feng, Josef Kittler, William Christmas, Patrik Huber, Xiao-Jun Wu 
A Deep Regression Architecture With Two-Stage Re-Initialization for High Performance Facial Landmark Detection 
Jiangjing Lv, Xiaohu Shao, Junliang Xing, Cheng Cheng, Xi Zhou 
Multiple People Tracking by Lifted Multicut and Person Re-Identification 
Siyu Tang, Mykhaylo Andriluka, Bjoern Andres, Bernt Schiele 
Towards Accurate Multi-Person Pose Estimation in the Wild 
George Papandreou, Tyler Zhu, Nori Kanazawa, Alexander Toshev, Jonathan Tompson, Chris Bregler, Kevin Murphy

Applications

Towards a Quality Metric for Dense Light Fields 
Vamsi Kiran Adhikarla, Marek Vinkler, Denis Sumin, RafaÅ‚ K. Mantiuk, Karol Myszkowski, Hans-Peter Seidel, Piotr Didyk 
Controlling Perceptual Factors in Neural Style Transfer 
Leon A. Gatys, Alexander S. Ecker, Matthias Bethge, Aaron Hertzmann, Eli Shechtman

Biomedical Image/Video Analysis

Joint Sequence Learning and Cross-Modality Convolution for 3D Biomedical Segmentation 
Kuan-Lun Tseng, Yen-Liang Lin, Winston Hsu, Chung-Yang Huang 
LSTM Self-Supervision for Detailed Behavior Analysis 
Biagio Brattoli, Uta Büchler, Anna-Sophia Wahl, Martin E. Schwab, Björn Ommer

Computational Photography

A Wide-Field-Of-View Monocentric Light Field Camera (PDFprojectproject
Donald G. Dansereau, Glenn Schuster, Joseph Ford, Gordon Wetzstein

Image Motion & Tracking

S2F: Slow-To-Fast Interpolator Flow 
Yanchao Yang, Stefano Soatto 
CLKN: Cascaded Lucas-Kanade Networks for Image Alignment 
Che-Han Chang, Chun-Nan Chou, Edward Y. Chang 
Multi-Object Tracking With Quadruplet Convolutional Neural Networks 
Mooyeol Baek, Jeany Son, Minsu Cho, Bohyung Han

Low- & Mid-Level Vision

Learning to Detect Salient Objects With Image-Level Supervision 
Lijun Wang, Huchuan Lu, Yifan Wang, Mengyang Feng, Dong Wang, Baocai Yin, Xiang Ruan 
From Motion Blur to Motion Flow: A Deep Learning Solution for Removing Heterogeneous Motion Blur 
Dong Gong, Jie Yang, Lingqiao Liu, Yanning Zhang, Ian Reid, Chunhua Shen, Anton van den Hengel, Qinfeng Shi 
Co-Occurrence Filter 
Roy J. Jevnisek, Shai Avidan 
Fractal Dimension Invariant Filtering and Its CNN-Based Implementation 
Hongteng Xu, Junchi Yan, Nils Persson, Weiyao Lin, Hongyuan Zha 
Noise-Blind Image Deblurring 
Meiguang Jin, Stefan Roth, Paolo Favaro 
Simultaneous Visual Data Completion and Denoising Based on Tensor Rank and Total Variation Minimization and Its Primal-Dual Splitting Algorithm 
Tatsuya Yokota, Hidekata Hontani 
HPatches: A Benchmark and Evaluation of Handcrafted and Learned Local Descriptors 
Vassileios Balntas, Karel Lenc, Andrea Vedaldi, Krystian Mikolajczyk 
Hyperspectral Image Super-Resolution via Non-Local Sparse Tensor Factorization 
Renwei Dian, Leyuan Fang, Shutao Li 
Reflection Removal Using Low-Rank Matrix Completion 
Byeong-Ju Han, Jae-Young Sim 
Object Co-Skeletonization With Co-Segmentation 
Koteswar Rao Jerripothula, Jianfei Cai, Jiangbo Lu, Junsong Yuan

Machine Learning

Mining Object Parts From CNNs via Active Question-Answering 
Quanshi Zhang, Ruiming Cao, Ying Nian Wu, Song-Chun Zhu 
PolyNet: A Pursuit of Structural Diversity in Very Deep Networks 
Xingcheng Zhang, Zhizhong Li, Chen Change Loy, Dahua Lin 
The VQA-Machine: Learning How to Use Existing Vision Algorithms to Answer New Questions 
Peng Wang, Qi Wu, Chunhua Shen, Anton van den Hengel 
Joint Discriminative Bayesian Dictionary and Classifier Learning 
Naveed Akhtar, Ajmal Mian, Fatih Porikli 
A Study of Lagrangean Decompositions and Dual Ascent Solvers for Graph Matching 
Paul Swoboda, Carsten Rother, Hassan Abu Alhaija, Dagmar Kainmüller, Bogdan Savchynskyy 
Quad-Networks: Unsupervised Learning to Rank for Interest Point Detection 
Nikolay Savinov, Akihito Seki, Ľubor Ladický, Torsten Sattler, Marc Pollefeys 
Outlier-Robust Tensor PCA 
Pan Zhou, Jiashi Feng 
Learning Adaptive Receptive Fields for Deep Image Parsing Network 
Zhen Wei, Yao Sun, Jinqiao Wang, Hanjiang Lai, Si Liu 
Learning an Invariant Hilbert Space for Domain Adaptation 
Samitha Herath, Mehrtash Harandi, Fatih Porikli 
Fixed-Point Factorized Networks 
Peisong Wang, Jian Cheng 
Discriminative Optimization: Theory and Applications to Point Cloud Registration 
Jayakorn Vongkulbhisal, Fernando De la Torre, João P. Costeira 
Online Asymmetric Similarity Learning for Cross-Modal Retrieval 
Yiling Wu, Shuhui Wang, Qingming Huang 
Improving Training of Deep Neural Networks via Singular Value Bounding 
Kui Jia, Dacheng Tao, Shenghua Gao, Xiangmin Xu 
S3Pool: Pooling With Stochastic Spatial Sampling 
Shuangfei Zhai, Hui Wu, Abhishek Kumar, Yu Cheng, Yongxi Lu, Zhongfei Zhang, Rogerio Feris 
Sports Field Localization via Deep Structured Models 
Namdar Homayounfar, Sanja Fidler, Raquel Urtasun 
Noisy Softmax: Improving the Generalization Ability of DCNN via Postponing the Early Softmax Saturation 
Binghui Chen, Weihong Deng, Junping Du 
Switching Convolutional Neural Network for Crowd Counting (PDF,project
Deepak Babu Sam*, Shiv Surya*, R. Venkatesh Babu ((* Equal Contributors) Video Analytics Lab, Indian Institute of Science) 
Network Sketching: Exploiting Binary Structure in Deep CNNs (PDF
Yiwen Guo, Anbang Yao, Hao Zhao, Yurong Chen 
Multi-Task Clustering of Human Actions by Sharing Information 
Shizhe Hu, Xiaoqiang Yan, Yangdong Ye 
Soft-Margin Mixture of Regressions 
Dong Huang, Longfei Han, Fernando De la Torre 
Multigrid Neural Architectures 
Tsung-Wei Ke, Michael Maire, Stella X. Yu 
High-Resolution Image Inpainting Using Multi-Scale Neural Patch Synthesis 
Chao Yang, Xin Lu, Zhe Lin, Eli Shechtman, Oliver Wang, Hao Li 
Deep Quantization: Encoding Convolutional Activations With Deep Generative Model 
Zhaofan Qiu, Ting Yao, Tao Mei 
DOPE: Distributed Optimization for Pairwise Energies 
Jose Dolz, Ismail Ben Ayed, Christian Desrosiers 
Improved Texture Networks: Maximizing Quality and Diversity in Feed-Forward Stylization and Texture Synthesis 
Dmitry Ulyanov, Andrea Vedaldi, Victor Lempitsky

Object Recognition & Scene Understanding

Polyhedral Conic Classifiers for Visual Object Detection and Classification 
Hakan Cevikalp, Bill Triggs 
Incremental Kernel Null Space Discriminant Analysis for Novelty Detection 
Juncheng Liu, Zhouhui Lian, Yi Wang, Jianguo Xiao 
Predicting Ground-Level Scene Layout From Aerial Imagery 
Menghua Zhai, Zachary Bessinger, Scott Workman, Nathan Jacobs 
Deep Feature Flow for Video Recognition 
Xizhou Zhu, Yuwen Xiong, Jifeng Dai, Lu Yuan, Yichen Wei 
Object-Aware Dense Semantic Correspondence 
Fan Yang, Xin Li, Hong Cheng, Jianping Li, Leiting Chen 
Semantic Regularisation for Recurrent Image Annotation 
Feng Liu, Tao Xiang, Timothy M. Hospedales, Wankou Yang, Changyin Sun 
Video2Shop: Exact Matching Clothes in Videos to Online Shopping Images 
Zhi-Qi Cheng, Xiao Wu, Yang Liu, Xian-Sheng Hua 
Fast-At: Fast Automatic Thumbnail Generation Using Deep Neural Networks 
Seyed A. Esmaeili, Bharat Singh, Larry S. Davis 
Multi-Level Attention Networks for Visual Question Answering 
Dongfei Yu, Jianlong Fu, Tao Mei, Yong Rui 
Generating Descriptions With Grounded and Co-Referenced People 
Anna Rohrbach, Marcus Rohrbach, Siyu Tang, Seong Joon Oh, Bernt Schiele 
Straight to Shapes: Real-Time Detection of Encoded Shapes 
Saumya Jetley, Michael Sapienza, Stuart Golodetz, Philip H. S. Torr 
Simultaneous Feature Aggregating and Hashing for Large-Scale Image Search 
Thanh-Toan Do, Dang-Khoa Le Tan, Trung T. Pham, Ngai-Man Cheung 
Improving Facial Attribute Prediction Using Semantic Segmentation 
Mahdi M. Kalayeh, Boqing Gong, Mubarak Shah

Video Analytics

Learning Cross-Modal Deep Representations for Robust Pedestrian Detection 
Dan Xu, Wanli Ouyang, Elisa Ricci, Xiaogang Wang, Nicu Sebe 
Spatio-Temporal Self-Organizing Map Deep Network for Dynamic Object Detection From Videos (PDF
Yang Du, Chunfeng Yuan, Bing Li, Weiming Hu, Stephen Maybank 
CERN: Confidence-Energy Recurrent Network for Group Activity Recognition 
Tianmin Shu, Sinisa Todorovic, Song-Chun Zhu 
Understanding Traffic Density From Large-Scale Web Camera Data 
Shanghang Zhang, Guanhang Wu, João P. Costeira, José M. F. Moura 
Collaborative Summarization of Topic-Related Videos 
Rameswar Panda, Amit K. Roy-Chowdhury

Machine Learning 3

Spotlight 3-1A

Local Binary Convolutional Neural Networks 
Felix Juefei-Xu, Vishnu Naresh Boddeti, Marios Savvides 
Deep Self-Taught Learning for Weakly Supervised Object Localization 
Zequn Jie, Yunchao Wei, Xiaojie Jin, Jiashi Feng, Wei Liu 
Multi-Modal Mean-Fields via Cardinality-Based Clamping 
Pierre Baqué, François Fleuret, Pascal Fua 
Probabilistic Temporal Subspace Clustering 
Vladimir Pavlovic, Behnam Gholami 
Provable Self-Representation Based Outlier Detection in a Union of Subspaces 
Chong You, Daniel P. Robinson, René Vidal 
Latent Multi-View Subspace Clustering 
Changqing Zhang, Qinghua Hu, Huazhu Fu, Pengfei Zhu, Xiaochun Cao 
Learning to Extract Semantic Structure From Documents Using Multimodal Fully Convolutional Neural Networks 
Xiao Yang, Ersin Yumer, Paul Asente, Mike Kraley, Daniel Kifer, C. Lee Giles 
Age Progression/Regression by Conditional Adversarial Autoencoder 
Zhifei Zhang, Yang Song, Hairong Qi

Oral 3-1A

Compact Matrix Factorization With Dependent Subspaces 
Viktor Larsson, Carl Olsson 
FFTLasso: Large-Scale LASSO in the Fourier Domain 
Adel Bibi, Hani Itani, Bernard Ghanem 
On the Global Geometry of Sphere-Constrained Sparse Blind Deconvolution 
Yuqian Zhang, Yenson Lau, Han-wen Kuo, Sky Cheung, Abhay Pasupathy, John Wright 
Global Optimality in Neural Network Training 
Benjamin D. Haeffele, René Vidal

Object Recognition & Scene Understanding 2

Spotlight 3-1B

What Is and What Is Not a Salient Object? Learning Salient Object Detector by Ensembling Linear Exemplar Regressors 
Changqun Xia, Jia Li, Xiaowu Chen, Anlin Zheng, Yu Zhang 
Deep Variation-Structured Reinforcement Learning for Visual Relationship and Attribute Detection 
Xiaodan Liang, Lisa Lee, Eric P. Xing 
Modeling Relationships in Referential Expressions With Compositional Modular Networks 
Ronghang Hu, Marcus Rohrbach, Jacob Andreas, Trevor Darrell, Kate Saenko 
Counting Everyday Objects in Everyday Scenes 
Prithvijit Chattopadhyay, Ramakrishna Vedantam, Ramprasaath R. Selvaraju, Dhruv Batra, Devi Parikh 
Fully Convolutional Instance-Aware Semantic Segmentation 
Yi Li, Haozhi Qi, Jifeng Dai, Xiangyang Ji, Yichen Wei 
Semantic Autoencoder for Zero-Shot Learning 
Elyor Kodirov, Tao Xiang, Shaogang Gong 
CityPersons: A Diverse Dataset for Pedestrian Detection 
Shanshan Zhang, Rodrigo Benenson, Bernt Schiele 
GuessWhat?! Visual Object Discovery Through Multi-Modal Dialogue 
Harm de Vries, Florian Strub, Sarath Chandar, Olivier Pietquin, Hugo Larochelle, Aaron Courville

Oral 3-1B

Look Closer to See Better: Recurrent Attention Convolutional Neural Network for Fine-Grained Image Recognition 
Jianlong Fu, Heliang Zheng, Tao Mei 
Annotating Object Instances With a Polygon-RNN 
Lluís Castrejón, Kaustav Kundu, Raquel Urtasun, Sanja Fidler 
Connecting Look and Feel: Associating the Visual and Tactile Properties of Physical Materials 
Wenzhen Yuan, Shaoxiong Wang, Siyuan Dong, Edward Adelson 
Deep Learning Human Mind for Automated Visual Classification 
Concetto Spampinato, Simone Palazzo, Isaak Kavasidis, Daniela Giordano, Nasim Souly, Mubarak Shah

Poster 3-1

3D Computer Vision

Self-Calibration-Based Approach to Critical Motion Sequences of Rolling-Shutter Structure From Motion 
Eisuke Ito, Takayuki Okatani 
Semi-Calibrated Near Field Photometric Stereo 
Fotios Logothetis, Roberto Mecca, Roberto Cipolla 
Semantic Multi-View Stereo: Jointly Estimating Objects and Voxels 
Ali Osman Ulusoy, Michael J. Black, Andreas Geiger 
Learning to Predict Stereo Reliability Enforcing Local Consistency of Confidence Maps 
Matteo Poggi, Stefano Mattoccia 
The Misty Three Point Algorithm for Relative Pose 
Tobias Palmér, Kalle Ã…ström, Jan-Michael Frahm 
The Surfacing of Multiview 3D Drawings via Lofting and Occlusion Reasoning (PDFdatasetposter
Anil Usumezbas, Ricardo Fabbri, Benjamin B. Kimia 
A New Representation of Skeleton Sequences for 3D Action Recognition 
Qiuhong Ke, Mohammed Bennamoun, Senjian An, Ferdous Sohel, Farid Boussaid 
A General Framework for Curve and Surface Comparison and Registration With Oriented Varifolds 
Irène Kaltenmark, Benjamin Charlier, Nicolas Charon 
Learning to Align Semantic Segmentation and 2.5D Maps for Geolocalization 
Anil Armagan, Martin Hirzer, Peter M. Roth, Vincent Lepetit 
A Generative Model for Depth-Based Robust 3D Facial Pose Tracking 
Lu Sheng, Jianfei Cai, Tat-Jen Cham, Vladimir Pavlovic, King Ngi Ngan 
Fast 3D Reconstruction of Faces With Glasses 
Fabio Maninchedda, Martin R. Oswald, Marc Pollefeys 
An Efficient Algebraic Solution to the Perspective-Three-Point Problem 
Tong Ke, Stergios I. Roumeliotis

Analyzing Humans in Images

Learning From Synthetic Humans 
Gül Varol, Javier Romero, Xavier Martin, Naureen Mahmood, Michael J. Black, Ivan Laptev, Cordelia Schmid 
Forecasting Interactive Dynamics of Pedestrians With Fictitious Play 
Wei-Chiu Ma, De-An Huang, Namhoon Lee, Kris M. Kitani 
Hand Keypoint Detection in Single Images Using Multiview Bootstrapping 
Tomas Simon, Hanbyul Joo, Iain Matthews, Yaser Sheikh 
PoseTrack: Joint Multi-Person Pose Estimation and Tracking 
Umar Iqbal, Anton Milan, Juergen Gall 
Expecting the Unexpected: Training Detectors for Unusual Pedestrians With Adversarial Imposters 
Shiyu Huang, Deva Ramanan 
On Human Motion Prediction Using Recurrent Neural Networks 
Julieta Martinez, Michael J. Black, Javier Romero 
Learning and Refining of Privileged Information-Based RNNs for Action Recognition From Depth Sequences 
Zhiyuan Shi, Tae-Kyun Kim 
Quality Aware Network for Set to Set Recognition 
Yu Liu, Junjie Yan, Wanli Ouyang 
Unite the People: Closing the Loop Between 3D and 2D Human Representations 
Christoph Lassner, Javier Romero, Martin Kiefel, Federica Bogo, Michael J. Black, Peter V. Gehler 
Deep Multitask Architecture for Integrated 2D and 3D Human Sensing 
Alin-Ionut Popa, Mihai Zanfir, Cristian Sminchisescu 
Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset 
João Carreira, Andrew Zisserman

Applications

Identifying First-Person Camera Wearers in Third-Person Videos 
Chenyou Fan, Jangwon Lee, Mingze Xu, Krishna Kumar Singh, Yong Jae Lee, David J. Crandall, Michael S. Ryoo

Biomedical Image/Video Analysis

Parsing Images of Overlapping Organisms With Deep Singling-Out Networks 
Victor Yurchenko, Victor Lempitsky 
Fine-Tuning Convolutional Neural Networks for Biomedical Image Analysis: Actively and Incrementally 
Zongwei Zhou, Jae Shin, Lei Zhang, Suryakanth Gurudu, Michael Gotway, Jianming Liang

Computational Photography

Depth From Defocus in the Wild 
Huixuan Tang, Scott Cohen, Brian Price, Stephen Schiller, Kiriakos N. Kutulakos 
Matting and Depth Recovery of Thin Structures Using a Focal Stack 
Chao Liu, Srinivasa G. Narasimhan, Artur W. Dubrawski

Image Motion & Tracking

Robust Interpolation of Correspondences for Large Displacement Optical Flow 
Yinlin Hu, Yunsong Li, Rui Song 
Large Margin Object Tracking With Circulant Feature Maps 
Mengmeng Wang, Yong Liu, Zeyi Huang 
Minimum Delay Moving Object Detection 
Dong Lao, Ganesh Sundaramoorthi 
Multi-Task Correlation Particle Filter for Robust Object Tracking 
Tianzhu Zhang, Changsheng Xu, Ming-Hsuan Yang 
Attentional Correlation Filter Network for Adaptive Visual Tracking 
Jongwon Choi, Hyung Jin Chang, Sangdoo Yun, Tobias Fischer, Yiannis Demiris, Jin Young Choi 
The World of Fast Moving Objects 
Denys Rozumnyi, Jan Kotera, Filip Å roubek, Lukáš Novotný, Jiří Matas 
Discriminative Correlation Filter With Channel and Spatial Reliability 
Alan LukežiÄ, Tomáš Vojíř, Luka ÄŒehovin Zajc, Jiří Matas, Matej Kristan

Low- & Mid-Level Vision

Learning Deep Binary Descriptor With Multi-Quantization 
Yueqi Duan, Jiwen Lu, Ziwei Wang, Jianjiang Feng, Jie Zhou 
One-To-Many Network for Visually Pleasing Compression Artifacts Reduction 
Jun Guo, Hongyang Chao 
Gated Feedback Refinement Network for Dense Image Labeling 
Md Amirul Islam, Mrigank Rochan, Neil D. B. Bruce, Yang Wang 
BRISKS: Binary Features for Spherical Images on a Geodesic Grid 
Hao Guan, William A. P. Smith 
Superpixels and Polygons Using Simple Non-Iterative Clustering 
Radhakrishna Achanta, Sabine Süsstrunk 
Hardware-Efficient Guided Image Filtering for Multi-Label Problem 
Longquan Dai, Mengke Yuan, Zechao Li, Xiaopeng Zhang, Jinhui Tang 
Alternating Direction Graph Matching (PDF
D. Khuê Lê-Huu, Nikos Paragios 
Learning Discriminative and Transformation Covariant Local Feature Detectors 
Xu Zhang, Felix X. Yu, Svebor Karaman, Shih-Fu Chang

Machine Learning

Correlational Gaussian Processes for Cross-Domain Visual Recognition 
Chengjiang Long, Gang Hua 
DeLiGAN : Generative Adversarial Networks for Diverse and Limited Data (PDFcode
Swaminathan Gurumurthy (CMU), Ravi Kiran Sarvadevabhatla (Video Analytics Lab, Indian Institute of Science), R. Venkatesh Babu 
Oriented Response Networks 
Yanzhao Zhou, Qixiang Ye, Qiang Qiu, Jianbin Jiao 
Missing Modalities Imputation via Cascaded Residual Autoencoder 
Luan Tran, Xiaoming Liu, Jiayu Zhou, Rong Jin 
Efficient Optimization for Hierarchically-structured Interacting Segments (HINTS) 
Hossam Isack, Olga Veksler, Ipek Oguz, Milan Sonka, Yuri Boykov 
A Message Passing Algorithm for the Minimum Cost Multicut Problem 
Paul Swoboda, Bjoern Andres 
End-To-End Representation Learning for Correlation Filter Based Tracking 
Jack Valmadre, Luca Bertinetto, João Henriques, Andrea Vedaldi, Philip H. S. Torr 
Filter Flow Made Practical: Massively Parallel and Lock-Free 
Sathya N. Ravi, Yunyang Xiong, Lopamudra Mukherjee, Vikas Singh 
Online Graph Completion: Multivariate Signal Recovery in Computer Vision 
Won Hwa Kim, Mona Jalal, Seongjae Hwang, Sterling C. Johnson, Vikas Singh 
Point to Set Similarity Based Deep Feature Learning for Person Re-Identification 
Sanping Zhou, Jinjun Wang, Jiayun Wang, Yihong Gong, Nanning Zheng 
Exploiting Saliency for Object Segmentation From Image Level Labels 
Seong Joon Oh, Rodrigo Benenson, Anna Khoreva, Zeynep Akata, Mario Fritz, Bernt Schiele 
Consensus Maximization With Linear Matrix Inequality Constraints 
Pablo Speciale, Danda Pani Paudel, Martin R. Oswald, Till Kroeger, Luc Van Gool, Marc Pollefeys 
Physically-Based Rendering for Indoor Scene Understanding Using Convolutional Neural Networks 
Yinda Zhang, Shuran Song, Ersin Yumer, Manolis Savva, Joon-Young Lee, Hailin Jin, Thomas Funkhouser 
Deep Multimodal Representation Learning From Temporal Data 
Xitong Yang, Palghat Ramesh, Radha Chitta, Sriganesh Madhvanath, Edgar A. Bernal, Jiebo Luo 
All You Need Is Beyond a Good Init: Exploring Better Solution for Training Extremely Deep Convolutional Neural Networks With Orthonormality and Modulation 
Di Xie, Jiang Xiong, Shiliang Pu 
Hard Mixtures of Experts for Large Scale Weakly Supervised Vision 
Sam Gross, Marc’Aurelio Ranzato, Arthur Szlam 
A Reinforcement Learning Approach to the View Planning Problem 
Mustafa Devrim Kaba, Mustafa Gokhan Uzunbas, Ser Nam Lim 
Zero-Shot Classification With Discriminative Semantic Representation Learning 
Meng Ye, Yuhong Guo 
Adversarial Discriminative Domain Adaptation 
Eric Tzeng, Judy Hoffman, Kate Saenko, Trevor Darrell

None of the above

Learning to Rank Retargeted Images 
Yang Chen, Yong-Jin Liu, Yu-Kun Lai

Object Recognition & Scene Understanding

Automatic Discovery, Association Estimation and Learning of Semantic Attributes for a Thousand Categories 
Ziad Al-Halah, Rainer Stiefelhagen 
Scene Parsing Through ADE20K Dataset 
Bolei Zhou, Hang Zhao, Xavier Puig, Sanja Fidler, Adela Barriuso, Antonio Torralba 
Weakly Supervised Cascaded Convolutional Networks 
Ali Diba, Vivek Sharma, Ali Pazandeh, Hamed Pirsiavash, Luc Van Gool 
Discretely Coding Semantic Rank Orders for Supervised Image Hashing 
Li Liu, Ling Shao, Fumin Shen, Mengyang Yu 
Joint Geometrical and Statistical Alignment for Visual Domain Adaptation 
Jing Zhang, Wanqing Li, Philip Ogunbona 
Weakly Supervised Dense Video Captioning 
Zhiqiang Shen, Jianguo Li, Zhou Su, Minjun Li, Yurong Chen, Yu-Gang Jiang, Xiangyang Xue 
RefineNet: Multi-Path Refinement Networks for High-Resolution Semantic Segmentation 
Guosheng Lin, Anton Milan, Chunhua Shen, Ian Reid 
Semantic Segmentation via Structured Patch Prediction, Context CRF and Guidance CRF 
Falong Shen, Rui Gan, Shuicheng Yan, Gang Zeng 
Person Search With Natural Language Description 
Shuang Li, Tong Xiao, Hongsheng Li, Bolei Zhou, Dayu Yue, Xiaogang Wang 
Weakly Supervised Affordance Detection 
Johann Sawatzky, Abhilash Srikantha, Juergen Gall 
Zero-Shot Recognition Using Dual Visual-Semantic Mapping Paths 
Yanan Li, Donghui Wang, Huanhang Hu, Yuetan Lin, Yueting Zhuang 
Neural Aggregation Network for Video Face Recognition (PDF
Jiaolong Yang (Microsoft Research), Peiran Ren (Microsoft Research), Dongqing Zhang (Microsoft Research), Dong Chen (Microsoft Research), Fang Wen (Microsoft Research), Hongdong Li (ANU), Gang Hua (Microsoft Research) 
Relationship Proposal Networks 
Ji Zhang, Mohamed Elhoseiny, Scott Cohen, Walter Chang, Ahmed Elgammal 
Learning Object Interactions and Descriptions for Semantic Image Segmentation 
Guangrun Wang, Ping Luo, Liang Lin, Xiaogang Wang 
RON: Reverse Connection With Objectness Prior Networks for Object Detection 
Tao Kong, Fuchun Sun, Anbang Yao, Huaping Liu, Ming Lu, Yurong Chen 
Weakly-Supervised Visual Grounding of Phrases With Linguistic Structures 
Fanyi Xiao, Leonid Sigal, Yong Jae Lee 
Incorporating Copying Mechanism in Image Captioning for Learning Novel Objects 
Ting Yao, Yingwei Pan, Yehao Li, Tao Mei 
Beyond Instance-Level Image Retrieval: Leveraging Captions to Learn a Global Visual Representation for Semantic Retrieval 
Diane Larlus, Albert Gordo 
MuCaLe-Net: Multi Categorical-Level Networks to Generate More Discriminating Features 
Youssef Tamaazousti, Hervé Le Borgne, Céline Hudelot 
Zero Shot Learning via Multi-Scale Manifold Regularization 
Shay Deutsch, Soheil Kolouri, Kyungnam Kim, Yuri Owechko, Stefano Soatto

Theory

Deeply Supervised Salient Object Detection With Short Connections 
Qibin Hou, Ming-Ming Cheng, Xiaowei Hu, Ali Borji, Zhuowen Tu, Philip H. S. Torr 
A Matrix Splitting Method for Composite Function Minimization 
Ganzhao Yuan, Wei-Shi Zheng, Bernard Ghanem

Video Analytics

One-Shot Video Object Segmentation (PDFprojectcode-tensorflowcode-caffe
Sergi Caelles, Kevis-Kokitsi Maninis, Jordi Pont-Tuset, Laura Leal-Taixé, Daniel Cremers, Luc Van Gool 
Fast Person Re-Identification via Cross-Camera Semantic Binary Transformation 
Jiaxin Chen, Yunhong Wang, Jie Qin, Li Liu, Ling Shao 
SPFTN: A Self-Paced Fine-Tuning Network for Segmenting Objects in Weakly Labelled Videos 
Dingwen Zhang, Le Yang, Deyu Meng, Dong Xu, Junwei Han

Machine Learning 4

Spotlight 4-1A

Hidden Layers in Perceptual Learning 
Gad Cohen, Daphna Weinshall 
Few-Shot Object Recognition From Machine-Labeled Web Images 
Zhongwen Xu, Linchao Zhu, Yi Yang 
Hallucinating Very Low-Resolution Unaligned and Noisy Face Images by Transformative Discriminative Autoencoders 
Xin Yu, Fatih Porikli 
Are You Smarter Than a Sixth Grader? Textbook Question Answering for Multimodal Machine Comprehension 
Aniruddha Kembhavi, Minjoon Seo, Dustin Schwenk, Jonghyun Choi, Ali Farhadi, Hannaneh Hajishirzi 
Deep Hashing Network for Unsupervised Domain Adaptation 
Hemanth Venkateswara, Jose Eusebio, Shayok Chakraborty, Sethuraman Panchanathan 
Generalized Deep Image to Image Regression 
Venkataraman Santhanam, Vlad I. Morariu, Larry S. Davis 
Deep Learning With Low Precision by Half-Wave Gaussian Quantization 
Zhaowei Cai, Xiaodong He, Jian Sun, Nuno Vasconcelos 
Creativity: Generating Diverse Questions Using Variational Autoencoders 
Unnat Jain, Ziyu Zhang, Alexander G. Schwing

Oral 4-1A

Geometric Deep Learning on Graphs and Manifolds Using Mixture Model CNNs 
Federico Monti, Davide Boscaini, Jonathan Masci, Emanuele Rodolà , Jan Svoboda, Michael M. Bronstein 
Full Resolution Image Compression With Recurrent Neural Networks 
George Toderici, Damien Vincent, Nick Johnston, Sung Jin Hwang, David Minnen, Joel Shor, Michele Covell 
Neural Face Editing With Intrinsic Image Disentangling 
Zhixin Shu, Ersin Yumer, Sunil Hadap, Kalyan Sunkavalli, Eli Shechtman, Dimitris Samaras 
Ubernet: Training a Universal Convolutional Neural Network for Low-, Mid-, and High-Level Vision Using Diverse Datasets and Limited Memory 
Iasonas Kokkinos

Analyzing Humans with 3D Vision

Spotlight 4-1B

3D Face Morphable Models “In-The-Wild†
James Booth, Epameinondas Antonakos, Stylianos Ploumpis, George Trigeorgis, Yannis Panagakis, Stefanos Zafeiriou 
KillingFusion: Non-Rigid 3D Reconstruction Without Correspondences 
Miroslava Slavcheva, Maximilian Baust, Daniel Cremers, Slobodan Ilic 
Detailed, Accurate, Human Shape Estimation From Clothed 3D Scan Sequences 
Chao Zhang, Sergi Pujades, Michael J. Black, Gerard Pons-Moll 
POSEidon: Face-From-Depth for Driver Pose Estimation 
Guido Borghi, Marco Venturelli, Roberto Vezzani, Rita Cucchiara 
Human Shape From Silhouettes Using Generative HKS Descriptors and Cross-Modal Neural Networks 
Endri Dibra, Himanshu Jain, Cengiz Öztireli, Remo Ziegler, Markus Gross 
Parametric T-Spline Face Morphable Model for Detailed Fitting in Shape Subspace 
Weilong Peng, Zhiyong Feng, Chao Xu, Yong Su 
3D Menagerie: Modeling the 3D Shape and Pose of Animals 
Silvia Zuffi, Angjoo Kanazawa, David W. Jacobs, Michael J. Black 
iCaRL: Incremental Classifier and Representation Learning 
Sylvestre-Alvise Rebuffi, Alexander Kolesnikov, Georg Sperl, Christoph H. Lampert

Oral 4-1B

Recurrent 3D Pose Sequence Machines 
Mude Lin, Liang Lin, Xiaodan Liang, Keze Wang, Hui Cheng 
Learning Detailed Face Reconstruction From a Single Image 
Elad Richardson, Matan Sela, Roy Or-El, Ron Kimmel 
Thin-Slicing Network: A Deep Structured Model for Pose Estimation in Videos 
Jie Song, Limin Wang, Luc Van Gool, Otmar Hilliges 
Dynamic FAUST: Registering Human Bodies in Motion 
Federica Bogo, Javier Romero, Gerard Pons-Moll, Michael J. Black

Poster 4-1

3D Computer Vision

Semantically Coherent Co-Segmentation and Reconstruction of Dynamic Scenes 
Armin Mustafa, Adrian Hilton 
On the Two-View Geometry of Unsynchronized Cameras 
Cenek Albl, Zuzana Kukelova, Andrew Fitzgibbon, Jan Heller, Matej Smid, Tomas Pajdla 
Using Locally Corresponding CAD Models for Dense 3D Reconstructions From a Single Image 
Chen Kong, Chen-Hsuan Lin, Simon Lucey 
A Clever Elimination Strategy for Efficient Minimal Solvers 
Zuzana Kukelova, Joe Kileel, Bernd Sturmfels, Tomas Pajdla 
Convex Global 3D Registration With Lagrangian Duality 
Jesus Briales, Javier Gonzalez-Jimenez 
DeMoN: Depth and Motion Network for Learning Monocular Stereo (PDF,project
Benjamin Ummenhofer, Huizhong Zhou, Jonas Uhrig, Nikolaus Mayer, Eddy Ilg, Alexey Dosovitskiy, Thomas Brox 
3D Bounding Box Estimation Using Deep Learning and Geometry 
Arsalan Mousavian, Dragomir Anguelov, John Flynn, Jana KoÅ¡ecká 
A Dataset for Benchmarking Image-Based Localization 
Xun Sun, Yuanfan Xie, Pei Luo, Liang Wang

Analyzing Humans in Images

Asynchronous Temporal Fields for Action Recognition 
Gunnar A. Sigurdsson, Santosh Divvala, Ali Farhadi, Abhinav Gupta 
Sequential Person Recognition in Photo Albums With a Recurrent Network 
Yao Li, Guosheng Lin, Bohan Zhuang, Lingqiao Liu, Chunhua Shen, Anton van den Hengel 
Multi-Context Attention for Human Pose Estimation 
Xiao Chu, Wei Yang, Wanli Ouyang, Cheng Ma, Alan L. Yuille, Xiaogang Wang 
3D Convolutional Neural Networks for Efficient and Robust Hand Pose Estimation From Single Depth Images 
Liuhao Ge, Hui Liang, Junsong Yuan, Daniel Thalmann 
Lifting From the Deep: Convolutional 3D Pose Estimation From a Single Image 
Denis Tome, Chris Russell, Lourdes Agapito 
AdaScan: Adaptive Scan Pooling in Deep Convolutional Neural Networks for Human Action Recognition in Videos 
Amlan Kar, Nishant Rai, Karan Sikka, Gaurav Sharma 
Deep Structured Learning for Facial Action Unit Intensity Estimation 
Robert Walecki, Ognjen (Oggi) Rudovic, Vladimir Pavlovic, Bjöern Schuller, Maja Pantic 
Simultaneous Facial Landmark Detection, Pose and Deformation Estimation Under Facial Occlusion 
Yue Wu, Chao Gou, Qiang Ji 
Self-Supervised Video Representation Learning With Odd-One-Out Networks 
Basura Fernando, Hakan Bilen, Efstratios Gavves, Stephen Gould 
Robust Joint and Individual Variance Explained 
Christos Sagonas, Yannis Panagakis, Alina Leidinger, Stefanos Zafeiriou 
Discriminative Covariance Oriented Representation Learning for Face Recognition With Image Sets 
Wen Wang, Ruiping Wang, Shiguang Shan, Xilin Chen 
3D Human Pose Estimation = 2D Pose Estimation + Matching 
Ching-Hang Chen, Deva Ramanan

Applications

Joint Gap Detection and Inpainting of Line Drawings 
Kazuma Sasaki, Satoshi Iizuka, Edgar Simo-Serra, Hiroshi Ishikawa

Biomedical Image/Video Analysis

Riemannian Nonlinear Mixed Effects Models: Analyzing Longitudinal Deformations in Neuroimaging 
Hyunwoo J. Kim, Nagesh Adluru, Heemanshu Suri, Baba C. Vemuri, Sterling C. Johnson, Vikas Singh 
Simultaneous Super-Resolution and Cross-Modality Synthesis of 3D Medical Images Using Weakly-Supervised Joint Convolutional Sparse Coding 
Yawen Huang, Ling Shao, Alejandro F. Frangi

Computational Photography

Multiple-Scattering Microphysics Tomography 
Aviad Levis, Yoav Y. Schechner, Anthony B. Davis 
Image Motion & Tracking

Accurate Optical Flow via Direct Cost Volume Processing 
Jia Xu, René Ranftl, Vladlen Koltun 
Event-Based Visual Inertial Odometry 
Alex Zihao Zhu, Nikolay Atanasov, Kostas Daniilidis 
Robust Visual Tracking Using Oblique Random Forests 
Le Zhang, Jagannadan Varadarajan, Ponnuthurai Nagaratnam Suganthan, Narendra Ahuja, Pierre Moulin

Low- & Mid-Level Vision

Deep Laplacian Pyramid Networks for Fast and Accurate Super-Resolution 
Wei-Sheng Lai, Jia-Bin Huang, Narendra Ahuja, Ming-Hsuan Yang 
Learning Non-Lambertian Object Intrinsics Across ShapeNet Categories 
Jian Shi, Yue Dong, Hao Su, Stella X. Yu 
MCMLSD: A Dynamic Programming Approach to Line Segment Detection 
Emilio J. Almazà n, Ron Tal, Yiming Qian, James H. Elder 
Contour-Constrained Superpixels for Image and Video Processing 
Se-Ho Lee, Won-Dong Jang, Chang-Su Kim 
Richer Convolutional Features for Edge Detection 
Yun Liu, Ming-Ming Cheng, Xiaowei Hu, Kai Wang, Xiang Bai 
Non-Local Color Image Denoising With Convolutional Neural Networks 
Stamatios Lefkimmiatis 
Generative Face Completion 
Yijun Li, Sifei Liu, Jimei Yang, Ming-Hsuan Yang 
Hyper-Laplacian Regularized Unidirectional Low-Rank Tensor Recovery for Multispectral Image Denoising 
Yi Chang, Luxin Yan, Sheng Zhong 
Unsupervised Semantic Scene Labeling for Streaming Data 
Maggie Wigness, John G. Rogers III 
Why You Should Forget Luminance Conversion and Do Something Better 
Rang M. H. Nguyen, Michael S. Brown 
Deep Semantic Feature Matching 
Nikolai Ufer, Björn Ommer

Machine Learning

Revisiting the Variable Projection Method for Separable Nonlinear Least Squares Problems 
Je Hyeong Hong, Christopher Zach, Andrew Fitzgibbon 
Efficient Multiple Instance Metric Learning Using Weakly Supervised Data 
Marc T. Law, Yaoliang Yu, Raquel Urtasun, Richard S. Zemel, Eric P. Xing 
WILDCAT: Weakly Supervised Learning of Deep ConvNets for Image Classification, Pointwise Localization and Segmentation (PDFsupplementary materialcode-pytorch
Thibaut Durand, Taylor Mordan, Nicolas Thome, Matthieu Cord 
Image-To-Image Translation With Conditional Adversarial Networks 
Phillip Isola, Jun-Yan Zhu, Tinghui Zhou, Alexei A. Efros 
Deep Roots: Improving CNN Efficiency With Hierarchical Filter Groups 
Yani Ioannou, Duncan Robertson, Roberto Cipolla, Antonio Criminisi 
Aggregated Residual Transformations for Deep Neural Networks 
Saining Xie, Ross Girshick, Piotr Dollár, Zhuowen Tu, Kaiming He 
MIML-FCN+: Multi-Instance Multi-Label Learning via Fully Convolutional Networks With Privileged Information 
Hao Yang, Joey Tianyi Zhou, Jianfei Cai, Yew Soon Ong 
Low-Rank Embedded Ensemble Semantic Dictionary for Zero-Shot Learning 
Zhengming Ding, Ming Shao, Yun Fu 
Factorized Variational Autoencoders for Modeling Audience Reactions to Movies 
Zhiwei Deng, Rajitha Navarathna, Peter Carr, Stephan Mandt, Yisong Yue, Iain Matthews, Greg Mori 
Learning Features by Watching Objects Move 
Deepak Pathak, Ross Girshick, Piotr Dollár, Trevor Darrell, Bharath Hariharan 
What Can Help Pedestrian Detection? 
Jiayuan Mao, Tete Xiao, Yuning Jiang, Zhimin Cao 
DeepPermNet: Visual Permutation Learning 
Rodrigo Santa Cruz, Basura Fernando, Anoop Cherian, Stephen Gould 
Learning the Multilinear Structure of Visual Data 
Mengjiao Wang, Yannis Panagakis, Patrick Snape, Stefanos Zafeiriou 
Adaptive and Move Making Auxiliary Cuts for Binary Pairwise Energies 
Lena Gorelick, Yuri Boykov, Olga Veksler 
Designing Energy-Efficient Convolutional Neural Networks Using Energy-Aware Pruning 
Tien-Ju Yang, Yu-Hsin Chen, Vivienne Sze 
Joint Multi-Person Pose Estimation and Semantic Part Segmentation (PDFdataset
Fangting Xia, Peng Wang, Xianjie Chen, Alan L. Yuille 
Deep Feature Interpolation for Image Content Changes 
Paul Upchurch, Jacob Gardner, Geoff Pleiss, Robert Pless, Noah Snavely, Kavita Bala, Kilian Weinberger 
FASON: First and Second Order Information Fusion Network for Texture Recognition 
Xiyang Dai, Joe Yue-Hei Ng, Larry S. Davis 
Lean Crowdsourcing: Combining Humans and Machines in an Online System 
Steve Branson, Grant Van Horn, Pietro Perona

Object Recognition & Scene Understanding

Supervising Neural Attention Models for Video Captioning by Human Gaze Data 
Youngjae Yu, Jongwook Choi, Yeonhwa Kim, Kyung Yoo, Sang-Hun Lee, Gunhee Kim 
L2-Net: Deep Learning of Discriminative Patch Descriptor in Euclidean Space 
Yurun Tian, Bin Fan, Fuchao Wu 
Convolutional Random Walk Networks for Semantic Image Segmentation 
Gedas Bertasius, Lorenzo Torresani, Stella X. Yu, Jianbo Shi 
Knowledge Acquisition for Visual Question Answering via Iterative Querying 
Yuke Zhu, Joseph J. Lim, Li Fei-Fei 
Memory-Augmented Attribute Manipulation Networks for Interactive Fashion Search 
Bo Zhao, Jiashi Feng, Xiao Wu, Shuicheng Yan 
From Zero-Shot Learning to Conventional Supervised Classification: Unseen Visual Data Synthesis 
Yang Long, Li Liu, Ling Shao, Fumin Shen, Guiguang Ding, Jungong Han 
Are Large-Scale 3D Models Really Necessary for Accurate Visual Localization? 
Torsten Sattler, Akihiko Torii, Josef Sivic, Marc Pollefeys, Hajime Taira, Masatoshi Okutomi, Tomas Pajdla 
Asymmetric Feature Maps With Application to Sketch Based Retrieval 
Giorgos Tolias, OndÅ™ej Chum 
Diverse Image Annotation 
Baoyuan Wu, Fan Jia, Wei Liu, Bernard Ghanem 
AMC: Attention guided Multi-modal Correlation Learning for Image Search 
Kan Chen, Trung Bui, Chen Fang, Zhaowen Wang, Ram Nevatia 
Multi-Attention Network for One Shot Learning 
Peng Wang, Lingqiao Liu, Chunhua Shen, Zi Huang, Anton van den Hengel, Heng Tao Shen 
Fried Binary Embedding for High-Dimensional Visual Features 
Weixiang Hong, Junsong Yuan, Sreyasee Das Bhattacharjee 
Pyramid Scene Parsing Network 
Hengshuang Zhao, Jianping Shi, Xiaojuan Qi, Xiaogang Wang, Jiaya Jia 
Learning Deep Match Kernels for Image-Set Classification 
Haoliang Sun, Xiantong Zhen, Yuanjie Zheng, Gongping Yang, Yilong Yin, Shuo Li 
Task-Driven Dynamic Fusion: Reducing Ambiguity in Video Description 
Xishan Zhang, Ke Gao, Yongdong Zhang, Dongming Zhang, Jintao Li, Qi Tian 
Learning Multifunctional Binary Codes for Both Category and Attribute Oriented Retrieval Tasks 
Haomiao Liu, Ruiping Wang, Shiguang Shan, Xilin Chen 
Indoor Scene Parsing With Instance Segmentation, Semantic Labeling and Support Relationship Inference 
Wei Zhuo, Mathieu Salzmann, Xuming He, Miaomiao Liu 
Episodic CAMN: Contextual Attention-Based Memory Networks With Iterative Feedback for Scene Labeling 
Abrar H. Abdulnabi, Bing Shuai, Stefan Winkler, Gang Wang 
Link the Head to the “Beakâ€: Zero Shot Learning From Noisy Text Description at Part Precision 
Mohamed Elhoseiny, Yizhe Zhu, Han Zhang, Ahmed Elgammal 
SCA-CNN: Spatial and Channel-Wise Attention in Convolutional Networks for Image Captioning 
Long Chen, Hanwang Zhang, Jun Xiao, Liqiang Nie, Jian Shao, Wei Liu, Tat-Seng Chua 
Deep Pyramidal Residual Networks (PDFcode
Dongyoon Han, Jiwhan Kim, Junmo Kim 
Product Split Trees 
Artem Babenko, Victor Lempitsky 
Making the v in VQA Matter: Elevating the Role of Image Understanding in Visual Question Answering 
Yash Goyal, Tejas Khot, Douglas Summers-Stay, Dhruv Batra, Devi Parikh 
Commonly Uncommon: Semantic Sparsity in Situation Recognition 
Mark Yatskar, Vicente Ordonez, Luke Zettlemoyer, Ali Farhadi 
Cross-Modality Binary Code Learning via Fusion Similarity Hashing 
Hong Liu, Rongrong Ji, Yongjian Wu, Feiyue Huang, Baochang Zhang

Theory

Saliency Revisited: Analysis of Mouse Movements Versus Fixations 
Hamed R. Tavakoli, Fawad Ahmed, Ali Borji, Jorma Laaksonen 
InterpoNet, a Brain Inspired Neural Network for Optical Flow Dense Interpolation 
Shay Zweig, Lior Wolf

Video Analytics

SST: Single-Stream Temporal Action Proposals 
Shyamal Buch, Victor Escorcia, Chuanqi Shen, Bernard Ghanem, Juan Carlos Niebles 
Video Segmentation via Multiple Granularity Analysis 
Rui Yang, Bingbing Ni, Chao Ma, Yi Xu, Xiaokang Yang 
Spatio-Temporal Alignment of Non-Overlapping Sequences From Independently Panning Cameras 
Seyed Morteza Safdarnejad, Xiaoming Liu 
UntrimmedNets for Weakly Supervised Action Recognition and Detection 
Limin Wang, Yuanjun Xiong, Dahua Lin, Luc Van Gool

Object Recognition & Scene Understanding 3

Spotlight 4-2A

Gaze Embeddings for Zero-Shot Image Classification 
Nour Karessli, Zeynep Akata, Bernt Schiele, Andreas Bulling 
What’s in a Question: Using Visual Questions as a Form of Supervision 
Siddha Ganju, Olga Russakovsky, Abhinav Gupta 
Attend to You: Personalized Image Captioning With Context Sequence Memory Networks 
Cesc Chunseong Park, Byeongchang Kim, Gunhee Kim 
Adversarially Tuned Scene Generation 
VSR Veeravasarapu, Constantin Rothkopf, Ramesh Visvanathan 
Residual Attention Network for Image Classification 
Fei Wang, Mengqing Jiang, Chen Qian, Shuo Yang, Cheng Li, Honggang Zhang, Xiaogang Wang, Xiaoou Tang 
Not All Pixels Are Equal: Difficulty-Aware Semantic Segmentation via Deep Layer Cascade 
Xiaoxiao Li, Ziwei Liu, Ping Luo, Chen Change Loy, Xiaoou Tang 
Learning Non-Maximum Suppression 
Jan Hosang, Rodrigo Benenson, Bernt Schiele 
The Amazing Mysteries of the Gutter: Drawing Inferences Between Panels in Comic Book Narratives 
Mohit Iyyer, Varun Manjunatha, Anupam Guha, Yogarshi Vyas, Jordan Boyd-Graber, Hal Daumé III, Larry S. Davis

Oral 4-2A

Object Region Mining With Adversarial Erasing: A Simple Classification to Semantic Segmentation Approach 
Yunchao Wei, Jiashi Feng, Xiaodan Liang, Ming-Ming Cheng, Yao Zhao, Shuicheng Yan 
Fine-Grained Recognition as HSnet Search for Informative Image Parts 
Michael Lam, Behrooz Mahasseni, Sinisa Todorovic 
G2DeNet: Global Gaussian Distribution Embedding Network and Its Application to Visual Recognition 
Qilong Wang, Peihua Li, Lei Zhang 
YOLO9000: Better, Faster, Stronger 
Joseph Redmon, Ali Farhadi

Machine Learning for 3D Vision

Spotlight 4-2B

Multi-View 3D Object Detection Network for Autonomous Driving 
Xiaozhi Chen, Huimin Ma, Ji Wan, Bo Li, Tian Xia 
UltraStereo: Efficient Learning-Based Matching for Active Stereo Systems 
Sean Ryan Fanello, Julien Valentin, Christoph Rhemann, Adarsh Kowdle, Vladimir Tankovich, Philip Davidson, Shahram Izadi 
Shape Completion Using 3D-Encoder-Predictor CNNs and Shape Synthesis 
Angela Dai, Charles Ruizhongtai Qi, Matthias Nießner 
Geometric Loss Functions for Camera Pose Regression With Deep Learning 
Alex Kendall, Roberto Cipolla 
CNN-SLAM: Real-Time Dense Monocular SLAM With Learned Depth Prediction 
Keisuke Tateno, Federico Tombari, Iro Laina, Nassir Navab 
Learning From Noisy Large-Scale Datasets With Minimal Supervision 
Andreas Veit, Neil Alldrin, Gal Chechik, Ivan Krasin, Abhinav Gupta, Serge Belongie 
SyncSpecCNN: Synchronized Spectral CNN for 3D Shape Segmentation 
Li Yi, Hao Su, Xingwen Guo, Leonidas J. Guibas 
Non-Local Deep Features for Salient Object Detection 
Zhiming Luo, Akshaya Mishra, Andrew Achkar, Justin Eichel, Shaozi Li, Pierre-Marc Jodoin

Oral 4-2B

Unsupervised Monocular Depth Estimation With Left-Right Consistency 
Clément Godard, Oisin Mac Aodha, Gabriel J. Brostow 
Unsupervised Learning of Depth and Ego-Motion From Video 
Tinghui Zhou, Matthew Brown, Noah Snavely, David G. Lowe 
OctNet: Learning Deep 3D Representations at High Resolutions 
Gernot Riegler, Ali Osman Ulusoy, Andreas Geiger 
3D Shape Segmentation With Projective Convolutional Networks 
Evangelos Kalogerakis, Melinos Averkiou, Subhransu Maji, Siddhartha Chaudhuri

Poster 4-2

3D Computer Vision

SGM-Nets: Semi-Global Matching With Neural Networks 
Akihito Seki, Marc Pollefeys 
Stereo-Based 3D Reconstruction of Dynamic Fluid Surfaces by Global Optimization 
Yiming Qian, Minglun Gong, Yee-Hong Yang 
Fine-To-Coarse Global Registration of RGB-D Scans 
Maciej Halber, Thomas Funkhouser 
Analyzing Computer Vision Data - The Good, the Bad and the Ugly 
Oliver Zendel, Katrin Honauer, Markus Murschitz, Martin Humenberger, Gustavo Fernández Domínguez 
Product Manifold Filter: Non-Rigid Shape Correspondence via Kernel Density Estimation in the Product Space 
Matthias Vestner, Roee Litman, Emanuele Rodolà , Alex Bronstein, Daniel Cremers 
Unsupervised Vanishing Point Detection and Camera Calibration From a Single Manhattan Image With Radial Distortion 
Michel Antunes, João P. Barreto, Djamila Aouada, Björn Ottersten 
Toroidal Constraints for Two-Point Localization Under High Outlier Ratios 
Federico Camposeco, Torsten Sattler, Andrea Cohen, Andreas Geiger, Marc Pollefeys 
4D Light Field Superpixel and Segmentation 
Hao Zhu, Qi Zhang, Qing Wang 
Exploiting Symmetry and/or Manhattan Properties for 3D Object Structure Estimation From Single and Multiple Images 
Yuan Gao, Alan L. Yuille

Analyzing Humans in Images

Binary Coding for Partial Action Analysis With Limited Observation Ratios 
Jie Qin, Li Liu, Ling Shao, Bingbing Ni, Chen Chen, Fumin Shen, Yunhong Wang 
SphereFace: Deep Hypersphere Embedding for Face Recognition 
Weiyang Liu, Yandong Wen, Zhiding Yu, Ming Li, Bhiksha Raj, Le Song 
IRINA: Iris Recognition (Even) in Inaccurately Segmented Data 
Hugo Proença, João C. Neves 
Look Into Person: Self-Supervised Structure-Sensitive Learning and a New Benchmark for Human Parsing 
Ke Gong, Xiaodan Liang, Dongyu Zhang, Xiaohui Shen, Liang Lin 
Action Unit Detection With Region Adaptation, Multi-Labeling Learning and Optimal Temporal Fusing 
Wei Li, Farnaz Abtahi, Zhigang Zhu 
See the Forest for the Trees: Joint Spatial and Temporal Recurrent Neural Networks for Video-Based Person Re-Identification 
Zhen Zhou, Yan Huang, Wei Wang, Liang Wang, Tieniu Tan 
Joint Intensity and Spatial Metric Learning for Robust Gait Recognition 
Yasushi Makihara, Atsuyuki Suzuki, Daigo Muramatsu, Xiang Li, Yasushi Yagi 
Pose-Aware Person Recognition 
Vijay Kumar, Anoop Namboodiri, Manohar Paluri, C. V. Jawahar 
Not Afraid of the Dark: NIR-VIS Face Recognition via Cross-Spectral Hallucination and Low-Rank Embedding 
José Lezama, Qiang Qiu, Guillermo Sapiro

Applications

Jointly Learning Energy Expenditures and Activities Using Egocentric Multimodal Signals 
Katsuyuki Nakamura, Serena Yeung, Alexandre Alahi, Li Fei-Fei 
Binarized Mode Seeking for Scalable Visual Pattern Discovery 
Wei Zhang, Xiaochun Cao, Rui Wang, Yuanfang Guo, Zhineng Chen 
Scribbler: Controlling Deep Image Synthesis With Sketch and Color 
Patsorn Sangkloy, Jingwan Lu, Chen Fang, Fisher Yu, James Hays

Biomedical Image/Video Analysis

Multi-Way Multi-Level Kernel Modeling for Neuroimaging Classification 
Lifang He, Chun-Ta Lu, Hao Ding, Shen Wang, Linlin Shen, Philip S. Yu, Ann B. Ragin 
WSISA: Making Survival Prediction From Whole Slide Histopathological Images 
Xinliang Zhu, Jiawen Yao, Feiyun Zhu, Junzhou Huang

Computational Photography

On the Effectiveness of Visible Watermarks 
Tali Dekel, Michael Rubinstein, Ce Liu, William T. Freeman 
Snapshot Hyperspectral Light Field Imaging 
Zhiwei Xiong, Lizhi Wang, Huiqun Li, Dong Liu, Feng Wu 
Semantic Image Inpainting With Deep Generative Models 
Raymond A. Yeh, Chen Chen, Teck Yian Lim, Alexander G. Schwing, Mark Hasegawa-Johnson, Minh N. Do

Image Motion & Tracking

Fast Multi-Frame Stereo Scene Flow With Motion Segmentation 
Tatsunori Taniai, Sudipta N. Sinha, Yoichi Sato 
Improved Stereo Matching With Constant Highway Networks and Reflective Confidence Learning 
Amit Shaked, Lior Wolf 
Optical Flow in Mostly Rigid Scenes 
Jonas Wulff, Laura Sevilla-Lara, Michael J. Black 
Optical Flow Requires Multiple Strategies (but Only One Network) (PDFcode
Tal Schuster, Lior Wolf, David Gadot 
ECO: Efficient Convolution Operators for Tracking 
Martin Danelljan, Goutam Bhat, Fahad Shahbaz Khan, Michael Felsberg

Low- & Mid-Level Vision

Differential Angular Imaging for Material Recognition 
Jia Xue, Hang Zhang, Kristin Dana, Ko Nishino 
Fast Fourier Color Constancy 
Jonathan T. Barron, Yun-Ta Tsai 
Comparative Evaluation of Hand-Crafted and Learned Local Features 
Johannes L. Schönberger, Hans Hardmeier, Torsten Sattler, Marc Pollefeys 
Learning Fully Convolutional Networks for Iterative Non-Blind Deconvolution 
Jiawei Zhang, Jinshan Pan, Wei-Sheng Lai, Rynson W. H. Lau, Ming-Hsuan Yang 
Image Deblurring via Extreme Channels Prior 
Yanyang Yan, Wenqi Ren, Yuanfang Guo, Rui Wang, Xiaochun Cao 
Simultaneous Stereo Video Deblurring and Scene Flow Estimation 
Liyuan Pan, Yuchao Dai, Miaomiao Liu, Fatih Porikli 
Deep Photo Style Transfer 
Fujun Luan, Sylvain Paris, Eli Shechtman, Kavita Bala 
Generative Attribute Controller With Conditional Filtered Generative Adversarial Networks 
Takuhiro Kaneko, Kaoru Hiramatsu, Kunio Kashino 
Fast Haze Removal for Nighttime Image Using Maximum Reflectance Prior 
Jing Zhang, Yang Cao, Shuai Fang, Yu Kang, Chang Wen Chen

Machine Learning

Low-Rank Bilinear Pooling for Fine-Grained Classification 
Shu Kong, Charless Fowlkes 
Neural Scene De-Rendering 
Jiajun Wu, Joshua B. Tenenbaum, Pushmeet Kohli 
Real-Time Neural Style Transfer for Videos 
Haozhi Huang, Hao Wang, Wenhan Luo, Lin Ma, Wenhao Jiang, Xiaolong Zhu, Zhifeng Li, Wei Liu 
A Graph Regularized Deep Neural Network for Unsupervised Image Representation Learning 
Shijie Yang, Liang Li, Shuhui Wang, Weigang Zhang, Qingming Huang 
Collaborative Deep Reinforcement Learning for Joint Object Search 
Xiangyu Kong, Bo Xin, Yizhou Wang, Gang Hua 
Loss Max-Pooling for Semantic Image Segmentation 
Samuel Rota Bulò, Gerhard Neuhold, Peter Kontschieder 
Deep View Morphing 
Dinghuang Ji, Junghyun Kwon, Max McFarland, Silvio Savarese 
Unsupervised Learning of Long-Term Motion Dynamics for Videos 
Zelun Luo, Boya Peng, De-An Huang, Alexandre Alahi, Li Fei-Fei 
Revisiting Metric Learning for SPD Matrix Based Visual Representation 
Luping Zhou, Lei Wang, Jianjia Zhang, Yinghuan Shi, Yang Gao 
Expert Gate: Lifelong Learning With a Network of Experts 
Rahaf Aljundi, Punarjay Chakravarty, Tinne Tuytelaars 
A Gift From Knowledge Distillation: Fast Optimization, Network Minimization and Transfer Learning 
Junho Yim, Donggyu Joo, Jihoon Bae, Junmo Kim 
Domain Adaptation by Mixture of Alignments of Second- or Higher-Order Scatter Tensors 
Piotr Koniusz, Yusuf Tas, Fatih Porikli 
Deep Mixture of Linear Inverse Regressions Applied to Head-Pose Estimation 
Stéphane Lathuilière, Rémi Juge, Pablo Mesejo, Rafael Muñoz-Salinas, Radu Horaud 
STD2P: RGBD Semantic Segmentation Using Spatio-Temporal Data-Driven Pooling 
Yang He, Wei-Chen Chiu, Margret Keuper, Mario Fritz 
Harmonic Networks: Deep Translation and Rotation Equivariance 
Daniel E. Worrall, Stephan J. Garbin, Daniyar Turmukhambetov, Gabriel J. Brostow 
Multimodal Transfer: A Hierarchical Deep Convolutional Neural Network for Fast Artistic Style Transfer 
Xin Wang, Geoffrey Oxholm, Da Zhang, Yuan-Fang Wang 
Detect, Replace, Refine: Deep Structured Prediction for Pixel Wise Labeling 
Spyros Gidaris, Nikos Komodakis 
Weighted-Entropy-Based Quantization for Deep Neural Networks 
Eunhyeok Park, Junwhan Ahn, Sungjoo Yoo 
Residual Expansion Algorithm: Fast and Effective Optimization for Nonconvex Least Squares Problems 
Daiki Ikami, Toshihiko Yamasaki, Kiyoharu Aizawa 
Bidirectional Beam Search: Forward-Backward Inference in Neural Sequence Models for Fill-In-The-Blank Image Captioning 
Qing Sun, Stefan Lee, Dhruv Batra 
Newton-Type Methods for Inference in Higher-Order Markov Random Fields 
Hariprasad Kannan, Nikos Komodakis, Nikos Paragios 
Adaptive Relaxed ADMM: Convergence Theory and Practical Implementation 
Zheng Xu, Mário A. T. Figueiredo, Xiaoming Yuan, Christoph Studer, Tom Goldstein

Object Recognition & Scene Understanding

ViP-CNN: Visual Phrase Guided Convolutional Neural Network 
Yikang Li, Wanli Ouyang, Xiaogang Wang, Xiao’ou Tang 
Instance-Aware Image and Sentence Matching With Selective Multimodal LSTM 
Yan Huang, Wei Wang, Liang Wang 
Kernel Square-Loss Exemplar Machines for Image Retrieval 
Rafael S. Rezende, Joaquin Zepeda, Jean Ponce, Francis Bach, Patrick Pérez 
Cognitive Mapping and Planning for Visual Navigation 
Saurabh Gupta, James Davidson, Sergey Levine, Rahul Sukthankar, Jitendra Malik 
Combining Bottom-Up, Top-Down, and Smoothness Cues for Weakly Supervised Image Segmentation 
Anirban Roy, Sinisa Todorovic 
Seeing Into Darkness: Scotopic Visual Recognition 
Bo Chen, Pietro Perona 
Deep Co-Occurrence Feature Learning for Visual Object Recognition 
Ya-Fang Shih, Yang-Ming Yeh, Yen-Yu Lin, Ming-Fang Weng, Yi-Chang Lu, Yung-Yu Chuang 
An Empirical Evaluation of Visual Question Answering for Novel Objects 
Santhosh K. Ramakrishnan, Ambar Pal, Gaurav Sharma, Anurag Mittal 
InstanceCut: From Edges to Instances With MultiCut 
Alexander Kirillov, Evgeny Levinkov, Bjoern Andres, Bogdan Savchynskyy, Carsten Rother 
Fine-Grained Image Classification via Combining Vision and Language 
Xiangteng He, Yuxin Peng 
Mimicking Very Efficient Network for Object Detection 
Quanquan Li, Shengying Jin, Junjie Yan 
Tracking by Natural Language Specification 
Zhenyang Li, Ran Tao, Efstratios Gavves, Cees G. M. Snoek, Arnold W.M. Smeulders 
A Dataset and Exploration of Models for Understanding Video Data Through Fill-In-The-Blank Question-Answering 
Tegan Maharaj, Nicolas Ballas, Anna Rohrbach, Aaron Courville, Christopher Pal 
Learning Detection With Diverse Proposals 
Samaneh Azadi, Jiashi Feng, Trevor Darrell 
Skeleton Key: Image Captioning by Skeleton-Attribute Decomposition 
Yufei Wang, Zhe Lin, Xiaohui Shen, Scott Cohen, Garrison W. Cottrell

Theory

A Low Power, Fully Event-Based Gesture Recognition System 
Arnon Amir, Brian Taba, David Berg, Timothy Melano, Jeffrey McKinstry, Carmelo Di Nolfo, Tapan Nayak, Alexander Andreopoulos, Guillaume Garreau, Marcela Mendoza, Jeff Kusnitz, Michael Debole, Steve Esser, Tobi Delbruck, Myron Flickner, Dharmendra Modha

Video Analytics

Learning Deep Context-Aware Features Over Body and Latent Parts for Person Re-Identification 
Dangwei Li, Xiaotang Chen, Zhang Zhang, Kaiqi Huang 
Recurrent Modeling of Interaction Context for Collective Activity Recognition 
Minsi Wang, Bingbing Ni, Xiaokang Yang 
Primary Object Segmentation in Videos Based on Region Augmentation and Reduction 
Yeong Jun Koh, Chang-Su Kim 
ROAM: A Rich Object Appearance Model With Application to Rotoscoping 
Ondrej Miksik, Juan-Manuel Pérez-Rúa, Philip H. S. Torr, Patrick Pérez 
Temporal Residual Networks for Dynamic Scene Recognition 
Christoph Feichtenhofer, Axel Pinz, Richard P. Wildes 
Spatiotemporal Multiplier Networks for Video Action Recognition 
Christoph Feichtenhofer, Axel Pinz, Richard P. Wildes 
Learning to Learn From Noisy Web Videos 
Serena Yeung, Vignesh Ramanathan, Olga Russakovsky, Liyue Shen, Greg Mori, Li Fei-Fei 
YouTube-BoundingBoxes: A Large High-Precision Human-Annotated Data Set for Object Detection in Video 
Esteban Real, Jonathon Shlens, Stefano Mazzocchi, Xin Pan, Vincent Vanhoucke 
Online Video Object Segmentation via Convolutional Trident Network 
Won-Dong Jang, Chang-Su Kim

猜你喜欢

转载自blog.csdn.net/hxg2006/article/details/80375468