


default search action
26th ACM Multimedia 2018: Seoul, Republic of Korea
- Susanne Boll, Kyoung Mu Lee, Jiebo Luo, Wenwu Zhu, Hyeran Byun, Chang Wen Chen, Rainer Lienhart, Tao Mei:

2018 ACM Multimedia Conference on Multimedia Conference, MM 2018, Seoul, Republic of Korea, October 22-26, 2018. ACM 2018, ISBN 978-1-4503-5665-7
FF-1
- Max Mühlhäuser:

Session details: FF-1. - Chuan-Xiang Li, Zhen-Duo Chen, Peng-Fei Zhang, Xin Luo

, Liqiang Nie, Wei Zhang, Xin-Shun Xu:
SCRATCH: A Scalable Discrete Matrix Factorization Hashing for Cross-Modal Retrieval. 1-9 - Ana Garcia del Molino, Joo-Hwee Lim, Ah-Hwee Tan

:
Predicting Visual Context for Unsupervised Event Segmentation in Continuous Photo-streams. 10-17 - Xingxing Wei, Jun Zhu, Sitong Feng, Hang Su:

Video-to-Video Translation with Global Temporal Consistency. 18-25 - Jinxing Li, Bob Zhang

, Guangming Lu, David Zhang:
Shared Linear Encoder-based Gaussian Process Latent Variable Model for Visual Classification. 26-34 - Jia-Xing Zhong, Nannan Li, Weijie Kong, Tao Zhang, Thomas H. Li, Ge Li:

Step-by-step Erasion, One-by-one Collection: A Weakly Supervised Temporal Action Detector. 35-44 - Jianshu Li, Jian Zhao, Yunpeng Chen

, Sujoy Roy, Shuicheng Yan, Jiashi Feng, Terence Sim:
Multi-Human Parsing Machines. 45-53 - Xuanyi Dong, Linchao Zhu

, De Zhang, Yi Yang, Fei Wu:
Fast Parameter Adaptation for Few-shot Image Captioning and Visual Question Answering. 54-62 - Junbo Wang, Wei Wang, Yan Huang, Liang Wang, Tieniu Tan:

Hierarchical Memory Modelling for Video Captioning. 63-71 - Zheng Wang, Xiang Bai

, Mang Ye
, Shin'ichi Satoh:
Incremental Deep Hidden Attribute Learning. 72-80 - Huarong Chen, Bin Wang

, Tianxiang Pan, Liwang Zhou, Hua Zeng:
CropNet: Real-Time Thumbnailing. 81-89 - Zhi-Qi Cheng

, Xiao Wu, Siyu Huang
, Jun-Xiu Li, Alexander G. Hauptmann, Qiang Peng:
Learning to Transfer: Generalizable Attribute Learning with Multitask Neural Model Search. 90-98 - Yingying Zhu, Jiong Wang, Lingxi Xie, Liang Zheng

:
Attention-based Pyramid Aggregation Network for Visual Place Recognition. 99-107 - Changde Du, Changying Du, Hao Wang, Jinpeng Li

, Wei-Long Zheng, Bao-Liang Lu, Huiguang He
:
Semi-supervised Deep Generative Modelling of Incomplete Multi-Modality Emotional Data. 108-116 - Yuxiao Chen, Jianbo Yuan, Quanzeng You, Jiebo Luo

:
Twitter Sentiment Analysis via Bi-sense Emoji Embedding and Attention-based LSTM. 117-125 - Feifei Zhang, Tianzhu Zhang, Qirong Mao, Lingyu Duan, Changsheng Xu:

Facial Expression Recognition in the Wild: A Cycle-Consistent Adversarial Attention Transfer Approach. 126-135 - Runnan Li, Zhiyong Wu, Jia Jia, Jingbei Li, Wei Chen, Helen Meng:

Inferring User Emotive State Changes in Realistic Human-Computer Conversational Dialogs. 136-144 - Zhengzhe Liu, Xiaojuan Qi, Lei Pang:

Self-boosted Gesture Interactive System with ST-Net. 145-153 - Felix Kosmalla, Christian Murlowski

, Florian Daiber
, Antonio Krüger
:
Slackliner - An Interactive Slackline Training Assistant. 154-162 - Yaoyu Li, Tianzhu Zhang, Lingyu Duan, Changsheng Xu:

A Unified Generative Adversarial Framework for Image Generation and Person Re-identification. 163-172 - Anahita Mahzari, Afshin Taghavi Nasrabadi, Aliehsan Samiei, Ravi Prakash

:
FoV-Aware Edge Caching for Adaptive 360° Video Streaming. 173-181
Keynote 1
- Susanne Boll:

Session details: Keynote 1. - Marianna Obrist:

Don't just Look - Smell, Taste, and Feel the Interaction. 182
FF-2
- Peng Cui:

Session details: FF-2. - Rui Zhang, Sheng Tang, Yu Li, Junbo Guo, Yongdong Zhang, Jintao Li, Shuicheng Yan:

Style Separation and Synthesis via Generative Adversarial Networks. 183-191 - Hao Xiao

, Weiyao Lin
, Bin Sheng, Ke Lu, Junchi Yan, Jingdong Wang
, Errui Ding, Yihao Zhang, Hongkai Xiong
:
Group Re-Identification: Leveraging and Integrating Multi-Grain Information. 192-200 - Xu Gao, Tingting Jiang

:
OSMO: Online Specific Models for Occlusion in Multiple Object Tracking under Surveillance Scene. 201-210 - Yuke Li:

Video Forecasting with Forward-Backward-Net: Delving Deeper into Spatiotemporal Consistency. 211-219 - Rui Shao, Xiangyuan Lan, Pong C. Yuen:

Feature Constrained by Pixel: Hierarchical Adversarial Deep Domain Adaptation. 220-228 - Zhixing Chen, Di Huang, Yunhong Wang, Liming Chen:

Fast and Light Manifold CNN based 3D Facial Expression Recognition across Pose Variations. 229-238 - Xiaomeng Song, Yucheng Shi

, Xin Chen, Yahong Han:
Explore Multi-Step Reasoning in Video Question Answering. 239-247 - Shancheng Fang, Hongtao Xie, Zheng-Jun Zha

, Nannan Sun, Jianlong Tan, Yongdong Zhang:
Attention and Language Ensemble for Scene Text Recognition with Convolutional Sequence Modeling. 248-256 - Zhaoyang Zhang, Zhanghui Kuang, Ping Luo, Litong Feng, Wei Zhang

:
Temporal Sequence Distillation: Towards Few-Frame Action Recognition in Videos. 257-264 - Zhihang Fu, Zhongming Jin, Guo-Jun Qi

, Chen Shen, Rongxin Jiang, Yaowu Chen, Xian-Sheng Hua:
Previewer for Multi-Scale Object Detector. 265-273 - Guanshuo Wang, Yufeng Yuan, Xiong Chen, Jiwei Li, Xi Zhou:

Learning Discriminative Features with Multiple Granularities for Person Re-Identification. 274-282 - Guoxiang Qu, Wenwei Zhang, Zhe Wang, Xing Dai, Jianping Shi, Junjun He, Fei Li, Xiulan Zhang, Yu Qiao

:
StripNet: Towards Topology Consistent Strip Structure Segmentation. 283-291 - Samuel Albanie, Arsha Nagrani, Andrea Vedaldi, Andrew Zisserman:

Emotion Recognition in Speech using Cross-Modal Transfer in the Wild. 292-301 - Can Wang, Shangfei Wang:

Personalized Multiple Facial Action Unit Recognition through Generative Adversarial Recognition Network. 302-310 - Cigdem Beyan

, Muhammad Shahid, Vittorio Murino
:
Investigation of Small Group Social Interactions Using Deep Visual Activity-Based Nonverbal Features. 311-319 - Eugene Yujun Fu

, Michael Xuelin Huang, Hong Va Leong, Grace Ngai
:
Cross-Species Learning: A Low-Cost Approach to Learning Human Fight from Animal Fight. 320-327 - Qianli Xu, Vigneshwaran Subbaraju

, Chee How Cheong, Aijing Wang, Kathleen Kang, Munirah Bashir, Yanhong Dong
, Liyuan Li, Joo-Hwee Lim:
Personalized Serious Games for Cognitive Intervention with Lifelog Visual Analytics. 328-336 - Wendy Bolier, Wolfgang Hürst, Guido van Bommel, Joost Bosman, Harriët Bosman:

Drawing in a Virtual 3D Space - Introducing VR Drawing in Elementary School Art Education. 337-345 - Luca Lovagnini, Wenxiao Zhang, Farshid Hassani Bijarbooneh, Pan Hui:

CIRCE: Real-Time Caching for Instance Recognition on Cloud Environments and Multi-Core Architectures. 346-354 - Wenxiao Zhang, Bo Han, Pan Hui:

Jaguar: Low Latency Mobile Augmented Reality with Flexible Tracking. 355-363
Keynote 2
- Tao Mei:

Session details: Keynote 2. - Xian-Sheng Hua:

Challenges and Practices of Large Scale Visual Intelligence in the Real-World. 364
Deep-1 (Image Translation)
- Nicu Sebe:

Session details: Deep-1 (Image Translation). - Yuheng Zhi

, Huawei Wei, Bingbing Ni:
Structure Guided Photorealistic Style Transfer. 365-373 - Xuewen Yang

, Dongliang Xie, Xin Wang:
Crossing-Domain Generative Adversarial Networks for Unsupervised Multi-Domain Image-to-Image Translation. 374-382 - Bo Zhao, Xiao Wu, Zhi-Qi Cheng

, Hao Liu, Zequn Jie, Jiashi Feng:
Multi-View Image Generation from a Single-View. 383-391 - Jichao Zhang, Yezhi Shu, Songhua Xu, Gongze Cao, Fan Zhong, Meng Liu, Xueying Qin:

Sparsely Grouped Multi-Task Generative Adversarial Networks for Facial Attribute Manipulation. 392-401
Vision-1 (Machine Learning)
- Jingkuan Song:

Session details: Vision-1 (Machine Learning). - Jindong Wang

, Wenjie Feng
, Yiqiang Chen
, Han Yu
, Meiyu Huang, Philip S. Yu:
Visual Domain Adaptation with Manifold Embedded Distribution Alignment. 402-410 - Zheyan Shen, Peng Cui, Kun Kuang, Bo Li, Peixuan Chen:

Causally Regularized Learning with Agnostic Data Selection Bias. 411-419 - Yanjie Liang, Qiangqiang Wu, Yi Liu, Yan Yan, Hanzi Wang:

Robust Correlation Filter Tracking with Shepherded Instance-Aware Proposals. 420-428 - Fan Qi, Xiaoshan Yang, Changsheng Xu:

A Unified Framework for Multimodal Domain Adaptation. 429-437
Multimedia-1 (Multimedia Recommendation & Discovery)
- Mark Liao:

Session details: Multimedia-1 (Multimedia Recommendation & Discovery). - Shintami Chusnul Hidayati, Cheng-Chun Hsu, Yu-Ting Chang, Kai-Lung Hua

, Jianlong Fu, Wen-Huang Cheng:
What Dress Fits Me Best?: Fashion Recommendation on the Clothing Style for Personal Body Shape. 438-446 - Xiaowen Huang, Shengsheng Qian, Quan Fang, Jitao Sang, Changsheng Xu:

CSAN: Contextual Self-Attention Network for User Sequential Recommendation. 447-455 - Jun Hu, Shengsheng Qian, Quan Fang, Changsheng Xu:

Attentive Interactive Convolutional Matching for Community Question Answering in Social Multimedia. 456-464 - Francesco Gelli, Tiberio Uricchio

, Xiangnan He, Alberto Del Bimbo, Tat-Seng Chua:
Beyond the Product: Discovering Image Posts for Brands in Social Media. 465-473
Vision-2 (Object & Scene Understanding)
- Zheng-Jun Zha:

Session details: Vision-2 (Object & Scene Understanding). - Lishi Zhang, Chenghan Fu, Jia Li:

Collaborative Annotation of Semantic Objects in Images with Multi-granularity Supervisions. 474-482 - Mengyang Pu, Yaping Huang, Qingji Guan, Qi Zou:

GraphNet: Learning Image Pseudo Annotations for Weakly-Supervised Semantic Segmentation. 483-491 - Hengcan Shi, Hongliang Li

, Qingbo Wu, Fanman Meng, King N. Ngan:
Boosting Scene Parsing Performance via Reliable Scale Prediction. 492-500 - Fan Zhu, Li Liu, Jin Xie, Fumin Shen, Ling Shao

, Yi Fang:
Learning to Synthesize 3D Indoor Scenes from Monocular Images. 501-509
Multimodal-1 (Multimodal Reasoning)
- Xian-Sheng Hua:

Session details: Multimodal-1 (Multimodal Reasoning). - Chaojun Han, Fumin Shen, Li Liu, Yang Yang, Heng Tao Shen:

Visual Spatial Attention Network for Relationship Detection. 510-518 - Chenfei Wu

, Jinlai Liu, Xiaojie Wang, Xuan Dong:
Object-Difference Attention: A Simple Relational Attention for Visual Question Answering. 519-527 - Jinwei Qi, Yuxin Peng, Yunkan Zhuo:

Life-long Cross-media Correlation Learning. 528-536 - Yue Gu

, Xinyu Li
, Kaixiang Huang, Shiyu Fu, Kangning Yang
, Shuhong Chen, Moliang Zhou, Ivan Marsic:
Human Conversation Analysis Using Attentive Multimodal Networks with Hierarchical Encoder-Decoder. 537-545
System-1 (Video Analysis & Streaming)
- Xin Yang:

Session details: System-1 (Video Analysis & Streaming). - Wentao Liu

, Zhengfang Duanmu, Zhou Wang
:
End-to-End Blind Quality Assessment of Compressed Videos Using Deep Neural Networks. 546-554 - Ibrahim Ben Mustafa, Tamer Nadeem

, Emir Halepovic:
FlexStream: Towards Flexible Adaptive Video Streaming on End Devices using Extreme SDN. 555-563 - Lan Xie, Xinggong Zhang, Zongming Guo:

CLS: A Cross-user Learning based System for Improving QoE in 360-degree Video Adaptive Streaming. 564-572 - Abdelhak Bentaleb, Ali C. Begen

, Saad Harous
, Roger Zimmermann:
A Distributed Approach for Bitrate Selection in HTTP Adaptive Streaming. 573-581
FF-3
- Zhu Li:

Session details: FF-3. - Qing Zhang, Ganzhao Yuan, Chunxia Xiao, Lei Zhu, Wei-Shi Zheng:

High-Quality Exposure Correction of Underexposed Photos. 582-590 - Qianqian Xu, Jiechao Xiong, Xinwei Sun

, Zhiyong Yang
, Xiaochun Cao, Qingming Huang, Yuan Yao:
A Margin-based MLE for Crowdsourced Partial Ranking. 591-599 - Ana Garcia del Molino, Michael Gygli:

PHD-GIFs: Personalized Highlight Detection for Automatic GIF Creation. 600-608 - Lu Pang, Yaowei Wang, Yi-Zhe Song

, Tiejun Huang, Yonghong Tian:
Cross-Domain Adversarial Feature Learning for Sketch Re-identification. 609-617 - Quan Chen, Tiezheng Ge, Yanyu Xu

, Zhiqiang Zhang, Xinxin Yang, Kun Gai:
Semantic Human Matting. 618-626 - Lingxiao Song, Zhihe Lu

, Ran He, Zhenan Sun, Tieniu Tan:
Geometry Guided Adversarial Facial Expression Synthesis. 627-635 - Siqi Wang, Yijie Zeng, Qiang Liu, Chengzhang Zhu, En Zhu, Jianping Yin:

Detecting Abnormality without Knowing Normality: A Two-stage Approach for Unsupervised Video Abnormal Event Detection. 636-644 - Tingting Li, Ruihe Qian, Chao Dong, Si Liu, Qiong Yan, Wenwu Zhu, Liang Lin:

BeautyGAN: Instance-level Facial Makeup Transfer with Deep Generative Adversarial Network. 645-653 - Xianghui Luo, Zhuo Su, Jiaming Guo, Gengwei Zhang

, Xiangjian He
:
Trusted Guidance Pyramid Network for Human Parsing. 654-662 - Jingjing Li, Lei Zhu

, Zi Huang
, Ke Lu, Jidong Zhao:
I read, I saw, I tell: Texts Assisted Fine-Grained Visual Classification. 663-671 - Ziwei Wang

, Yadan Luo
, Yang Li, Zi Huang
, Hongzhi Yin
:
Look Deeper See Richer: Depth-aware Image Paragraph Captioning. 672-680 - Huaiwen Zhang, Quan Fang, Shengsheng Qian, Changsheng Xu:

Learning Multimodal Taxonomy via Variational Deep Graph Embedding and Clustering. 681-689 - Junyu Gao, Tianzhu Zhang, Changsheng Xu:

Watch, Think and Attend: End-to-End Video Classification via Dynamic Knowledge Evolution Modeling. 690-699 - Yongcheng Liu, Lu Sheng

, Jing Shao, Junjie Yan, Shiming Xiang, Chunhong Pan:
Multi-Label Image Classification via Knowledge Distillation from Weakly-Supervised Detection. 700-708 - Jiayu Wang, Wengang Zhou, Jinhui Tang

, Zhongqian Fu, Qi Tian, Houqiang Li:
Unregularized Auto-Encoder with Generative Adversarial Networks for Image Generation. 709-717 - Yangbangyan Jiang, Zhiyong Yang

, Qianqian Xu, Xiaochun Cao, Qingming Huang:
When to Learn What: Deep Cognitive Subspace Clustering. 718-726 - Wendong Zhang

, Feng Gao, Bingbing Ni, Lingyu Duan, Yichao Yan, Jingwei Xu, Xiaokang Yang:
Depth Structure Preserving Scene Image Generation. 727-736 - Jiawei Liu, Zheng-Jun Zha

, Hongtao Xie, Zhiwei Xiong, Yongdong Zhang:
CA3Net: Contextual-Attentional Attribute-Appearance Network for Person Re-Identification. 737-745 - Gusi Te, Wei Hu, Amin Zheng, Zongming Guo:

RGCNN: Regularized Graph CNN for Point Cloud Segmentation. 746-754 - Bin Liu

, Yue Cao, Mingsheng Long
, Jianmin Wang
, Jingdong Wang:
Deep Triplet Quantization. 755-763
Keynote 3
- Jiebo Luo:

Session details: Keynote 3. - Ernest A. Edmonds:

What has Art Got to do With It? 773
Best Paper Session
- Rainer Lienhart, Tao Mei:

Session details: Best Paper Session. - Hao Tang, Wei Wang, Dan Xu

, Yan Yan, Nicu Sebe
:
GestureGAN for Hand Gesture-to-Gesture Translation in the Wild. 774-782 - Bei Liu, Jianlong Fu, Makoto P. Kato, Masatoshi Yoshikawa:

Beyond Narrative Description: Generating Poetry from Images by Multi-Adversarial Training. 783-791 - Jian Zhao, Jianshu Li, Yu Cheng, Terence Sim, Shuicheng Yan, Jiashi Feng:

Understanding Humans in Crowded Scenes: Deep Nested Adversarial Learning and A New Benchmark for Multi-Human Parsing. 792-800 - Lizi Liao

, Yunshan Ma
, Xiangnan He, Richang Hong, Tat-Seng Chua:
Knowledge-aware Multimodal Dialogue Systems. 801-809
Doctoral Symposium
- Meng Wang:

Session details: Doctoral Symposium. - Na Zhao

:
End2End Semantic Segmentation for 3D Indoor Scenes. 810-814 - Sabrina Kletz:

On Reducing Effort in Evaluating Laparoscopic Skills. 815-819 - Tianran Hu:

Decode Human Life from Social Media. 820-824
FF-4
- Wen-Huang Cheng:

Session details: FF-4. - Yiling Wu, Shuhui Wang, Qingming Huang:

Learning Semantic Structure-preserved Embeddings for Cross-modal Retrieval. 825-833 - Zhendong Mao, Quan Wang, Yongdong Zhang, Bin Wang:

Post Tuned Hashing: A New Approach to Indexing High-dimensional Data. 834-842 - Meng Liu, Xiang Wang, Liqiang Nie, Qi Tian, Baoquan Chen, Tat-Seng Chua:

Cross-modal Moment Localization in Videos. 843-851 - Zhaoda Ye, Yuxin Peng:

Multi-Scale Correlation for Sequential Cross-modal Hashing Learning. 852-860 - Litao Yu, Yongsheng Gao

, Jun Zhou:
Generative Adversarial Product Quantisation. 861-869 - Yubin Deng, Chen Change Loy, Xiaoou Tang:

Aesthetic-Driven Image Enhancement by Adversarial Learning. 870-878 - Kekai Sheng, Weiming Dong, Chongyang Ma, Xing Mei, Feiyue Huang, Bao-Gang Hu

:
Attention-based Multi-Patch Aggregation for Image Aesthetic Assessment. 879-886 - Zheqi He

, Yafeng Zhou, Yongtao Wang, Siwei Wang, Xiaoqing Lu, Zhi Tang, Ling Cai:
An End-to-End Quadrilateral Regression Network for Comic Panel Extraction. 887-895 - Xin Yang, Jinyu Chen, Zhiwei Wang

, Qiaozhe Zhang
, Wenyu Liu
, Chunyuan Liao, Kwang-Ting Cheng
:
Monocular Camera Based Real-Time Dense Mapping Using Generative Adversarial Network. 896-904 - Xiaojing Ma, Changming Liu

, Sixing Cao, Bin Zhu:
JPEG Decompression in the Homomorphic Encryption Domain. 905-913 - Mengbai Xiao, Shuoqian Wang, Chao Zhou, Li Liu, Zhenhua Li, Yao Liu, Songqing Chen:

MiniView Layout for Bandwidth-Efficient 360-Degree Video. 914-922 - Guoxian Song, Jianfei Cai

, Tat-Jen Cham
, Jianmin Zheng, Juyong Zhang, Henry Fuchs:
Real-time 3D Face-Eye Performance Capture of a Person Wearing VR Headset. 923-931 - Chen Li, Mai Xu, Xinzhe Du, Zulin Wang:

Bridge the Gap Between VQA and Human Behavior on Omnidirectional Video: A Large-Scale Dataset and a Deep Learning Model. 932-940 - Zongpu Zhang, Yang Hua

, Tao Song
, Zhengui Xue, Ruhui Ma, Neil Martin Robertson, Haibing Guan:
Tracking-assisted Weakly Supervised Online Visual Object Segmentation in Unconstrained Videos. 941-949 - Praveen Tirupattur

, Yogesh Singh Rawat, Concetto Spampinato, Mubarak Shah:
ThoughtViz: Visualizing Human Thoughts Using Generative Adversarial Network. 950-958 - Xiaoju Zheng, Zheng-Jun Zha

, Liansheng Zhuang:
A Feature-Adaptive Semi-Supervised Framework for Co-saliency Detection. 959-966 - Jogendra Nath Kundu, Aditya Ganeshan

, Rahul M. V., Aditya Prakash, Venkatesh Babu R.
:
iSPA-Net: Iterative Semantic Pose Alignment Network. 967-975 - Litong Feng, Ziyin Li, Zhanghui Kuang, Wei Zhang

:
Extractive Video Summarizer with Memory Augmented Neural Networks. 976-983 - Jing Zhang

, Yang Cao, Yang Wang, Chenglin Wen, Chang Wen Chen
:
Fully Point-wise Convolutional Neural Network for Modeling Statistical Regularities in Natural Images. 984-992 - Jingjia Huang, Nannan Li, Jia-Xing Zhong, Thomas H. Li, Ge Li:

Online Action Tube Detection via Resolving the Spatio-temporal Context Pattern. 993-1001 - Zhiwei Fang, Jing Liu, Yanyuan Qiao

, Qu Tang, Yong Li, Hanqing Lu:
Enhancing Visual Question Answering Using Dropout. 1002-1010 - Shota Horiguchi, Naoyuki Kanda, Kenji Nagamatsu:

Face-Voice Matching using Cross-modal Embeddings. 1011-1019 - Jingjing Chen

, Chong-Wah Ngo, Fuli Feng, Tat-Seng Chua:
Deep Understanding of Cooking Procedure for Cross-modal Recipe Retrieval. 1020-1028 - Yu Wu

, Linchao Zhu
, Lu Jiang, Yi Yang:
Decoupled Novel Object Captioner. 1029-1037 - David Semedo

, João Magalhães:
Temporal Cross-Media Retrieval with Soft-Smoothing. 1038-1046 - Yu Song, Fan Tang, Weiming Dong, Xiaopeng Zhang, Oliver Deussen, Tong-Yee Lee:

Photo Squarization by Deep Multi-Operator Retargeting. 1047-1055 - Guanbin Li, Xiang He, Wei Zhang, Huiyou Chang, Le Dong, Liang Lin:

Non-locally Enhanced Encoder-Decoder Network for Single Image De-raining. 1056-1064 - Pu Zhao

, Sijia Liu, Yanzhi Wang, Xue Lin:
An ADMM-Based Universal Framework for Adversarial Attacks on Deep Neural Networks. 1065-1073 - Jiwei Yang, Xu Shen, Xinmei Tian, Houqiang Li, Jianqiang Huang, Xian-Sheng Hua:

Local Convolutional Neural Networks for Person Re-Identification. 1074-1082 - Zhihe Lu

, Tanhao Hu, Lingxiao Song, Zhaoxiang Zhang, Ran He:
Conditional Expression Synthesis with Face Parsing Transformation. 1083-1091 - Liang Li

, Shuhui Wang, Shuqiang Jiang, Qingming Huang:
Attentive Recurrent Neural Network for Weak-supervised Multi-label Image Classification. 1092-1100 - Jatin Garg, Skand Vishwanath Peri, Himanshu Tolani, Narayanan C. Krishnan

:
Deep Cross Modal Learning for Caricature Verification and Identification (CaVINet). 1101-1109 - Nakamasa Inoue, Koichi Shinoda:

Few-Shot Adaptation for Multimedia Semantic Indexing. 1110-1118 - Zhengzhong Zhou, Xiu Di, Wei Zhou, Liqing Zhang:

Fashion Sensitive Clothing Recommendation Using Hierarchical Collocation Model. 1119-1127 - Yihang Lou, Yan Bai, Shiqi Wang, Ling-Yu Duan:

Multi-Scale Context Attention Network for Image Retrieval. 1128-1136 - Yibing Zhan, Jun Yu, Zhou Yu

, Rong Zhang, Dacheng Tao
, Qi Tian:
Comprehensive Distance-Preserving Autoencoders for Cross-Modal Retrieval. 1137-1145 - Xusong Chen, Dong Liu, Zheng-Jun Zha

, Wengang Zhou, Zhiwei Xiong, Yan Li:
Temporal Hierarchical Attention at Category- and Item-Level for Micro-Video Click-Through Prediction. 1146-1153 - Jufeng Yang, Liyi Chen, Le Zhang

, Xiaoxiao Sun
, Dongyu She, Shao-Ping Lu, Ming-Ming Cheng
:
Historical Context-based Style Classification of Painting Images via Label Distribution Learning. 1154-1162 - Hao Wu, Zhengxing Sun, Weihang Yuan:

Direction-aware Neural Style Transfer. 1163-1171 - Bin He, Feng Gao, Daiqian Ma, Boxin Shi

, Ling-Yu Duan:
ChipGAN: A Generative Adversarial Network for Chinese Ink Wash Painting Style Transfer. 1172-1180 - Teemu Kämäräinen

, Matti Siekkinen
, Jukka Eerikäinen, Antti Ylä-Jääski
:
CloudVR: Cloud Accelerated Interactive Mobile Virtual Reality. 1181-1189 - Anh Nguyen, Zhisheng Yan

, Klara Nahrstedt:
Your Attention is Unique: Detecting 360-Degree Video Saliency in Head-Mounted Display for Head Movement Prediction. 1190-1198 - Yiting Shao

, Qi Zhang
, Ge Li, Zhu Li, Li Li:
Hybrid Point Cloud Attribute Compression Using Slice-based Layered Structure and Block-based Intra Prediction. 1199-1207 - Tianchi Huang, Rui-Xiao Zhang, Chao Zhou, Lifeng Sun:

QARC: Video Quality Aware Rate Control for Real-Time Video Streaming based on Deep Reinforcement Learning. 1208-1216 - Haitian Pang, Cong Zhang, Fangxin Wang, Han Hu, Zhi Wang, Jiangchuan Liu, Lifeng Sun:

Optimizing Personalized Interaction Experience in Crowd-Interactive Livecast: A Cloud-Edge Approach. 1217-1225
Demo + Video + Makers' Program
- Kwanghoon Sohn, Yong Man Ro:

Session details: Demo + Video + Makers' Program. - Songyou Peng, Le Zhang

, Stefan Winkler, Marianne Winslett:
Give Me One Portrait Image, I Will Tell You Your Emotion and Personality. 1226-1227 - Yang Liu, Yang Yang, Weidong Fang, Wuxiong Zhang:

Demo: Phase-based Acoustic Localization and Motion Tracking for Mobile Interaction. 1228-1230 - Cunjun Zhang, Kehua Lei, Jia Jia, Yihui Ma, Zhiyuan Hu:

AI Painting: An Aesthetic Painting Generation System. 1231-1233 - Aleksandr Farseev, Kirill Lepikhin, Hendrik Schwartz, Eu Khoon Ang, Kenny Powar:

SoMin.ai: Social Multimedia Influencer Discovery Marketplace. 1234-1236 - Taoran Tang, Hanyang Mao, Jia Jia:

AniDance: Real-Time Dance Motion Synthesize to the Song. 1237-1239 - Gjorgji Strezoski, Inske Groenen

, Jurriaan Besenbruch, Marcel Worring
:
ArtSight: An Artistic Data Exploration Engine. 1240-1241 - Yoonjung Park, Yoonsik Yang, Hyocheol Ro, Junghyun Byun, Seougho Chae, Tack-Don Han:

Meet AR-bot: Meeting Anywhere, Anytime with Movable Spatial AR Robot. 1242-1243 - Ryosuke Tanno, Daichi Horita, Wataru Shimoda, Keiji Yanai

:
Magical Rice Bowl: A Real-time Food Category Changer. 1244-1246 - Haolin Ren, Benjamin Renoust, Guy Melançon

, Marie-Luce Viaud, Shin'ichi Satoh:
Exploring Temporal Communities in Mass Media Archives. 1247-1249 - Matthias Zeppelzauer

, Alexis Ringot
, Florian Taurer
:
SoniControl - A Mobile Ultrasonic Firewall. 1250-1252 - Mohammed Habibullah Baig, Jibin Rajan Varghese

, Zhangyang Wang:
MusicMapp: A Deep Learning Based Solution for Music Exploration and Visual Interaction. 1253-1255 - Paula Gómez Duran

, Eva Mohedano, Kevin McGuinness
, Xavier Giró-i-Nieto
, Noel E. O'Connor
:
Demonstration of an Open Source Framework for Qualitative Evaluation of CBIR Systems. 1256-1257 - Yun-Gyung Cheong, Woo-Hyun Park, Hye-Yeon Yu:

A Demonstration of an Intelligent Storytelling System. 1258-1259 - Yaohua Bu, Jia Jia, Xiang Li, Suping Zhou, Xiaobo Lu:

IcooBook: When the Picture Book for Children Encounters Aesthetics of Interaction. 1260-1262 - Thomas Forgione, Axel Carlier, Géraldine Morin, Wei Tsang Ooi

, Vincent Charvillat, Praveen Kumar Yadav:
An Implementation of a DASH Client for Browsing Networked Virtual Environment. 1263-1264 - Lizi Liao

, You Zhou, Yunshan Ma
, Richang Hong, Tat-Seng Chua:
Knowledge-aware Multimodal Fashion Chatbot. 1265-1266 - Alex Lee, Chang-Uk Kwak, Jeong-Woo Son, Sun-Joong Kim:

SVIAS: Scene-segmented Video Information Annotation System. 1267-1269 - Chang-Uk Kwak, Minho Han, Sun-Joong Kim, Gyeong-June Hahm

:
Interactive Story Maker: Tagged Video Retrieval System for Video Re-creation Service. 1270-1271 - Xingyu Liu, Jingfan Guo, Tongwei Ren, Yahong Han, Lei Huang, Gangshan Wu:

HeterStyle: A Heterogeneous Video Style Transfer Application. 1272-1273 - Hyocheol Ro, Inhwan Kim, Junghyun Byun, Yoonsik Yang, Yoonjung Park, Seungho Chae, Tack-Don Han:

PAMI: Projection Augmented Meeting Interface for Video Conferencing. 1274-1277 - Yoonjung Park, Yoonsik Yang, Hyocheol Ro, Jinwon Cha, Kyuri Kim, Tack-Don Han:

ChildAR-bot: Educational Playing Projection-based AR Robot for Children. 1278-1282
Deep-2 (Recognition)
- Qin Jin:

Session details: Deep-2 (Recognition). - Yansong Tang, Zian Wang, Peiyang Li, Jiwen Lu

, Ming Yang
, Jie Zhou:
Mining Semantics-Preserving Attention for Group Activity Recognition. 1283-1291 - Rui Yan, Jinhui Tang

, Xiangbo Shu, Zechao Li, Qi Tian:
Participation-Contributed Temporal Dynamic Model for Group Activity Recognition. 1292-1300 - Peiqin Zhuang, Yali Wang, Yu Qiao:

WildFish: A Large Benchmark for Fish Recognition in the Wild. 1301-1309 - Haoxuan You

, Yifan Feng
, Rongrong Ji, Yue Gao:
PVNet: A Joint Convolutional Network of Point Cloud and Multi-View for 3D Shape Recognition. 1310-1318
Multimedia-2 (Socical & Emotional Multimedia)
- Rongrong Ji:

Session details: Multimedia-2 (Socical & Emotional Multimedia). - Sicheng Zhao, Xin Zhao, Guiguang Ding, Kurt Keutzer:

EmotionGAN: Unsupervised Domain Adaptation for Learning Discrete Probability Distributions of Image Emotions. 1319-1327 - Pei Lv, Meng Wang, Yongbo Xu, Ze Peng, Junyi Sun, Shi-Mei Su, Bing Zhou, Mingliang Xu:

USAR: An Interactive User-specific Aesthetic Ranking Framework for Images. 1328-1336 - Ekraam Sabir, Wael AbdAlmageed, Yue Wu

, Prem Natarajan:
Deep Multimodal Image-Repurposing Detection. 1337-1345 - Bowen Pan, Shangfei Wang:

Facial Expression Recognition Enhanced by Thermal Images through Adversarial Learning. 1346-1353
Panel-1
- Jun Jitao, Yu Sang:

Session details: Panel-1. - Jitao Sang, Jun Yu, Ramesh C. Jain, Rainer Lienhart, Peng Cui, Jiashi Feng:

Deep Learning for Multimedia: Science or Technology? 1354-1355
Open Source Software Competition
- Min-Chun Hu:

Session details: Open Source Software Competition. - Kuan-Ting Lai

, Chia-Chih Lin, Chun-Yao Kang, Mei-Enn Liao, Ming-Syan Chen:
VIVID: Virtual Environment for Visual Deep Learning. 1356-1359 - Tsung-Wei Huang, Chun-Xun Lin, Guannan Guo, Martin D. F. Wong

:
A General-purpose Distributed Programming System using Data-parallel Streams. 1360-1363 - Konstantinos Zampogiannis, Cornelia Fermüller, Yiannis Aloimonos:

cilantro: A Lean, Versatile, and Efficient Library for Point Cloud Data Processing. 1364-1367 - Matthieu Pizenberg, Axel Carlier, Emmanuel Faure, Vincent Charvillat:

Web-Based Configurable Image Annotations. 1368-1371
Vision-3 (Applications in Multimedia)
- Zheng-Jun Zha:

Session details: Vision-3 (Applications in Multimedia). - Xiangteng He, Yuxin Peng:

Only Learn One Sample: Fine-Grained Visual Categorization with One Sample Training. 1372-1380 - Kecheng Zheng, Zheng-Jun Zha

, Yang Cao, Xuejin Chen, Feng Wu:
LA-Net: Layout-Aware Dense Network for Monocular Depth Estimation. 1381-1388 - Ziqing Huang, Shiguang Liu:

Robustness and Discrimination Oriented Hashing Combining Texture and Invariant Vector Distance. 1389-1397 - Shuhui Wang, Yangyu Chen, Junbao Zhuo, Qingming Huang, Qi Tian:

Joint Global and Co-Attentive Representation Learning for Image-Sentence Retrieval. 1398-1406
Multimodal-2 (Cross-Modal Translation)
- Xian-Sheng Hua:

Session details: Multimodal-2 (Cross-Modal Translation). - Mingkuan Yuan, Yuxin Peng:

Text-to-image Synthesis via Symmetrical Distillation Networks. 1407-1415 - Daqing Liu, Zheng-Jun Zha

, Hanwang Zhang
, Yongdong Zhang, Feng Wu:
Context-Aware Visual Policy Network for Sequence-Level Image Captioning. 1416-1424 - Sheng Liu, Zhou Ren, Junsong Yuan

:
SibNet: Sibling Convolutional Encoder for Video Captioning. 1425-1434 - Wenbin Che, Xiaopeng Fan

, Ruiqin Xiong, Debin Zhao:
Paragraph Generation Network with Visual Relationship Detection. 1435-1443
Panel-2
- Jiaying Liu, Wen-Huang Cheng:

Session details: Panel-2. - Wen-Huang Cheng, Jiaying Liu, Mohan S. Kankanhalli, Abdulmotaleb El-Saddik, Benoit Huet:

AI + Multimedia Make Better Life? 1455-1456
FF-5
- Zhu Li:

Session details: FF-5. - Na Jiang, Sichen Bai, Yue Xu, Chang Xing, Zhong Zhou

, Wei Wu:
Online Inter-Camera Trajectory Association Exploiting Person Re-Identification and Camera Topology. 1457-1465 - Jing Zhu, Yi Fang:

Learning Local Descriptors with Adversarial Enhancer from Volumetric Geometry Patches. 1466-1474 - Zhen Cui, Chunyan Xu, Wenming Zheng

, Jian Yang:
Context-Dependent Diffusion Network for Visual Relationship Detection. 1475-1482 - Shuo Wang, Dan Guo

, Wengang Zhou, Zheng-Jun Zha
, Meng Wang:
Connectionist Temporal Fusion for Sign Language Translation. 1483-1491 - Kai Li, Zhengming Ding, Kunpeng Li, Yulun Zhang

, Yun Fu:
Support Neighbor Loss for Person Re-Identification. 1492-1500 - Bing Li, Chia-Wen Lin

, Shan Liu
, Tiejun Huang, Wen Gao, C.-C. Jay Kuo
:
Perceptual Temporal Incoherence Aware Stereo Video Retargeting. 1501-1509 - Yanli Ji, Feixiang Xu, Yang Yang, Fumin Shen, Heng Tao Shen, Wei-Shi Zheng:

A Large-scale RGB-D Database for Arbitrary-view Human Action Recognition. 1510-1518 - Huiyun Wang, Youjiang Xu, Yahong Han:

Spotting and Aggregating Salient Regions for Video Captioning. 1519-1526 - Qixian Zhou, Xiaodan Liang, Ke Gong, Liang Lin:

Adaptive Temporal Encoding Network for Video Instance-level Human Parsing. 1527-1535 - Yuanzheng Ci

, Xinzhu Ma, Zhihui Wang, Haojie Li, Zhongxuan Luo:
User-Guided Deep Anime Line Art Colorization with Conditional Adversarial Networks. 1536-1544 - Tianli Zhao, Xiangyu He

, Jian Cheng, Jing Hu:
BitStream: Efficient Computing Architecture for Real-Time Low-Power Inference of Binary Neural Networks on CPUs. 1545-1552 - Lingbo Liu, Ruimao Zhang

, Jiefeng Peng, Guanbin Li, Bowen Du, Liang Lin:
Attentive Crowd Flow Machines. 1553-1561 - Deqiang Ouyang

, Jie Shao, Yonghui Zhang, Yang Yang, Heng Tao Shen:
Video-based Person Re-identification via Self-Paced Learning and Deep Reinforcement Learning Framework. 1562-1570 - Lizi Liao

, Xiangnan He, Bo Zhao, Chong-Wah Ngo, Tat-Seng Chua:
Interpretable Multimodal Retrieval for Fashion Products. 1571-1579 - Chieh-Yu Chen, Wenze Lai, Hsin-Ying Hsieh, Wen-Hao Zheng, Yu-Shuen Wang, Jung-Hong Chuang:

Generating Defensive Plays in Basketball Games. 1580-1588 - Hong Liu, Mingbao Lin, Shengchuan Zhang, Yongjian Wu, Feiyue Huang, Rongrong Ji

:
Dense Auto-Encoder Hashing for Robust Cross-Modality Retrieval. 1589-1597 - Taoran Tang, Jia Jia, Hanyang Mao:

Dance with Melody: An LSTM-autoencoder Approach to Music-oriented Dance Synthesis. 1598-1606 - Gong Chen

, Yan Liu, Sheng-hua Zhong, Xiang Zhang
:
Musicality-Novelty Generative Adversarial Nets for Algorithmic Composition. 1607-1615 - Divyashri Bhat

, Rajvardhan Somraj Deshmukh
, Michael Zink
:
Improving QoE of ABR Streaming Sessions through QUIC Retransmissions. 1616-1624 - Ziqian Chen, Shiqi Wang, Dapeng Oliver Wu

, Tiejun Huang, Ling-Yu Duan:
From Data to Knowledge: Deep Learning Model Compression, Transmission and Communication. 1625-1633
Keynote 4
- Kyoung Mu Lee:

Session details: Keynote 4. - Gary Geunbae Lee:

Living with AI in Connected Devices for valuable Experience. 1634
Multimedia -3 (Multimedia Search)
- Jitao Sang:

Session details: Multimedia -3 (Multimedia Search). - Mingbao Lin, Rongrong Ji

, Hong Liu, Yongjian Wu:
Supervised Online Hashing via Hadamard Codebook Learning. 1635-1643 - Yuanqiang Fang, Wengang Zhou, Yijuan Lu, Jinhui Tang

, Qi Tian, Houqiang Li:
Cascaded Feature Augmentation with Diffusion for Image Retrieval. 1644-1652 - Zhangjie Cao, Ziping Sun, Mingsheng Long

, Jianmin Wang
, Philip S. Yu:
Deep Priority Hashing. 1653-1661 - Xingbo Liu, Xiushan Nie, Wenjun Zeng

, Chaoran Cui, Lei Zhu
, Yilong Yin:
Fast Discrete Cross-modal Hashing With Regressing From Semantic Labels. 1662-1669
Experience-1 (Multimedia Entertainment and Experience)
- Zhisheng Yan:

Session details: Experience-1 (Multimedia Entertainment and Experience). - Shuai Zheng, Fan Yang, M. Hadi Kiapour, Robinson Piramuthu:

ModaNet: A Large-scale Street Fashion Dataset with Polygon Annotations. 1670-1678 - Dania Murad, Riwu Wang, Douglas Turnbull, Ye Wang

:
SLIONS: A Karaoke Application to Enhance Foreign Language Learning. 1679-1687 - Shuai Yang

, Jiaying Liu
, Wenhan Yang, Zongming Guo:
Context-Aware Unsupervised Text Stylization. 1688-1696 - Jun Kato

, Masa Ogata, Takahiro Inoue, Masataka Goto
:
Songle Sync: A Large-Scale Web-based Platform for Controlling Various Devices in Synchronization with Music. 1697-1705
System-2 (Smart Multimedia Systems)
- Yijuan Lu:

Session details: System-2 (Smart Multimedia Systems). - Weidong Geng, Feilin Han, Jiangke Lin, Liuyi Zhu, Jieming Bai, Suzhen Wang

, Lin He, Qiang Xiao, Zhangjiong Lai:
Fine-Grained Grocery Product Recognition by One-Shot Learning. 1706-1714 - Yusuke Matsui, Ryota Hinami, Shin'ichi Satoh:

Reconfigurable Inverted Index. 1715-1723 - Hiroshi Sankoh, Sei Naito, Keisuke Nonaka, Houari Sabirin, Jun Chen:

Robust Billboard-based, Free-viewpoint Video Synthesis Algorithm to Overcome Occlusions under Challenging Outdoor Sport Scenes. 1724-1732 - Wei Cheng, Lan Xu

, Lei Han, Yuanfang Guo, Lu Fang:
iHuman3D: Intelligent Human Body 3D Reconstruction using a Single Flying Camera. 1733-1741
FF-6
- Benoit Huet:

Session details: FF-6. - Lianli Gao, Pengpeng Zeng

, Jingkuan Song, Xianglong Liu, Heng Tao Shen:
Examine before You Answer: Multi-task Learning with Adaptive-attentions for Multiple-choice VQA. 1742-1750 - Zhiwen Fan, Huafeng Wu, Xueyang Fu

, Yue Huang, Xinghao Ding:
Residual-Guide Network for Single Image Deraining. 1751-1759 - Zhengyu Zhao, Martha A. Larson:

From Volcano to Toyshop: Adaptive Discriminative Region Discovery for Scene Recognition. 1760-1768 - Joshua Sowerby, Yang Zhang, Dimitris Agrafiotis:

The Effect of Foveation on High Dynamic Range Video Perception. 1769-1776 - Wenxue Cui, Feng Jiang, Xinwei Gao, Shengping Zhang, Debin Zhao:

An Efficient Deep Quantized Compressed Sensing Coding Framework of Natural Images. 1777-1785 - Diep Thi Ngoc Nguyen, Hideki Nakayama

, Naoaki Okazaki, Tatsuya Sakaeda:
PoB: Toward Reasoning Patterns of Beauty in Image Data. 1786-1793 - Nan Xu, Yanqing Guo, Xin Zheng

, Qianyu Wang, Xiangyang Luo:
Partial Multi-view Subspace Clustering. 1794-1801 - Teng Long

, Xing Xu, Youyou Li, Fumin Shen, Jingkuan Song, Heng Tao Shen:
Pseudo Transfer with Marginalized Corrupted Attribute for Zero-shot Learning. 1802-1810 - Guangxing Han, Xuan Zhang, Chongrong Li:

Semi-Supervised DFF: Decoupling Detection and Feature Flow for Video Object Detectors. 1811-1819 - Lingjing Wang, Cheng Qian, Jifei Wang, Yi Fang:

Unsupervised Learning of 3D Model Reconstruction from Hand-Drawn Sketches. 1820-1828 - Sibo Song, Ngai-Man Cheung, Vijay Chandrasekhar, Bappaditya Mandal

:
Deep Adaptive Temporal Pooling for Activity Recognition. 1829-1837 - Mingyong Zeng, Chang Tian, Zemin Wu:

Person Re-identification with Hierarchical Deep Learning Feature and efficient XQDA Metric. 1838-1846 - Jingkuan Song, Zhilong Zhou, Lianli Gao, Xing Xu, Heng Tao Shen:

Cumulative Nets for Edge Detection. 1847-1855 - Niluthpol Chowdhury Mithun

, Rameswar Panda, Evangelos E. Papalexakis
, Amit K. Roy-Chowdhury:
Webly Supervised Joint Embedding for Cross-Modal Image-Text Retrieval. 1856-1864 - Yangyang Guo, Zhiyong Cheng, Liqiang Nie, Xin-Shun Xu, Mohan S. Kankanhalli

:
Multi-modal Preference Modeling for Product Search. 1865-1873 - Feiran Huang, Xiaoming Zhang, Zhoujun Li

:
Learning Joint Multimodal Representation with Adversarial Attention Networks. 1874-1882 - Binbing Liao, Jingqing Zhang, Ming Cai, Siliang Tang

, Yifan Gao, Chao Wu, Shengwen Yang, Wenwu Zhu, Yike Guo
, Fei Wu:
Dest-ResNet: A Deep Spatiotemporal Residual Network for Hotspot Traffic Speed Prediction. 1883-1891 - Yifang Yin

, Rajiv Ratn Shah
, Roger Zimmermann:
Learning and Fusing Multimodal Deep Features for Acoustic Scene Categorization. 1892-1900 - Zhenyu Tang

, Nicolás Morales, Dinesh Manocha:
Dynamic Sound Field Synthesis for Speech and Music Optimization. 1901-1909 - Thomas Forgione, Axel Carlier, Géraldine Morin, Wei Tsang Ooi

, Vincent Charvillat, Praveen Kumar Yadav:
DASH for 3D Networked Virtual Environment. 1910-1918
Keynote 5
- Wenwu Zhu:

Session details: Keynote 5. - Bowen Zhou:

Transforming Retailing Experiences with Artificial Intelligence. 1919-1920
Deep-3 (Image Processing-Inpainting, Super-Resolution, Deblurring)
- Shuqiang Jiang:

Session details: Deep-3 (Image Processing-Inpainting, Super-Resolution, Deblurring). - Risheng Liu

, Yi He, Shichao Cheng, Xin Fan, Zhongxuan Luo:
Learning Collaborative Generation Correction Modules for Blind Image Deblurring and Beyond. 1921-1929 - Minghao Yin

, Yongbing Zhang, Xiu Li, Shiqi Wang:
When Deep Fool Meets Deep Prior: Adversarial Attack on Super-Resolution Network. 1930-1938 - Haoran Zhang, Zhenzhen Hu, Changzhi Luo, Wangmeng Zuo, Meng Wang:

Semantic Image Inpainting with Progressive Generative Networks. 1939-1947 - Huy V. Vo, Ngoc Q. K. Duong, Patrick Pérez:

Structural inpainting. 1948-1956
Brand New Ideas
- Kiyoharu Aizawa:

Session details: Brand New Ideas. - Mykhaylo Andriluka, Jasper R. R. Uijlings, Vittorio Ferrari:

Fluid Annotation: A Human-Machine Collaboration Interface for Full Image Annotation. 1957-1966 - Lixin Liu, Xiaojun Wan, Zongming Guo:

Images2Poem: Generating Chinese Poetry from Image Streams. 1967-1975 - Yaman Kumar

, Mayank Aggarwal, Pratham Nawal, Shin'ichi Satoh, Rajiv Ratn Shah
, Roger Zimmermann:
Harnessing AI for Speech Reconstruction using Multi-view Silent Video Feed. 1976-1983 - Kanchan Bahirat, Umang Shah, Alvaro A. Cárdenas, Balakrishnan Prabhakaran

:
ALERT: Adding a Secure Layer in Decision Support for Advanced Driver Assistance System (ADAS). 1984-1992 - Nitish Nag

, Vaibhav Pandey, Preston J. Putzel, Hari Bhimaraju
, Srikanth Krishnan, Ramesh C. Jain:
Cross-Modal Health State Estimation. 1993-2002
Grand Challenge-1
- Shuqiang Jiang:

Session details: Grand Challenge-1. - Liuwu Li, Sihong Huang, Ziliang He, Wenyin Liu:

An Effective Text-based Characterization Combined with Numerical Features for Social Media Headline Prediction. 2003-2007 - Chih-Chung Hsu

, Chia-Yen Lee
, Ting-Xuan Liao, Jun-Yi Lee, Tsai-Yne Hou, Ying-Chu Kuo, Jing-Wen Lin, Ching-Yi Hsueh, Zhong-Xuan Zhang, Hsiang-Chin Chien:
An Iterative Refinement Approach for Social Media Headline Prediction. 2008-2012 - Feitao Huang, Junhong Chen, Zehang Lin

, Peipei Kang, Zhenguo Yang:
Random Forest Exploiting Post-related and User-related Features for Social Media Popularity Prediction. 2013-2017 - Xusong Chen, Rui Zhao, Shengjie Ma, Dong Liu, Zheng-Jun Zha

:
Content-Based Video Relevance Prediction with Second-Order Relevance and Attention Modeling. 2018-2022
Vision-4 (Representation Learning)
- Marcel Worring:

Session details: Vision-4 (Representation Learning). - Tianshui Chen, Wenxi Wu, Yuefang Gao, Le Dong, Xiaonan Luo, Liang Lin:

Fine-Grained Representation Learning and Recognition by Exploiting Hierarchical Semantic Embedding. 2023-2031 - Gang Yang, Jinlu Liu, Jieping Xu

, Xirong Li
:
Dissimilarity Representation Learning for Generalized Zero-Shot Recognition. 2032-2039 - Kai Han

, Jianyuan Guo
, Chao Zhang, Mingjian Zhu:
Attribute-Aware Attention Model for Fine-grained Representation Learning. 2040-2048 - Siyu Huang

, Xi Li
, Zhiqi Cheng
, Zhongfei Zhang, Alexander G. Hauptmann:
GNAS: A Greedy Neural Architecture Search Method for Multi-Attribute Learning. 2049-2057
Grand Challenge-2
- Shuqiang Jiang:

Session details: Grand Challenge-2. - Jianfeng Dong, Xirong Li

, Chaoxi Xu, Gang Yang, Xun Wang:
Feature Re-Learning with Data Augmentation for Content-based Video Recommendation. 2058-2062 - Qi Wang, Jingxiang Lai, Kai Xu, Wenyin Liu, Liang Lei:

Beauty Product Image Retrieval Based on Multi-Feature Fusion and Feature Aggregation. 2063-2067 - Jian Han Lim

, Nurul Japar
, Chun Chet Ng, Chee Seng Chan:
Unprecedented Usage of Pre-trained CNNs on Beauty Product. 2068-2072 - Zehang Lin

, Zhenguo Yang, Feitao Huang, Junhong Chen:
Regional Maximum Activations of Convolutions with Attention for Cross-domain Beauty and Personal Care Product Retrieval. 2073-2077
Interactive Art
- Hyunjung Shim:

Session details: Interactive Art. - Lyn Chao-ling Chen

, He-Lin Luo:
Shadow Calligraphy of Dance: An Image-Based Interactive Installation for Capturing Flowing Human Figures. 2078-2080 - Anis Haron, Soon Xuan Yong, Chee-Onn Wong:

Cellular Music: An Interactive Game of Life Sequencer. 2081-2083 - Soon Xuan Yong, Chee-Onn Wong, Kong Cheng Tan, Anis Haron:

TAGapp Visualization: An Application Based Visual Art Installation. 2084-2086
Tutorials
- Jan Sedmidubský

, Pavel Zezula:
Similarity-Based Processing of Motion Capture Data. 2087-2089 - Yunchao Wei, Xiaodan Liang, Si Liu, Liang Lin:

Structured Deep Learning for Pixel-level Understanding. 2090-2092 - Jungseock Joo, Zachary C. Steinert-Threlkeld

, Jiebo Luo
:
Social and Political Event Analysis based on Rich Media. 2093-2095 - Joseph P. Robinson

, Ming Shao, Yun Fu:
To Recognize Families In the Wild: A Machine Vision Tutorial. 2096-2097 - Jitao Sang:

Deep Learning Interpretation. 2098-2100 - Klaus Schoeffmann, Werner Bailer, Cathal Gurrin

, George Awad, Jakub Lokoc:
Interactive Video Search: Where is the User in the Age of Deep Learning? 2101-2103 - Ting Yao, Jingen Liu:

Human Behavior Understanding: From Action Recognition to Complex Event Detection. 2104-2105 - Michael Riegler, Pål Halvorsen, Bernd Münzer, Klaus Schoeffmann:

The Importance of Medical Multimedia. 2106-2108
Workshop Summaries
- Teresa Chambel

, Francesca De Simone, Rene Kaiser
, Nimesha Ranasinghe, Wendy Van den Broeck
:
AltMM 2018 - 3rd International Workshop on Multimedia Alternate Realities. 2109-2110 - Fabien Ringeval, Björn W. Schuller

, Michel F. Valstar, Roddy Cowie, Maja Pantic:
Summary for AVEC 2018: Bipolar Disorder and Cross-Cultural Affect Recognition. 2111-2112 - Kwanghoon Sohn, Ming-Hsuan Yang

, Hyeran Byun, Jongwoo Lim, Jison Hsu, Stephen Lin, Euntai Kim, Seungryong Kim:
CoVieW'18: The 1st Workshop and Challenge on Comprehensive Video Understanding in the Wild. 2113-2115 - Jochen Meyer, Susanne Boll, Noel E. O'Connor

, Ramesh C. Jain, Troy McDaniel
:
HealthMedia 2018: Third International Workshop on Multimedia for Personal Health and Health Care. 2116-2117 - Xueliang Liu, Rui Min, Benoit Huet, Jia Jia:

MAHCI 2018: The 1st Workshop on Multimedia for Accessible Human Computer Interface. 2118-2119 - Dong-Yan Huang, Sicheng Zhao, Björn W. Schuller

, Hongxun Yao, Jianhua Tao, Min Xu
, Lei Xie, Qingming Huang, Jie Yang:
ASMMC-MMAC 2018: The Joint Workshop of 4th the Workshop on Affective Social Multimedia Computing and first Multi-Modal Affective Computing of Large-Scale Multimedia Data Workshop. 2120-2121 - Adrian Hilton, Hong-Goo Kang, Hansung Kim

, Kwanghoon Sohn:
AVSU: Workshop on Audio-Visual Scene Understanding for Immersive Multimedia. 2122-2124 - Rainer Lienhart, Thomas B. Moeslund

, Hideo Saito:
1st ACM International Workshop on Multimedia Content Analysis in Sports. 2125-2126 - Xavier Alameda-Pineda, Miriam Redi, Nicu Sebe

, Shih-Fu Chang, Jiebo Luo
:
EE-USAD: ACM MM 2018Workshop on UnderstandingSubjective Attributes of Data focus on Evoked Emotions. 2127-2128

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.


Google
Google Scholar
Semantic Scholar
Internet Archive Scholar
CiteSeerX
ORCID














