


default search action
26th ACM Multimedia 2018: Seoul, Republic of Korea
- Susanne Boll, Kyoung Mu Lee, Jiebo Luo, Wenwu Zhu, Hyeran Byun, Chang Wen Chen, Rainer Lienhart, Tao Mei:
2018 ACM Multimedia Conference on Multimedia Conference, MM 2018, Seoul, Republic of Korea, October 22-26, 2018. ACM 2018, ISBN 978-1-4503-5665-7
FF-1
- Max Mühlhäuser:
Session details: FF-1. - Chuan-Xiang Li, Zhen-Duo Chen, Peng-Fei Zhang, Xin Luo, Liqiang Nie, Wei Zhang, Xin-Shun Xu:
SCRATCH: A Scalable Discrete Matrix Factorization Hashing for Cross-Modal Retrieval. 1-9 - Ana Garcia del Molino, Joo-Hwee Lim, Ah-Hwee Tan
:
Predicting Visual Context for Unsupervised Event Segmentation in Continuous Photo-streams. 10-17 - Xingxing Wei, Jun Zhu, Sitong Feng, Hang Su:
Video-to-Video Translation with Global Temporal Consistency. 18-25 - Jinxing Li, Bob Zhang
, Guangming Lu, David Zhang:
Shared Linear Encoder-based Gaussian Process Latent Variable Model for Visual Classification. 26-34 - Jia-Xing Zhong, Nannan Li, Weijie Kong, Tao Zhang, Thomas H. Li, Ge Li:
Step-by-step Erasion, One-by-one Collection: A Weakly Supervised Temporal Action Detector. 35-44 - Jianshu Li, Jian Zhao, Yunpeng Chen
, Sujoy Roy, Shuicheng Yan, Jiashi Feng, Terence Sim:
Multi-Human Parsing Machines. 45-53 - Xuanyi Dong, Linchao Zhu
, De Zhang, Yi Yang, Fei Wu:
Fast Parameter Adaptation for Few-shot Image Captioning and Visual Question Answering. 54-62 - Junbo Wang, Wei Wang, Yan Huang, Liang Wang, Tieniu Tan:
Hierarchical Memory Modelling for Video Captioning. 63-71 - Zheng Wang, Xiang Bai, Mang Ye
, Shin'ichi Satoh:
Incremental Deep Hidden Attribute Learning. 72-80 - Huarong Chen, Bin Wang
, Tianxiang Pan, Liwang Zhou, Hua Zeng:
CropNet: Real-Time Thumbnailing. 81-89 - Zhi-Qi Cheng
, Xiao Wu, Siyu Huang
, Jun-Xiu Li, Alexander G. Hauptmann, Qiang Peng:
Learning to Transfer: Generalizable Attribute Learning with Multitask Neural Model Search. 90-98 - Yingying Zhu, Jiong Wang, Lingxi Xie, Liang Zheng
:
Attention-based Pyramid Aggregation Network for Visual Place Recognition. 99-107 - Changde Du, Changying Du, Hao Wang, Jinpeng Li
, Wei-Long Zheng, Bao-Liang Lu, Huiguang He
:
Semi-supervised Deep Generative Modelling of Incomplete Multi-Modality Emotional Data. 108-116 - Yuxiao Chen, Jianbo Yuan, Quanzeng You, Jiebo Luo
:
Twitter Sentiment Analysis via Bi-sense Emoji Embedding and Attention-based LSTM. 117-125 - Feifei Zhang, Tianzhu Zhang, Qirong Mao, Lingyu Duan, Changsheng Xu:
Facial Expression Recognition in the Wild: A Cycle-Consistent Adversarial Attention Transfer Approach. 126-135 - Runnan Li, Zhiyong Wu, Jia Jia, Jingbei Li, Wei Chen, Helen Meng:
Inferring User Emotive State Changes in Realistic Human-Computer Conversational Dialogs. 136-144 - Zhengzhe Liu, Xiaojuan Qi, Lei Pang:
Self-boosted Gesture Interactive System with ST-Net. 145-153 - Felix Kosmalla, Christian Murlowski, Florian Daiber
, Antonio Krüger
:
Slackliner - An Interactive Slackline Training Assistant. 154-162 - Yaoyu Li, Tianzhu Zhang, Lingyu Duan, Changsheng Xu:
A Unified Generative Adversarial Framework for Image Generation and Person Re-identification. 163-172 - Anahita Mahzari, Afshin Taghavi Nasrabadi, Aliehsan Samiei, Ravi Prakash
:
FoV-Aware Edge Caching for Adaptive 360° Video Streaming. 173-181
Keynote 1
- Susanne Boll:
Session details: Keynote 1. - Marianna Obrist:
Don't just Look - Smell, Taste, and Feel the Interaction. 182
FF-2
- Peng Cui:
Session details: FF-2. - Rui Zhang, Sheng Tang, Yu Li, Junbo Guo, Yongdong Zhang, Jintao Li, Shuicheng Yan:
Style Separation and Synthesis via Generative Adversarial Networks. 183-191 - Hao Xiao
, Weiyao Lin
, Bin Sheng, Ke Lu, Junchi Yan, Jingdong Wang
, Errui Ding, Yihao Zhang, Hongkai Xiong
:
Group Re-Identification: Leveraging and Integrating Multi-Grain Information. 192-200 - Xu Gao, Tingting Jiang
:
OSMO: Online Specific Models for Occlusion in Multiple Object Tracking under Surveillance Scene. 201-210 - Yuke Li:
Video Forecasting with Forward-Backward-Net: Delving Deeper into Spatiotemporal Consistency. 211-219 - Rui Shao, Xiangyuan Lan, Pong C. Yuen:
Feature Constrained by Pixel: Hierarchical Adversarial Deep Domain Adaptation. 220-228 - Zhixing Chen, Di Huang, Yunhong Wang, Liming Chen:
Fast and Light Manifold CNN based 3D Facial Expression Recognition across Pose Variations. 229-238 - Xiaomeng Song, Yucheng Shi
, Xin Chen, Yahong Han:
Explore Multi-Step Reasoning in Video Question Answering. 239-247 - Shancheng Fang, Hongtao Xie, Zheng-Jun Zha
, Nannan Sun, Jianlong Tan, Yongdong Zhang:
Attention and Language Ensemble for Scene Text Recognition with Convolutional Sequence Modeling. 248-256 - Zhaoyang Zhang, Zhanghui Kuang, Ping Luo, Litong Feng, Wei Zhang
:
Temporal Sequence Distillation: Towards Few-Frame Action Recognition in Videos. 257-264 - Zhihang Fu, Zhongming Jin, Guo-Jun Qi
, Chen Shen, Rongxin Jiang, Yaowu Chen, Xian-Sheng Hua:
Previewer for Multi-Scale Object Detector. 265-273 - Guanshuo Wang, Yufeng Yuan, Xiong Chen, Jiwei Li, Xi Zhou:
Learning Discriminative Features with Multiple Granularities for Person Re-Identification. 274-282 - Guoxiang Qu, Wenwei Zhang, Zhe Wang, Xing Dai, Jianping Shi, Junjun He, Fei Li, Xiulan Zhang, Yu Qiao
:
StripNet: Towards Topology Consistent Strip Structure Segmentation. 283-291 - Samuel Albanie, Arsha Nagrani, Andrea Vedaldi, Andrew Zisserman:
Emotion Recognition in Speech using Cross-Modal Transfer in the Wild. 292-301 - Can Wang, Shangfei Wang:
Personalized Multiple Facial Action Unit Recognition through Generative Adversarial Recognition Network. 302-310 - Cigdem Beyan
, Muhammad Shahid, Vittorio Murino
:
Investigation of Small Group Social Interactions Using Deep Visual Activity-Based Nonverbal Features. 311-319 - Eugene Yujun Fu
, Michael Xuelin Huang, Hong Va Leong, Grace Ngai
:
Cross-Species Learning: A Low-Cost Approach to Learning Human Fight from Animal Fight. 320-327 - Qianli Xu, Vigneshwaran Subbaraju
, Chee How Cheong, Aijing Wang, Kathleen Kang, Munirah Bashir, Yanhong Dong
, Liyuan Li, Joo-Hwee Lim:
Personalized Serious Games for Cognitive Intervention with Lifelog Visual Analytics. 328-336 - Wendy Bolier, Wolfgang Hürst, Guido van Bommel, Joost Bosman, Harriët Bosman:
Drawing in a Virtual 3D Space - Introducing VR Drawing in Elementary School Art Education. 337-345 - Luca Lovagnini, Wenxiao Zhang, Farshid Hassani Bijarbooneh, Pan Hui:
CIRCE: Real-Time Caching for Instance Recognition on Cloud Environments and Multi-Core Architectures. 346-354 - Wenxiao Zhang, Bo Han, Pan Hui:
Jaguar: Low Latency Mobile Augmented Reality with Flexible Tracking. 355-363
Keynote 2
- Tao Mei:
Session details: Keynote 2. - Xian-Sheng Hua:
Challenges and Practices of Large Scale Visual Intelligence in the Real-World. 364
Deep-1 (Image Translation)
- Nicu Sebe:
Session details: Deep-1 (Image Translation). - Yuheng Zhi
, Huawei Wei, Bingbing Ni:
Structure Guided Photorealistic Style Transfer. 365-373 - Xuewen Yang
, Dongliang Xie, Xin Wang:
Crossing-Domain Generative Adversarial Networks for Unsupervised Multi-Domain Image-to-Image Translation. 374-382 - Bo Zhao, Xiao Wu, Zhi-Qi Cheng
, Hao Liu, Zequn Jie, Jiashi Feng:
Multi-View Image Generation from a Single-View. 383-391 - Jichao Zhang, Yezhi Shu, Songhua Xu, Gongze Cao, Fan Zhong, Meng Liu, Xueying Qin:
Sparsely Grouped Multi-Task Generative Adversarial Networks for Facial Attribute Manipulation. 392-401
Vision-1 (Machine Learning)
- Jingkuan Song:
Session details: Vision-1 (Machine Learning). - Jindong Wang
, Wenjie Feng
, Yiqiang Chen, Han Yu
, Meiyu Huang, Philip S. Yu:
Visual Domain Adaptation with Manifold Embedded Distribution Alignment. 402-410 - Zheyan Shen, Peng Cui, Kun Kuang, Bo Li, Peixuan Chen:
Causally Regularized Learning with Agnostic Data Selection Bias. 411-419 - Yanjie Liang, Qiangqiang Wu, Yi Liu, Yan Yan, Hanzi Wang:
Robust Correlation Filter Tracking with Shepherded Instance-Aware Proposals. 420-428 - Fan Qi, Xiaoshan Yang, Changsheng Xu:
A Unified Framework for Multimodal Domain Adaptation. 429-437
Multimedia-1 (Multimedia Recommendation & Discovery)
- Mark Liao:
Session details: Multimedia-1 (Multimedia Recommendation & Discovery). - Shintami Chusnul Hidayati, Cheng-Chun Hsu, Yu-Ting Chang, Kai-Lung Hua
, Jianlong Fu, Wen-Huang Cheng:
What Dress Fits Me Best?: Fashion Recommendation on the Clothing Style for Personal Body Shape. 438-446 - Xiaowen Huang, Shengsheng Qian, Quan Fang, Jitao Sang, Changsheng Xu:
CSAN: Contextual Self-Attention Network for User Sequential Recommendation. 447-455 - Jun Hu, Shengsheng Qian, Quan Fang, Changsheng Xu:
Attentive Interactive Convolutional Matching for Community Question Answering in Social Multimedia. 456-464 - Francesco Gelli, Tiberio Uricchio
, Xiangnan He, Alberto Del Bimbo, Tat-Seng Chua:
Beyond the Product: Discovering Image Posts for Brands in Social Media. 465-473
Vision-2 (Object & Scene Understanding)
- Zheng-Jun Zha:
Session details: Vision-2 (Object & Scene Understanding). - Lishi Zhang, Chenghan Fu, Jia Li:
Collaborative Annotation of Semantic Objects in Images with Multi-granularity Supervisions. 474-482 - Mengyang Pu, Yaping Huang, Qingji Guan, Qi Zou:
GraphNet: Learning Image Pseudo Annotations for Weakly-Supervised Semantic Segmentation. 483-491 - Hengcan Shi, Hongliang Li
, Qingbo Wu, Fanman Meng, King N. Ngan:
Boosting Scene Parsing Performance via Reliable Scale Prediction. 492-500 - Fan Zhu, Li Liu, Jin Xie, Fumin Shen, Ling Shao
, Yi Fang:
Learning to Synthesize 3D Indoor Scenes from Monocular Images. 501-509
Multimodal-1 (Multimodal Reasoning)
- Xian-Sheng Hua:
Session details: Multimodal-1 (Multimodal Reasoning). - Chaojun Han, Fumin Shen, Li Liu, Yang Yang, Heng Tao Shen:
Visual Spatial Attention Network for Relationship Detection. 510-518 - Chenfei Wu
, Jinlai Liu, Xiaojie Wang, Xuan Dong:
Object-Difference Attention: A Simple Relational Attention for Visual Question Answering. 519-527 - Jinwei Qi, Yuxin Peng, Yunkan Zhuo:
Life-long Cross-media Correlation Learning. 528-536 - Yue Gu
, Xinyu Li
, Kaixiang Huang, Shiyu Fu, Kangning Yang
, Shuhong Chen, Moliang Zhou, Ivan Marsic:
Human Conversation Analysis Using Attentive Multimodal Networks with Hierarchical Encoder-Decoder. 537-545
System-1 (Video Analysis & Streaming)
- Xin Yang:
Session details: System-1 (Video Analysis & Streaming). - Wentao Liu
, Zhengfang Duanmu, Zhou Wang
:
End-to-End Blind Quality Assessment of Compressed Videos Using Deep Neural Networks. 546-554 - Ibrahim Ben Mustafa, Tamer Nadeem
, Emir Halepovic:
FlexStream: Towards Flexible Adaptive Video Streaming on End Devices using Extreme SDN. 555-563 - Lan Xie, Xinggong Zhang, Zongming Guo:
CLS: A Cross-user Learning based System for Improving QoE in 360-degree Video Adaptive Streaming. 564-572 - Abdelhak Bentaleb, Ali C. Begen
, Saad Harous
, Roger Zimmermann:
A Distributed Approach for Bitrate Selection in HTTP Adaptive Streaming. 573-581
FF-3
- Zhu Li:
Session details: FF-3. - Qing Zhang, Ganzhao Yuan, Chunxia Xiao, Lei Zhu, Wei-Shi Zheng:
High-Quality Exposure Correction of Underexposed Photos. 582-590 - Qianqian Xu, Jiechao Xiong, Xinwei Sun
, Zhiyong Yang
, Xiaochun Cao, Qingming Huang, Yuan Yao:
A Margin-based MLE for Crowdsourced Partial Ranking. 591-599 - Ana Garcia del Molino, Michael Gygli:
PHD-GIFs: Personalized Highlight Detection for Automatic GIF Creation. 600-608 - Lu Pang, Yaowei Wang, Yi-Zhe Song
, Tiejun Huang, Yonghong Tian:
Cross-Domain Adversarial Feature Learning for Sketch Re-identification. 609-617 - Quan Chen, Tiezheng Ge, Yanyu Xu
, Zhiqiang Zhang, Xinxin Yang, Kun Gai:
Semantic Human Matting. 618-626 - Lingxiao Song, Zhihe Lu, Ran He, Zhenan Sun, Tieniu Tan:
Geometry Guided Adversarial Facial Expression Synthesis. 627-635 - Siqi Wang, Yijie Zeng, Qiang Liu, Chengzhang Zhu, En Zhu, Jianping Yin:
Detecting Abnormality without Knowing Normality: A Two-stage Approach for Unsupervised Video Abnormal Event Detection. 636-644 - Tingting Li, Ruihe Qian, Chao Dong, Si Liu, Qiong Yan, Wenwu Zhu, Liang Lin:
BeautyGAN: Instance-level Facial Makeup Transfer with Deep Generative Adversarial Network. 645-653 - Xianghui Luo, Zhuo Su, Jiaming Guo, Gengwei Zhang
, Xiangjian He
:
Trusted Guidance Pyramid Network for Human Parsing. 654-662 - Jingjing Li, Lei Zhu
, Zi Huang
, Ke Lu, Jidong Zhao:
I read, I saw, I tell: Texts Assisted Fine-Grained Visual Classification. 663-671 - Ziwei Wang
, Yadan Luo
, Yang Li, Zi Huang
, Hongzhi Yin
:
Look Deeper See Richer: Depth-aware Image Paragraph Captioning. 672-680 - Huaiwen Zhang, Quan Fang, Shengsheng Qian, Changsheng Xu:
Learning Multimodal Taxonomy via Variational Deep Graph Embedding and Clustering. 681-689 - Junyu Gao, Tianzhu Zhang, Changsheng Xu:
Watch, Think and Attend: End-to-End Video Classification via Dynamic Knowledge Evolution Modeling. 690-699 - Yongcheng Liu, Lu Sheng
, Jing Shao, Junjie Yan, Shiming Xiang, Chunhong Pan:
Multi-Label Image Classification via Knowledge Distillation from Weakly-Supervised Detection. 700-708 - Jiayu Wang, Wengang Zhou, Jinhui Tang
, Zhongqian Fu, Qi Tian, Houqiang Li:
Unregularized Auto-Encoder with Generative Adversarial Networks for Image Generation. 709-717 - Yangbangyan Jiang, Zhiyong Yang
, Qianqian Xu, Xiaochun Cao, Qingming Huang:
When to Learn What: Deep Cognitive Subspace Clustering. 718-726 - Wendong Zhang
, Feng Gao, Bingbing Ni, Lingyu Duan, Yichao Yan, Jingwei Xu, Xiaokang Yang:
Depth Structure Preserving Scene Image Generation. 727-736 - Jiawei Liu, Zheng-Jun Zha
, Hongtao Xie, Zhiwei Xiong, Yongdong Zhang:
CA3Net: Contextual-Attentional Attribute-Appearance Network for Person Re-Identification. 737-745 - Gusi Te, Wei Hu, Amin Zheng, Zongming Guo:
RGCNN: Regularized Graph CNN for Point Cloud Segmentation. 746-754 - Bin Liu
, Yue Cao, Mingsheng Long
, Jianmin Wang
, Jingdong Wang:
Deep Triplet Quantization. 755-763
Keynote 3
- Jiebo Luo:
Session details: Keynote 3. - Ernest A. Edmonds:
What has Art Got to do With It? 773
Best Paper Session
- Rainer Lienhart, Tao Mei:
Session details: Best Paper Session. - Hao Tang, Wei Wang, Dan Xu, Yan Yan, Nicu Sebe
:
GestureGAN for Hand Gesture-to-Gesture Translation in the Wild. 774-782 - Bei Liu, Jianlong Fu, Makoto P. Kato, Masatoshi Yoshikawa:
Beyond Narrative Description: Generating Poetry from Images by Multi-Adversarial Training. 783-791 - Jian Zhao, Jianshu Li, Yu Cheng, Terence Sim, Shuicheng Yan, Jiashi Feng:
Understanding Humans in Crowded Scenes: Deep Nested Adversarial Learning and A New Benchmark for Multi-Human Parsing. 792-800 - Lizi Liao
, Yunshan Ma, Xiangnan He, Richang Hong, Tat-Seng Chua:
Knowledge-aware Multimodal Dialogue Systems. 801-809
Doctoral Symposium
- Meng Wang:
Session details: Doctoral Symposium. - Na Zhao:
End2End Semantic Segmentation for 3D Indoor Scenes. 810-814 - Sabrina Kletz:
On Reducing Effort in Evaluating Laparoscopic Skills. 815-819 - Tianran Hu:
Decode Human Life from Social Media. 820-824
FF-4
- Wen-Huang Cheng:
Session details: FF-4. - Yiling Wu, Shuhui Wang, Qingming Huang:
Learning Semantic Structure-preserved Embeddings for Cross-modal Retrieval. 825-833 - Zhendong Mao, Quan Wang, Yongdong Zhang, Bin Wang:
Post Tuned Hashing: A New Approach to Indexing High-dimensional Data. 834-842 - Meng Liu, Xiang Wang, Liqiang Nie, Qi Tian, Baoquan Chen, Tat-Seng Chua:
Cross-modal Moment Localization in Videos. 843-851 - Zhaoda Ye, Yuxin Peng:
Multi-Scale Correlation for Sequential Cross-modal Hashing Learning. 852-860 - Litao Yu, Yongsheng Gao
, Jun Zhou:
Generative Adversarial Product Quantisation. 861-869 - Yubin Deng, Chen Change Loy, Xiaoou Tang:
Aesthetic-Driven Image Enhancement by Adversarial Learning. 870-878 - Kekai Sheng, Weiming Dong, Chongyang Ma, Xing Mei, Feiyue Huang, Bao-Gang Hu
:
Attention-based Multi-Patch Aggregation for Image Aesthetic Assessment. 879-886 - Zheqi He
, Yafeng Zhou, Yongtao Wang, Siwei Wang, Xiaoqing Lu, Zhi Tang, Ling Cai:
An End-to-End Quadrilateral Regression Network for Comic Panel Extraction. 887-895 - Xin Yang, Jinyu Chen, Zhiwei Wang, Qiaozhe Zhang
, Wenyu Liu
, Chunyuan Liao, Kwang-Ting Cheng
:
Monocular Camera Based Real-Time Dense Mapping Using Generative Adversarial Network. 896-904 - Xiaojing Ma, Changming Liu
, Sixing Cao, Bin Zhu:
JPEG Decompression in the Homomorphic Encryption Domain. 905-913 - Mengbai Xiao, Shuoqian Wang, Chao Zhou, Li Liu, Zhenhua Li, Yao Liu, Songqing Chen:
MiniView Layout for Bandwidth-Efficient 360-Degree Video. 914-922 - Guoxian Song, Jianfei Cai
, Tat-Jen Cham
, Jianmin Zheng, Juyong Zhang, Henry Fuchs:
Real-time 3D Face-Eye Performance Capture of a Person Wearing VR Headset. 923-931 - Chen Li, Mai Xu, Xinzhe Du, Zulin Wang:
Bridge the Gap Between VQA and Human Behavior on Omnidirectional Video: A Large-Scale Dataset and a Deep Learning Model. 932-940 - Zongpu Zhang, Yang Hua
, Tao Song
, Zhengui Xue, Ruhui Ma, Neil Martin Robertson, Haibing Guan:
Tracking-assisted Weakly Supervised Online Visual Object Segmentation in Unconstrained Videos. 941-949 - Praveen Tirupattur
, Yogesh Singh Rawat, Concetto Spampinato, Mubarak Shah:
ThoughtViz: Visualizing Human Thoughts Using Generative Adversarial Network. 950-958 - Xiaoju Zheng, Zheng-Jun Zha
, Liansheng Zhuang:
A Feature-Adaptive Semi-Supervised Framework for Co-saliency Detection. 959-966 - Jogendra Nath Kundu, Aditya Ganeshan, Rahul M. V., Aditya Prakash, Venkatesh Babu R.
:
iSPA-Net: Iterative Semantic Pose Alignment Network. 967-975 - Litong Feng, Ziyin Li, Zhanghui Kuang, Wei Zhang
:
Extractive Video Summarizer with Memory Augmented Neural Networks. 976-983 - Jing Zhang
, Yang Cao, Yang Wang, Chenglin Wen, Chang Wen Chen
:
Fully Point-wise Convolutional Neural Network for Modeling Statistical Regularities in Natural Images. 984-992