default search action
25th ACM Multimedia 2017: Mountain View, CA, USA
- Qiong Liu, Rainer Lienhart, Haohong Wang, Sheng-Wei "Kuan-Ta" Chen, Susanne Boll, Yi-Ping Phoebe Chen, Gerald Friedland, Jia Li, Shuicheng Yan:
Proceedings of the 2017 ACM on Multimedia Conference, MM 2017, Mountain View, CA, USA, October 23-27, 2017. ACM 2017, ISBN 978-1-4503-4906-2
Fast Forward 1
- Junnan Li, Yongkang Wong, Qi Zhao, Mohan S. Kankanhalli:
Attention Transfer from Web Images for Video Recognition. 1-9 - Ravi Kiran Sarvadevabhatla, Isht Dwivedi, Abhijat Biswas, Sahil Manocha, Venkatesh Babu R.:
SketchParse: Towards Rich Descriptions for Poorly Drawn Sketches using Multi-Task Hierarchical Deep Networks. 10-18 - Xiaobai Liu, Qi Chen, Lei Zhu, Yuanlu Xu, Liang Lin:
Place-centric Visual Urban Perception with Deep Multi-instance Regression. 19-27 - Spencer Cappallo, Cees G. M. Snoek:
Future-Supervised Retrieval of Unseen Queries for Live Video. 28-36 - Yi-Ling Chen, Jan Klopp, Min Sun, Shao-Yi Chien, Kwan-Liu Ma:
Learning to Compose with Professional Photographs on the Web. 37-45 - Fuhai Chen, Rongrong Ji, Jinsong Su, Yongjian Wu, Yunsheng Wu:
StructCap: Structured Semantic Embedding for Image Captioning. 46-54 - Chih-Fan Hsu, Anthony Chen, Cheng-Hsin Hsu, Chun-Ying Huang, Chin-Laung Lei, Kuan-Ta Chen:
Is Foveated Rendering Perceivable in Virtual Reality?: Exploring the Efficiency and Consistency of Quality Assessment Methods. 55-63 - Fuwen Tan, Chi-Wing Fu, Teng Deng, Jianfei Cai, Tat-Jen Cham:
FaceCollage: A Rapidly Deployable System for Real-time Head Reconstruction for On-The-Go 3D Telepresence. 64-72 - Bo Yan, Shu Shi, Yong Liu, Weizhe Yuan, Haoqin He, Rittwik Jana, Yang Xu, H. Jonathan Chao:
LiveJack: Integrating CDNs and Edge Clouds for Live Content Broadcasting. 73-81 - Si Liu, Yao Sun, Defa Zhu, Renda Bao, Wei Wang, Xiangbo Shu, Shuicheng Yan:
Face Aging with Contextual Generative Adversarial Nets. 82-90 - Yu-Ting Chang, Wen-Huang Cheng, Bo Wu, Kai-Lung Hua:
Fashion World Map: Understanding Cities Through Streetwear Fashion. 91-99 - Behnam Maneshgar, Leila Sujir, Sudhir P. Mudur, Charalambos Poullis:
Automatic Adjustment of Stereoscopic Content for Long-Range Projections in Outdoor Areas. 100-108 - Zhenguang Liu, Li Cheng, Anan Liu, Luming Zhang, Xiangnan He, Roger Zimmermann:
Multiview and Multimodal Pervasive Indoor Localization. 109-117 - Zhaoyang Zeng, Jianlong Fu, Hongyang Chao, Tao Mei:
Searching Personal Photos on the Phone with Instant Visual Query Suggestion and Joint Text-Image Hashing. 118-126 - Junyu Gao, Tianzhu Zhang, Changsheng Xu:
A Unified Personalized Video Recommendation via Dynamic Recurrent Neural Networks. 127-135
Keynote Address 1
- Achin Bhowmik:
Enhancing and Augmenting Human Perception with Artificial Intelligence Technologies. 136
Best Paper Presentation
- Yuan Tian, Suraj Raghuraman, Thiru Annaswamy, Aleksander Borresen, Klara Nahrstedt, Balakrishnan Prabhakaran:
H-TIME: Haptic-enabled Tele-Immersive Musculoskeletal Examination. 137-145 - Ziwei Yang, Yahong Han, Zheng Wang:
Catching the Temporal Regions-of-Interest for Video Captioning. 146-153 - Bokun Wang, Yang Yang, Xing Xu, Alan Hanjalic, Heng Tao Shen:
Adversarial Cross-Modal Retrieval. 154-162 - Shuhui Jiang, Zhengming Ding, Yun Fu:
Deep Low-rank Sparse Collective Factorization for Cross-Domain Recommendation. 163-171
Fast Forward 2
- Sijie Yan, Ziwei Liu, Ping Luo, Shi Qiu, Xiaogang Wang, Xiaoou Tang:
Unconstrained Fashion Landmark Detection via Hierarchical Recurrent Transformer Networks. 172-180 - Jianan Li, Yunchao Wei, Xiaodan Liang, Fang Zhao, Jianshu Li, Tingfa Xu, Jiashi Feng:
Deep Attribute-preserving Metric Learning for Natural Language Object Retrieval. 181-189 - Xiaoling Gu, Yongkang Wong, Pai Peng, Lidan Shou, Gang Chen, Mohan S. Kankanhalli:
Understanding Fashion Trends from Street Photos via Neighbor-Constrained Embedding Learning. 190-198 - Yichao Yan, Jingwei Xu, Bingbing Ni, Wendong Zhang, Xiaokang Yang:
Skeleton-Aided Articulated Motion Generation. 199-207 - Jiale Bai, Bingbing Ni, Minsi Wang, Yang Shen, Hanjiang Lai, Chongyang Zhang, Lin Mei, Chuanping Hu, Chen Yao:
Deep Progressive Hashing for Image Retrieval. 208-216 - Shaojing Fan, Ming Jiang, Zhiqi Shen, Bryan L. Koenig, Mohan S. Kankanhalli, Qi Zhao:
The Role of Visual Attention in Sentiment Prediction. 217-225 - Mengdan Zhang, Jiashi Feng, Weiming Hu:
Robust Visual Object Tracking with Top-down Reasoning. 226-234 - Yuke Li:
Pedestrian Path Forecasting in Crowd: A Deep Spatio-Temporal Perspective. 235-243 - Yiru Zhao, Bing Deng, Jianqiang Huang, Hongtao Lu, Xian-Sheng Hua:
Stylized Adversarial AutoEncoder for Image Generation. 244-251 - Chenglong Li, Xiaohao Wu, Zhimin Bao, Jin Tang:
ReGLe: Spatially Regularized Graph Learning for Visual Tracking. 252-260 - Junbao Zhuo, Shuhui Wang, Weigang Zhang, Qingming Huang:
Deep Unsupervised Convolutional Domain Adaptation. 261-269 - Tongtao Zhang, Spencer Whitehead, Hanwang Zhang, Hongzhi Li, Joseph G. Ellis, Lifu Huang, Wei Liu, Heng Ji, Shih-Fu Chang:
Improving Event Extraction via Multimodal Integration. 270-278 - Xuanyi Dong, Deyu Meng, Fan Ma, Yi Yang:
A Dual-Network Progressive Approach to Weakly Supervised Object Detection. 279-287 - Dihong Gong, Daisy Zhe Wang, Yang Peng:
Multimodal Learning for Web Information Extraction. 288-296 - Bingke Zhu, Yingying Chen, Jinqiao Wang, Si Liu, Bo Zhang, Ming Tang:
Fast Deep Matting for Portrait Animation on Mobile Phone. 297-305 - Stefano Petrangeli, Viswanathan Swaminathan, Mohammad Hosseini, Filip De Turck:
An HTTP/2-Based Adaptive Streaming Framework for 360° Virtual Reality Videos. 306-314 - Lan Xie, Zhimin Xu, Yixuan Ban, Xinggong Zhang, Zongming Guo:
360ProbDASH: Improving QoE of 360 Video Streaming Using Tile-based HTTP Adaptive Streaming. 315-323 - Wei Zhang, Xiaofei Liao, Peng Li, Hai Jin, Li Lin:
ShareRender: Bypassing GPU Virtualization to Enable Fine-grained Resource Sharing for Cloud Gaming. 324-332 - Ke Xia, Yuqing Ma, Xianglong Liu, Yadong Mu, Li Liu:
Temporal Binary Coding for Large-Scale Video Search. 333-341 - Hantao Yao, Shiliang Zhang, Yongdong Zhang, Jintao Li, Qi Tian:
One-Shot Fine-Grained Instance Retrieval. 342-350 - Jun Chen, Chaokun Wang, Jianmin Wang:
Modeling the Intransitive Pairwise Image Preference from Multiple Angles. 351-359 - Florian Alt, Lukas Ziegler:
PD-Survey: Supporting Audience-Centric Research through Surveys on Pervasive Display Networks. 360-368 - Sicheng Zhao, Guiguang Ding, Yue Gao, Jungong Han:
Learning Visual Emotion Distributions via Multi-Modal Features Fusion. 369-377 - Dingquan Li, Tingting Jiang, Ming Jiang:
Exploiting High-Level Semantics for No-Reference Image Quality Assessment of Realistic Blur Images. 378-386 - Yue Zhang, Felix Weninger, Boqing Liu, Maximilian Schmitt, Florian Eyben, Björn W. Schuller:
A Paralinguistic Approach To Speaker Diarisation: Using Age, Gender, Voice Likability and Personality Traits. 387-392 - Xiao-Yu Du, Jinhui Tang, Zechao Li, Zhiguang Qin:
Wheel: Accelerating CNNs with Distributed GPUs via Hybrid Parallelism and Alternate Strategy. 393-401 - Weiqing Min, Shuqiang Jiang, Shuhui Wang, Jitao Sang, Shuhuan Mei:
A Delicious Recipe Analysis Framework for Exploring Multi-Modal Recipes with Various Attributes. 402-410 - Fudong Nian, Bing-Kun Bao, Teng Li, Changsheng Xu:
Multi-Modal Knowledge Representation Learning via Webly-Supervised Relationships Mining. 411-419 - Longhui Wei, Shiliang Zhang, Hantao Yao, Wen Gao, Qi Tian:
GLAD: Global-Local-Alignment Descriptor for Pedestrian Retrieval. 420-428
Understanding 1 -- Deep Learning for MM (1)
- Ting Liu, Yunchao Wei, Yao Zhao, Si Liu, Shikui Wei:
Magic-wall: Visualizing Room Decoration. 429-437 - Xin Li, Fan Yang, Hong Cheng, Junyu Chen, Yuxiao Guo, Leiting Chen:
Multi-Scale Cascade Network for Salient Object Detection. 439-447 - Jun-Yan He, Xiao Wu, Yu-Gang Jiang, Bo Zhao, Qiang Peng:
Sketch Recognition with Deep Visual-Sequential Fusion Model. 448-456
Panel 1
- Yung-Hsiang Lu, Andrea Cavallaro, Catherine Crump, Gerald Friedland, Keith Winstein:
Privacy Protection in Online Multimedia. 457-459
Experience 1 -- Social and Affective Multimedia
- Cristina Segalin, Fabio Celli, Luca Polonio, Michal Kosinski, David Stillwell, Nicu Sebe, Marco Cristani, Bruno Lepri:
What your Facebook Profile Picture Reveals about your Personality. 460-468 - Jiajia Yang, Shangfei Wang:
Capturing Spatial and Temporal Patterns for Distinguishing between Posed and Spontaneous Expressions. 469-477 - Nicholas Cummins, Shahin Amiriparian, Gerhard Hagerer, Anton Batliner, Stefan Steidl, Björn W. Schuller:
An Image-based Deep Spectrum Feature Representation for the Recognition of Emotional Speech. 478-484 - Lorenzo Gatti, Gözde Özbal, Oliviero Stock, Carlo Strapparava:
Automatic Generation of Lyrics Parodies. 485-491
Systems 1 -- Systems and Applications
- Yusen Li, Yunhua Deng, Xueyan Tang, Wentong Cai, Xiaoguang Liu, Gang Wang:
On Server Provisioning for Cloud Gaming. 492-500 - Zhenguang Liu, Zepeng Wang, Luming Zhang, Rajiv Ratn Shah, Yingjie Xia, Yi Yang, Xuelong Li:
FastShrinkage: Perceptually-aware Retargeting Toward Mobile Platforms. 501-509 - Tangli Xue, Hongcheng Luo, Danpeng Cheng, Zikang Yuan, Xin Yang:
Real-time Monocular Dense Mapping for Augmented Reality. 510-518
Engagement 1 -- Multimedia Search and Recommendation
- Jen-Chun Lin, Wen-Li Wei, James Yang, Hsin-Min Wang, Hong-Yuan Mark Liao:
Automatic Music Video Generation Based on Simultaneous Soundtrack Recommendation and Video Editing. 519-527 - Ryota Hinami, Yusuke Matsui, Shin'ichi Satoh:
Region-Based Image Retrieval Revisited. 528-536 - Jun Xu, Ting Yao, Yongdong Zhang, Tao Mei:
Learning Multimodal Attention LSTM Networks for Video Captioning. 537-545
Business Idea Venture
- Fabio Celli, Pietro Zani Massani, Bruno Lepri:
Profilio: Psychometric Profiling to Boost Social Media Advertising. 546-550 - Alexis Joly, Pierre Bonnet, Antoine Affouard, Jean-Christophe Lombardo, Hervé Goëau:
Pl@ntNet - My Business. 551-555
Interactive Art
- James She, Kong Cheng Tan, Soon Xuan Yong:
Drag A Star 3.0: An Audience Participatory Interactive Art. 556-558 - JoAnn Kuchera-Morin, Lance Putnam, Luca Peliti, Dennis Adderton, Andrés Cabrera, Kon Hyong Kim, Gustavo A. Rincon, Joseph Tilbian, Hannah Wolfe, Tim Wood, Keehong Youn:
PROBABLY/POSSIBLY?: An Immersive Interactive Visual/Sonic Quantum Composition and Synthesizer. 559-561 - Sahar Sajadieh:
Touch Me Here: A Virtual Touch Cinema. 562-564 - Brianna Ondris:
Filters. 565-567 - Megan Hardy, Sumanto Pal:
Split Consideration for Foreground and Background Painting Using Artificial Neural Networks. 568-570 - Inhye Lee, Hyomin Kim:
Spatial Magnetic Field Visualization: Interactive Kinetic Art Installation Driven by the Invisible Forces of Magnetic Fields. 571-573 - Yagiz Mungan:
À Quatre Mains. 574-576 - Edouard Beau:
Las Barricadas Misteriosas. 577-579 - Jiayi Young, Weidong Yang, Shih-Wen Young, Qilian Yu:
Presently Untitled: Data Mapping of 2016 U.S. Presidential Election Twitter Activity, Phase III. 580-581
Fast Forward 3
- Arun Balajee Vasudevan, Michael Gygli, Anna Volokitin, Luc Van Gool:
Query-adaptive Video Summarization via Quality-aware Relevance Estimation. 582-590 - Andrea Zunino, Jacopo Cavazza, Atesh Koul, Andrea Cavallo, Cristina Becchio, Vittorio Murino:
Predicting Human Intentions from Motion Cues Only: A 2D+3D Fusion Approach. 591-599 - Xinhang Song, Chengpeng Chen, Shuqiang Jiang:
RGB-D Scene Recognition with Object-to-Object Relation. 600-608 - Lin Chen, Hua Yang, Shuang Wu, Zhiyong Gao:
Data Generation for Improving Person Re-identification. 609-617 - Youbao Tang, Xiangqian Wu:
Salient Object Detection with Chained Multi-Scale Fully Convolutional Network. 618-626 - Xiangteng He, Yuxin Peng, Junjie Zhao:
Fine-grained Discriminative Localization via Saliency-guided Faster R-CNN. 627-635 - Yang Long, Ling Shao:
Learning to Recognise Unseen Classes by A Few Similes. 636-644 - Zhichao Song, Bingbing Ni, Yichao Yan, Zhe Ren, Yi Xu, Xiaokang Yang:
Deep Cross-Modality Alignment for Multi-Shot Person Re-IDentification. 645-653 - Ning Zhang, Yu Cao, Benyuan Liu, Yan Luo:
Improved Multimodal Representation Learning with Skip Connections. 654-662 - Abhimanyu Dubey, Sumeet Agarwal:
Modeling Image Virality with Pairwise Spatial Transformer Networks. 663-671 - Guoxian Dai, Jin Xie, Yi Fang:
Metric-based Generative Adversarial Network. 672-680 - Yiyi Zhou, Rongrong Ji, Jinsong Su, Yongjian Wu, Yunsheng Wu:
More Than An Answer: Neural Pivot Network for Visual Qestion Answering. 681-689 - Zhongyang Zheng, Bo Wang, Yakun Wang, Shuang Yang, Zhongqian Dong, Tianyang Yi, Cyrus Choi, Emily J. Chang, Edward Y. Chang:
Aristo: An Augmented Reality Platform for Immersion and Interactivity. 690-698 - Kiana Calagari, Mohamed A. Elgharib, Shervin Shirmohammadi, Mohamed Hefeeda:
Sports VR Content Generation from Regular Camera Feeds. 699-707 - Mengbai Xiao, Chao Zhou, Yao Liu, Songqing Chen:
OpTile: Toward Optimal Tiling in 360-degree Video Streaming. 708-716 - Zhisheng Yan, Chang Wen Chen:
Too Many Pixels to Perceive: Subpixel Shutoff for Display Energy Reduction on OLED Smartphones. 717-725 - Lei Zhu, Zi Huang, Xiaojun Chang, Jingkuan Song, Heng Tao Shen:
Exploring Consistent Preferences: Discrete Hashing with Pair-Exemplar for Scalable Landmark Search. 726-734 - Chengju Zhou, Meiqing Wu, Siew-Kei Lam:
Fast and Accurate Pedestrian Detection using Dual-Stage Group Cost-Sensitive RealBoost with Vector Form Filters. 735-743 - Mengshi Qi, Yunhong Wang, Annan Li:
Online Cross-Modal Scene Retrieval by Binary Representation and Semantic Graph. 744-752 - Xuemeng Song, Fuli Feng, Jinhuan Liu, Zekun Li, Liqiang Nie, Jun Ma:
NeuroStylist: Neural Compatibility Modeling for Clothing Matching. 753-761 - Marc Van den Broeck, Fahim Kawsar, Johannes Schöning:
It's All Around You: Exploring 360° Video Viewing Experiences on Mobile Devices. 762-768 - Tanfang Chen, Yaxin Wang, Shangfei Wang, Shiyu Chen:
Exploring Domain Knowledge for Affective Video Content Analyses. 769-776 - Chun-Han Yao, Chia-Yang Chang, Shao-Yi Chien:
Occlusion-aware Video Temporal Consistency. 777-785 - Donghyeon Won, Zachary C. Steinert-Threlkeld, Jungseock Joo:
Protest Activity Detection and Perceived Violence Estimation from Social Media Images. 786-794 - Zhiwei Jin, Juan Cao, Han Guo, Yongdong Zhang, Jiebo Luo:
Multimodal Fusion with Recurrent Neural Networks for Rumor Detection on Microblogs. 795-816
Keynote Address 3
- Injong Rhee:
Building Multi-modal Interfaces for Smartphones. 817
SIGMMM Award Session
- Arnold W. M. Smeulders:
ACM SIGMM Award for Outstanding Technical Contributions to Multimedia Computing, Communications and Applications. 818 - Liangliang Cao:
ACM SIGMM Rising Star Award 2017. 819 - Chien-Nan (Shannon) Chen:
SIGMM Award for Outstanding Ph.D. Thesis in Multimedia Computing, Communications and Applications 2017. 820
Doctoral Symposium
- Jan Willem Kleinrouweler:
Using DASH Assisting Network Elements for Optimizing Video Streaming Quality. 821-825 - Gong Chen:
Who Composes the Music?: Musicality Evaluation for Algorithmic Composition via Electroencephalography. 826-830 - Jianfeng Dong:
Cross-media Relevance Computation for Multimedia Retrieval. 831-835 - Xiang Chen:
Towards Global Optimization in Display Advertising by Integrating Multimedia Metrics with Real-Time Bidding. 836-845
Fast Forward 4
- Fanghui Liu, Xiaolin Huang, Jie Yang:
Indefinite Kernel Logistic Regression. 846-853 - Jiaqi Zhang, Zhenzhen Wang, Junsong Yuan, Yap-Peng Tan:
Positive and Unlabeled Learning for Anomaly Detection with Multi-features. 854-862 - Bin Zhao, Xuelong Li, Xiaoqiang Lu:
Hierarchical Recurrent Neural Network for Video Summarization. 863-871 - Yuan Zong, Xiaohua Huang, Wenming Zheng, Zhen Cui, Guoying Zhao:
Learning a Target Sample Re-Generator for Cross-Database Micro-Expression Recognition. 872-880 - Hyungik Oh, Ramesh C. Jain:
From Multimedia Logs to Personal Chronicles. 881-889 - Jing Han, Zixing Zhang, Maximilian Schmitt, Maja Pantic, Björn W. Schuller:
From Hard to Soft: Towards more Human-like Emotion Recognition by Modelling the Perception Uncertainty. 890-897 - Jingjing Li, Ke Lu, Zi Huang, Heng Tao Shen:
Two Birds One Stone: On both Cold-Start and Long-Tail Recommendation. 898-906 - Liang Zhang, Bingpeng Ma, Guorong Li, Qingming Huang, Qi Tian:
Multi-Networks Joint Learning for Large-Scale Cross-Modal Retrieval. 907-915 - Pengpeng Zhao, Xiefeng Xu, Yanchi Liu, Victor S. Sheng, Kai Zheng, Hui Xiong:
Photo2Trip: Exploiting Visual Contents in Geo-tagged Photos for Personalized Tour Recommendation. 916-924 - Chao Wu, Wenwu Zhu, Qiushi Li, Yaoxue Zhang:
Rethinking HTTP Adaptive Streaming with the Mobile User Perception. 925-933 - Jonghoe Koo, Juheon Yi, Joongheon Kim, Mohammad Ashraful Hoque, Sunghyun Choi:
REQUEST: Seamless Dynamic Adaptive Streaming over HTTP for Multi-Homed Smartphone under Resource Constraints. 934-942 - Xavier Corbillon, Alisa Devlic, Gwendal Simon, Jacob Chakareski:
Optimal Set of 360-Degree Videos for Viewport-Adaptive Streaming. 943-951 - Wencang Zhao, Yu Kong, Zhengming Ding, Yun Fu:
Deep Active Learning Through Cognitive Information Parcels. 952-960 - Meng Wang, Lingjing Wang, Yi Fang:
3DensiNet: A Robust Neural Network Architecture towards 3D Volumetric Object Prediction from 2D Image. 961-969 - Meng Liu, Liqiang Nie, Meng Wang, Baoquan Chen:
Towards Micro-video Understanding by Joint Sequential-Sparse Modeling. 970-978 - Hua Zhang, Rui Wang, Changqing Zhang, Xiaochun Cao:
LEAF: Latent Extended Attribute Features Discovery for Visual Classification. 979-987