


default search action
19th ISM 2017: Taichung, Taiwan
- 19th IEEE International Symposium on Multimedia, ISM 2017, Taichung, Taiwan, December 11-13, 2017. IEEE Computer Society 2017, ISBN 978-1-5386-2937-6

Session 1: Diverse Topics
- Jiji Cai, Liang Chang, Hongbin Wang, Cheolkon Jung, Joongkyu Kim:

Boundary-Preserving Depth Upsampling Without Texture Copying Artifacts and Holes. 1-5 - Thomas Richter, Joachim Keinert, Antonin Descampe

, Gaël Rouvroy, Alexandre Willeme:
Multi-generation-robust Coding with JPEG XS. 6-13 - Qiuxia Hou, Cheolkon Jung:

Occlusion Robust Light Field Depth Estimation Using Segmentation Guided Bilateral Filtering. 14-18 - Hamed Hamzeh

, Mahdi Hemmati
, Shervin Shirmohammadi
:
Priced-Based Fair Bandwidth Allocation for Networked Multimedia. 19-24 - Xiaolu Liu, Shuang Liang, Wenlong Hang, Baiying Lei, Qiong Wang, Jing Qin

, Kup-Sze Choi
:
Performance Evaluation of Walking Imagery Training Based on Virtual Environment in Brain-Computer Interfaces. 25-30 - Mahdi Salarian, Mehdi Sharifzadeh, Rashid Ansari:

Image Based Localization Based on Feature Scale Consistency in BOF Vector. 31-37
Session 2: 360degree Video & Image
- Duc V. Nguyen

, Huyen T. T. Tran, Anh T. Pham
, Truong Cong Thang:
A New Adaptation Approach for Viewport-adaptive 360-degree Video Streaming. 38-44 - Cagri Ozcinar, Ana De Abreu, Sebastian Knorr

, Aljosa Smolic:
Estimation of Optimal Encoding Ladders for Tiled 360° VR Video in Adaptive Streaming Systems. 45-52 - Falah Jabar

, João Ascenso
, Maria Paula Queluz:
Perceptual Analysis of Perspective Projection for Viewport Rendering in 360° Images. 53-60
Session 3: Learning
- Naifan Zhuang, Jun Ye, Kien A. Hua:

Convolutional DLSTM for Crowd Scene Understanding. 61-68 - Kento Masui, Akiyoshi Ochiai, Shintaro Yoshizawa, Hideki Nakayama

:
Recurrent Visual Relationship Recognition with Triplet Unit. 69-76 - Zhengxue Cheng, Masaru Takeuchi, Jiro Katto

:
A Pre-Saliency Map Based Blind Image Quality Assessment via Convolutional Neural Networks. 77-82 - Nathan Henderson, Ramazan Aygun:

Human Action Classification Using Temporal Slicing for Deep Convolutional Neural Networks. 83-90 - Alessandro Filini, João Ascenso

, Riccardo Leonardi:
Rate-Accuracy Optimization of Deep Convolutional Neural Network Models. 91-98 - Wei-bang Chen, Yongjin Lu, James Li, Ben Zimmerman:

Automatic Classification of Microstructures in Thermal Barrier Coating Images. 99-106
Session 4: Visual Aspects
- Sho Ooi, Mutsuo Sano, Hajime Tabuchi, Fumie Saito, Satoshi Umeda:

Sustained Attention Function Evaluation During Cooking Based on Egocentric Vision. 107-113 - Hoang Le, Thong Doan, Carl S. Marshall, Selvakumar Panneer, Feng Liu:

Detecting Good Surface for Improvisatory Visual Projection. 114-121 - Shashank Mujumdar, Nitin Gupta

, Abhinav Jain, Sameep Mehta:
Coherent Visual Description of Textual Instructions. 122-129 - Kevin Desai

, Suraj Raghuraman, Rong Jin, Balakrishnan Prabhakaran:
QoE Studies on Interactive 3D Tele-Immersion. 130-137 - Uma Gopalakrishnan, P. Venkat Rangan, Ramkumar N, Balaji Hariharan:

Spatio-Temporal Compositing of Video Elements for Immersive eLearning Classrooms. 138-145
Session 5: Best Papers
- Nitin Gupta

, Ankush Gupta, Vikas Joshi, L. Venkata Subramaniam, Sameep Mehta:
Deep Attribute Driven Image Similarity Learning Using Limited Data. 146-153 - Petr Elias

, Jan Sedmidubský
, Pavel Zezula:
A Real-Time Annotation of Motion Data Streams. 154-161 - Chengwu Liang, Enqing Chen, Lin Qi, Ling Guan:

Heterogeneous Features Fusion with Collaborative Representation Learning for 3D Action Recognition. 162-168 - Hashim Yasin:

Towards Efficient 3D Pose Retrieval and Reconstruction from 2D Landmarks. 169-176 - Yann Bayle, Ladislav Marsik

, Martin Rusek
, Matthias Robine, Pierre Hanna, Katerina Slaninová
, Jan Martinovic
, Jaroslav Pokorný:
Kara1k: A Karaoke Dataset for Cover Song Identification and Singing Voice Analysis. 177-184 - Jussi Tarvainen, Jorma Laaksonen

, Tapio Takala
:
Computational and Perceptual Determinants of Film Mood in Different Types of Scenes. 185-192
Session 6: Retrieval & Mining
- Ichiro Ide

, Ye Zhang, Ryunosuke Tanishige, Keisuke Doman, Yasutomo Kawanishi
, Daisuke Deguchi
, Hiroshi Murase:
Summarization of News Videos Considering the Consistency of Auditory and Visual Contents. 193-199 - Yang Yang, Qian Kou, Shaoyi Du, Shuang Luo, Yuehu Liu, Bangyu Wu

:
An Iterative Feature-Pair Updating Framework for Rigid Template Matching with Outliers. 200-207 - Martin Pichl, Eva Zangerle, Günther Specht, Markus Schedl:

Mining Culture-Specific Music Listening Behavior from Social Media Data. 208-215 - Bernd Münzer, Manfred Jürgen Primus, Sabrina Kletz, Stefan Petscharnig, Klaus Schoeffmann:

Static vs. Dynamic Content Descriptors for Video Retrieval in Laparoscopy. 216-223 - Parvaneh Pouladzadeh, Razib Iqbal, Shervin Shirmohammadi

, Omid Fatemi:
A Cloud-Based Multi-threaded Implementation of View Synthesis System. 224-231 - Yinmiao Ma, Danlu Liu, Grant J. Scott, Jeffrey Uhlmann, Chi-Ren Shyu:

In-Memory Distributed Indexing for Large-Scale Media Data Retrieval. 232-239
Session 7-A: Retrieval, Recommendation, and Summarization
- Jan Sedmidubský

, Petr Elias
, Pavel Zezula:
Enhancing Effectiveness of Descriptors for Searching and Recognition in Motion Capture Data. 240-243 - Wei-Ta Chu

, Ming-Chih Kao:
Blog Article Summarization with Image-Text Alignment Techniques. 244-247 - Fairouz Hussein

, Massimo Piccardi
:
Minimum-Risk Structured Learning of Video Summarization. 248-251 - Xueyu Mao, Saayan Mitra, Viswanathan Swaminathan:

Feature Selection for FM-Based Context-Aware Recommendation Systems. 252-255 - Shuo Yang

, Somdeb Sarkhel, Saayan Mitra, Viswanathan Swaminathan:
Personalized Video Recommendations for Shared Accounts. 256-259
Session 7-B: Tracking & Matching
- Inyong Yun, Seokhoon Boo, Joongkyu Kim, Cheolkon Jung:

Moment-Based Dense Correspondence Matching Robust to Image Variation. 260-263 - Henri Nicolas, Valentina Guerin Detourville:

Very Small Moving Objects Detection in Videos by Means of Fuzzy Logic and Reliability Coefficients: Application to Migrating Birds Counting. 264-267 - Leon Strapper, Robert Mertens, Sebastian Pospiech, Florian Bussmann

, Arthur Grah, Marius Mamsch:
A Gaze Tracking Based, Multi Modal Human Computer Interaction Concept for Efficient Input. 268-273 - Ahmad Delforouzi, Marcin Grzegorzek

:
Robust and Fast Object Tracking for Challenging 360-degree Videos. 274-277 - Fan Yang, Sébastien Poullot, Shin'ichi Satoh:

Temporal Matching Kernel with Embedded Stability-Sensitive Filter. 278-283
Session 7-C: Mining & Learning
- Zhan Xu, Guoping Qiu

:
A Color Prediction System for Interactive Drawing Based Image Retrieval on Mobile Devices. 284-287 - Yi Yu, Suhua Tang, Kiyoharu Aizawa, Akiko Aizawa:

VenueNet: Fine-Grained Venue Discovery by Deep Correlation Learning. 288-291 - Haijun Lei, Yujia Zhao, Yuting Wen, Baiying Lei:

Adaptive Sparse Learning for Neurodegenerative Disease Classification. 292-295 - Pushpalatha K

, Ananthanarayana V. S.:
A New Multimedia Documents Clustering Approach Based on Feature Patterns Similarity. 296-299 - Somdeb Sarkhel, Wreetabrata Kar, Viswanathan Swaminathan:

User Segment Identification Based on Similarity in Content Consumption. 300-303
Session 8-A: Contents & Features
- Ichiro Ide

, Yasutomo Kawanishi
, Kyoka Kunishiro, Frank Nack, Daisuke Deguchi
, Hiroshi Murase:
Automatic Selection of Web Contents Towards Automatic Authoring of a Video Biography. 304-307 - Markus Schedl, Florian Lemmerich, Bruce Ferwerda

, Marcin Skowron, Peter Knees:
Indicators of Country Similarity in Terms of Music Taste, Cultural, and Socio-economic Factors. 308-311 - Francisco Javier Velázquez-García, Frank Eliassen:

DAMPAT: Dynamic Adaptation of Multimedia Presentations in Application Mobility. 312-317 - Frode Eika Sandnes, Evelyn Eika:

Drawing Abrasive Hologram Animations with Auto-Generated Scratch Patterns. 318-321 - Martin Oelsch, Dominik Van Opdenbosch, Eckehard G. Steinbach

:
Survey of Visual Feature Extraction Algorithms in a Mars-like Environment. 322-325
Session 8-B: Video Streaming
- Justas Poderys, José Soler

:
Streaming Multimedia via Overlay Networks Using Wi-Fi Peer-to-Peer Connections. 326-329 - Maryam Amiri, Hussein Al Osman, Shervin Shirmohammadi

:
SDN-enabled Game-Aware Network Management for Residential Gateways. 330-333 - Jungwoo Lee, Hwangje Han, Minseok Song

:
Balancing Transcoding Against Quality-of-Experience to Limit Energy Consumption in Video-on-Demand Systems. 334-337 - Abbas Javadtalab, Mona Omidyeganeh, Shervin Shirmohammadi

, Mojtaba Hosseini:
A Bitrate-Conservative Fast-Adjusting Rate Controller for Video Conferencing. 338-341
Session 8-C: Enhancement & Security
- Tingting Sun, Cheolkon Jung, Peng Ke, Hyoseob Song, Jungmee Hwang:

Readability Enhancement of Low Light Videos Based on Discrete Wavelet Transform. 342-345 - Tsuo-Chen Wu, Mei-Chen Yeh:

The Impact of Feng Shui on House Price: A Data Perspective. 346-349 - Qingtao Fu, Cheolkon Jung, Ge Yang:

Adaptive Quantization-Based HDR Video Coding with HEVC Main 10 Profile. 350-353 - Ching-Chun Chang, Chang-Tsun Li

:
Secure Secret Sharing in the Cloud. 358-361
Demo I: Video Related
- Arttu Ylä-Outinen, Ari Lemmetti, Marko Viitanen

, Jarno Vanne
, Timo D. Hämäläinen:
Kvazaar: HEVC/H.265 4K30p Intra Encoder. 362-363
and Timo D. Hamalainen
- Andreas Leibetseder, Bernd Münzer, Klaus Schoeffmann, Jörg Keckstein:

Endometriosis Annotation in Endoscopic Videos. 364-365 - Bernd Münzer, Klaus Schoeffmann, Laszlo Boeszoermenyi:

EndoXplore: A Web-Based Video Explorer for Endoscopic Videos. 366-367 - Sai Samarth R. Phaye, Love Mehta, Mukesh Kumar Saini:

The One Man Show. 368-369
Demo II: Interaction, Tracking, Network Related
- Luisa Brinkschulte, Robert Mertens, Leon Strapper, Sebastian Pospiech, Lars Knipping:

A Multi Modal Interaction Paradigm Combining Gaze Tracking and Keyboard. 370-371 - Jan Sedmidubský

, Pavel Zezula:
A Web Application for Subsequence Matching in 3D Human Motion Data. 372-373 - Bo Wei

, Wataru Kawakami, Kenji Kanai, Jiro Katto
:
A History-Based TCP Throughput Prediction Incorporating Communication Quality Features by Support Vector Regression for Mobile Network. 374-375 - Chao-Yung Hsu, Li-Wei Kang, Teng-Yi You, Wei-Chen Jhong:

Vision-Based Automatic Identification Tracking of Steel Products for Intelligent Steel Manufacturing. 376-377
Workshop: Emerging Multimedia Applications and Services for Smart Cities (EMASC 2017)
- Mohammed F. Alhamid, Saad Alsahli, Majdi Rawashdeh, Mubarak Alrashoud:

Detection and Visualization of Arabic Emotions on Social Emotion Map. 378-381 - Neeraj Goel, Rajat Sharma, N. Nikhil, S. D. Mahanoor, Mukesh Kumar Saini:

A Crowd-Sourced Adaptive Safe Navigation for Smart Cities. 382-387 - Wanjun Pei, Benjamin Guthier, Abdulmotaleb El Saddik

:
The Solar System as a 3D Metaphor to Visualize User Interactions in a Social Network. 388-393 - Amjad A. Alghanim, Sk. Md. Mizanur Rahman

, M. Anwar Hossain:
Privacy Analysis of Smart City Healthcare Services. 394-398 - Lingchao Kong, Jingyi Zhu, Rui Dai, Mohammad Nazmus Sadat

:
Impact of Distributed Caching on Video Streaming Quality in Information Centric Networks. 399-402
Workshop: Intelligent Multimedia Applications and Design for Quality Living (IMAD 2017)
- Moussa Ouedraogo, Wassila Aggoune-Mtalaa

, Djamel Khadraoui:
MAESTRO: Constructing a Reference Framework for Self Monitoring Devices Dedicated to Seniors. 403-406 - Mukesh Kumar Saini, Ali Danesh, Abdulmotaleb El Saddik

:
Shall IoT User Interfaces Start Recommending Multimedia Devices as Well? 407-412 - Debajyoti Pal

, Tuul Triyason, Suree Funilkul:
Smart Homes and Quality of Life for the Elderly: A Systematic Review. 413-419 - Randy Tan

, Naimul Mefraz Khan, Ling Guan:
Real-Time System for Human Activity Analysis. 420-425 - Ho-Yin Yue, Wai-Man Pang

, Chiu-Yin Tam, Clive Wai-Ngok Fan:
Does Spending More Time on Facebook Makes Users Engage in Politics? 426-431 - Geoffrey Poon, Ki-Mei Li, Wai-Man Pang

:
A Memory-Friendly Multi-modal Emotion Analysis for Smart Toy. 432-437 - Kin Chi Chan, Tak Leung Cheung, Siu Hong Lai, Kin Chung Kwan, Ho-Yin Yue, Wai-Man Pang

:
Where2Buy: A Location-Based Shopping App with Products-wise Searching. 438-443 - Jinta Zheng, Jing Qin

, Kup-Sze Choi
:
Towards Interactive and Realistic Rendering of 3D Fetal Ultrasound via Photon Mapping. 444-449 - Kup-Sze Choi

, Shuang Liang:
Enhancing the Performance of Brain-Computer Interface with Haptics. 450-452 - Tatsuya Nagashima, Kenji Kanai, Jiro Katto

:
QoS and QoE Evaluations of 2K and 4K DASH Contents Distributions. 453-458 - Xi Liu, Chen Li, Lihua Tian:

Hand Gesture Recognition Based on Wavelet Invariant Moments. 459-464 - Haijun Lei, Tao Han, Weifeng Huang, Jong Yih Kuo, Zhen Yu, Xinzi He, Baiying Lei:

Cross-Modal Transfer Learning for HEp-2 Cell Classification Based on Deep Residual Network. 465-468
Workshop: Mining and Applications on Multimedia (MAM 2017)
- Sheng-Chih Chen

, Yi-Cheng Chen, Wei-Lin Chen:
Data Mining Techniques vs. Policy Development: Evaluating Advanced Applied Technological Policies and Emerging Communication Technology. 469-474 - Chung-Hua Chu, Hsiao-Ting Shih, Chih-Hua Tai:

Study of Touch Identify for Mobile Device Security. 475-478 - Markus Schedl, Bruce Ferwerda

:
Large-Scale Analysis of Group-Specific Music Genre Taste from Collaborative Tags. 479-482 - Jong Yih Kuo, Chia Wei Pan, Baiying Lei:

Using Stacked Denoising Autoencoder for the Student Dropout Prediction. 483-488
Workshop: Machine Learning and Computing for Visual Semantic Analysis (MLCSA 2017)
- Rui Liang, Qingxin Zhu, Honglei Wei, Shujiao Liao:

A Video Shot Boundary Detection Approach Based on CNN Feature. 489-494 - Chairath Sirirattanapol, Yusuke Matsui, Shin'ichi Satoh, Kuninori Matsuda, Kazuaki Yamamoto:

Deep Image Retrieval Applied on Kotenseki Ancient Japanese Literature. 495-499 - Xian-Hua Han, Jan Wang, Boxin Shi

, Yinqiang Zheng
, Yen-Wei Chen:
Hyper-spectral Image Super-resolution Using Non-negative Spectral Representation with Data-Guided Sparsity. 500-506 - Jian Wang, Xian-Hua Han, Yingying Xu, Lanfen Lin, Hongjie Hu, Chongwu Jin, Yen-Wei Chen:

Tensor Sparse Representation of Temporal Features for Content-Based Retrieval of Focal Liver Lesions Using Multi-phase Medical Images. 507-510 - Jun-Ho Choi, Manri Cheon, Jong-Seok Lee:

Influence of Video Quality on Multi-view Activity Recognition. 511-515 - Kenki Nakamura, Qiang Ma

:
Context-aware Image Generation by Using Generative Adversarial Networks. 516-523
Workshop: Multimedia Search and Applications (MSA 2017)
- Sixin Xue, Yayuan Yan, Ji-Jiang Yang, Yue Wang:

A New Governance Architecture for Government Information Resources Based on Big Data Ecological Environment in China. 524-530 - Yi Yang, Guigang Zhang, Jian Wang, Weixing Huang:

Distributed Representation for Neighborhood-Based Collaborative Filtering. 531-535 - Yanzhou Gong, Ziqiang Ni, Weixing Huang, Jian Wang, Guigang Zhang:

A Real-Time Chinese Calligraphy Creation System. 536-542 - Zhiwen Lei, Xiaoxiao Yang, Yanzhou Gong, Weixing Huang, Jian Wang, Guigang Zhang:

A Robust Hand Cursor Interaction Method Using Kinect. 543-548 - Joni Rasanen

, Marko Viitanen
, Jarno Vanne
, Timo D. Hämäläinen:
Kvazzup: Open Software for HEVC Video Calls. 549-552 - Yuan Wang, Lihua Tian, Chen Li:

LBP-SVD Based Copy Move Forgery Detection Algorithm. 553-556 - Mi Zhang, Lihua Tian, Chen Li:

Key Frame Extraction Based on Entropy Difference and Perceptual Hash. 557-560 - Masahiko Sugimura, Takayuki Baba, Ryuta Tanaka:

Fast Binary Descriptor Search for Keypoint Matching by Norm Ordering. 561-566
Workshop: Multimedia Technologies for E-Learning (MTEL 2017)
- Toby Dragon, Carrie Lindeman:

Automated Assessment of Students' Conceptual Understanding: Supporting Students and Teachers Using Data from an Interactive Textbook. 567-572 - Aman Chaudhary, Akshatha K, Kiran Kodlekere, SuryaPrasad J:

Keyword Based Indexing of a Multimedia File. 573-576 - Florian Schimanke

, Robert Mertens, Leonard Hill:
A Unit Testing Framework for Context Variant Code in a Mobile Learning App. 577-582 - Zhe Yang, Lihua Tian, Chen Li:

A Fast Video Shot Boundary Detection Employing OTSU's Method and Dual Pauta Criterion. 583-586 - Johannes Klein, Jean Botev

, Steffen Rothkugel:
Enabling Near Real-Time Collaboration in a Distributed Multimedia Editing Environment. 587-594 - Wen-Hung Liao

, Chin-Wen Chang, Yi-Chieh Wu
:
Classification of Reading Patterns Based on Gaze Information. 595-600

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.


Google
Google Scholar
Semantic Scholar
Internet Archive Scholar
CiteSeerX
ORCID














