


default search action
26th MMM 2020: Daejeon, South Korea
- Yong Man Ro, Wen-Huang Cheng, Junmo Kim, Wei-Ta Chu, Peng Cui, Jung-Woo Choi, Min-Chun Hu, Wesley De Neve:
MultiMedia Modeling - 26th International Conference, MMM 2020, Daejeon, South Korea, January 5-8, 2020, Proceedings, Part II. Lecture Notes in Computer Science 11962, Springer 2020, ISBN 978-3-030-37733-5
Poster Session
- Pengfei Chen, Minglei Yuan, Tong Lu:
Multi-scale Comparison Network for Few-Shot Learning. 3-13 - Jiayu Song, QingHua Xu
, Wei Liu, Yueran Zu, Mengdong Chen:
Semantic and Morphological Information Guided Chinese Text Classification. 14-26 - Duc V. Nguyen
, Huyen T. T. Tran
, Truong Cong Thang
:
A Delay-Aware Adaptation Framework for Cloud Gaming Under the Computation Constraint of User Devices. 27-38 - Dongbiao He, Jinlei Jiang, Cédric Westphal, Guangwen Yang:
Efficient Edge Caching for High-Quality 360-Degree Video Delivery. 39-51 - Suping Zhou, Jia Jia, Long Zhang, Yanfeng Wang, Wei Chen, Fanbo Meng, Fei Yu, Jialie Shen:
Inferring Emphasis for Real Voice Data: An Attentive Multimodal Neural Network Approach. 52-62 - Xi Yang, Yeo-Jin Kim, Michelle Taub, Roger Azevedo, Min Chi:
PRIME: Block-Wise Missingness Handling for Multi-modalities in Intelligent Tutoring Systems. 63-75 - Yuwei Yang, Fanman Meng, Hongliang Li, Qingbo Wu, Xiaolong Xu, Shuai Chen:
A New Local Transformation Module for Few-Shot Segmentation. 76-87 - Mingjie Wu, Yongfei Zhang
, Tianyu Zhang, Wenqi Zhang:
Background Segmentation for Vehicle Re-identification. 88-99 - Joanna Hong, Hong Joo Lee, Yelin Kim, Yong Man Ro
:
Face Tells Detailed Expression: Generating Comprehensive Facial Expression Sentence Through Facial Action Units. 100-111 - Yang Wang, Ye Qian, Jiahao Shi, Feng Su:
A Deep Convolutional Deblurring and Detection Neural Network for Localizing Text in Videos. 112-124 - Wei Hou, Dakui Wang, Xiaojun Chen:
Generate Images with Obfuscated Attributes for Private Image Classification. 125-135 - Xiaozhong Ji, Yirui Wu, Tong Lu:
Context-Aware Residual Network with Promotion Gates for Single Image Super-Resolution. 136-147 - Xiaoyu Xu
, Jian Qian, Li Yu, Shengju Yu, Hao Tao, Ran Zhu:
A Compact Deep Neural Network for Single Image Super-Resolution. 148-160 - Kai Huang, Jianjun Li, Shichao Cheng, Jie Yu, Wanyong Tian, Lulu Zhao, Junfeng Hu, Chin-Chen Chang:
An Efficient Algorithm of Facial Expression Recognition by TSG-RNN Network. 161-174 - Yiming Li, Xiaoshan Yang, Changsheng Xu:
Structured Neural Motifs: Scene Graph Parsing via Enhanced Context. 175-188 - Duanzheng Guan, Dengshi Li, Xuebei Cai, Xiaochen Wang, Ruimin Hu:
Perceptual Localization of Virtual Sound Source Based on Loudspeaker Triplet. 189-200 - Xiaoge Song, Yirui Wu, Wenhai Wang, Tong Lu:
TK-Text: Multi-shaped Scene Text Detection via Instance Segmentation. 201-213 - Hirotaka Kato, Takatsugu Hirayama, Ichiro Ide
, Keisuke Doman, Yasutomo Kawanishi
, Daisuke Deguchi
, Hiroshi Murase:
More-Natural Mimetic Words Generation for Fine-Grained Gait Description. 214-225 - Ying Zhao
, Zhiwei Luo, Changqin Quan
, Dianchao Liu, Gang Wang:
Lite Hourglass Network for Multi-person Pose Estimation. 226-238
Special Session Papers // SS1: AI-Powered 3D Vision
- Yunhan Sun
, Jinlong Shi, Suqin Bai, Qiang Qian, Zhengxing Sun:
Single View Depth Estimation via Dense Convolution Network with Self-supervision. 241-253 - Menghan Zhang, Yunbo Rao, Jiansu Pu, Xun Luo, Qifei Wang:
Multi-data UAV Images for Large Scale Reconstruction of Buildings. 254-266 - Sen Xiang, Qiong Liu, Huiping Deng, Jin Wu, Li Yu:
Deformed Phase Prediction Using SVM for Structured Light Depth Generation. 267-278 - Liang Wang
, Biying Yan, Fuqing Duan, Ke Lu:
Extraction of Multi-class Multi-instance Geometric Primitives from Point Clouds Using Energy Minimization. 279-290 - Xiangyu Sun, Qiong Liu, You Yang:
Similarity Graph Convolutional Construction Network for Interactive Action Recognition. 291-303 - Zihao Chen, Xu Wang, Yu Zhou
, Longhao Zou, Jianmin Jiang:
Content-Aware Cubemap Projection for Panoramic Image via Deep Q-Learning. 304-315 - Teng Wan, Shaoyi Du, Wenting Cui, Qixing Xie, Yuying Liu, Zuoyong Li:
Robust RGB-D Data Registration Based on Correntropy and Bi-directional Distance. 316-326 - Hui Cao, Haikuan Du, Siyu Zhang, Shen Cai:
InSphereNet: A Concise Representation and Classification Method for 3D Object. 327-339 - Wenting Cui, Shaoyi Du, Teng Wan, Yan Liu, Yuying Liu, Yang Yang, Qingnan Mou, Mengqi Han, Yu-Cheng Guo:
3-D Oral Shape Retrieval Using Registration Algorithm. 340-349 - Yu Wang, Tao Lu
, Ruobo Xu, Yanduo Zhang:
Face Super-Resolution by Learning Multi-view Texture Compensation. 350-360 - Junlin Zhang, Xu Wang:
Light Field Salient Object Detection via Hybrid Priors. 361-372
SS2: Multimedia Analytics: Perspectives, Tools and Applications
- Werner Bailer, Maarten Wijnants, Hendrik Lievens
, Sandy Claes:
Multimedia Analytics Challenges and Opportunities for Creating Interactive Radio Content. 375-387 - Iva Gornishka
, Stevan Rudinac, Marcel Worring
:
Interactive Search and Exploration in Discussion Forums Using Multimodal Embeddings. 388-399 - Xixun Wu, Binheng Song, Zhixiang Wang, Chun Yuan:
An Inverse Mapping with Manifold Alignment for Zero-Shot Learning. 400-411 - Aaron Duane, Cathal Gurrin
:
Baseline Analysis of a Conventional and Virtual Reality Lifelog Retrieval System. 412-423 - Aikaterini Katmada, George Kalpakis
, Theodora Tsikrika
, Stelios Andreadis
, Stefanos Vrochidis
, Ioannis Kompatsiaris:
An Extensible Framework for Interactive Real-Time Visualizations of Large-Scale Heterogeneous Multimedia Information from Online Sources. 424-435
SS3: Multimedia Datasets for Repeatable Experimentation (MDRE)
- Andreas Leibetseder
, Sabrina Kletz
, Klaus Schoeffmann
, Simon Keckstein, Jörg Keckstein:
GLENDA: Gynecologic Laparoscopy Endometriosis Dataset. 439-450 - Debesh Jha
, Pia H. Smedsrud, Michael A. Riegler, Pål Halvorsen, Thomas de Lange, Dag Johansen, Håvard D. Johansen:
Kvasir-SEG: A Segmented Polyp Dataset. 451-462 - Frank Hopfgartner
, Cathal Gurrin
, Hideo Joho:
Rethinking the Test Collection Methodology for Personal Self-tracking Data. 463-474 - Graham Healy
, Zhengwei Wang, Tomás Ward, Alan F. Smeaton, Cathal Gurrin
:
Experiences and Insights from the Collection of a Novel Multimedia EEG Dataset. 475-486
SS4: MMAC: Multi-modal Affective Computing of Large-Scale Multimedia Data
- Zhilei Liu
, Jiahui Dong, Cuicui Zhang, Longbiao Wang, Jianwu Dang:
Relation Modeling with Graph Convolutional Networks for Facial Action Unit Detection. 489-501 - Jian Guan
, Liming Yin, Jianguo Sun, Shuhan Qi, Xuan Wang, Qing Liao
:
Enhanced Gaze Following via Object Detection and Human Pose Estimation. 502-513 - Zhilei Liu
, Diyi Liu, Yunpeng Wu:
Region Based Adversarial Synthesis of Facial Action Units. 514-526 - Zhilei Liu
, Le Li, Yunpeng Wu, Cuicui Zhang:
Facial Expression Restoration Based on Improved Graph Convolutional Networks. 527-539 - Xiaona Guo, Wei Zhong, Long Ye, Li Fang, Yan Heng, Qin Zhang:
Global Affective Video Content Regression Based on Complementary Audio-Visual Features. 540-550
SS5: MULTIMED2020: Multimedia and Multimodal Analytics in the Medical Domain and Pervasive Environments
- Henning Müller
, Vincent Andrearczyk, Oscar Alfonso Jiménez del Toro, Anjani Dhrangadhariya, Roger Schaer, Manfredo Atzori:
Studying Public Medical Images from the Open Access Literature and Social Networks for Model Training and Knowledge Extraction. 553-564 - Jun Wu, Yao Zhang, Jie Wang, Jianchun Zhao, Dayong Ding, Ningjiang Chen, Lingling Wang, Xuan Chen, Chunhui Jiang, Xuan Zou, Xing Liu, Hui Xiao, Yuan Tian
, Zongjiang Shang, Kaiwei Wang, Xirong Li, Gang Yang, Jianping Fan:
AttenNet: Deep Attention Based Retinal Disease Classification in OCT Images. 565-576 - Tobias Baur, Sina Clausen, Alexander Heimerl, Florian Lingenfelser, Wolfgang Lutz
, Elisabeth André:
NOVA: A Tool for Explanatory Multimodal Behavior Analysis and Its Application to Psychotherapy. 577-588 - Sabrina Kletz, Klaus Schoeffmann, Andreas Leibetseder, Jenny Benois-Pineau, Heinrich Husslein:
Instrument Recognition in Laparoscopy for Technical Skill Assessment. 589-600 - Panagiotis Giannakeris, Georgios Meditskos, Konstantinos Avgerinakis, Stefanos Vrochidis
, Ioannis Kompatsiaris:
Real-Time Recognition of Daily Actions Based on 3D Joint Movements and Fisher Encoding. 601-613 - Athina Tsanousa, Angelos Chatzimichail, Georgios Meditskos, Stefanos Vrochidis
, Ioannis Kompatsiaris:
Model-Based and Class-Based Fusion of Multisensor Data. 614-625 - Natalia Sokolova
, Klaus Schoeffmann, Mario Taschwer, Doris Putzgruber-Adamitsch, Yosuf El-Shabrawi:
Evaluating the Generalization Performance of Instrument Classification in Cataract Surgery Videos. 626-636
SS6: Intelligent Multimedia Security
- Yajun Xu, Zhendong Mao, Peng Zhang, Bin Wang:
Compact Position-Aware Attention Network for Image Semantic Segmentation. 639-650 - Chuanbin Liu, Youliang Tian, Hongtao Xie:
Law Is Order: Protecting Multimedia Network Transmission by Game Theory and Mechanism Design. 651-668 - Qiuxian Li, Youliang Tian:
Rational Delegation Computing Using Information Theory and Game Theory Approach. 669-680 - Xuecheng Ning, Xiaoshan Yang, Changsheng Xu:
Multi-hop Interactive Cross-Modal Retrieval. 681-693
DEMO Papers
- Marc A. Kastner
, Ichiro Ide
, Yasutomo Kawanishi
, Takatsugu Hirayama, Daisuke Deguchi
, Hiroshi Murase:
Browsing Visual Sentiment Datasets Using Psycholinguistic Groundings. 697-702 - Chih-Yao Chang, Bo-I Chuang, Chi-Chun Hsia, Wen-Cheng Chen, Min-Chun Hu:
Framework Design for Multiplayer Motion Sensing Game in Mixture Reality. 703-708 - Yi Yu, Florian Harscoët, Simon Canales, Gurunath Reddy M, Suhua Tang
, Junjun Jiang
:
Lyrics-Conditioned Neural Melody Generation. 709-714 - Abdullah Alfarrarjeh, Zeyu Ma, Seon Ho Kim, Yeonsoo Park, Cyrus Shahabi:
A Web-Based Visualization Tool for 3D Spatial Coverage Measurement of Aerial Images. 715-721 - Zhongbo Sun, Yannan Wang, Li Cao:
An Attention Based Speaker-Independent Audio-Visual Deep Learning Model for Speech Enhancement. 722-728 - Tony Zhao, Jaeyoung Choi, Gerald Friedland:
DIME: An Online Tool for the Visual Comparison of Cross-modal Retrieval Models. 729-733 - Jung-Woo Choi
:
Real-Time Demonstration of Personal Audio and 3D Audio Rendering Using Line Array Systems. 734-738 - Yongwoo Kim, Jae-Seok Choi, Jaehyup Lee, Munchurl Kim:
A CNN-Based Multi-scale Super-Resolution Architecture on FPGA for 4K/8K UHD Applications. 739-744 - Abdul Muqeet
, Sung-Ho Bae:
Effective Utilization of Hybrid Residual Modules in Deep Neural Networks for Super Resolution. 745-750
VBS Papers
- Andreas Leibetseder, Bernd Münzer, Jürgen Primus, Sabrina Kletz, Klaus Schoeffmann:
diveXplore 4.0: The ITEC Deep Interactive Video Exploration System at VBS2020. 753-759 - Loris Sauter
, Mahnaz Amiri Parian
, Ralph Gasser
, Silvan Heller
, Luca Rossetto
, Heiko Schuldt
:
Combining Boolean and Multimedia Retrieval in vitrivr for Large-Scale Video Search. 760-765 - Nguyen-Khang Le
, Dieu-Hien Nguyen
, Minh-Triet Tran
:
An Interactive Video Search Platform for Multi-modal Retrieval with Advanced Concepts. 766-771 - Phuong Anh Nguyen
, Jiaxin Wu
, Chong-Wah Ngo, Danny Francis, Benoit Huet:
VIREO @ Video Browser Showdown 2020. 772-777 - Stelios Andreadis
, Anastasia Moumtzidou, Konstantinos Apostolidis, Konstantinos Gkountakos
, Damianos Galanopoulos, Emmanouil Michail, Ilias Gialampoukidis
, Stefanos Vrochidis
, Vasileios Mezaris, Ioannis Kompatsiaris:
VERGE in VBS 2020. 778-783 - Jakub Lokoc, Gregor Kovalcík, Tomás Soucek:
VIRET at Video Browser Showdown 2020. 784-789 - Miroslav Kratochvíl
, Patrik Veselý, Frantisek Mejzlík, Jakub Lokoc:
SOM-Hunter: Video Browsing with Relevance-to-SOM Feedback Loop. 790-795 - Björn Þór Jónsson, Omar Shahbaz Khan
, Dennis C. Koelma, Stevan Rudinac, Marcel Worring
, Jan Zahálka
:
Exquisitor at the Video Browser Showdown 2020. 796-802 - Byoungjun Kim, Ji Yea Shim, Minho Park, Yong Man Ro
:
Deep Learning-Based Video Retrieval Using Object Relationships and Associated Audio Classes. 803-808 - Sungjune Park, Jaeyub Song, Minho Park, Yong Man Ro
:
IVIST: Interactive VIdeo Search Tool in VBS 2020. 809-814

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.