![](https://dblp.org/img/logo.ua.320x120.png)
![](https://dblp.org/img/dropdown.dark.16x16.png)
![](https://dblp.org/img/peace.dark.16x16.png)
Остановите войну!
for scientists:
![search dblp search dblp](https://dblp.org/img/search.dark.16x16.png)
![search dblp](https://dblp.org/img/search.dark.16x16.png)
default search action
MMM 2024, Amsterdam, The Netherlands - Part III
- Stevan Rudinac
, Alan Hanjalic
, Cynthia C. S. Liem
, Marcel Worring
, Björn Þór Jónsson
, Bei Liu
, Yoko Yamakata
:
MultiMedia Modeling - 30th International Conference, MMM 2024, Amsterdam, The Netherlands, January 29 - February 2, 2024, Proceedings, Part III. Lecture Notes in Computer Science 14556, Springer 2024, ISBN 978-3-031-53310-5 - Qiang Chen
, Fuxiao He
, Guoqiang Xiao
:
Global-to-Local Feature Mining Network for RGB-Infrared Person Re-Identification. 1-13 - Lu Chen
, Jiawei Tan
, Pingan Yang
, Hongxing Wang
:
Semantic Transition Detection for Self-supervised Video Scene Segmentation. 14-27 - Xueyang Qin, Lishuang Li, Jing Hao, Meiling Ge, Jiayi Huang, Guangyao Pang:
Multi-task Collaborative Network for Image-Text Retrieval. 28-42 - Hao-Yuan Ma, Li Zhang, Xiang-Yi Wei:
FGENet: Fine-Grained Extraction Network for Congested Crowd Counting. 43-56 - Jingjing Xie, Jixuan Hong, Manjin Sheng, Chenhui Yang:
MSMV-UNet: A 2.5D Stroke Lesion Segmentation Method Based on Multi-slice Feature Fusion. 57-69 - Xiang Gao, Sining Wu, Fan Wang, Xiaopeng Hu:
Non-Local Spatial-Wise and Global Channel-Wise Transformer for Efficient Image Super-Resolution. 70-85 - Ting Peng, Yihang Zhou, Rong Sun, Yizhi Luo, Yuqi Li:
MobileViT-FocR: MobileViT with Fixed-One-Centre Loss and Gradient Reversal for Generalised Fake Face Detection. 86-100 - Xiran Zhang, Haiyan Liu, Caixia Liu, Haiyang Zhang, Zhiwei Huo:
ASF-Conformer: Audio Scoring Conformer with FFC for Speaker Verification in Noisy Environments. 101-111 - Yuanjian He, Weile Zhang, Junyuan Deng, Yulai Cong:
Prior-Knowledge-Free Video Frame Interpolation with Bidirectional Regularized Implicit Neural Representations. 112-126 - Shengrong Ling, Sisi You, Bing-Kun Bao:
Two-Stage Reasoning Network with Modality Decomposition for Text VQA. 127-140 - Honglei Zheng, Wenkang Fan, Yinran Chen, Xiongbiao Luo:
Localization and Local Motion Magnification of Pulsatile Regions in Endoscopic Surgery Videos. 141-154 - Shinichi Ka
, Koichi Shinoda
:
Co-speech Gesture Generation with Variational Auto Encoder. 155-168 - Chunyin Sheng, Xiang Gao, Xiaopeng Hu, Fan Wang:
Differentiable Neural Architecture Search Based on Efficient Architecture for Lightweight Image Super-Resolution. 169-183 - Zhengwei Yang, Yange Wang, Lei Ma, Xiangzheng Li:
Learning Collaborative Reinforcement Attention for 3D Face Reconstruction and Dense Alignment. 184-197 - Konstantinos Triaridis
, Vasileios Mezaris
:
Exploring Multi-modal Fusion for Image Manipulation Detection and Localization. 198-211 - Feifei Xu, Zheng Zhong, Yitao Zhu, Yingchen Zhou, Guangzhen Li:
Appearance-Motion Dual-Stream Heterogeneous Network for VideoQA. 212-227 - Xiang Li, Ming Lu, Ziming Guo, Xiaoming Zhang:
Adaptive Token Selection and Fusion Network for Multimodal Sentiment Analysis. 228-241 - Pei Chen, Zhiyong Feng, Meng Xing, Yiming Zhang, Jinqing Zheng:
Exploring Imperceptible Adversarial Examples in YCbCr Color Space. 242-256 - Liyun Xu
, Min Zhang:
Fractional-Order Image Moments and Applications. 257-269 - Maria Pegia
, Ferran Agullo Lopez
, Anastasia Moumtzidou
, Alberto Gutierrez-Torre
, Björn Þór Jónsson
, Josep Lluis Berral-Garcia
, Ilias Gialampoukidis
, Stefanos Vrochidis
, Ioannis Kompatsiaris
:
Time-Quality Tradeoff of MuseHash Query Processing Performance. 270-283 - Zhanjie Jin, Anming Dong
, Jiguo Yu
, Shuxiang Dong, You Zhou:
Dual-Fisheye Image Stitching via Unsupervised Deep Learning. 284-298 - Junpeng Liu, Hengkang Bao:
CA-GAN: Conditional Adaptive Generative Adversarial Network for Text-to-Image Synthesis. 299-312 - Dexu Yao, Aimin Li, Deqi Liu, Mengfan Cheng:
RDC-YOLOv5: Improved Safety Helmet Detection in Adverse Weather. 313-326 - Aril Bernhard Ovesen, Tor-Arne Schmidt Nordmo, Michael Alexander Riegler, Pål Halvorsen, Dag Johansen:
Sustainable Commercial Fishery Control Using Multimedia Forensics Data from Non-trusted, Mobile Edge Nodes. 327-340 - Shan Cao, Qingfeng Wu:
MC-TCMNER: A Multi-modal Fusion Model Combining Contrast Learning Method for Traditional Chinese Medicine NER. 341-354 - Xiangyu Chen, Md Ayshik Rahman Khan
, Md. Rakibul Hasan
, Tom Gedeon, Md. Zakir Hossain
:
C3-PO: A Convolutional Neural Network for COVID Onset Prediction from Cough Sounds. 355-368 - Mingyuan Ge, Jianan Shui, Junyu Chen, Mingyong Li:
Pseudo-label Based Unsupervised Momentum Representation Learning for Multi-domain Image Retrieval. 369-380 - Jianbo Xiong, Shinan Zou, Jin Tang:
DFGait: Decomposition Fusion Representation Learning for Multimodal Gait Recognition. 381-395 - Jiangfeng Li
, Bowen Wang
, Yongrui Qin
, Chenxi Zhang
, Gang Yu
, Qinpei Zhao
:
MoPE: Mixture of Pooling Experts Framework for Image-Text Retrieval. 396-409 - Linzi Xing, Quan Hung Tran, Fabian Caba, Franck Dernoncourt, Seunghyun Yoon, Zhaowen Wang, Trung Bui, Giuseppe Carenini:
Multi-modal Video Topic Segmentation with Dual-Contrastive Domain Adaptation. 410-424 - Wenlong Lu, Suping Wu, Xitie Zhang, Shengjia Zhang:
Unsupervised Multi-collaborative Learning Network for 3D Face Reconstruction. 425-436 - Yiru Zhang, Zeke Li, Bijing Liu, Haiwei Fan, Yong Yang, Qun Yang:
A Region Based Non-overlapping Reference Speech Estimation Method for Speaker Extraction. 437-447 - Pan Li, Suping Wu, Xitie Zhang, Yuxin Peng, Boyang Zhang, Bin Wang:
Self-supervised Edge Structure Learning for Multi-view Stereo and Parallel Optimization. 448-461 - Shuai Wang, Jiayi Shen, Athanasios Efthymiou, Stevan Rudinac, Monika Kackovic, Nachoem Wijnberg, Marcel Worring:
Prototype-Enhanced Hypergraph Learning for Heterogeneous Information Networks. 462-476 - Ali Abdari
, Alex Falcon
, Giuseppe Serra
:
A Language-Based Solution to Enable Metaverse Retrieval. 477-488 - Chenlin Zhao
, Jiabo Ye
, Yaguang Song
, Ming Yan
, Xiaoshan Yang
, Changsheng Xu
:
Part-Aware Prompt Tuning for Weakly Supervised Referring Expression Grounding. 489-502 - Sarwar Khan, Jun-Cheng Chen, Wen-Hung Liao, Chu-Song Chen:
Adversarially Robust Deepfake Detection via Adversarial Feature Similarity Learning. 503-516 - Adriano Baratè
, Luca Andrea Ludovico
:
A Multidimensional Taxonomy Model for Music Tangible User Interfaces. 517-531
![](https://dblp.org/img/cog.dark.24x24.png)
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.