default search action
MMM 2024, Amsterdam, The Netherlands - Part I
- Stevan Rudinac, Alan Hanjalic, Cynthia C. S. Liem, Marcel Worring, Björn Þór Jónsson, Bei Liu, Yoko Yamakata:
MultiMedia Modeling - 30th International Conference, MMM 2024, Amsterdam, The Netherlands, January 29 - February 2, 2024, Proceedings, Part I. Lecture Notes in Computer Science 14554, Springer 2024, ISBN 978-3-031-53304-4 - Chi-Yu Chen, Pu Ching, Pei-Hsin Huang, Min-Chun Tien:
Where Are Biases? Adversarial Debiasing with Spurious Feature Visualization. 1-14 - Mengying Xu, Hanjiang Lai, Jian Yin:
Cross-Modal Hash Retrieval with Category Semantics. 15-27 - Min Li, Fengfa Li, Bo Meng, Ruwen Bai, Junxing Ren, Zihao Huang, Chenghua Gao:
Spatiotemporal Representation Enhanced ViT for Video Recognition. 28-40 - Kedi Qiu, Shoudong Shi, Tianxiang Zhao, Yongfang Ye:
SCFormer: A Vision Transformer with Split Channel in Sitting Posture Recognition. 41-52 - Zebin Li, Jianping Luo:
Dive into Coarse-to-Fine Strategy in Single Image Deblurring. 53-65 - Yuhang Yang, Xiao Yan, Sanyuan Zhang:
TICondition: Expanding Control Capabilities for Text-to-Image Generation with Multi-Modal Conditions. 66-79 - Zhe Kong, Neng Gao, Yifei Zhang, Yuhan Liu:
Enhancing Generative Generalized Zero Shot Learning via Multi-Space Constraints and Adaptive Integration. 80-93 - Chen-Hsiu Huang, Ja-Ling Wu:
Joint Image Data Hiding and Rate-Distortion Optimization in Neural Compressed Latent Representations. 94-108 - Jixuan Hong, Jingjing Xie, Xueqin He, Chenhui Yang:
GSUNet: A Brain Tumor Segmentation Method Based on 3D Ghost Shuffle U-Net. 109-120 - Youkai Wang, Yue Hu, Wansen Wu, Ting Liu, Yong Peng:
ACT: Action-assoCiated and Target-Related Representations for Object Navigation. 121-133 - Die Yu, Zhaoyan Fang, Yong Jiang:
Foreground Feature Enhancement and Peak & Background Suppression for Fine-Grained Visual Classification. 134-146 - Jinyu Shi, Wenjie Wu:
YOLOv5-SRR: Enhancing YOLOv5 for Effective Underwater Target Detection. 147-158 - Yongqi Liu, Jiashuang Zhou, Xiaoqin Du:
Image Clustering and Generation with HDGMVAE-I. 159-171 - Anqi Zhang, Guangyu Gao, Zhuocheng Lv, Yukun An:
"Car or Bus?" CLearSeg: CLIP-Enhanced Discrimination Among Resembling Classes for Few-Shot Semantic Segmentation. 172-186 - Ting Liu, Yue Hu, Wansen Wu, Youkai Wang, Kai Xu, Quanjun Yin:
PANDA: Prompt-Based Context- and Indoor-Aware Pretraining for Vision and Language Navigation. 187-200 - Wenjun Gan, Jiawei Liu, Yangchun Zhu, Yong Wu, Guozhi Zhao, Zheng-Jun Zha:
Cross-Modal Semantic Alignment Learning for Text-Based Person Search. 201-215 - Lisa Liu, William Y. Wang, Pingping Cai:
Point Cloud Classification via Learnable Memory Bank. 216-229 - William Y. Wang, Lisa Liu, Pingping Cai:
Adversarially Regularized Low-Light Image Enhancement. 230-243 - Yuan Zhou, Xin Chen, Yanrong Guo, Jun Yu, Richang Hong, Qi Tian:
Advancing Incremental Few-Shot Semantic Segmentation via Semantic-Guided Relation Alignment and Adaptation. 244-257 - Zhengye Shen, Guangtong Lu, Qian Qiao, Fanzhang Li:
PMGCN:Preserving Measuring Mapping Prototype Graph Calibration Network for Few-Shot Learning. 258-272 - Zituo Li, Jianbin Sun, Yuqi Qin, Lunhao Ju, Ke-Wei Yang:
ARE-CAM: An Interpretable Approach to Quantitatively Evaluating the Adversarial Robustness of Deep Models Based on CAM. 273-285 - Bei Liu, Jian Zhang, Tianwen Yuan, Peng Huang, Chengwei Feng, Minghe Li:
SSK-Yolo: Global Feature-Driven Small Object Detection Network for Images. 286-299 - Zixuan Hong, Weipeng Cao, Zhiwu Xu, Zhenru Chen, Xi Tao, Zhong Ming, Chuqing Cao, Liang Zheng:
MetaVSR: A Novel Approach to Video Super-Resolution for Arbitrary Magnification. 300-313 - Yehong Pan, Jian Wang, Guihong Liu, Qiushuo Wu, Yazi Zheng, Xin Lan, Weibo Liang, Jiancheng Lv, Yuan Li:
From Skulls to Faces: A Deep Generative Framework for Realistic 3D Craniofacial Reconstruction. 314-326 - Wei Liu, Jiahuan Wang, Chao Wang, Yan Peng, Shaorong Xie:
Structure-Aware Adaptive Hybrid Interaction Modeling for Image-Text Matching. 327-341 - Vaibhav Mudgal, Qingyang Wang, Lorin Sweeney, Alan F. Smeaton:
Using Saliency and Cropping to Improve Video Memorability. 342-355 - Shuaiwei Wang, Zhao Liu, Jie Lei, Zunlei Feng, Juan Xu, Xuan Li, Ronghua Liang:
Contextual Augmentation with Bias Adaptive for Few-Shot Video Object Segmentation. 356-369 - Feng Chen, Xin Song, Liang Zhu:
A Lightweight Local Attention Network for Image Super-Resolution. 370-384 - Qiulin Li, Junhao Qiang, Qun Yang:
Domain Adaptation for Speaker Verification Based on Self-supervised Learning with Adversarial Training. 385-395 - Qian Cao, Dongdong Zhang, Chengyu Sun:
Quality Scalable Video Coding Based on Neural Representation. 396-409 - Zijian Lin, Jianping Luo:
Hierarchical Bi-directional Temporal Context Mining for Improved Video Compression. 410-421 - Yongyu Liu, Guoliang Lin, Hanjiang Lai, Yan Pan:
MAMixer: Multivariate Time Series Forecasting via Multi-axis Mixing. 422-435 - Kun Zhang, Chunling Gao, Shuangyuan Yang:
A Custom GAN-Based Robust Algorithm for Medical Image Watermarking. 436-447 - Xiaoting Li, Shouhong Wan, Hantao Zhang, Peiquan Jin:
A Detail-Guided Multi-source Fusion Network for Remote Sensing Object Detection. 448-461 - Qiuxian Li, Quanxing Zhou, Hongfa Ding:
A Secure and Fair Federated Learning Protocol Under the Universal Composability Framework. 462-474 - Kang Yi, Haoran Tang, Hongyu Bai, Yinjie Wang, Jing Xu, Ping Li:
Bi-directional Interaction and Dense Aggregation Network for RGB-D Salient Object Detection. 475-489 - Sizheng Guo, Haozhe Yang, Xianming Lin:
Face Forgery Detection via Texture and Saliency Enhancement. 490-502
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.