


default search action
International Journal of Multimedia Information Retrieval, Volume 14
Volume 14, Number 1, March 2025
- Qiuhong Tian, Weilun Miao, Lizao Zhang, Ziyu Yang, Yang Yu, Yanying Zhao, Lan Yao:
STCA: an action recognition network with spatio-temporal convolution and attention. 1 - Fan Yang, Nor Azman Ismail, Pang Yee Yong, Alhuseen Omar Alsayed:
CAMIR: fine-tuning CLIP and multi-head cross-attention mechanism for multimodal image retrieval with sketch and text features. 2 - Hao Wen, Ziqian Lu, Fengli Shen, Zheming Lu, Jia-Lin Cui:
Improving skeleton-based action recognition with interactive object information. 3 - Ziyong Lin, Xiaolong Jiang, Jie Zhang, Mingyong Li:
Dual-matrix guided reconstruction hashing for unsupervised cross-modal retrieval. 4 - Hao Chen, Wu Huang, Tao Zhang
:
Optimized RT-DETR for accurate and efficient video object detection via decoupled feature aggregation. 5 - Zhong Ji, Yuanheng Liu, Xuan Wang, Jingren Liu, Jiale Cao, YunLong Yu:
Multi-task classification network for few-shot learning. 6 - Changqin Huang, Zhenheng Lin, Zhongmei Han, Qionghao Huang, Fan Jiang, Xiaodi Huang
:
PAMoE-MSA: polarity-aware mixture of experts network for multimodal sentiment analysis. 7 - Digambar Pawar, Raghavendra Gowda, Krishna Chandra:
Image forgery classification and localization through vision transformers. 8 - Lixia Xue, Jiang Dong, Ronggui Wang, Juan Yang:
MFAFD: a few-shot learning method for cascading models with parameter free attention and finite discrete space. 9 - Qiang Zhang, Qin Shi, Teng Cheng, Junning Zhang, Jiong Chen:
VPC-VoxelNet: multi-modal fusion 3D object detection networks based on virtual point clouds. 10
Volume 14, Number 2, June 2025
- Weichen Zhao, Yuxing Lu
, Zhiyuan Liu, Yuan Yang, Ge Jiao:
Cross-modal alignment with synthetic caption for text-based person search. 11 - Hemraj Singh, Mridula Verma, Ramalingaswamy Cheruku:
DMFNet: geometric multi-scale pixel-level contrastive learning for video salient object detection. 12 - Manh-Duy Nguyen, Binh T. Nguyen, Cathal Gurrin:
Concept-based and embedding-based models in lifelog retrieval: an empirical comparison of performance. 13 - Pu Yan, Kang Ruan, Lili Wang, Yang Zhao, Xu Wang:
Multi-view learning for camouflaged object detection with PVTv2. 14 - Chao Yang, Yakun Chen, Zihao Li, Xianzhi Wang, Kaize Shi, Lina Yao, Guandong Xu, Zhongwen Guo:
Deep multimodal learning for time series analysis in social computing: a survey. 15 - Xinxin Hao, Haishun Du, Jiangtao Guo
, Jieru Li:
A CNN-transformer hybrid model and a multi-modal multi-stage training strategy for visible-infrared person re-identification. 16 - Minh-Tam Nguyen, Quynh T. Nguyen, Minh-Son Dao, Binh T. Nguyen:
Multimodal scene-graph matching for cheapfakes detection. 17 - Qiaoyun Zhang, Chih-Yung Chang, Shih-Jung Wu, Hsiang-Chuan Chang, Diptendu Sinha Roy:
MMDL: a multi-modal deep learning for video highlight detection in sports. 18 - Lingling Kan, Ruixuan Liu, Hongwei Liang, Fengcai Huo, Wenfeng Wang:
Human behavior recognition based on DualBiNet model. 19 - Mikel Williams-Lekuona, Georgina Cosma:
FiCo-ITR: bridging fine-grained and coarse-grained image-text retrieval for comparative performance analysis. 20 - Muhammad Irzam Liaqat
, Shah Nawaz, Muhammad Zaigham Zaheer, Muhammad Saad Saeed, Hassan Sajjad, Tom De Schepper, Karthik Nandakumar, Muhammad Haris Khan, Ignazio Gallo, Markus Schedl:
Chameleon: A Multimodal Learning Framework Robust to Missing Modalities. 21
Volume 14, Number 3, September 2025
- Jian Wang, Jia Su, Zonghui Wen, Yongqing Sun:
Enhanced YOLOv10 for small object detection with context-aware and adaptive modules. 22 - Guan Yang, Weihao Sun, Xiaoming Liu, Yang Liu, Chen Wang:
Semantic Fusion and Contrastive Generation for Generalized Zero-Shot Learning. 23 - Xiaofei Zhang, Xiaoguang Di, Runwen Zhu:
TPE-YOLO: improved low-light object detection using a two-way pyramid enhancement network. 24 - Yunxue Shao, Zhiyang Wang, Lingfeng Wang:
MCDINO: Self-supervised learning of masks based on combination of multi-path channel attention and local feature weighting. 25

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.