


default search action
Visual Intelligence, Volume 3
Volume 3, Number 1, 2025
- Zhe Cao, Lixin Xu, Jin Zhang, Biwen Yang, Kaizheng Chen, Ruiheng Zhang:
DBDB: de-bimodal defocus blur in joint infrared-visible imaging. - Haonan Cheng, Hanyue Liu, Juanjuan Cai, Long Ye:
CLFormer: a cross-lingual transformer framework for temporal forgery localization. - Yichen Shi, Yuhao Gao, Yingxin Lai, Hongyang Wang, Jun Feng, Lei He, Jun Wan, Changsheng Chen, Zitong Yu, Xiaochun Cao:
SHIELD: an evaluation benchmark for face spoofing and forgery detection with multimodal large language models. - Yingjia Xu, Mengxia Wu, Zixin Guo, Min Cao, Mang Ye, Jorma Laaksonen:
Efficient text-to-video retrieval via multi-modal multi-tagger derived pre-screening. - Mingjin Zhang, Qian Xu, Yuchun Wang, Xi Li, Haojuan Yuan:
MIRSAM: multimodal vision-language segment anything model for infrared small target detection. - Yuli Zhou, Guolei Sun, Yawei Li, Guo-Sen Xie, Luca Benini, Ender Konukoglu:
When SAM2 meets video camouflaged object segmentation: a comprehensive evaluation and adaptation. - Xiaohan Fang, Peilin Chen, Meng Wang, Shiqi Wang:
Immersive video interaction system: a survey. - Jiaxin Mei, Tao Zhou, Kaiwen Huang, Yizhe Zhang, Yi Zhou, Ye Wu, Huazhu Fu:
A survey on deep learning for polyp segmentation: techniques, challenges and future trends. - Suyan Li, Fuxiang Huang, Lei Zhang:
A survey of multimodal composite editing and retrieval. - Ruikun Zhang, Zhiyuan Yang, Liyuan Pan:
DehazeMamba: large multi-modal model guided single image dehazing via mamba. - Yifei Deng, Zhengyu Chen, Chenglong Li, Jin Tang:
Uncertainty-aware coarse-to-fine alignment for text-image person retrieval. - Yasheng Sun, Bohan Li, Mingchen Zhuge, Deng-Ping Fan, Salman H. Khan, Fahad Shahbaz Khan, Hideki Koike:
Connecting dreams with visual brainstorming instruction. - Hang Zhang, Wenxiao Zhang, Haoxuan Qu, Jun Liu:
Enhancing human-centered dynamic scene understanding via multiple LLMs collaborated reasoning. - Xiao Wang, Yuehang Li, Wentao Wu, Jiandong Jin, Yao Rong, Bo Jiang, Chuanfu Li, Jin Tang:
Pre-training on high-resolution X-ray images: an experimental study. - Qianggang Ding, Zhichao Shen, Weiqiang Zhu, Bang Liu:
DASFormer: self-supervised pretraining for earthquake monitoring.

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.