"Textual Tokens Classification for Multi-Modal Alignment in Vision-Language ..."

Zhongjie Mao et al. (2024)

Details and statistics

DOI: 10.1109/ICASSP48485.2024.10446122

access: closed

type: Conference or Workshop Paper

metadata version: 2024-10-18