"SwinBERT: End-to-End Transformers with Sparse Attention for Video Captioning."

Kevin Lin et al. (2022)

Details and statistics

DOI: 10.1109/CVPR52688.2022.01742

access: closed

type: Conference or Workshop Paper

metadata version: 2022-10-05

a service of  Schloss Dagstuhl - Leibniz Center for Informatics