"MM-ViT: Multi-Modal Video Transformer for Compressed Video Action Recognition."

Jiawei Chen, Chiu Man Ho (2022)

Details and statistics

DOI: 10.1109/WACV51458.2022.00086

access: closed

type: Conference or Workshop Paper

metadata version: 2022-02-17

a service of  Schloss Dagstuhl - Leibniz Center for Informatics