MART: Masked Affective RepresenTation Learning via Masked Temporal Distribution Distillation
Published in IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2024
This paper is about an MAE-style method for learning robust affective representation of videos via masking, termed MART.