Survey: Transformer based video-language pre-training

Inspired by the success of transformer-based pre-training methods on natural language tasks and further computer vision tasks, researchers have started to apply transformer to video processing. This survey aims to provide a comprehensive overview of transformer-based pre-training methods for Video-L...

Full description

Bibliographic Details
Main Authors: Ludan Ruan, Qin Jin
Format: Article
Language:English
Published: KeAi Communications Co. Ltd. 2022-01-01
Series:AI Open
Subjects:
Online Access:http://www.sciencedirect.com/science/article/pii/S2666651022000018