A Video Question Answering Model Based on Knowledge Distillation

Video question answering (QA) is a cross-modal task that requires understanding the video content to answer questions. Current techniques address this challenge by employing stacked modules, such as attention mechanisms and graph convolutional networks. These methods reason about the semantics of vi...

Full description

Bibliographic Details
Main Authors:	Zhuang Shao, Jiahui Wan, Linlin Zong
Format:	Article
Language:	English
Published:	MDPI AG 2023-06-01
Series:	Information
Subjects:	video question answering multimodal fusion knowledge distillation
Online Access:	https://www.mdpi.com/2078-2489/14/6/328

Internet

https://www.mdpi.com/2078-2489/14/6/328

A Video Question Answering Model Based on Knowledge Distillation

Internet

Similar Items