Video description method based on multidimensional and multimodal information

In order to solve the problem of complex information representation in automatic video description tasks,a multi-dimensional and multi-modal visual feature extraction and fusion method was proposed.Firstly,multi-dimensional features such as static and dynamic attributes of the video sequence were ex...

詳細記述

書誌詳細
主要な著者: Enjie DING, Zhongyu LIU, Yafeng LIU, Wanli YU
フォーマット: 論文
言語:zho
出版事項: Editorial Department of Journal on Communications 2020-02-01
シリーズ:Tongxin xuebao
主題:
オンライン・アクセス:http://www.joconline.com.cn/thesisDetails#10.11959/j.issn.1000-436x.2020037