OSVidCap: A Framework for the Simultaneous Recognition and Description of Concurrent Actions in Videos in an Open-Set Scenario

Automatically understanding and describing the visual content of videos in natural language is a challenging task in computer vision. Existing approaches are often designed to describe single events in a closed-set setting. However, in real-world scenarios, concurrent activities and previously unsee...

Full description

Bibliographic Details
Main Authors:	Andrei De Souza Inacio, Matheus Gutoski, Andre Eugenio Lazzaretti, Heitor Silverio Lopes
Format:	Article
Language:	English
Published:	IEEE 2021-01-01
Series:	IEEE Access
Subjects:	Video captioning open-set recognition deep learning
Online Access:	https://ieeexplore.ieee.org/document/9552885/

Internet

https://ieeexplore.ieee.org/document/9552885/

OSVidCap: A Framework for the Simultaneous Recognition and Description of Concurrent Actions in Videos in an Open-Set Scenario

Internet

Similar Items