A novel Multi-Layer Attention Framework for visual description prediction using bidirectional LSTM

Abstract The massive influx of text, images, and videos to the internet has recently increased the challenge of computer vision-based tasks in big data. Integrating visual data with natural language to generate video explanations has been a challenge for decades. However, recent experiments on image...

Full description

Bibliographic Details
Main Authors:	Dinesh Naik, C. D. Jaidhar
Format:	Article
Language:	English
Published:	SpringerOpen 2022-11-01
Series:	Journal of Big Data
Subjects:	Attention Computer vision Convolutional Neural Network LSTM Video captioning
Online Access:	https://doi.org/10.1186/s40537-022-00664-6

Internet

https://doi.org/10.1186/s40537-022-00664-6

A novel Multi-Layer Attention Framework for visual description prediction using bidirectional LSTM

Internet

Similar Items