Maintain a Better Balance between Performance and Cost for Image Captioning by a Size-Adjustable Convolutional Module
Image captioning is a challenging AI problem that connects computer vision and natural language processing. Many deep learning (DL) models have been proposed in the literature for solving this problem. So far, the primary concern of image captioning has been focused on increasing the accuracy of gen...
Main Authors: | Yan Lyu, Yong Liu, Qiangfu Zhao |
---|---|
Format: | Article |
Language: | English |
Published: |
MDPI AG
2023-07-01
|
Series: | Electronics |
Subjects: | |
Online Access: | https://www.mdpi.com/2079-9292/12/14/3187 |
Similar Items
-
Parallel Dense Video Caption Generation with Multi-Modal Features
by: Xuefei Huang, et al.
Published: (2023-08-01) -
Fusion of Multi-Modal Features to Enhance Dense Video Caption
by: Xuefei Huang, et al.
Published: (2023-06-01) -
A Multimodal Framework for Video Caption Generation
by: Reshmi S. Bhooshan, et al.
Published: (2022-01-01) -
Caption for Cover of Volume 2 Issue 1
by: Amy Christian
Published: (2011-12-01) -
A Systematic Literature Review on Using the Encoder-Decoder Models for Image Captioning in English and Arabic Languages
by: Ashwaq Alsayed, et al.
Published: (2023-09-01)