Separate Syntax and Semantics: Part-of-Speech-Guided Transformer for Image Captioning

Separate Syntax and Semantics: Part-of-Speech-Guided Transformer for Image Captioning

Transformer-based image captioning models have recently achieved remarkable performance by using new fully attentive paradigms. However, existing models generally follow the conventional language model of predicting the next word conditioned on the visual features and partially generated words. They...

Full description

Bibliographic Details
Main Authors:	Dong Wang, Bing Liu, Yong Zhou, Mingming Liu, Peng Liu, Rui Yao
Format:	Article
Language:	English
Published:	MDPI AG 2022-11-01
Series:	Applied Sciences
Subjects:	image captioning transformer part of speech multitask learning
Online Access:	https://www.mdpi.com/2076-3417/12/23/11875

Similar Items

Image Captioning Model Using Part-of-Speech Guidance Module for Description With Diverse Vocabulary
by: Ju-Won Bae, et al.
Published: (2022-01-01)

Learn and Tell: Learning Priors for Image Caption Generation
by: Pei Liu, et al.
Published: (2020-10-01)

Full-Memory Transformer for Image Captioning
by: Tongwei Lu, et al.
Published: (2023-01-01)

An Attentive Fourier-Augmented Image-Captioning Transformer
by: Raymond Ian Osolo, et al.
Published: (2021-09-01)

Insights into Object Semantics: Leveraging Transformer Networks for Advanced Image Captioning
by: Deema Abdal Hafeth, et al.
Published: (2024-03-01)

UAT: Universal Attention Transformer for Video Captioning
by: Heeju Im, et al.
Published: (2022-06-01)

Novel Object Captioning with Semantic Match from External Knowledge
by: Sen Du, et al.
Published: (2023-07-01)

Part-of-Speech Tags Guide Low-Resource Machine Translation
by: Zaokere Kadeer, et al.
Published: (2023-08-01)

Semantic Representations With Attention Networks for Boosting Image Captioning
by: Deema Abdal Hafeth, et al.
Published: (2023-01-01)

Style-Enhanced Transformer for Image Captioning in Construction Scenes
by: Kani Song, et al.
Published: (2024-03-01)

Semantic-Guided Selective Representation for Image Captioning
by: Yinan Li, et al.
Published: (2023-01-01)

Exploring Spatial-Based Position Encoding for Image Captioning
by: Xiaobao Yang, et al.
Published: (2023-11-01)

Cascade Semantic Fusion for Image Captioning
by: Shiwei Wang, et al.
Published: (2019-01-01)

Cross Encoder-Decoder Transformer with Global-Local Visual Extractor for Medical Image Captioning
by: Hojun Lee, et al.
Published: (2022-02-01)

From Plane to Hierarchy: Deformable Transformer for Remote Sensing Image Captioning
by: Runyan Du, et al.
Published: (2023-01-01)

Adaptive Curriculum Learning for Video Captioning
by: Shanhao Li, et al.
Published: (2022-01-01)

Image Captioning with multi-level similarity-guided semantic matching
by: Jiesi Li, et al.
Published: (2021-12-01)

The effects of captioning texts and caption ordering on L2 listening comprehension and vocabulary learning
by: Fatemeh Alikhani, et al.
Published: (2013-07-01)

A performance analysis of transformer-based deep learning models for Arabic image captioning
by: Ashwaq Alsayed, et al.
Published: (2023-10-01)

An Analysis of the Use of Feed-Forward Sub-Modules for Transformer-Based Image Captioning Tasks
by: Raymond Ian Osolo, et al.
Published: (2021-12-01)

Caption for Cover of Volume 2 Issue 1
by: Amy Christian
Published: (2011-12-01)

Multi-Gate Attention Network for Image Captioning
by: Weitao Jiang, et al.
Published: (2021-01-01)

A Context Semantic Auxiliary Network for Image Captioning
by: Jianying Li, et al.
Published: (2023-07-01)

A Systematic Literature Review on Using the Encoder-Decoder Models for Image Captioning in English and Arabic Languages
by: Ashwaq Alsayed, et al.
Published: (2023-09-01)

A Mask-Guided Transformer Network with Topic Token for Remote Sensing Image Captioning
by: Zihao Ren, et al.
Published: (2022-06-01)

Step by Step: A Gradual Approach for Dense Video Captioning
by: Wangyu Choi, et al.
Published: (2023-01-01)

Extracting Structured Supervision From Captions for Weakly Supervised Semantic Segmentation
by: Daniel R. Vilar, et al.
Published: (2021-01-01)

Context-Driven Image Caption With Global Semantic Relations of the Named Entities
by: Yun Jing, et al.
Published: (2020-01-01)

Hybrid Attention Distribution and Factorized Embedding Matrix in Image Captioning
by: Jian Wang, et al.
Published: (2020-01-01)

Interactive Change-Aware Transformer Network for Remote Sensing Image Change Captioning
by: Chen Cai, et al.
Published: (2023-12-01)

Stack-VS : stacked visual-semantic attention for image caption generation
by: Cheng, Ling, et al.
Published: (2021)

ATT-BM-SOM: A Framework of Effectively Choosing Image Information and Optimizing Syntax for Image Captioning
by: Zhenyu Yang, et al.
Published: (2020-01-01)

Fashion-Oriented Image Captioning with External Knowledge Retrieval and Fully Attentive Gates
by: Nicholas Moratelli, et al.
Published: (2023-01-01)

Part-of-Speech Tagging with Rule-Based Data Preprocessing and Transformer
by: Hongwei Li, et al.
Published: (2021-12-01)

Non-speech information w angielskich i rosyjskich napisach Closed Captions zawartych w serialu Эпидемия. Analiza kontrastywna
by: Daniel Piecewicz
Published: (2023-12-01)

CapERA: Captioning Events in Aerial Videos
by: Laila Bashmal, et al.
Published: (2023-04-01)

Text Augmentation Using BERT for Image Captioning
by: Viktar Atliha, et al.
Published: (2020-08-01)

A Sparse Transformer-Based Approach for Image Captioning
by: Zhou Lei, et al.
Published: (2020-01-01)

Semantic transparency and Oneida morphological parts of speech
by: Koenig Jean-Pierre, et al.
Published: (2023-01-01)

Real-time Arabic Video Captioning Using CNN and Transformer Networks Based on Parallel Implementation
by: Adel Jalal Yousif, et al.
Published: (2024-03-01)