Social Image Captioning: Exploring Visual Attention and User Attention

Image captioning with a natural language has been an emerging trend. However, the social image, associated with a set of user-contributed tags, has been rarely investigated for a similar task. The user-contributed tags, which could reflect the user attention, have been neglected in conventional imag...

Full description

Bibliographic Details
Main Authors: Leiquan Wang, Xiaoliang Chu, Weishan Zhang, Yiwei Wei, Weichen Sun, Chunlei Wu
Format: Article
Language:English
Published: MDPI AG 2018-02-01
Series:Sensors
Subjects:
Online Access:http://www.mdpi.com/1424-8220/18/2/646
_version_ 1798039025694539776
author Leiquan Wang
Xiaoliang Chu
Weishan Zhang
Yiwei Wei
Weichen Sun
Chunlei Wu
author_facet Leiquan Wang
Xiaoliang Chu
Weishan Zhang
Yiwei Wei
Weichen Sun
Chunlei Wu
author_sort Leiquan Wang
collection DOAJ
description Image captioning with a natural language has been an emerging trend. However, the social image, associated with a set of user-contributed tags, has been rarely investigated for a similar task. The user-contributed tags, which could reflect the user attention, have been neglected in conventional image captioning. Most existing image captioning models cannot be applied directly to social image captioning. In this work, a dual attention model is proposed for social image captioning by combining the visual attention and user attention simultaneously.Visual attention is used to compress a large mount of salient visual information, while user attention is applied to adjust the description of the social images with user-contributed tags. Experiments conducted on the Microsoft (MS) COCO dataset demonstrate the superiority of the proposed method of dual attention.
first_indexed 2024-04-11T21:48:15Z
format Article
id doaj.art-06ce2509ae7e48b5b5f40e54686795de
institution Directory Open Access Journal
issn 1424-8220
language English
last_indexed 2024-04-11T21:48:15Z
publishDate 2018-02-01
publisher MDPI AG
record_format Article
series Sensors
spelling doaj.art-06ce2509ae7e48b5b5f40e54686795de2022-12-22T04:01:19ZengMDPI AGSensors1424-82202018-02-0118264610.3390/s18020646s18020646Social Image Captioning: Exploring Visual Attention and User AttentionLeiquan Wang0Xiaoliang Chu1Weishan Zhang2Yiwei Wei3Weichen Sun4Chunlei Wu5College of Computer & Communication Engineering, China University of Petroleum (East China), Qingdao 266555, ChinaCollege of Computer & Communication Engineering, China University of Petroleum (East China), Qingdao 266555, ChinaCollege of Computer & Communication Engineering, China University of Petroleum (East China), Qingdao 266555, ChinaCollege of Computer & Communication Engineering, China University of Petroleum (East China), Qingdao 266555, ChinaFirst Research Institute of the Ministry of Public Security of PRC, Beijing 100048, ChinaCollege of Computer & Communication Engineering, China University of Petroleum (East China), Qingdao 266555, ChinaImage captioning with a natural language has been an emerging trend. However, the social image, associated with a set of user-contributed tags, has been rarely investigated for a similar task. The user-contributed tags, which could reflect the user attention, have been neglected in conventional image captioning. Most existing image captioning models cannot be applied directly to social image captioning. In this work, a dual attention model is proposed for social image captioning by combining the visual attention and user attention simultaneously.Visual attention is used to compress a large mount of salient visual information, while user attention is applied to adjust the description of the social images with user-contributed tags. Experiments conducted on the Microsoft (MS) COCO dataset demonstrate the superiority of the proposed method of dual attention.http://www.mdpi.com/1424-8220/18/2/646social image captioninguser-contributed tagsuser attentionvisual attention
spellingShingle Leiquan Wang
Xiaoliang Chu
Weishan Zhang
Yiwei Wei
Weichen Sun
Chunlei Wu
Social Image Captioning: Exploring Visual Attention and User Attention
Sensors
social image captioning
user-contributed tags
user attention
visual attention
title Social Image Captioning: Exploring Visual Attention and User Attention
title_full Social Image Captioning: Exploring Visual Attention and User Attention
title_fullStr Social Image Captioning: Exploring Visual Attention and User Attention
title_full_unstemmed Social Image Captioning: Exploring Visual Attention and User Attention
title_short Social Image Captioning: Exploring Visual Attention and User Attention
title_sort social image captioning exploring visual attention and user attention
topic social image captioning
user-contributed tags
user attention
visual attention
url http://www.mdpi.com/1424-8220/18/2/646
work_keys_str_mv AT leiquanwang socialimagecaptioningexploringvisualattentionanduserattention
AT xiaoliangchu socialimagecaptioningexploringvisualattentionanduserattention
AT weishanzhang socialimagecaptioningexploringvisualattentionanduserattention
AT yiweiwei socialimagecaptioningexploringvisualattentionanduserattention
AT weichensun socialimagecaptioningexploringvisualattentionanduserattention
AT chunleiwu socialimagecaptioningexploringvisualattentionanduserattention