#PraCegoVer: A Large Dataset for Image Captioning in Portuguese

Automatically describing images using natural sentences is essential to visually impaired people’s inclusion on the Internet. This problem is known as <i>Image Captioning</i>. There are many datasets in the literature, but most contain only English captions, whereas datasets with caption...

Full description

Bibliographic Details
Main Authors: Gabriel Oliveira dos Santos, Esther Luna Colombini, Sandra Avila
Format: Article
Language:English
Published: MDPI AG 2022-01-01
Series:Data
Subjects:
Online Access:https://www.mdpi.com/2306-5729/7/2/13
_version_ 1827655779960750080
author Gabriel Oliveira dos Santos
Esther Luna Colombini
Sandra Avila
author_facet Gabriel Oliveira dos Santos
Esther Luna Colombini
Sandra Avila
author_sort Gabriel Oliveira dos Santos
collection DOAJ
description Automatically describing images using natural sentences is essential to visually impaired people’s inclusion on the Internet. This problem is known as <i>Image Captioning</i>. There are many datasets in the literature, but most contain only English captions, whereas datasets with captions described in other languages are scarce. We introduce the #PraCegoVer, a multi-modal dataset with Portuguese captions based on posts from Instagram. It is the first large dataset for image captioning in Portuguese. In contrast to popular datasets, #PraCegoVer has only one reference per image, and both mean and variance of reference sentence length are significantly high, which makes our dataset challenging due to its linguistic aspect. We carry a detailed analysis to find the main classes and topics in our data. We compare #PraCegoVer to MS COCO dataset in terms of sentence length and word frequency. We hope that #PraCegoVer dataset encourages more works addressing the automatic generation of descriptions in Portuguese.
first_indexed 2024-03-09T22:13:26Z
format Article
id doaj.art-7e8a7f87993a4c638b0f9c7f94cbac5d
institution Directory Open Access Journal
issn 2306-5729
language English
last_indexed 2024-03-09T22:13:26Z
publishDate 2022-01-01
publisher MDPI AG
record_format Article
series Data
spelling doaj.art-7e8a7f87993a4c638b0f9c7f94cbac5d2023-11-23T19:27:41ZengMDPI AGData2306-57292022-01-01721310.3390/data7020013#PraCegoVer: A Large Dataset for Image Captioning in PortugueseGabriel Oliveira dos Santos0Esther Luna Colombini1Sandra Avila2Institute of Computing, University of Campinas (Unicamp), Campinas 13083-852, BrazilInstitute of Computing, University of Campinas (Unicamp), Campinas 13083-852, BrazilInstitute of Computing, University of Campinas (Unicamp), Campinas 13083-852, BrazilAutomatically describing images using natural sentences is essential to visually impaired people’s inclusion on the Internet. This problem is known as <i>Image Captioning</i>. There are many datasets in the literature, but most contain only English captions, whereas datasets with captions described in other languages are scarce. We introduce the #PraCegoVer, a multi-modal dataset with Portuguese captions based on posts from Instagram. It is the first large dataset for image captioning in Portuguese. In contrast to popular datasets, #PraCegoVer has only one reference per image, and both mean and variance of reference sentence length are significantly high, which makes our dataset challenging due to its linguistic aspect. We carry a detailed analysis to find the main classes and topics in our data. We compare #PraCegoVer to MS COCO dataset in terms of sentence length and word frequency. We hope that #PraCegoVer dataset encourages more works addressing the automatic generation of descriptions in Portuguese.https://www.mdpi.com/2306-5729/7/2/13#PraCegoVerimage captioning in Portugueseimage captioningimage-to-text
spellingShingle Gabriel Oliveira dos Santos
Esther Luna Colombini
Sandra Avila
#PraCegoVer: A Large Dataset for Image Captioning in Portuguese
Data
#PraCegoVer
image captioning in Portuguese
image captioning
image-to-text
title #PraCegoVer: A Large Dataset for Image Captioning in Portuguese
title_full #PraCegoVer: A Large Dataset for Image Captioning in Portuguese
title_fullStr #PraCegoVer: A Large Dataset for Image Captioning in Portuguese
title_full_unstemmed #PraCegoVer: A Large Dataset for Image Captioning in Portuguese
title_short #PraCegoVer: A Large Dataset for Image Captioning in Portuguese
title_sort pracegover a large dataset for image captioning in portuguese
topic #PraCegoVer
image captioning in Portuguese
image captioning
image-to-text
url https://www.mdpi.com/2306-5729/7/2/13
work_keys_str_mv AT gabrieloliveiradossantos pracegoveralargedatasetforimagecaptioninginportuguese
AT estherlunacolombini pracegoveralargedatasetforimagecaptioninginportuguese
AT sandraavila pracegoveralargedatasetforimagecaptioninginportuguese