#PraCegoVer: A Large Dataset for Image Captioning in Portuguese
Automatically describing images using natural sentences is essential to visually impaired people’s inclusion on the Internet. This problem is known as <i>Image Captioning</i>. There are many datasets in the literature, but most contain only English captions, whereas datasets with caption...
Main Authors: | , , |
---|---|
Format: | Article |
Language: | English |
Published: |
MDPI AG
2022-01-01
|
Series: | Data |
Subjects: | |
Online Access: | https://www.mdpi.com/2306-5729/7/2/13 |
_version_ | 1827655779960750080 |
---|---|
author | Gabriel Oliveira dos Santos Esther Luna Colombini Sandra Avila |
author_facet | Gabriel Oliveira dos Santos Esther Luna Colombini Sandra Avila |
author_sort | Gabriel Oliveira dos Santos |
collection | DOAJ |
description | Automatically describing images using natural sentences is essential to visually impaired people’s inclusion on the Internet. This problem is known as <i>Image Captioning</i>. There are many datasets in the literature, but most contain only English captions, whereas datasets with captions described in other languages are scarce. We introduce the #PraCegoVer, a multi-modal dataset with Portuguese captions based on posts from Instagram. It is the first large dataset for image captioning in Portuguese. In contrast to popular datasets, #PraCegoVer has only one reference per image, and both mean and variance of reference sentence length are significantly high, which makes our dataset challenging due to its linguistic aspect. We carry a detailed analysis to find the main classes and topics in our data. We compare #PraCegoVer to MS COCO dataset in terms of sentence length and word frequency. We hope that #PraCegoVer dataset encourages more works addressing the automatic generation of descriptions in Portuguese. |
first_indexed | 2024-03-09T22:13:26Z |
format | Article |
id | doaj.art-7e8a7f87993a4c638b0f9c7f94cbac5d |
institution | Directory Open Access Journal |
issn | 2306-5729 |
language | English |
last_indexed | 2024-03-09T22:13:26Z |
publishDate | 2022-01-01 |
publisher | MDPI AG |
record_format | Article |
series | Data |
spelling | doaj.art-7e8a7f87993a4c638b0f9c7f94cbac5d2023-11-23T19:27:41ZengMDPI AGData2306-57292022-01-01721310.3390/data7020013#PraCegoVer: A Large Dataset for Image Captioning in PortugueseGabriel Oliveira dos Santos0Esther Luna Colombini1Sandra Avila2Institute of Computing, University of Campinas (Unicamp), Campinas 13083-852, BrazilInstitute of Computing, University of Campinas (Unicamp), Campinas 13083-852, BrazilInstitute of Computing, University of Campinas (Unicamp), Campinas 13083-852, BrazilAutomatically describing images using natural sentences is essential to visually impaired people’s inclusion on the Internet. This problem is known as <i>Image Captioning</i>. There are many datasets in the literature, but most contain only English captions, whereas datasets with captions described in other languages are scarce. We introduce the #PraCegoVer, a multi-modal dataset with Portuguese captions based on posts from Instagram. It is the first large dataset for image captioning in Portuguese. In contrast to popular datasets, #PraCegoVer has only one reference per image, and both mean and variance of reference sentence length are significantly high, which makes our dataset challenging due to its linguistic aspect. We carry a detailed analysis to find the main classes and topics in our data. We compare #PraCegoVer to MS COCO dataset in terms of sentence length and word frequency. We hope that #PraCegoVer dataset encourages more works addressing the automatic generation of descriptions in Portuguese.https://www.mdpi.com/2306-5729/7/2/13#PraCegoVerimage captioning in Portugueseimage captioningimage-to-text |
spellingShingle | Gabriel Oliveira dos Santos Esther Luna Colombini Sandra Avila #PraCegoVer: A Large Dataset for Image Captioning in Portuguese Data #PraCegoVer image captioning in Portuguese image captioning image-to-text |
title | #PraCegoVer: A Large Dataset for Image Captioning in Portuguese |
title_full | #PraCegoVer: A Large Dataset for Image Captioning in Portuguese |
title_fullStr | #PraCegoVer: A Large Dataset for Image Captioning in Portuguese |
title_full_unstemmed | #PraCegoVer: A Large Dataset for Image Captioning in Portuguese |
title_short | #PraCegoVer: A Large Dataset for Image Captioning in Portuguese |
title_sort | pracegover a large dataset for image captioning in portuguese |
topic | #PraCegoVer image captioning in Portuguese image captioning image-to-text |
url | https://www.mdpi.com/2306-5729/7/2/13 |
work_keys_str_mv | AT gabrieloliveiradossantos pracegoveralargedatasetforimagecaptioninginportuguese AT estherlunacolombini pracegoveralargedatasetforimagecaptioninginportuguese AT sandraavila pracegoveralargedatasetforimagecaptioninginportuguese |