SentiLex-PT: Principais características e potencialidades

This paper describes the main characteristics of SentiLex-PT, a sentiment lexicon designed for the extraction of sentiment and opinion about human entities in Portuguese texts. The potential of this resource is illustrated on its application to two types of corpora, the SentiCorpus-PT, a social medi...

Full description

Bibliographic Details
Main Authors: Paula Carvalho, Mário J. Silva
Format: Article
Language:English
Published: University of Oslo 2015-03-01
Series:Oslo Studies in Language
Online Access:https://journals.uio.no/osla/article/view/1444
_version_ 1819265744833085440
author Paula Carvalho
Mário J. Silva
author_facet Paula Carvalho
Mário J. Silva
author_sort Paula Carvalho
collection DOAJ
description This paper describes the main characteristics of SentiLex-PT, a sentiment lexicon designed for the extraction of sentiment and opinion about human entities in Portuguese texts. The potential of this resource is illustrated on its application to two types of corpora, the SentiCorpus-PT, a social media corpus, consisting of user comments to news articles, and a literary piece of the early twentieth century, The Poor (Os Pobres), by Raul Brandão. The data were processed by UNITEX, a natural language processing system based on dictionaries and grammars.
first_indexed 2024-12-23T20:50:15Z
format Article
id doaj.art-da902481fb2c43ecac9bf0b746dbb97a
institution Directory Open Access Journal
issn 1890-9639
language English
last_indexed 2024-12-23T20:50:15Z
publishDate 2015-03-01
publisher University of Oslo
record_format Article
series Oslo Studies in Language
spelling doaj.art-da902481fb2c43ecac9bf0b746dbb97a2022-12-21T17:31:41ZengUniversity of OsloOslo Studies in Language1890-96392015-03-017110.5617/osla.1444SentiLex-PT: Principais características e potencialidadesPaula CarvalhoMário J. SilvaThis paper describes the main characteristics of SentiLex-PT, a sentiment lexicon designed for the extraction of sentiment and opinion about human entities in Portuguese texts. The potential of this resource is illustrated on its application to two types of corpora, the SentiCorpus-PT, a social media corpus, consisting of user comments to news articles, and a literary piece of the early twentieth century, The Poor (Os Pobres), by Raul Brandão. The data were processed by UNITEX, a natural language processing system based on dictionaries and grammars.https://journals.uio.no/osla/article/view/1444
spellingShingle Paula Carvalho
Mário J. Silva
SentiLex-PT: Principais características e potencialidades
Oslo Studies in Language
title SentiLex-PT: Principais características e potencialidades
title_full SentiLex-PT: Principais características e potencialidades
title_fullStr SentiLex-PT: Principais características e potencialidades
title_full_unstemmed SentiLex-PT: Principais características e potencialidades
title_short SentiLex-PT: Principais características e potencialidades
title_sort sentilex pt principais caracteristicas e potencialidades
url https://journals.uio.no/osla/article/view/1444
work_keys_str_mv AT paulacarvalho sentilexptprincipaiscaracteristicasepotencialidades
AT mariojsilva sentilexptprincipaiscaracteristicasepotencialidades