Annotators’ Selection Impact on the Creation of a Sentiment Corpus for the Cryptocurrency Financial Domain

Well labeled natural language corpus data is essential for most natural language processing techniques, especially in specialized fields. However, cohort biases remain a significant challenge in machine learning. The narrow origin of data sampling or human annotators in cohorts is a prevalent issue...

Full description

Bibliographic Details
Main Authors:	Manoel Fernando Alonso Gadi, Miguel Angel Sicilia
Format:	Article
Language:	English
Published:	IEEE 2023-01-01
Series:	IEEE Access
Subjects:	Annotation annotator selection criteria cryptocurrency news event labeled data set NLP
Online Access:	https://ieeexplore.ieee.org/document/10322757/

_version_	1827631124732444672
author	Manoel Fernando Alonso Gadi Miguel Angel Sicilia
author_facet	Manoel Fernando Alonso Gadi Miguel Angel Sicilia
author_sort	Manoel Fernando Alonso Gadi
collection	DOAJ
description	Well labeled natural language corpus data is essential for most natural language processing techniques, especially in specialized fields. However, cohort biases remain a significant challenge in machine learning. The narrow origin of data sampling or human annotators in cohorts is a prevalent issue for machine learning researchers due to its potential to induce bias in the final product. During the development of the CryptoLin corpus for another research project, the authors became concerned about the potential influence of cohort bias on the selection of annotators. Therefore, this paper addresses the question of whether cohort diversity improves the labeling result through the implementation of a repeated annotator process, involving two annotator cohorts and a statistically robust comparison methodology. The utilization of statistical tests, such as the Chi-Square Independence test for absolute frequency tables, and the construction of confidence intervals for Kappa point estimates, facilitates a rigorous analysis of the differences between Kappa estimates. Furthermore, the application of a two-proportion z-test to compare the accuracy scores of UTAD and IE annotators for various pre-trained models, including Vader Sentiment Analysis, TextBlob Sentiment Analysis, Flair NLP library, and FinBERT Financial Sentiment Analysis with BERT, contributes to the advancement of knowledge in this field. The paper utilizes Cryptocurrency Linguo (CryptoLin), a corpus containing 2683 cryptocurrency-related news articles spanning more than three years, and compares two different selection criteria for the annotators. CryptoLin was annotated twice with discrete values representing negative, neutral, and positive news respectively. The first annotation was done by twenty-seven annotators from the same cohort. Each news title was randomly assigned and blindly annotated by three human annotators. The second annotation was carried out by eighty-three annotators from three cohorts. Each news title was randomly assigned and blindly annotated by three human annotators, one in each different cohort. In both annotations, a consensus mechanism using simple voting was applied. The first annotation used the same cohort with students from the same nationality and background. The second used three cohorts with students from a very diverse set of nationalities and educational backgrounds. The results demonstrate that manual labeling done by both groups was acceptable according to inter-rater reliability coefficients Fleiss’s Kappa, Krippendorff’s Alpha, and Gwet’s AC1. Preliminary analysis utilizing Vader, Textblob, Flair, and FinBERT confirmed the utility of the data set labeling for further refinement of sentiment analysis algorithms. Our results also highlight that the more diverse annotator pool performed better in all measured aspects.
first_indexed	2024-03-09T14:16:39Z
format	Article
id	doaj.art-5864f7b7b78d49968543b0b5365fe7d5
institution	Directory Open Access Journal
issn	2169-3536
language	English
last_indexed	2024-03-09T14:16:39Z
publishDate	2023-01-01
publisher	IEEE
record_format	Article
series	IEEE Access
spelling	doaj.art-5864f7b7b78d49968543b0b5365fe7d52023-11-29T00:01:26ZengIEEEIEEE Access2169-35362023-01-011113108113108810.1109/ACCESS.2023.333426010322757Annotators’ Selection Impact on the Creation of a Sentiment Corpus for the Cryptocurrency Financial DomainManoel Fernando Alonso Gadi0https://orcid.org/0000-0003-2988-0630Miguel Angel Sicilia1https://orcid.org/0000-0003-3067-4180Department of Computer Science, University of Alcalá de Henares, Alcalá de Henares, SpainDepartment of Computer Science, University of Alcalá de Henares, Alcalá de Henares, SpainWell labeled natural language corpus data is essential for most natural language processing techniques, especially in specialized fields. However, cohort biases remain a significant challenge in machine learning. The narrow origin of data sampling or human annotators in cohorts is a prevalent issue for machine learning researchers due to its potential to induce bias in the final product. During the development of the CryptoLin corpus for another research project, the authors became concerned about the potential influence of cohort bias on the selection of annotators. Therefore, this paper addresses the question of whether cohort diversity improves the labeling result through the implementation of a repeated annotator process, involving two annotator cohorts and a statistically robust comparison methodology. The utilization of statistical tests, such as the Chi-Square Independence test for absolute frequency tables, and the construction of confidence intervals for Kappa point estimates, facilitates a rigorous analysis of the differences between Kappa estimates. Furthermore, the application of a two-proportion z-test to compare the accuracy scores of UTAD and IE annotators for various pre-trained models, including Vader Sentiment Analysis, TextBlob Sentiment Analysis, Flair NLP library, and FinBERT Financial Sentiment Analysis with BERT, contributes to the advancement of knowledge in this field. The paper utilizes Cryptocurrency Linguo (CryptoLin), a corpus containing 2683 cryptocurrency-related news articles spanning more than three years, and compares two different selection criteria for the annotators. CryptoLin was annotated twice with discrete values representing negative, neutral, and positive news respectively. The first annotation was done by twenty-seven annotators from the same cohort. Each news title was randomly assigned and blindly annotated by three human annotators. The second annotation was carried out by eighty-three annotators from three cohorts. Each news title was randomly assigned and blindly annotated by three human annotators, one in each different cohort. In both annotations, a consensus mechanism using simple voting was applied. The first annotation used the same cohort with students from the same nationality and background. The second used three cohorts with students from a very diverse set of nationalities and educational backgrounds. The results demonstrate that manual labeling done by both groups was acceptable according to inter-rater reliability coefficients Fleiss’s Kappa, Krippendorff’s Alpha, and Gwet’s AC1. Preliminary analysis utilizing Vader, Textblob, Flair, and FinBERT confirmed the utility of the data set labeling for further refinement of sentiment analysis algorithms. Our results also highlight that the more diverse annotator pool performed better in all measured aspects.https://ieeexplore.ieee.org/document/10322757/Annotationannotator selection criteriacryptocurrencynews eventlabeled data setNLP
spellingShingle	Manoel Fernando Alonso Gadi Miguel Angel Sicilia Annotators’ Selection Impact on the Creation of a Sentiment Corpus for the Cryptocurrency Financial Domain IEEE Access Annotation annotator selection criteria cryptocurrency news event labeled data set NLP
title	Annotators’ Selection Impact on the Creation of a Sentiment Corpus for the Cryptocurrency Financial Domain
title_full	Annotators’ Selection Impact on the Creation of a Sentiment Corpus for the Cryptocurrency Financial Domain
title_fullStr	Annotators’ Selection Impact on the Creation of a Sentiment Corpus for the Cryptocurrency Financial Domain
title_full_unstemmed	Annotators’ Selection Impact on the Creation of a Sentiment Corpus for the Cryptocurrency Financial Domain
title_short	Annotators’ Selection Impact on the Creation of a Sentiment Corpus for the Cryptocurrency Financial Domain
title_sort	annotators x2019 selection impact on the creation of a sentiment corpus for the cryptocurrency financial domain
topic	Annotation annotator selection criteria cryptocurrency news event labeled data set NLP
url	https://ieeexplore.ieee.org/document/10322757/
work_keys_str_mv	AT manoelfernandoalonsogadi annotatorsx2019selectionimpactonthecreationofasentimentcorpusforthecryptocurrencyfinancialdomain AT miguelangelsicilia annotatorsx2019selectionimpactonthecreationofasentimentcorpusforthecryptocurrencyfinancialdomain

Annotators&#x2019; Selection Impact on the Creation of a Sentiment Corpus for the Cryptocurrency Financial Domain

Similar Items

Annotators’ Selection Impact on the Creation of a Sentiment Corpus for the Cryptocurrency Financial Domain