Sentiment Analysis in Finance: From Transformers Back to eXplainable Lexicons (XLex)

Lexicon-based sentiment analysis in finance leverages specialized, manually annotated lexicons created by human experts to extract sentiment from financial texts effectively. Although lexicon-based methods are simple to implement and fast to operate on textual data, they require considerable manual...

Full description

Bibliographic Details
Main Authors:	Maryan Rizinski, Hristijan Peshov, Kostadin Mishev, Milos Jovanovik, Dimitar Trajanov
Format:	Article
Language:	English
Published:	IEEE 2024-01-01
Series:	IEEE Access
Subjects:	Machine learning natural language processing text classification sentiment analysis finance lexicons
Online Access:	https://ieeexplore.ieee.org/document/10380556/

_version_	1797296339752583168
author	Maryan Rizinski Hristijan Peshov Kostadin Mishev Milos Jovanovik Dimitar Trajanov
author_facet	Maryan Rizinski Hristijan Peshov Kostadin Mishev Milos Jovanovik Dimitar Trajanov
author_sort	Maryan Rizinski
collection	DOAJ
description	Lexicon-based sentiment analysis in finance leverages specialized, manually annotated lexicons created by human experts to extract sentiment from financial texts effectively. Although lexicon-based methods are simple to implement and fast to operate on textual data, they require considerable manual annotation efforts to create, maintain, and update the lexicons. These methods are also considered inferior to the deep learning-based approaches, such as transformer models, which have become dominant in various natural language processing (NLP) tasks due to their remarkable performance. However, their efficacy comes at a cost: these models require extensive data and computational resources for both training and testing. Additionally, they involve significant prediction times, making them unsuitable for real-time production environments or systems with limited processing capabilities. In this paper, we introduce a novel methodology named eXplainable Lexicons (XLex) that combines the advantages of both lexicon-based methods and transformer models. We propose an approach that utilizes transformers and SHapley Additive exPlanations (SHAP) for explainability to automatically learn financial lexicons. Our study presents four main contributions. Firstly, we demonstrate that transformer-aided explainable lexicons can enhance the vocabulary coverage of the benchmark Loughran-McDonald (LM) lexicon. This enhancement leads to a significant reduction in the need for human involvement in the process of annotating, maintaining, and updating the lexicons. Secondly, we show that the resulting lexicon outperforms the standard LM lexicon in sentiment analysis of financial datasets. Our experiments show that XLex outperforms LM when applied to general financial texts, resulting in enhanced word coverage and an overall increase in classification accuracy by 0.431. Furthermore, by employing XLex to extend LM, we create a combined dictionary, XLex+LM, which achieves an even higher accuracy improvement of 0.450. Thirdly, we illustrate that the lexicon-based approach is significantly more efficient in terms of model speed and size compared to transformers. Lastly, the proposed XLex approach is inherently more interpretable than transformer models. This interpretability is advantageous as lexicon models rely on predefined rules, unlike transformers, which have complex inner workings. The interpretability of the models allows for better understanding and insights into the results of sentiment analysis, making the XLex approach a valuable tool for financial decision-making.
first_indexed	2024-03-07T22:03:13Z
format	Article
id	doaj.art-638e705920e24d07a7db8c87dc4d69c2
institution	Directory Open Access Journal
issn	2169-3536
language	English
last_indexed	2024-03-07T22:03:13Z
publishDate	2024-01-01
publisher	IEEE
record_format	Article
series	IEEE Access
spelling	doaj.art-638e705920e24d07a7db8c87dc4d69c22024-02-24T00:00:15ZengIEEEIEEE Access2169-35362024-01-01127170719810.1109/ACCESS.2024.334997010380556Sentiment Analysis in Finance: From Transformers Back to eXplainable Lexicons (XLex)Maryan Rizinski0https://orcid.org/0000-0002-9324-0091Hristijan Peshov1https://orcid.org/0000-0002-1248-1775Kostadin Mishev2https://orcid.org/0000-0003-3982-3330Milos Jovanovik3https://orcid.org/0000-0001-7360-8015Dimitar Trajanov4https://orcid.org/0000-0002-3105-6010Department of Computer Science, Metropolitan College, Boston University, Boston, MA, USAFaculty of Computer Science and Engineering, Ss. Cyril and Methodius University, Skopje, North MacedoniaFaculty of Computer Science and Engineering, Ss. Cyril and Methodius University, Skopje, North MacedoniaFaculty of Computer Science and Engineering, Ss. Cyril and Methodius University, Skopje, North MacedoniaDepartment of Computer Science, Metropolitan College, Boston University, Boston, MA, USALexicon-based sentiment analysis in finance leverages specialized, manually annotated lexicons created by human experts to extract sentiment from financial texts effectively. Although lexicon-based methods are simple to implement and fast to operate on textual data, they require considerable manual annotation efforts to create, maintain, and update the lexicons. These methods are also considered inferior to the deep learning-based approaches, such as transformer models, which have become dominant in various natural language processing (NLP) tasks due to their remarkable performance. However, their efficacy comes at a cost: these models require extensive data and computational resources for both training and testing. Additionally, they involve significant prediction times, making them unsuitable for real-time production environments or systems with limited processing capabilities. In this paper, we introduce a novel methodology named eXplainable Lexicons (XLex) that combines the advantages of both lexicon-based methods and transformer models. We propose an approach that utilizes transformers and SHapley Additive exPlanations (SHAP) for explainability to automatically learn financial lexicons. Our study presents four main contributions. Firstly, we demonstrate that transformer-aided explainable lexicons can enhance the vocabulary coverage of the benchmark Loughran-McDonald (LM) lexicon. This enhancement leads to a significant reduction in the need for human involvement in the process of annotating, maintaining, and updating the lexicons. Secondly, we show that the resulting lexicon outperforms the standard LM lexicon in sentiment analysis of financial datasets. Our experiments show that XLex outperforms LM when applied to general financial texts, resulting in enhanced word coverage and an overall increase in classification accuracy by 0.431. Furthermore, by employing XLex to extend LM, we create a combined dictionary, XLex+LM, which achieves an even higher accuracy improvement of 0.450. Thirdly, we illustrate that the lexicon-based approach is significantly more efficient in terms of model speed and size compared to transformers. Lastly, the proposed XLex approach is inherently more interpretable than transformer models. This interpretability is advantageous as lexicon models rely on predefined rules, unlike transformers, which have complex inner workings. The interpretability of the models allows for better understanding and insights into the results of sentiment analysis, making the XLex approach a valuable tool for financial decision-making.https://ieeexplore.ieee.org/document/10380556/Machine learningnatural language processingtext classificationsentiment analysisfinancelexicons
spellingShingle	Maryan Rizinski Hristijan Peshov Kostadin Mishev Milos Jovanovik Dimitar Trajanov Sentiment Analysis in Finance: From Transformers Back to eXplainable Lexicons (XLex) IEEE Access Machine learning natural language processing text classification sentiment analysis finance lexicons
title	Sentiment Analysis in Finance: From Transformers Back to eXplainable Lexicons (XLex)
title_full	Sentiment Analysis in Finance: From Transformers Back to eXplainable Lexicons (XLex)
title_fullStr	Sentiment Analysis in Finance: From Transformers Back to eXplainable Lexicons (XLex)
title_full_unstemmed	Sentiment Analysis in Finance: From Transformers Back to eXplainable Lexicons (XLex)
title_short	Sentiment Analysis in Finance: From Transformers Back to eXplainable Lexicons (XLex)
title_sort	sentiment analysis in finance from transformers back to explainable lexicons xlex
topic	Machine learning natural language processing text classification sentiment analysis finance lexicons
url	https://ieeexplore.ieee.org/document/10380556/
work_keys_str_mv	AT maryanrizinski sentimentanalysisinfinancefromtransformersbacktoexplainablelexiconsxlex AT hristijanpeshov sentimentanalysisinfinancefromtransformersbacktoexplainablelexiconsxlex AT kostadinmishev sentimentanalysisinfinancefromtransformersbacktoexplainablelexiconsxlex AT milosjovanovik sentimentanalysisinfinancefromtransformersbacktoexplainablelexiconsxlex AT dimitartrajanov sentimentanalysisinfinancefromtransformersbacktoexplainablelexiconsxlex

Sentiment Analysis in Finance: From Transformers Back to eXplainable Lexicons (XLex)

Similar Items