Survey of feature selection and extraction techniques for stock market prediction
Abstract In stock market forecasting, the identification of critical features that affect the performance of machine learning (ML) models is crucial to achieve accurate stock price predictions. Several review papers in the literature have focused on various ML, statistical, and deep learning-based m...
Main Authors: | , , |
---|---|
Format: | Article |
Language: | English |
Published: |
SpringerOpen
2023-01-01
|
Series: | Financial Innovation |
Subjects: | |
Online Access: | https://doi.org/10.1186/s40854-022-00441-7 |
_version_ | 1797952414015291392 |
---|---|
author | Htet Htet Htun Michael Biehl Nicolai Petkov |
author_facet | Htet Htet Htun Michael Biehl Nicolai Petkov |
author_sort | Htet Htet Htun |
collection | DOAJ |
description | Abstract In stock market forecasting, the identification of critical features that affect the performance of machine learning (ML) models is crucial to achieve accurate stock price predictions. Several review papers in the literature have focused on various ML, statistical, and deep learning-based methods used in stock market forecasting. However, no survey study has explored feature selection and extraction techniques for stock market forecasting. This survey presents a detailed analysis of 32 research works that use a combination of feature study and ML approaches in various stock market applications. We conduct a systematic search for articles in the Scopus and Web of Science databases for the years 2011–2022. We review a variety of feature selection and feature extraction approaches that have been successfully applied in the stock market analyses presented in the articles. We also describe the combination of feature analysis techniques and ML methods and evaluate their performance. Moreover, we present other survey articles, stock market input and output data, and analyses based on various factors. We find that correlation criteria, random forest, principal component analysis, and autoencoder are the most widely used feature selection and extraction techniques with the best prediction accuracy for various stock market applications. |
first_indexed | 2024-04-10T22:45:58Z |
format | Article |
id | doaj.art-5fec26d2e196475ca5110ca1a65f7c70 |
institution | Directory Open Access Journal |
issn | 2199-4730 |
language | English |
last_indexed | 2024-04-10T22:45:58Z |
publishDate | 2023-01-01 |
publisher | SpringerOpen |
record_format | Article |
series | Financial Innovation |
spelling | doaj.art-5fec26d2e196475ca5110ca1a65f7c702023-01-15T12:19:26ZengSpringerOpenFinancial Innovation2199-47302023-01-019112510.1186/s40854-022-00441-7Survey of feature selection and extraction techniques for stock market predictionHtet Htet Htun0Michael Biehl1Nicolai Petkov2Bernoulli Institute for Mathematics, Computer Science, Artificial Intelligence, University of GroningenBernoulli Institute for Mathematics, Computer Science, Artificial Intelligence, University of GroningenBernoulli Institute for Mathematics, Computer Science, Artificial Intelligence, University of GroningenAbstract In stock market forecasting, the identification of critical features that affect the performance of machine learning (ML) models is crucial to achieve accurate stock price predictions. Several review papers in the literature have focused on various ML, statistical, and deep learning-based methods used in stock market forecasting. However, no survey study has explored feature selection and extraction techniques for stock market forecasting. This survey presents a detailed analysis of 32 research works that use a combination of feature study and ML approaches in various stock market applications. We conduct a systematic search for articles in the Scopus and Web of Science databases for the years 2011–2022. We review a variety of feature selection and feature extraction approaches that have been successfully applied in the stock market analyses presented in the articles. We also describe the combination of feature analysis techniques and ML methods and evaluate their performance. Moreover, we present other survey articles, stock market input and output data, and analyses based on various factors. We find that correlation criteria, random forest, principal component analysis, and autoencoder are the most widely used feature selection and extraction techniques with the best prediction accuracy for various stock market applications.https://doi.org/10.1186/s40854-022-00441-7Feature selectionFeature extractionDimensionality reductionStock market forecastingMachine learning |
spellingShingle | Htet Htet Htun Michael Biehl Nicolai Petkov Survey of feature selection and extraction techniques for stock market prediction Financial Innovation Feature selection Feature extraction Dimensionality reduction Stock market forecasting Machine learning |
title | Survey of feature selection and extraction techniques for stock market prediction |
title_full | Survey of feature selection and extraction techniques for stock market prediction |
title_fullStr | Survey of feature selection and extraction techniques for stock market prediction |
title_full_unstemmed | Survey of feature selection and extraction techniques for stock market prediction |
title_short | Survey of feature selection and extraction techniques for stock market prediction |
title_sort | survey of feature selection and extraction techniques for stock market prediction |
topic | Feature selection Feature extraction Dimensionality reduction Stock market forecasting Machine learning |
url | https://doi.org/10.1186/s40854-022-00441-7 |
work_keys_str_mv | AT htethtethtun surveyoffeatureselectionandextractiontechniquesforstockmarketprediction AT michaelbiehl surveyoffeatureselectionandextractiontechniquesforstockmarketprediction AT nicolaipetkov surveyoffeatureselectionandextractiontechniquesforstockmarketprediction |