Survey of feature selection and extraction techniques for stock market prediction

Abstract In stock market forecasting, the identification of critical features that affect the performance of machine learning (ML) models is crucial to achieve accurate stock price predictions. Several review papers in the literature have focused on various ML, statistical, and deep learning-based m...

Full description

Bibliographic Details
Main Authors: Htet Htet Htun, Michael Biehl, Nicolai Petkov
Format: Article
Language:English
Published: SpringerOpen 2023-01-01
Series:Financial Innovation
Subjects:
Online Access:https://doi.org/10.1186/s40854-022-00441-7
_version_ 1797952414015291392
author Htet Htet Htun
Michael Biehl
Nicolai Petkov
author_facet Htet Htet Htun
Michael Biehl
Nicolai Petkov
author_sort Htet Htet Htun
collection DOAJ
description Abstract In stock market forecasting, the identification of critical features that affect the performance of machine learning (ML) models is crucial to achieve accurate stock price predictions. Several review papers in the literature have focused on various ML, statistical, and deep learning-based methods used in stock market forecasting. However, no survey study has explored feature selection and extraction techniques for stock market forecasting. This survey presents a detailed analysis of 32 research works that use a combination of feature study and ML approaches in various stock market applications. We conduct a systematic search for articles in the Scopus and Web of Science databases for the years 2011–2022. We review a variety of feature selection and feature extraction approaches that have been successfully applied in the stock market analyses presented in the articles. We also describe the combination of feature analysis techniques and ML methods and evaluate their performance. Moreover, we present other survey articles, stock market input and output data, and analyses based on various factors. We find that correlation criteria, random forest, principal component analysis, and autoencoder are the most widely used feature selection and extraction techniques with the best prediction accuracy for various stock market applications.
first_indexed 2024-04-10T22:45:58Z
format Article
id doaj.art-5fec26d2e196475ca5110ca1a65f7c70
institution Directory Open Access Journal
issn 2199-4730
language English
last_indexed 2024-04-10T22:45:58Z
publishDate 2023-01-01
publisher SpringerOpen
record_format Article
series Financial Innovation
spelling doaj.art-5fec26d2e196475ca5110ca1a65f7c702023-01-15T12:19:26ZengSpringerOpenFinancial Innovation2199-47302023-01-019112510.1186/s40854-022-00441-7Survey of feature selection and extraction techniques for stock market predictionHtet Htet Htun0Michael Biehl1Nicolai Petkov2Bernoulli Institute for Mathematics, Computer Science, Artificial Intelligence, University of GroningenBernoulli Institute for Mathematics, Computer Science, Artificial Intelligence, University of GroningenBernoulli Institute for Mathematics, Computer Science, Artificial Intelligence, University of GroningenAbstract In stock market forecasting, the identification of critical features that affect the performance of machine learning (ML) models is crucial to achieve accurate stock price predictions. Several review papers in the literature have focused on various ML, statistical, and deep learning-based methods used in stock market forecasting. However, no survey study has explored feature selection and extraction techniques for stock market forecasting. This survey presents a detailed analysis of 32 research works that use a combination of feature study and ML approaches in various stock market applications. We conduct a systematic search for articles in the Scopus and Web of Science databases for the years 2011–2022. We review a variety of feature selection and feature extraction approaches that have been successfully applied in the stock market analyses presented in the articles. We also describe the combination of feature analysis techniques and ML methods and evaluate their performance. Moreover, we present other survey articles, stock market input and output data, and analyses based on various factors. We find that correlation criteria, random forest, principal component analysis, and autoencoder are the most widely used feature selection and extraction techniques with the best prediction accuracy for various stock market applications.https://doi.org/10.1186/s40854-022-00441-7Feature selectionFeature extractionDimensionality reductionStock market forecastingMachine learning
spellingShingle Htet Htet Htun
Michael Biehl
Nicolai Petkov
Survey of feature selection and extraction techniques for stock market prediction
Financial Innovation
Feature selection
Feature extraction
Dimensionality reduction
Stock market forecasting
Machine learning
title Survey of feature selection and extraction techniques for stock market prediction
title_full Survey of feature selection and extraction techniques for stock market prediction
title_fullStr Survey of feature selection and extraction techniques for stock market prediction
title_full_unstemmed Survey of feature selection and extraction techniques for stock market prediction
title_short Survey of feature selection and extraction techniques for stock market prediction
title_sort survey of feature selection and extraction techniques for stock market prediction
topic Feature selection
Feature extraction
Dimensionality reduction
Stock market forecasting
Machine learning
url https://doi.org/10.1186/s40854-022-00441-7
work_keys_str_mv AT htethtethtun surveyoffeatureselectionandextractiontechniquesforstockmarketprediction
AT michaelbiehl surveyoffeatureselectionandextractiontechniquesforstockmarketprediction
AT nicolaipetkov surveyoffeatureselectionandextractiontechniquesforstockmarketprediction