Reduction in Data Imbalance for Client-Side Training in Federated Learning for the Prediction of Stock Market Prices

The approach of federated learning (FL) addresses significant challenges, including access rights, privacy, security, and the availability of diverse data. However, edge devices produce and collect data in a non-independent and identically distributed (non-IID) manner. Therefore, it is possible that...

Full description

Bibliographic Details
Main Authors:	Momina Shaheen, Muhammad Shoaib Farooq, Tariq Umer
Format:	Article
Language:	English
Published:	MDPI AG 2023-12-01
Series:	Journal of Sensor and Actuator Networks
Subjects:	data imbalance edge networks federated learning stock market prediction
Online Access:	https://www.mdpi.com/2224-2708/13/1/1

_version_	1827343315960332288
author	Momina Shaheen Muhammad Shoaib Farooq Tariq Umer
author_facet	Momina Shaheen Muhammad Shoaib Farooq Tariq Umer
author_sort	Momina Shaheen
collection	DOAJ
description	The approach of federated learning (FL) addresses significant challenges, including access rights, privacy, security, and the availability of diverse data. However, edge devices produce and collect data in a non-independent and identically distributed (non-IID) manner. Therefore, it is possible that the number of data samples may vary among the edge devices. This study elucidates an approach for implementing FL to achieve a balance between training accuracy and imbalanced data. This approach entails the implementation of data augmentation in data distribution by utilizing class estimation and by balancing on the client side during local training. Secondly, simple linear regression is utilized for model training at the client side to manage the optimal computation cost to achieve a reduction in computation cost. To validate the proposed approach, the technique was applied to a stock market dataset comprising stocks (AAL, ADBE, ASDK, and BSX) to predict the day-to-day values of stocks. The proposed approach has demonstrated favorable results, exhibiting a strong fit of 0.95 and above with a low error rate. The R-squared values, predominantly ranging from 0.97 to 0.98, indicate the model’s effectiveness in capturing variations in stock prices. Strong fits are observed within 75 to 80 iterations for stocks displaying consistently high R-squared values, signifying accuracy. On the 100th iteration, the declining MSE, MAE, and RMSE (AAL at 122.03, 4.89, 11.04, respectively; ADBE at 457.35, 17.79, and 21.38, respectively; ASDK at 182.78, 5.81, 13.51, respectively; and BSX at 34.50, 4.87, 5.87, respectively) values corroborated the positive results of the proposed approach with minimal data loss.
first_indexed	2024-03-07T22:24:29Z
format	Article
id	doaj.art-26ebc8a23e4a40f989214cde133b7299
institution	Directory Open Access Journal
issn	2224-2708
language	English
last_indexed	2024-03-07T22:24:29Z
publishDate	2023-12-01
publisher	MDPI AG
record_format	Article
series	Journal of Sensor and Actuator Networks
spelling	doaj.art-26ebc8a23e4a40f989214cde133b72992024-02-23T15:23:54ZengMDPI AGJournal of Sensor and Actuator Networks2224-27082023-12-01131110.3390/jsan13010001Reduction in Data Imbalance for Client-Side Training in Federated Learning for the Prediction of Stock Market PricesMomina Shaheen0Muhammad Shoaib Farooq1Tariq Umer2School of Systems and Technology, University of Management and Technology, Lahore 54000, PakistanSchool of Systems and Technology, University of Management and Technology, Lahore 54000, PakistanDepartment of Computer Science, COMSATS University Islamabad Lahore Campus, Lahore 54000, PakistanThe approach of federated learning (FL) addresses significant challenges, including access rights, privacy, security, and the availability of diverse data. However, edge devices produce and collect data in a non-independent and identically distributed (non-IID) manner. Therefore, it is possible that the number of data samples may vary among the edge devices. This study elucidates an approach for implementing FL to achieve a balance between training accuracy and imbalanced data. This approach entails the implementation of data augmentation in data distribution by utilizing class estimation and by balancing on the client side during local training. Secondly, simple linear regression is utilized for model training at the client side to manage the optimal computation cost to achieve a reduction in computation cost. To validate the proposed approach, the technique was applied to a stock market dataset comprising stocks (AAL, ADBE, ASDK, and BSX) to predict the day-to-day values of stocks. The proposed approach has demonstrated favorable results, exhibiting a strong fit of 0.95 and above with a low error rate. The R-squared values, predominantly ranging from 0.97 to 0.98, indicate the model’s effectiveness in capturing variations in stock prices. Strong fits are observed within 75 to 80 iterations for stocks displaying consistently high R-squared values, signifying accuracy. On the 100th iteration, the declining MSE, MAE, and RMSE (AAL at 122.03, 4.89, 11.04, respectively; ADBE at 457.35, 17.79, and 21.38, respectively; ASDK at 182.78, 5.81, 13.51, respectively; and BSX at 34.50, 4.87, 5.87, respectively) values corroborated the positive results of the proposed approach with minimal data loss.https://www.mdpi.com/2224-2708/13/1/1data imbalanceedge networksfederated learningstock market prediction
spellingShingle	Momina Shaheen Muhammad Shoaib Farooq Tariq Umer Reduction in Data Imbalance for Client-Side Training in Federated Learning for the Prediction of Stock Market Prices Journal of Sensor and Actuator Networks data imbalance edge networks federated learning stock market prediction
title	Reduction in Data Imbalance for Client-Side Training in Federated Learning for the Prediction of Stock Market Prices
title_full	Reduction in Data Imbalance for Client-Side Training in Federated Learning for the Prediction of Stock Market Prices
title_fullStr	Reduction in Data Imbalance for Client-Side Training in Federated Learning for the Prediction of Stock Market Prices
title_full_unstemmed	Reduction in Data Imbalance for Client-Side Training in Federated Learning for the Prediction of Stock Market Prices
title_short	Reduction in Data Imbalance for Client-Side Training in Federated Learning for the Prediction of Stock Market Prices
title_sort	reduction in data imbalance for client side training in federated learning for the prediction of stock market prices
topic	data imbalance edge networks federated learning stock market prediction
url	https://www.mdpi.com/2224-2708/13/1/1
work_keys_str_mv	AT mominashaheen reductionindataimbalanceforclientsidetraininginfederatedlearningforthepredictionofstockmarketprices AT muhammadshoaibfarooq reductionindataimbalanceforclientsidetraininginfederatedlearningforthepredictionofstockmarketprices AT tariqumer reductionindataimbalanceforclientsidetraininginfederatedlearningforthepredictionofstockmarketprices

Reduction in Data Imbalance for Client-Side Training in Federated Learning for the Prediction of Stock Market Prices

Similar Items