Statistical Machine Learning Regression Models for Salary Prediction Featuring Economy Wide Activities and Occupations

A holistic occupational and economy-wide framework for salary prediction is developed and tested using statistical machine learning (ML). Predictive models are developed based on occupational features and organizational characteristics. Five different supervised ML algorithms are trained using surve...

Full description

Bibliographic Details
Main Authors: Yasser T. Matbouli, Suliman M. Alghamdi
Format: Article
Language:English
Published: MDPI AG 2022-10-01
Series:Information
Subjects:
Online Access:https://www.mdpi.com/2078-2489/13/10/495
_version_ 1797431539735199744
author Yasser T. Matbouli
Suliman M. Alghamdi
author_facet Yasser T. Matbouli
Suliman M. Alghamdi
author_sort Yasser T. Matbouli
collection DOAJ
description A holistic occupational and economy-wide framework for salary prediction is developed and tested using statistical machine learning (ML). Predictive models are developed based on occupational features and organizational characteristics. Five different supervised ML algorithms are trained using survey data from the Saudi Arabian labor market to estimate mean annual salary across economic activities and major occupational groups. In predicting the mean salary over economic activities, the Bayesian Gaussian process regression ML showed a marked improvement in <inline-formula><math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><msup><mi>R</mi><mn>2</mn></msup></semantics></math></inline-formula> over multiple linear regression (from 0.50 to 0.98). Moreover, lower error levels were obtained: root-mean-square error was reduced by 80% and mean absolute error was reduced by almost 90% compared to multiple linear regression. However, the salary prediction over major occupational groups resulted in artificial neural networks performing the best in terms of both <inline-formula><math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><msup><mi>R</mi><mn>2</mn></msup></semantics></math></inline-formula>, with an improvement from 0.62 in multiple linear regression to 0.94 and errors were reduced by approximately 60%. The proposed framework can help estimate annual salary levels across different types of economic activities and organization sizes, as well as different occupations.
first_indexed 2024-03-09T09:45:31Z
format Article
id doaj.art-26ca22e9b8504221bc2a92a8b99cdaf5
institution Directory Open Access Journal
issn 2078-2489
language English
last_indexed 2024-03-09T09:45:31Z
publishDate 2022-10-01
publisher MDPI AG
record_format Article
series Information
spelling doaj.art-26ca22e9b8504221bc2a92a8b99cdaf52023-12-02T00:33:06ZengMDPI AGInformation2078-24892022-10-01131049510.3390/info13100495Statistical Machine Learning Regression Models for Salary Prediction Featuring Economy Wide Activities and OccupationsYasser T. Matbouli0Suliman M. Alghamdi1Department of Industrial Engineering, Faculty of Engineering-Rabigh Branch, King Abdulaziz University, Jeddah 21589, Saudi ArabiaAdvanced Resources for Information Technology Co., Riyadh 13524, Saudi ArabiaA holistic occupational and economy-wide framework for salary prediction is developed and tested using statistical machine learning (ML). Predictive models are developed based on occupational features and organizational characteristics. Five different supervised ML algorithms are trained using survey data from the Saudi Arabian labor market to estimate mean annual salary across economic activities and major occupational groups. In predicting the mean salary over economic activities, the Bayesian Gaussian process regression ML showed a marked improvement in <inline-formula><math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><msup><mi>R</mi><mn>2</mn></msup></semantics></math></inline-formula> over multiple linear regression (from 0.50 to 0.98). Moreover, lower error levels were obtained: root-mean-square error was reduced by 80% and mean absolute error was reduced by almost 90% compared to multiple linear regression. However, the salary prediction over major occupational groups resulted in artificial neural networks performing the best in terms of both <inline-formula><math xmlns="http://www.w3.org/1998/Math/MathML" display="inline"><semantics><msup><mi>R</mi><mn>2</mn></msup></semantics></math></inline-formula>, with an improvement from 0.62 in multiple linear regression to 0.94 and errors were reduced by approximately 60%. The proposed framework can help estimate annual salary levels across different types of economic activities and organization sizes, as well as different occupations.https://www.mdpi.com/2078-2489/13/10/495machine learning regressionsalary predictiongaussian process regressionartificial neural networkseconomic activitiesoccupational groups
spellingShingle Yasser T. Matbouli
Suliman M. Alghamdi
Statistical Machine Learning Regression Models for Salary Prediction Featuring Economy Wide Activities and Occupations
Information
machine learning regression
salary prediction
gaussian process regression
artificial neural networks
economic activities
occupational groups
title Statistical Machine Learning Regression Models for Salary Prediction Featuring Economy Wide Activities and Occupations
title_full Statistical Machine Learning Regression Models for Salary Prediction Featuring Economy Wide Activities and Occupations
title_fullStr Statistical Machine Learning Regression Models for Salary Prediction Featuring Economy Wide Activities and Occupations
title_full_unstemmed Statistical Machine Learning Regression Models for Salary Prediction Featuring Economy Wide Activities and Occupations
title_short Statistical Machine Learning Regression Models for Salary Prediction Featuring Economy Wide Activities and Occupations
title_sort statistical machine learning regression models for salary prediction featuring economy wide activities and occupations
topic machine learning regression
salary prediction
gaussian process regression
artificial neural networks
economic activities
occupational groups
url https://www.mdpi.com/2078-2489/13/10/495
work_keys_str_mv AT yassertmatbouli statisticalmachinelearningregressionmodelsforsalarypredictionfeaturingeconomywideactivitiesandoccupations
AT sulimanmalghamdi statisticalmachinelearningregressionmodelsforsalarypredictionfeaturingeconomywideactivitiesandoccupations