K-means clustering analysis and multiple linear regression model on household income in Malaysia

Household income plays a significant role in determining a country's socioeconomic standing. This measure is often used by the government to formulate the federal budget and policies that are most appropriate for national development. In spite of this, Malaysia's current economic circumsta...

Full description

Bibliographic Details
Main Authors: Gan Pei Yee, Gan Pei Yee, Rusiman, Mohd Saifullah, Ismail, Shuhaida, Suparman, Suparman, Mohamad Hamzah, Firdaus, Shaf, Muhammad Ammar
Format: Article
Language:English
Published: 2023
Subjects:
Online Access:http://eprints.uthm.edu.my/9234/1/J16020_a18540f8debfd456df2dd54f8eb422f5.pdf
_version_ 1796869927649411072
author Gan Pei Yee, Gan Pei Yee
Rusiman, Mohd Saifullah
Ismail, Shuhaida
Suparman, Suparman
Mohamad Hamzah, Firdaus
Shaf, Muhammad Ammar
author_facet Gan Pei Yee, Gan Pei Yee
Rusiman, Mohd Saifullah
Ismail, Shuhaida
Suparman, Suparman
Mohamad Hamzah, Firdaus
Shaf, Muhammad Ammar
author_sort Gan Pei Yee, Gan Pei Yee
collection UTHM
description Household income plays a significant role in determining a country's socioeconomic standing. This measure is often used by the government to formulate the federal budget and policies that are most appropriate for national development. In spite of this, Malaysia's current economic circumstances continue to be characterized by income disparity. Therefore, this shortcoming can be addressed by analyzing the household income survey (HIS) conducted by Department of Statistics Malaysia (DoSM). In this study, the hybrid model is proposed where K-means and multiple linear regression (MLR) for clustering and predicting household income in Malaysia. Based on the experimental results, the K-means clustering analysis in conjunction with the MLR model outperformed the MLR model without clustering with a smaller mean square error. As a result, clustering analysis results in a more accurate estimate of household income because it reduces the variation between = households. It is important that household = income information reflect the concern of policymakers about the impact of universal and targeted interventions on different socioeconomic groups.
first_indexed 2024-03-05T22:02:00Z
format Article
id uthm.eprints-9234
institution Universiti Tun Hussein Onn Malaysia
language English
last_indexed 2024-03-05T22:02:00Z
publishDate 2023
record_format dspace
spelling uthm.eprints-92342023-07-17T07:41:32Z http://eprints.uthm.edu.my/9234/ K-means clustering analysis and multiple linear regression model on household income in Malaysia Gan Pei Yee, Gan Pei Yee Rusiman, Mohd Saifullah Ismail, Shuhaida Suparman, Suparman Mohamad Hamzah, Firdaus Shaf, Muhammad Ammar T Technology (General) Household income plays a significant role in determining a country's socioeconomic standing. This measure is often used by the government to formulate the federal budget and policies that are most appropriate for national development. In spite of this, Malaysia's current economic circumstances continue to be characterized by income disparity. Therefore, this shortcoming can be addressed by analyzing the household income survey (HIS) conducted by Department of Statistics Malaysia (DoSM). In this study, the hybrid model is proposed where K-means and multiple linear regression (MLR) for clustering and predicting household income in Malaysia. Based on the experimental results, the K-means clustering analysis in conjunction with the MLR model outperformed the MLR model without clustering with a smaller mean square error. As a result, clustering analysis results in a more accurate estimate of household income because it reduces the variation between = households. It is important that household = income information reflect the concern of policymakers about the impact of universal and targeted interventions on different socioeconomic groups. 2023 Article PeerReviewed text en http://eprints.uthm.edu.my/9234/1/J16020_a18540f8debfd456df2dd54f8eb422f5.pdf Gan Pei Yee, Gan Pei Yee and Rusiman, Mohd Saifullah and Ismail, Shuhaida and Suparman, Suparman and Mohamad Hamzah, Firdaus and Shaf, Muhammad Ammar (2023) K-means clustering analysis and multiple linear regression model on household income in Malaysia. IAES International Journal of Artificial Intelligence (IJ-AI), 12 (2). pp. 731-738. ISSN 2252-8938 https://doi.org/ 10.11591/ijai.v12.i2
spellingShingle T Technology (General)
Gan Pei Yee, Gan Pei Yee
Rusiman, Mohd Saifullah
Ismail, Shuhaida
Suparman, Suparman
Mohamad Hamzah, Firdaus
Shaf, Muhammad Ammar
K-means clustering analysis and multiple linear regression model on household income in Malaysia
title K-means clustering analysis and multiple linear regression model on household income in Malaysia
title_full K-means clustering analysis and multiple linear regression model on household income in Malaysia
title_fullStr K-means clustering analysis and multiple linear regression model on household income in Malaysia
title_full_unstemmed K-means clustering analysis and multiple linear regression model on household income in Malaysia
title_short K-means clustering analysis and multiple linear regression model on household income in Malaysia
title_sort k means clustering analysis and multiple linear regression model on household income in malaysia
topic T Technology (General)
url http://eprints.uthm.edu.my/9234/1/J16020_a18540f8debfd456df2dd54f8eb422f5.pdf
work_keys_str_mv AT ganpeiyeeganpeiyee kmeansclusteringanalysisandmultiplelinearregressionmodelonhouseholdincomeinmalaysia
AT rusimanmohdsaifullah kmeansclusteringanalysisandmultiplelinearregressionmodelonhouseholdincomeinmalaysia
AT ismailshuhaida kmeansclusteringanalysisandmultiplelinearregressionmodelonhouseholdincomeinmalaysia
AT suparmansuparman kmeansclusteringanalysisandmultiplelinearregressionmodelonhouseholdincomeinmalaysia
AT mohamadhamzahfirdaus kmeansclusteringanalysisandmultiplelinearregressionmodelonhouseholdincomeinmalaysia
AT shafmuhammadammar kmeansclusteringanalysisandmultiplelinearregressionmodelonhouseholdincomeinmalaysia