Optimal Weighted Learning of PCA and PLS for Multicollinearity Discriminators and Imbalanced Groups in Big Data (S/O: 13224)

This study developed an algorithm for statistical classification that enable ones to classify a future data to one of predetermined groups based on the measured data which facing two major threats; (i) multicollinearity among the measured variables and (ii) imbalanced groups. The developed algorithm...

Full description

Bibliographic Details
Main Authors: Mahat, Nor Idayu, Engku Abu Bakar, Engku Muhammad Nazri, Zakaria, Ammar, Mohd Nazir, Mohd Amril Nurman, Misiran, Masnita
Format: Monograph
Language:English
Published: UUM
Subjects:
Online Access:https://repo.uum.edu.my/id/eprint/31770/1/13224.pdf
_version_ 1824451194385334272
author Mahat, Nor Idayu
Engku Abu Bakar, Engku Muhammad Nazri
Zakaria, Ammar
Mohd Nazir, Mohd Amril Nurman
Misiran, Masnita
author_facet Mahat, Nor Idayu
Engku Abu Bakar, Engku Muhammad Nazri
Zakaria, Ammar
Mohd Nazir, Mohd Amril Nurman
Misiran, Masnita
author_sort Mahat, Nor Idayu
collection UUM
description This study developed an algorithm for statistical classification that enable ones to classify a future data to one of predetermined groups based on the measured data which facing two major threats; (i) multicollinearity among the measured variables and (ii) imbalanced groups. The developed algorithm weighted the n objects contribution in explaining the separation between groups. Then, the weights are used together with either Principal Component Analysis (PCA) or Partial Least Square (PLS) to tackle the collinearity among variables. Next, the weighted and transformed features were used to train Linear Discriminant Function (LDA) and to evaluate the constructed rule. The designed algorithm was structured in k-fold cross-validation in attempt to minimise the biasness of the classification performance, measured using error rate. Both simulation on bivariate and multivariate cases show some promising results that the weighted PCA on LDA and the weighted PLS on LDA are better than the traditional LDA, kernel discriminant, and PCA+LDA methods. Whilst, critical investigation on the minority group using sensitivity value has given some evidence how the two proposed methods are competitive, but they are similar if the groups are well separated. Evidence obtained from the real data sets also providing similar results to the simulated ones. Hence, both weighted PCA on LDA and the weighted PLS on LDA can be recommended to discriminate imbalanced groups with correlated variables
first_indexed 2025-02-19T02:30:45Z
format Monograph
id uum-31770
institution Universiti Utara Malaysia
language English
last_indexed 2025-02-19T02:30:45Z
publisher UUM
record_format eprints
spelling uum-317702024-12-12T10:47:04Z https://repo.uum.edu.my/id/eprint/31770/ Optimal Weighted Learning of PCA and PLS for Multicollinearity Discriminators and Imbalanced Groups in Big Data (S/O: 13224) Mahat, Nor Idayu Engku Abu Bakar, Engku Muhammad Nazri Zakaria, Ammar Mohd Nazir, Mohd Amril Nurman Misiran, Masnita QA Mathematics This study developed an algorithm for statistical classification that enable ones to classify a future data to one of predetermined groups based on the measured data which facing two major threats; (i) multicollinearity among the measured variables and (ii) imbalanced groups. The developed algorithm weighted the n objects contribution in explaining the separation between groups. Then, the weights are used together with either Principal Component Analysis (PCA) or Partial Least Square (PLS) to tackle the collinearity among variables. Next, the weighted and transformed features were used to train Linear Discriminant Function (LDA) and to evaluate the constructed rule. The designed algorithm was structured in k-fold cross-validation in attempt to minimise the biasness of the classification performance, measured using error rate. Both simulation on bivariate and multivariate cases show some promising results that the weighted PCA on LDA and the weighted PLS on LDA are better than the traditional LDA, kernel discriminant, and PCA+LDA methods. Whilst, critical investigation on the minority group using sensitivity value has given some evidence how the two proposed methods are competitive, but they are similar if the groups are well separated. Evidence obtained from the real data sets also providing similar results to the simulated ones. Hence, both weighted PCA on LDA and the weighted PLS on LDA can be recommended to discriminate imbalanced groups with correlated variables UUM Monograph NonPeerReviewed application/pdf en https://repo.uum.edu.my/id/eprint/31770/1/13224.pdf Mahat, Nor Idayu and Engku Abu Bakar, Engku Muhammad Nazri and Zakaria, Ammar and Mohd Nazir, Mohd Amril Nurman and Misiran, Masnita Optimal Weighted Learning of PCA and PLS for Multicollinearity Discriminators and Imbalanced Groups in Big Data (S/O: 13224). Project Report. UUM. (Submitted)
spellingShingle QA Mathematics
Mahat, Nor Idayu
Engku Abu Bakar, Engku Muhammad Nazri
Zakaria, Ammar
Mohd Nazir, Mohd Amril Nurman
Misiran, Masnita
Optimal Weighted Learning of PCA and PLS for Multicollinearity Discriminators and Imbalanced Groups in Big Data (S/O: 13224)
title Optimal Weighted Learning of PCA and PLS for Multicollinearity Discriminators and Imbalanced Groups in Big Data (S/O: 13224)
title_full Optimal Weighted Learning of PCA and PLS for Multicollinearity Discriminators and Imbalanced Groups in Big Data (S/O: 13224)
title_fullStr Optimal Weighted Learning of PCA and PLS for Multicollinearity Discriminators and Imbalanced Groups in Big Data (S/O: 13224)
title_full_unstemmed Optimal Weighted Learning of PCA and PLS for Multicollinearity Discriminators and Imbalanced Groups in Big Data (S/O: 13224)
title_short Optimal Weighted Learning of PCA and PLS for Multicollinearity Discriminators and Imbalanced Groups in Big Data (S/O: 13224)
title_sort optimal weighted learning of pca and pls for multicollinearity discriminators and imbalanced groups in big data s o 13224
topic QA Mathematics
url https://repo.uum.edu.my/id/eprint/31770/1/13224.pdf
work_keys_str_mv AT mahatnoridayu optimalweightedlearningofpcaandplsformulticollinearitydiscriminatorsandimbalancedgroupsinbigdataso13224
AT engkuabubakarengkumuhammadnazri optimalweightedlearningofpcaandplsformulticollinearitydiscriminatorsandimbalancedgroupsinbigdataso13224
AT zakariaammar optimalweightedlearningofpcaandplsformulticollinearitydiscriminatorsandimbalancedgroupsinbigdataso13224
AT mohdnazirmohdamrilnurman optimalweightedlearningofpcaandplsformulticollinearitydiscriminatorsandimbalancedgroupsinbigdataso13224
AT misiranmasnita optimalweightedlearningofpcaandplsformulticollinearitydiscriminatorsandimbalancedgroupsinbigdataso13224