Weighted L1-norm logistic regression for gene selection of microarray gene expression classification

The classification of cancer is a significant application of the DNA microarray data. Gene selection methods are ordinarily used handle the issue of high-dimensionality of microarray data to enable experts to diagnose and classify cancer with high accuracy. The penalized logistic regression (PLR) te...

Full description

Bibliographic Details
Main Authors: Alharthi, Aiedh Mrisi, Lee, Muhammad Hisyam, Algamal, Zakariya Yahya
Format: Article
Language:English
Published: Insight Society 2020
Subjects:
Online Access:http://eprints.utm.my/92655/1/AiedhMrisiAlharthi2020_WeightedL1NormLogisticRegression.pdf
_version_ 1796865467095187456
author Alharthi, Aiedh Mrisi
Lee, Muhammad Hisyam
Algamal, Zakariya Yahya
author_facet Alharthi, Aiedh Mrisi
Lee, Muhammad Hisyam
Algamal, Zakariya Yahya
author_sort Alharthi, Aiedh Mrisi
collection ePrints
description The classification of cancer is a significant application of the DNA microarray data. Gene selection methods are ordinarily used handle the issue of high-dimensionality of microarray data to enable experts to diagnose and classify cancer with high accuracy. The penalized logistic regression (PLR) technique is usually used in the dimensionality reduction of the high-dimensional gene expression data sets to remove irrelevant and redundant predictors from the binary logistic regression model. One of the regularization techniques used to achieve this goal is the least absolute shrinkage and selection operator (Lasso). However, this technique has been criticized for being biased in the selection of genes. The adaptive Lasso was usually proposed by assigning an initial weight to each gene to address the selection bias. This paper is concerned with adapting PLR to improve its capability in classification and gene selection, in the sense of accuracy, by introducing the one-dimensional weighted Mahalanobis distance (1-DWM) for each gene as an initial weight inside L1-norm. By experiments, this proposed method, denoted by adaptive penalized logistic regression (APLR), gives more accurate results compared with other famous methods in this regard. The proposed method is applied to some real high-dimensional gene expression data sets in order to demonstrate its efficiency in terms of classification accuracy and selection of gene. Therefore, the proposed method could be utilized in other studies implementing gene selection in the area of classification of high dimensional cancer data sets.
first_indexed 2024-03-05T20:57:27Z
format Article
id utm.eprints-92655
institution Universiti Teknologi Malaysia - ePrints
language English
last_indexed 2024-03-05T20:57:27Z
publishDate 2020
publisher Insight Society
record_format dspace
spelling utm.eprints-926552021-10-28T10:18:33Z http://eprints.utm.my/92655/ Weighted L1-norm logistic regression for gene selection of microarray gene expression classification Alharthi, Aiedh Mrisi Lee, Muhammad Hisyam Algamal, Zakariya Yahya QA Mathematics The classification of cancer is a significant application of the DNA microarray data. Gene selection methods are ordinarily used handle the issue of high-dimensionality of microarray data to enable experts to diagnose and classify cancer with high accuracy. The penalized logistic regression (PLR) technique is usually used in the dimensionality reduction of the high-dimensional gene expression data sets to remove irrelevant and redundant predictors from the binary logistic regression model. One of the regularization techniques used to achieve this goal is the least absolute shrinkage and selection operator (Lasso). However, this technique has been criticized for being biased in the selection of genes. The adaptive Lasso was usually proposed by assigning an initial weight to each gene to address the selection bias. This paper is concerned with adapting PLR to improve its capability in classification and gene selection, in the sense of accuracy, by introducing the one-dimensional weighted Mahalanobis distance (1-DWM) for each gene as an initial weight inside L1-norm. By experiments, this proposed method, denoted by adaptive penalized logistic regression (APLR), gives more accurate results compared with other famous methods in this regard. The proposed method is applied to some real high-dimensional gene expression data sets in order to demonstrate its efficiency in terms of classification accuracy and selection of gene. Therefore, the proposed method could be utilized in other studies implementing gene selection in the area of classification of high dimensional cancer data sets. Insight Society 2020-08 Article PeerReviewed application/pdf en http://eprints.utm.my/92655/1/AiedhMrisiAlharthi2020_WeightedL1NormLogisticRegression.pdf Alharthi, Aiedh Mrisi and Lee, Muhammad Hisyam and Algamal, Zakariya Yahya (2020) Weighted L1-norm logistic regression for gene selection of microarray gene expression classification. International Journal on Advanced Science, Engineering and Information Technology, 10 (4). pp. 1483-1488. ISSN 2088-5334 http://dx.doi.org/10.18517/ijaseit.10.4.10907 DOI:10.18517/ijaseit.10.4.10907
spellingShingle QA Mathematics
Alharthi, Aiedh Mrisi
Lee, Muhammad Hisyam
Algamal, Zakariya Yahya
Weighted L1-norm logistic regression for gene selection of microarray gene expression classification
title Weighted L1-norm logistic regression for gene selection of microarray gene expression classification
title_full Weighted L1-norm logistic regression for gene selection of microarray gene expression classification
title_fullStr Weighted L1-norm logistic regression for gene selection of microarray gene expression classification
title_full_unstemmed Weighted L1-norm logistic regression for gene selection of microarray gene expression classification
title_short Weighted L1-norm logistic regression for gene selection of microarray gene expression classification
title_sort weighted l1 norm logistic regression for gene selection of microarray gene expression classification
topic QA Mathematics
url http://eprints.utm.my/92655/1/AiedhMrisiAlharthi2020_WeightedL1NormLogisticRegression.pdf
work_keys_str_mv AT alharthiaiedhmrisi weightedl1normlogisticregressionforgeneselectionofmicroarraygeneexpressionclassification
AT leemuhammadhisyam weightedl1normlogisticregressionforgeneselectionofmicroarraygeneexpressionclassification
AT algamalzakariyayahya weightedl1normlogisticregressionforgeneselectionofmicroarraygeneexpressionclassification