High-dimensional QSAR modelling using penalized linear regression model with L1/2-norm

In high-dimensional quantitative structure–activity relationship (QSAR) modelling, penalization methods have been a popular choice to simultaneously address molecular descriptor selection and QSAR model estimation. In this study, a penalized linear regression model with L1/2-norm is proposed. Furthe...

Full description

Bibliographic Details
Main Authors: Algamal, Z. Y., Lee, M. H., Al-Fakih, A. M., Aziz, M.
Format: Article
Published: Taylor and Francis Ltd. 2016
Subjects:
_version_ 1796861878632185856
author Algamal, Z. Y.
Lee, M. H.
Al-Fakih, A. M.
Aziz, M.
author_facet Algamal, Z. Y.
Lee, M. H.
Al-Fakih, A. M.
Aziz, M.
author_sort Algamal, Z. Y.
collection ePrints
description In high-dimensional quantitative structure–activity relationship (QSAR) modelling, penalization methods have been a popular choice to simultaneously address molecular descriptor selection and QSAR model estimation. In this study, a penalized linear regression model with L1/2-norm is proposed. Furthermore, the local linear approximation algorithm is utilized to avoid the non-convexity of the proposed method. The potential applicability of the proposed method is tested on several benchmark data sets. Compared with other commonly used penalized methods, the proposed method can not only obtain the best predictive ability, but also provide an easily interpretable QSAR model. In addition, it is noteworthy that the results obtained in terms of applicability domain and Y-randomization test provide an efficient and a robust QSAR model. It is evident from the results that the proposed method may possibly be a promising penalized method in the field of computational chemistry research, especially when the number of molecular descriptors exceeds the number of compounds.
first_indexed 2024-03-05T20:03:05Z
format Article
id utm.eprints-72108
institution Universiti Teknologi Malaysia - ePrints
last_indexed 2024-03-05T20:03:05Z
publishDate 2016
publisher Taylor and Francis Ltd.
record_format dspace
spelling utm.eprints-721082017-11-23T06:19:24Z http://eprints.utm.my/72108/ High-dimensional QSAR modelling using penalized linear regression model with L1/2-norm Algamal, Z. Y. Lee, M. H. Al-Fakih, A. M. Aziz, M. QD Chemistry In high-dimensional quantitative structure–activity relationship (QSAR) modelling, penalization methods have been a popular choice to simultaneously address molecular descriptor selection and QSAR model estimation. In this study, a penalized linear regression model with L1/2-norm is proposed. Furthermore, the local linear approximation algorithm is utilized to avoid the non-convexity of the proposed method. The potential applicability of the proposed method is tested on several benchmark data sets. Compared with other commonly used penalized methods, the proposed method can not only obtain the best predictive ability, but also provide an easily interpretable QSAR model. In addition, it is noteworthy that the results obtained in terms of applicability domain and Y-randomization test provide an efficient and a robust QSAR model. It is evident from the results that the proposed method may possibly be a promising penalized method in the field of computational chemistry research, especially when the number of molecular descriptors exceeds the number of compounds. Taylor and Francis Ltd. 2016 Article PeerReviewed Algamal, Z. Y. and Lee, M. H. and Al-Fakih, A. M. and Aziz, M. (2016) High-dimensional QSAR modelling using penalized linear regression model with L1/2-norm. SAR and QSAR in Environmental Research, 27 (9). pp. 703-719. ISSN 1062-936X https://www.scopus.com/inward/record.uri?eid=2-s2.0-84987643691&doi=10.1080%2f1062936X.2016.1228696&partnerID=40&md5=4d4834740f41f51ed40fd692c7811449
spellingShingle QD Chemistry
Algamal, Z. Y.
Lee, M. H.
Al-Fakih, A. M.
Aziz, M.
High-dimensional QSAR modelling using penalized linear regression model with L1/2-norm
title High-dimensional QSAR modelling using penalized linear regression model with L1/2-norm
title_full High-dimensional QSAR modelling using penalized linear regression model with L1/2-norm
title_fullStr High-dimensional QSAR modelling using penalized linear regression model with L1/2-norm
title_full_unstemmed High-dimensional QSAR modelling using penalized linear regression model with L1/2-norm
title_short High-dimensional QSAR modelling using penalized linear regression model with L1/2-norm
title_sort high dimensional qsar modelling using penalized linear regression model with l1 2 norm
topic QD Chemistry
work_keys_str_mv AT algamalzy highdimensionalqsarmodellingusingpenalizedlinearregressionmodelwithl12norm
AT leemh highdimensionalqsarmodellingusingpenalizedlinearregressionmodelwithl12norm
AT alfakiham highdimensionalqsarmodellingusingpenalizedlinearregressionmodelwithl12norm
AT azizm highdimensionalqsarmodellingusingpenalizedlinearregressionmodelwithl12norm