Feasibility study of fuzzy clustering techniques in chemical database for compound classification

Compound selection methods are important in drug discovery especially in lead identification process. Finding the best method in compound selection has become a need to the pharmaceutical industry because of the increasing number of chemical compound to be screened. One of the best and widely used m...

Full description

Bibliographic Details
Main Authors: Dollah @ Md. Zain, Rozilawati, Bakri, Aryati, Bahari, Mahadi, Salim, Naomie
Format: Monograph
Language:English
Published: Faculty of Computer Science and Information System 2006
Subjects:
Online Access:http://eprints.utm.my/4402/1/75107.pdf
_version_ 1796853726488559616
author Dollah @ Md. Zain, Rozilawati
Bakri, Aryati
Bahari, Mahadi
Salim, Naomie
author_facet Dollah @ Md. Zain, Rozilawati
Bakri, Aryati
Bahari, Mahadi
Salim, Naomie
author_sort Dollah @ Md. Zain, Rozilawati
collection ePrints
description Compound selection methods are important in drug discovery especially in lead identification process. Finding the best method in compound selection has become a need to the pharmaceutical industry because of the increasing number of chemical compound to be screened. One of the best and widely used methods in compound selection is cluster-based selection where the compound datasets are grouped into clusters and representative compounds are selected from each cluster. Non-overlapping methods, such as Ward’s clustering method, have been widely used and it was agreed as the most efficient clustering method in compound selection. However, little focus has been given to overlapping method in compound selection or even in lead identification process. The research focused on the fuzzy c-means clustering where the effectiveness of the clusters produced with regard to compound selection is analyzed and compared with other conventional cluster-based compound selection method. Fuzzy c-means have been chosen because it produces clusters by identifying the cluster centroid and their corresponding degree of membership, therefore the compounds may belong to more than one cluster. The results from fuzzy c-means method are compared to Ward’s clustering method and also to the results from the fuzzification of Ward’s cluster. The analysis shows that fuzzy c-means clustering gives the best result in intermolecular dissimilarity; however it shows poor results of separation of active/inactive structure.
first_indexed 2024-03-05T18:03:50Z
format Monograph
id utm.eprints-4402
institution Universiti Teknologi Malaysia - ePrints
language English
last_indexed 2024-03-05T18:03:50Z
publishDate 2006
publisher Faculty of Computer Science and Information System
record_format dspace
spelling utm.eprints-44022010-06-01T03:17:26Z http://eprints.utm.my/4402/ Feasibility study of fuzzy clustering techniques in chemical database for compound classification Dollah @ Md. Zain, Rozilawati Bakri, Aryati Bahari, Mahadi Salim, Naomie ZA4050 Electronic information resources Compound selection methods are important in drug discovery especially in lead identification process. Finding the best method in compound selection has become a need to the pharmaceutical industry because of the increasing number of chemical compound to be screened. One of the best and widely used methods in compound selection is cluster-based selection where the compound datasets are grouped into clusters and representative compounds are selected from each cluster. Non-overlapping methods, such as Ward’s clustering method, have been widely used and it was agreed as the most efficient clustering method in compound selection. However, little focus has been given to overlapping method in compound selection or even in lead identification process. The research focused on the fuzzy c-means clustering where the effectiveness of the clusters produced with regard to compound selection is analyzed and compared with other conventional cluster-based compound selection method. Fuzzy c-means have been chosen because it produces clusters by identifying the cluster centroid and their corresponding degree of membership, therefore the compounds may belong to more than one cluster. The results from fuzzy c-means method are compared to Ward’s clustering method and also to the results from the fuzzification of Ward’s cluster. The analysis shows that fuzzy c-means clustering gives the best result in intermolecular dissimilarity; however it shows poor results of separation of active/inactive structure. Faculty of Computer Science and Information System 2006 Monograph NonPeerReviewed application/pdf en http://eprints.utm.my/4402/1/75107.pdf Dollah @ Md. Zain, Rozilawati and Bakri, Aryati and Bahari, Mahadi and Salim, Naomie (2006) Feasibility study of fuzzy clustering techniques in chemical database for compound classification. Project Report. Faculty of Computer Science and Information System, Skudai Johor. (Unpublished)
spellingShingle ZA4050 Electronic information resources
Dollah @ Md. Zain, Rozilawati
Bakri, Aryati
Bahari, Mahadi
Salim, Naomie
Feasibility study of fuzzy clustering techniques in chemical database for compound classification
title Feasibility study of fuzzy clustering techniques in chemical database for compound classification
title_full Feasibility study of fuzzy clustering techniques in chemical database for compound classification
title_fullStr Feasibility study of fuzzy clustering techniques in chemical database for compound classification
title_full_unstemmed Feasibility study of fuzzy clustering techniques in chemical database for compound classification
title_short Feasibility study of fuzzy clustering techniques in chemical database for compound classification
title_sort feasibility study of fuzzy clustering techniques in chemical database for compound classification
topic ZA4050 Electronic information resources
url http://eprints.utm.my/4402/1/75107.pdf
work_keys_str_mv AT dollahmdzainrozilawati feasibilitystudyoffuzzyclusteringtechniquesinchemicaldatabaseforcompoundclassification
AT bakriaryati feasibilitystudyoffuzzyclusteringtechniquesinchemicaldatabaseforcompoundclassification
AT baharimahadi feasibilitystudyoffuzzyclusteringtechniquesinchemicaldatabaseforcompoundclassification
AT salimnaomie feasibilitystudyoffuzzyclusteringtechniquesinchemicaldatabaseforcompoundclassification