DTLM-DBP: deep transfer learning models for DNA binding proteins identification

The identification of DNA binding proteins (DNABPs) is considered a major challenge in genome annotation because they are linked to several important applied and research applications of cellular functions e.g., in the study of the biological, biophysical, and biochemical effects of antibiotics, dru...

Full description

Bibliographic Details
Main Authors: Saber, S., Khairuddin, U., Yusof, R., Madani, A.
Format: Article
Language:English
Published: Tech Science Press 2021
Subjects:
Online Access:http://eprints.utm.my/94887/1/UswahKhairuddin2021_DTLMDBPDeepTransferLearningModels.pdf
_version_ 1796865911333847040
author Saber, S.
Khairuddin, U.
Yusof, R.
Madani, A.
author_facet Saber, S.
Khairuddin, U.
Yusof, R.
Madani, A.
author_sort Saber, S.
collection ePrints
description The identification of DNA binding proteins (DNABPs) is considered a major challenge in genome annotation because they are linked to several important applied and research applications of cellular functions e.g., in the study of the biological, biophysical, and biochemical effects of antibiotics, drugs, and steroids on DNA. This paper presents an efficient approach for DNABPs identification based on deep transfer learning, named "DTLM-DBP." Two transfer learning methods are used in the identification process. The first is based on the pre-trained deep learning model as a feature's extractor and classifier. Two different pre-trained Convolutional Neural Networks (CNN), AlexNet 8 and VGG 16, are tested and compared. The second method uses the deep learning model as a feature's extractor only and two different classifiers for the identification process. Two classifiers, Support Vector Machine (SVM) and Random Forest (RF), are tested and compared. The proposed approach is tested using different DNA proteins datasets. The performance of the identification process is evaluated in terms of identification accuracy, sensitivity, specificity andMCC, with four available DNAproteins datasets:PDB1075,PDB186,PDNA-543, and PDNA-316. The results show that the RF classifier, with VGG-Net pre-trained deep transfer learning features, gives the highest performance. DTLM-DBP was compared with other published methods and it provides a considerable improvement in the performance of DNABPs identification.
first_indexed 2024-03-05T21:04:17Z
format Article
id utm.eprints-94887
institution Universiti Teknologi Malaysia - ePrints
language English
last_indexed 2024-03-05T21:04:17Z
publishDate 2021
publisher Tech Science Press
record_format dspace
spelling utm.eprints-948872022-04-29T21:54:56Z http://eprints.utm.my/94887/ DTLM-DBP: deep transfer learning models for DNA binding proteins identification Saber, S. Khairuddin, U. Yusof, R. Madani, A. T58.5-58.64 Information technology The identification of DNA binding proteins (DNABPs) is considered a major challenge in genome annotation because they are linked to several important applied and research applications of cellular functions e.g., in the study of the biological, biophysical, and biochemical effects of antibiotics, drugs, and steroids on DNA. This paper presents an efficient approach for DNABPs identification based on deep transfer learning, named "DTLM-DBP." Two transfer learning methods are used in the identification process. The first is based on the pre-trained deep learning model as a feature's extractor and classifier. Two different pre-trained Convolutional Neural Networks (CNN), AlexNet 8 and VGG 16, are tested and compared. The second method uses the deep learning model as a feature's extractor only and two different classifiers for the identification process. Two classifiers, Support Vector Machine (SVM) and Random Forest (RF), are tested and compared. The proposed approach is tested using different DNA proteins datasets. The performance of the identification process is evaluated in terms of identification accuracy, sensitivity, specificity andMCC, with four available DNAproteins datasets:PDB1075,PDB186,PDNA-543, and PDNA-316. The results show that the RF classifier, with VGG-Net pre-trained deep transfer learning features, gives the highest performance. DTLM-DBP was compared with other published methods and it provides a considerable improvement in the performance of DNABPs identification. Tech Science Press 2021 Article PeerReviewed application/pdf en http://eprints.utm.my/94887/1/UswahKhairuddin2021_DTLMDBPDeepTransferLearningModels.pdf Saber, S. and Khairuddin, U. and Yusof, R. and Madani, A. (2021) DTLM-DBP: deep transfer learning models for DNA binding proteins identification. Computers, Materials and Continua, 68 (3). ISSN 1546-2218 http://dx.doi.org/10.32604/cmc.2021.017769 DOI: 10.32604/cmc.2021.017769
spellingShingle T58.5-58.64 Information technology
Saber, S.
Khairuddin, U.
Yusof, R.
Madani, A.
DTLM-DBP: deep transfer learning models for DNA binding proteins identification
title DTLM-DBP: deep transfer learning models for DNA binding proteins identification
title_full DTLM-DBP: deep transfer learning models for DNA binding proteins identification
title_fullStr DTLM-DBP: deep transfer learning models for DNA binding proteins identification
title_full_unstemmed DTLM-DBP: deep transfer learning models for DNA binding proteins identification
title_short DTLM-DBP: deep transfer learning models for DNA binding proteins identification
title_sort dtlm dbp deep transfer learning models for dna binding proteins identification
topic T58.5-58.64 Information technology
url http://eprints.utm.my/94887/1/UswahKhairuddin2021_DTLMDBPDeepTransferLearningModels.pdf
work_keys_str_mv AT sabers dtlmdbpdeeptransferlearningmodelsfordnabindingproteinsidentification
AT khairuddinu dtlmdbpdeeptransferlearningmodelsfordnabindingproteinsidentification
AT yusofr dtlmdbpdeeptransferlearningmodelsfordnabindingproteinsidentification
AT madania dtlmdbpdeeptransferlearningmodelsfordnabindingproteinsidentification