Accurately Predicting Glutarylation Sites Using Sequential Bi-Peptide-Based Evolutionary Features

Post Translational Modification (PTM) is defined as the alteration of protein sequence upon interaction with different macromolecules after the translation process. Glutarylation is considered one of the most important PTMs, which is associated with a wide range of cellular functioning, including me...

Full description

Bibliographic Details
Main Authors: Md. Easin Arafat, Md. Wakil Ahmad, S.M. Shovan, Abdollah Dehzangi, Shubhashis Roy Dipta, Md. Al Mehedi Hasan, Ghazaleh Taherzadeh, Swakkhar Shatabda, Alok Sharma
Format: Article
Language:English
Published: MDPI AG 2020-08-01
Series:Genes
Subjects:
Online Access:https://www.mdpi.com/2073-4425/11/9/1023
_version_ 1797554963027591168
author Md. Easin Arafat
Md. Wakil Ahmad
S.M. Shovan
Abdollah Dehzangi
Shubhashis Roy Dipta
Md. Al Mehedi Hasan
Ghazaleh Taherzadeh
Swakkhar Shatabda
Alok Sharma
author_facet Md. Easin Arafat
Md. Wakil Ahmad
S.M. Shovan
Abdollah Dehzangi
Shubhashis Roy Dipta
Md. Al Mehedi Hasan
Ghazaleh Taherzadeh
Swakkhar Shatabda
Alok Sharma
author_sort Md. Easin Arafat
collection DOAJ
description Post Translational Modification (PTM) is defined as the alteration of protein sequence upon interaction with different macromolecules after the translation process. Glutarylation is considered one of the most important PTMs, which is associated with a wide range of cellular functioning, including metabolism, translation, and specified separate subcellular localizations. During the past few years, a wide range of computational approaches has been proposed to predict Glutarylation sites. However, despite all the efforts that have been made so far, the prediction performance of the Glutarylation sites has remained limited. One of the main challenges to tackle this problem is to extract features with significant discriminatory information. To address this issue, we propose a new machine learning method called BiPepGlut using the concept of a bi-peptide-based evolutionary method for feature extraction. To build this model, we also use the Extra-Trees (ET) classifier for the classification purpose, which, to the best of our knowledge, has never been used for this task. Our results demonstrate BiPepGlut is able to significantly outperform previously proposed models to tackle this problem. BiPepGlut achieves 92.0%, 84.8%, 95.6%, 0.82, and 0.88 in accuracy, sensitivity, specificity, Matthew’s Correlation Coefficient, and F1-score, respectively. BiPepGlut is implemented as a publicly available online predictor.
first_indexed 2024-03-10T16:39:28Z
format Article
id doaj.art-1f7e3302773c47968b9c65b25be025e3
institution Directory Open Access Journal
issn 2073-4425
language English
last_indexed 2024-03-10T16:39:28Z
publishDate 2020-08-01
publisher MDPI AG
record_format Article
series Genes
spelling doaj.art-1f7e3302773c47968b9c65b25be025e32023-11-20T12:04:48ZengMDPI AGGenes2073-44252020-08-01119102310.3390/genes11091023Accurately Predicting Glutarylation Sites Using Sequential Bi-Peptide-Based Evolutionary FeaturesMd. Easin Arafat0Md. Wakil Ahmad1S.M. Shovan2Abdollah Dehzangi3Shubhashis Roy Dipta4Md. Al Mehedi Hasan5Ghazaleh Taherzadeh6Swakkhar Shatabda7Alok Sharma8Department of Computer Science and Engineering, United International University, Dhaka 1212, BangladeshDepartment of Computer Science and Engineering, United International University, Dhaka 1212, BangladeshDepartment of Computer Science and Engineering, Rajshahi University of Engineering and Technology, Rajshahi 6204, BangladeshDepartment of Computer Science, Rutgers University, Camden, NJ 08102, USADepartment of Computer Science and Engineering, United International University, Dhaka 1212, BangladeshDepartment of Computer Science and Engineering, Rajshahi University of Engineering and Technology, Rajshahi 6204, BangladeshInstitute for Bioscience and Biotechnology Research, University of Maryland, College Park, MD 20742, USADepartment of Computer Science and Engineering, United International University, Dhaka 1212, BangladeshInstitute for Integrated and Intelligent Systems, Griffith University, Brisbane, QLD 4111, AustraliaPost Translational Modification (PTM) is defined as the alteration of protein sequence upon interaction with different macromolecules after the translation process. Glutarylation is considered one of the most important PTMs, which is associated with a wide range of cellular functioning, including metabolism, translation, and specified separate subcellular localizations. During the past few years, a wide range of computational approaches has been proposed to predict Glutarylation sites. However, despite all the efforts that have been made so far, the prediction performance of the Glutarylation sites has remained limited. One of the main challenges to tackle this problem is to extract features with significant discriminatory information. To address this issue, we propose a new machine learning method called BiPepGlut using the concept of a bi-peptide-based evolutionary method for feature extraction. To build this model, we also use the Extra-Trees (ET) classifier for the classification purpose, which, to the best of our knowledge, has never been used for this task. Our results demonstrate BiPepGlut is able to significantly outperform previously proposed models to tackle this problem. BiPepGlut achieves 92.0%, 84.8%, 95.6%, 0.82, and 0.88 in accuracy, sensitivity, specificity, Matthew’s Correlation Coefficient, and F1-score, respectively. BiPepGlut is implemented as a publicly available online predictor.https://www.mdpi.com/2073-4425/11/9/1023post-translational modificationlysine Glutarylationmachine learningextra-trees classifierbi-peptide evolutionary features
spellingShingle Md. Easin Arafat
Md. Wakil Ahmad
S.M. Shovan
Abdollah Dehzangi
Shubhashis Roy Dipta
Md. Al Mehedi Hasan
Ghazaleh Taherzadeh
Swakkhar Shatabda
Alok Sharma
Accurately Predicting Glutarylation Sites Using Sequential Bi-Peptide-Based Evolutionary Features
Genes
post-translational modification
lysine Glutarylation
machine learning
extra-trees classifier
bi-peptide evolutionary features
title Accurately Predicting Glutarylation Sites Using Sequential Bi-Peptide-Based Evolutionary Features
title_full Accurately Predicting Glutarylation Sites Using Sequential Bi-Peptide-Based Evolutionary Features
title_fullStr Accurately Predicting Glutarylation Sites Using Sequential Bi-Peptide-Based Evolutionary Features
title_full_unstemmed Accurately Predicting Glutarylation Sites Using Sequential Bi-Peptide-Based Evolutionary Features
title_short Accurately Predicting Glutarylation Sites Using Sequential Bi-Peptide-Based Evolutionary Features
title_sort accurately predicting glutarylation sites using sequential bi peptide based evolutionary features
topic post-translational modification
lysine Glutarylation
machine learning
extra-trees classifier
bi-peptide evolutionary features
url https://www.mdpi.com/2073-4425/11/9/1023
work_keys_str_mv AT mdeasinarafat accuratelypredictingglutarylationsitesusingsequentialbipeptidebasedevolutionaryfeatures
AT mdwakilahmad accuratelypredictingglutarylationsitesusingsequentialbipeptidebasedevolutionaryfeatures
AT smshovan accuratelypredictingglutarylationsitesusingsequentialbipeptidebasedevolutionaryfeatures
AT abdollahdehzangi accuratelypredictingglutarylationsitesusingsequentialbipeptidebasedevolutionaryfeatures
AT shubhashisroydipta accuratelypredictingglutarylationsitesusingsequentialbipeptidebasedevolutionaryfeatures
AT mdalmehedihasan accuratelypredictingglutarylationsitesusingsequentialbipeptidebasedevolutionaryfeatures
AT ghazalehtaherzadeh accuratelypredictingglutarylationsitesusingsequentialbipeptidebasedevolutionaryfeatures
AT swakkharshatabda accuratelypredictingglutarylationsitesusingsequentialbipeptidebasedevolutionaryfeatures
AT aloksharma accuratelypredictingglutarylationsitesusingsequentialbipeptidebasedevolutionaryfeatures