SumSec: Accurate Prediction of Sumoylation Sites Using Predicted Secondary Structure

Post Translational Modification (PTM) is defined as the modification of amino acids along the protein sequences after the translation process. These modifications significantly impact on the functioning of proteins. Therefore, having a comprehensive understanding of the underlying mechanism of PTMs...

Full description

Bibliographic Details
Main Authors: Abdollah Dehzangi, Yosvany López, Ghazaleh Taherzadeh, Alok Sharma, Tatsuhiko Tsunoda
Format: Article
Language:English
Published: MDPI AG 2018-12-01
Series:Molecules
Subjects:
Online Access:https://www.mdpi.com/1420-3049/23/12/3260
_version_ 1811232290200616960
author Abdollah Dehzangi
Yosvany López
Ghazaleh Taherzadeh
Alok Sharma
Tatsuhiko Tsunoda
author_facet Abdollah Dehzangi
Yosvany López
Ghazaleh Taherzadeh
Alok Sharma
Tatsuhiko Tsunoda
author_sort Abdollah Dehzangi
collection DOAJ
description Post Translational Modification (PTM) is defined as the modification of amino acids along the protein sequences after the translation process. These modifications significantly impact on the functioning of proteins. Therefore, having a comprehensive understanding of the underlying mechanism of PTMs turns out to be critical in studying the biological roles of proteins. Among a wide range of PTMs, sumoylation is one of the most important modifications due to its known cellular functions which include transcriptional regulation, protein stability, and protein subcellular localization. Despite its importance, determining sumoylation sites via experimental methods is time-consuming and costly. This has led to a great demand for the development of fast computational methods able to accurately determine sumoylation sites in proteins. In this study, we present a new machine learning-based method for predicting sumoylation sites called SumSec. To do this, we employed the predicted secondary structure of amino acids to extract two types of structural features from neighboring amino acids along the protein sequence which has never been used for this task. As a result, our proposed method is able to enhance the sumoylation site prediction task, outperforming previously proposed methods in the literature. SumSec demonstrated high sensitivity (0.91), accuracy (0.94) and MCC (0.88). The prediction accuracy achieved in this study is 21% better than those reported in previous studies. The script and extracted features are publicly available at: https://github.com/YosvanyLopez/SumSec.
first_indexed 2024-04-12T10:59:53Z
format Article
id doaj.art-19fa7b58c4524fb1aa45e21c7a2af0fa
institution Directory Open Access Journal
issn 1420-3049
language English
last_indexed 2024-04-12T10:59:53Z
publishDate 2018-12-01
publisher MDPI AG
record_format Article
series Molecules
spelling doaj.art-19fa7b58c4524fb1aa45e21c7a2af0fa2022-12-22T03:35:58ZengMDPI AGMolecules1420-30492018-12-012312326010.3390/molecules23123260molecules23123260SumSec: Accurate Prediction of Sumoylation Sites Using Predicted Secondary StructureAbdollah Dehzangi0Yosvany López1Ghazaleh Taherzadeh2Alok Sharma3Tatsuhiko Tsunoda4Department of Computer Science, Morgan State University, Baltimore, MD 21251, USAGenesis Institute of Genetic Research, Genesis Healthcare Co., Tokyo 150-6015, JapanSchool of Information and Communication Technology, Griffith University, Gold Coast 4222, AustraliaInstitute for Integrated and Intelligent Systems, Griffith University, Brisbane 4111, AustraliaLaboratory for Medical Science Mathematics, RIKEN Center for Integrative Medical Sciences, Yokohama, Kanagawa 230-0045, JapanPost Translational Modification (PTM) is defined as the modification of amino acids along the protein sequences after the translation process. These modifications significantly impact on the functioning of proteins. Therefore, having a comprehensive understanding of the underlying mechanism of PTMs turns out to be critical in studying the biological roles of proteins. Among a wide range of PTMs, sumoylation is one of the most important modifications due to its known cellular functions which include transcriptional regulation, protein stability, and protein subcellular localization. Despite its importance, determining sumoylation sites via experimental methods is time-consuming and costly. This has led to a great demand for the development of fast computational methods able to accurately determine sumoylation sites in proteins. In this study, we present a new machine learning-based method for predicting sumoylation sites called SumSec. To do this, we employed the predicted secondary structure of amino acids to extract two types of structural features from neighboring amino acids along the protein sequence which has never been used for this task. As a result, our proposed method is able to enhance the sumoylation site prediction task, outperforming previously proposed methods in the literature. SumSec demonstrated high sensitivity (0.91), accuracy (0.94) and MCC (0.88). The prediction accuracy achieved in this study is 21% better than those reported in previous studies. The script and extracted features are publicly available at: https://github.com/YosvanyLopez/SumSec.https://www.mdpi.com/1420-3049/23/12/3260post translational modificationsumoylationensemble classifierbaggingsecondary structureprofile-bigram
spellingShingle Abdollah Dehzangi
Yosvany López
Ghazaleh Taherzadeh
Alok Sharma
Tatsuhiko Tsunoda
SumSec: Accurate Prediction of Sumoylation Sites Using Predicted Secondary Structure
Molecules
post translational modification
sumoylation
ensemble classifier
bagging
secondary structure
profile-bigram
title SumSec: Accurate Prediction of Sumoylation Sites Using Predicted Secondary Structure
title_full SumSec: Accurate Prediction of Sumoylation Sites Using Predicted Secondary Structure
title_fullStr SumSec: Accurate Prediction of Sumoylation Sites Using Predicted Secondary Structure
title_full_unstemmed SumSec: Accurate Prediction of Sumoylation Sites Using Predicted Secondary Structure
title_short SumSec: Accurate Prediction of Sumoylation Sites Using Predicted Secondary Structure
title_sort sumsec accurate prediction of sumoylation sites using predicted secondary structure
topic post translational modification
sumoylation
ensemble classifier
bagging
secondary structure
profile-bigram
url https://www.mdpi.com/1420-3049/23/12/3260
work_keys_str_mv AT abdollahdehzangi sumsecaccuratepredictionofsumoylationsitesusingpredictedsecondarystructure
AT yosvanylopez sumsecaccuratepredictionofsumoylationsitesusingpredictedsecondarystructure
AT ghazalehtaherzadeh sumsecaccuratepredictionofsumoylationsitesusingpredictedsecondarystructure
AT aloksharma sumsecaccuratepredictionofsumoylationsitesusingpredictedsecondarystructure
AT tatsuhikotsunoda sumsecaccuratepredictionofsumoylationsitesusingpredictedsecondarystructure