RAM-PGK: Prediction of Lysine Phosphoglycerylation Based on Residue Adjacency Matrix

Background: Post-translational modification (PTM) is a biological process that is associated with the modification of proteome, which results in the alteration of normal cell biology and pathogenesis. There have been numerous PTM reports in recent years, out of which, lysine phosphoglycerylation has...

Full description

Bibliographic Details
Main Authors: Abel Avitesh Chandra, Alok Sharma, Abdollah Dehzangi, Tatushiko Tsunoda
Format: Article
Language:English
Published: MDPI AG 2020-12-01
Series:Genes
Subjects:
Online Access:https://www.mdpi.com/2073-4425/11/12/1524
_version_ 1797544107331026944
author Abel Avitesh Chandra
Alok Sharma
Abdollah Dehzangi
Tatushiko Tsunoda
author_facet Abel Avitesh Chandra
Alok Sharma
Abdollah Dehzangi
Tatushiko Tsunoda
author_sort Abel Avitesh Chandra
collection DOAJ
description Background: Post-translational modification (PTM) is a biological process that is associated with the modification of proteome, which results in the alteration of normal cell biology and pathogenesis. There have been numerous PTM reports in recent years, out of which, lysine phosphoglycerylation has emerged as one of the recent developments. The traditional methods of identifying phosphoglycerylated residues, which are experimental procedures such as mass spectrometry, have shown to be time-consuming and cost-inefficient, despite the abundance of proteins being sequenced in this post-genomic era. Due to these drawbacks, computational techniques are being sought to establish an effective identification system of phosphoglycerylated lysine residues. The development of a predictor for phosphoglycerylation prediction is not a first, but it is necessary as the latest predictor falls short in adequately detecting phosphoglycerylated and non-phosphoglycerylated lysine residues. Results: In this work, we introduce a new predictor named RAM-PGK, which uses sequence-based information relating to amino acid residues to predict phosphoglycerylated and non-phosphoglycerylated sites. A benchmark dataset was employed for this purpose, which contained experimentally identified phosphoglycerylated and non-phosphoglycerylated lysine residues. From the dataset, we extracted the residue adjacency matrix pertaining to each lysine residue in the protein sequences and converted them into feature vectors, which is used to build the phosphoglycerylation predictor. Conclusion: RAM-PGK, which is based on sequential features and support vector machine classifiers, has shown a noteworthy improvement in terms of performance in comparison to some of the recent prediction methods. The performance metrics of the RAM-PGK predictor are: 0.5741 sensitivity, 0.6436 specificity, 0.0531 precision, 0.6414 accuracy, and 0.0824 Mathews correlation coefficient.
first_indexed 2024-03-10T13:54:46Z
format Article
id doaj.art-f682b3195d804dfabe658ed63e137b3a
institution Directory Open Access Journal
issn 2073-4425
language English
last_indexed 2024-03-10T13:54:46Z
publishDate 2020-12-01
publisher MDPI AG
record_format Article
series Genes
spelling doaj.art-f682b3195d804dfabe658ed63e137b3a2023-11-21T01:44:51ZengMDPI AGGenes2073-44252020-12-011112152410.3390/genes11121524RAM-PGK: Prediction of Lysine Phosphoglycerylation Based on Residue Adjacency MatrixAbel Avitesh Chandra0Alok Sharma1Abdollah Dehzangi2Tatushiko Tsunoda3School of Engineering & Physics, University of the South Pacific, Laucala Bay, Suva, FijiSchool of Engineering & Physics, University of the South Pacific, Laucala Bay, Suva, FijiDepartment of Computer Science, Rutgers University, Camden, NJ 08102, USALaboratory for Medical Science Mathematics, RIKEN Center for Integrative Medical Sciences, Yokohama 230-0045, JapanBackground: Post-translational modification (PTM) is a biological process that is associated with the modification of proteome, which results in the alteration of normal cell biology and pathogenesis. There have been numerous PTM reports in recent years, out of which, lysine phosphoglycerylation has emerged as one of the recent developments. The traditional methods of identifying phosphoglycerylated residues, which are experimental procedures such as mass spectrometry, have shown to be time-consuming and cost-inefficient, despite the abundance of proteins being sequenced in this post-genomic era. Due to these drawbacks, computational techniques are being sought to establish an effective identification system of phosphoglycerylated lysine residues. The development of a predictor for phosphoglycerylation prediction is not a first, but it is necessary as the latest predictor falls short in adequately detecting phosphoglycerylated and non-phosphoglycerylated lysine residues. Results: In this work, we introduce a new predictor named RAM-PGK, which uses sequence-based information relating to amino acid residues to predict phosphoglycerylated and non-phosphoglycerylated sites. A benchmark dataset was employed for this purpose, which contained experimentally identified phosphoglycerylated and non-phosphoglycerylated lysine residues. From the dataset, we extracted the residue adjacency matrix pertaining to each lysine residue in the protein sequences and converted them into feature vectors, which is used to build the phosphoglycerylation predictor. Conclusion: RAM-PGK, which is based on sequential features and support vector machine classifiers, has shown a noteworthy improvement in terms of performance in comparison to some of the recent prediction methods. The performance metrics of the RAM-PGK predictor are: 0.5741 sensitivity, 0.6436 specificity, 0.0531 precision, 0.6414 accuracy, and 0.0824 Mathews correlation coefficient.https://www.mdpi.com/2073-4425/11/12/1524post-translational modificationprotein sequenceresidue adjacency matrixprotein lysine modification databaseamino acidslysine
spellingShingle Abel Avitesh Chandra
Alok Sharma
Abdollah Dehzangi
Tatushiko Tsunoda
RAM-PGK: Prediction of Lysine Phosphoglycerylation Based on Residue Adjacency Matrix
Genes
post-translational modification
protein sequence
residue adjacency matrix
protein lysine modification database
amino acids
lysine
title RAM-PGK: Prediction of Lysine Phosphoglycerylation Based on Residue Adjacency Matrix
title_full RAM-PGK: Prediction of Lysine Phosphoglycerylation Based on Residue Adjacency Matrix
title_fullStr RAM-PGK: Prediction of Lysine Phosphoglycerylation Based on Residue Adjacency Matrix
title_full_unstemmed RAM-PGK: Prediction of Lysine Phosphoglycerylation Based on Residue Adjacency Matrix
title_short RAM-PGK: Prediction of Lysine Phosphoglycerylation Based on Residue Adjacency Matrix
title_sort ram pgk prediction of lysine phosphoglycerylation based on residue adjacency matrix
topic post-translational modification
protein sequence
residue adjacency matrix
protein lysine modification database
amino acids
lysine
url https://www.mdpi.com/2073-4425/11/12/1524
work_keys_str_mv AT abelaviteshchandra rampgkpredictionoflysinephosphoglycerylationbasedonresidueadjacencymatrix
AT aloksharma rampgkpredictionoflysinephosphoglycerylationbasedonresidueadjacencymatrix
AT abdollahdehzangi rampgkpredictionoflysinephosphoglycerylationbasedonresidueadjacencymatrix
AT tatushikotsunoda rampgkpredictionoflysinephosphoglycerylationbasedonresidueadjacencymatrix