DenSec: Secreted Protein Prediction in Cerebrospinal Fluid Based on DenseNet and Transformer

Cerebrospinal fluid (CSF) exists in the surrounding spaces of mammalian central nervous systems (CNS); therefore, there are numerous potential protein biomarkers associated with CNS disease in CSF. Currently, approximately 4300 proteins have been identified in CSF by protein profiling. However, due...

Full description

Bibliographic Details
Main Authors: Lan Huang, Yanli Qu, Kai He, Yan Wang, Dan Shao
Format: Article
Language:English
Published: MDPI AG 2022-07-01
Series:Mathematics
Subjects:
Online Access:https://www.mdpi.com/2227-7390/10/14/2490
_version_ 1797417157565349888
author Lan Huang
Yanli Qu
Kai He
Yan Wang
Dan Shao
author_facet Lan Huang
Yanli Qu
Kai He
Yan Wang
Dan Shao
author_sort Lan Huang
collection DOAJ
description Cerebrospinal fluid (CSF) exists in the surrounding spaces of mammalian central nervous systems (CNS); therefore, there are numerous potential protein biomarkers associated with CNS disease in CSF. Currently, approximately 4300 proteins have been identified in CSF by protein profiling. However, due to the diverse modifications, as well as the existing technical limits, large-scale protein identification in CSF is still considered a challenge. Inspired by computational methods, this paper proposes a deep learning framework, named DenSec, for secreted protein prediction in CSF. In the first phase of DenSec, all input proteins are encoded as a matrix with a fixed size of 1000 × 20 by calculating a position-specific score matrix (PSSM) of protein sequences. In the second phase, a dense convolutional network (DenseNet) is adopted to extract the feature from these PSSMs automatically. After that, Transformer with a fully connected dense layer acts as classifier to perform a binary classification in terms of secretion into CSF or not. According to the experiment results, DenSec achieves a mean accuracy of 86.00% in the test dataset and outperforms the state-of-the-art methods.
first_indexed 2024-03-09T06:15:43Z
format Article
id doaj.art-7d5f0b39206c4dbf90e5857b0fba1972
institution Directory Open Access Journal
issn 2227-7390
language English
last_indexed 2024-03-09T06:15:43Z
publishDate 2022-07-01
publisher MDPI AG
record_format Article
series Mathematics
spelling doaj.art-7d5f0b39206c4dbf90e5857b0fba19722023-12-03T11:53:57ZengMDPI AGMathematics2227-73902022-07-011014249010.3390/math10142490DenSec: Secreted Protein Prediction in Cerebrospinal Fluid Based on DenseNet and TransformerLan Huang0Yanli Qu1Kai He2Yan Wang3Dan Shao4Key Laboratory of Symbol Computation and Knowledge Engineering of Ministry of Education, College of Computer Science and Technology, Jilin University, Changchun 130012, ChinaKey Laboratory of Symbol Computation and Knowledge Engineering of Ministry of Education, College of Computer Science and Technology, Jilin University, Changchun 130012, ChinaKey Laboratory of Symbol Computation and Knowledge Engineering of Ministry of Education, College of Computer Science and Technology, Jilin University, Changchun 130012, ChinaKey Laboratory of Symbol Computation and Knowledge Engineering of Ministry of Education, College of Computer Science and Technology, Jilin University, Changchun 130012, ChinaCollege of Computer Science and Technology, Changchun University, Changchun 130022, ChinaCerebrospinal fluid (CSF) exists in the surrounding spaces of mammalian central nervous systems (CNS); therefore, there are numerous potential protein biomarkers associated with CNS disease in CSF. Currently, approximately 4300 proteins have been identified in CSF by protein profiling. However, due to the diverse modifications, as well as the existing technical limits, large-scale protein identification in CSF is still considered a challenge. Inspired by computational methods, this paper proposes a deep learning framework, named DenSec, for secreted protein prediction in CSF. In the first phase of DenSec, all input proteins are encoded as a matrix with a fixed size of 1000 × 20 by calculating a position-specific score matrix (PSSM) of protein sequences. In the second phase, a dense convolutional network (DenseNet) is adopted to extract the feature from these PSSMs automatically. After that, Transformer with a fully connected dense layer acts as classifier to perform a binary classification in terms of secretion into CSF or not. According to the experiment results, DenSec achieves a mean accuracy of 86.00% in the test dataset and outperforms the state-of-the-art methods.https://www.mdpi.com/2227-7390/10/14/2490cerebrospinal fluidsecreted protein predictionDenseNettransformer
spellingShingle Lan Huang
Yanli Qu
Kai He
Yan Wang
Dan Shao
DenSec: Secreted Protein Prediction in Cerebrospinal Fluid Based on DenseNet and Transformer
Mathematics
cerebrospinal fluid
secreted protein prediction
DenseNet
transformer
title DenSec: Secreted Protein Prediction in Cerebrospinal Fluid Based on DenseNet and Transformer
title_full DenSec: Secreted Protein Prediction in Cerebrospinal Fluid Based on DenseNet and Transformer
title_fullStr DenSec: Secreted Protein Prediction in Cerebrospinal Fluid Based on DenseNet and Transformer
title_full_unstemmed DenSec: Secreted Protein Prediction in Cerebrospinal Fluid Based on DenseNet and Transformer
title_short DenSec: Secreted Protein Prediction in Cerebrospinal Fluid Based on DenseNet and Transformer
title_sort densec secreted protein prediction in cerebrospinal fluid based on densenet and transformer
topic cerebrospinal fluid
secreted protein prediction
DenseNet
transformer
url https://www.mdpi.com/2227-7390/10/14/2490
work_keys_str_mv AT lanhuang densecsecretedproteinpredictionincerebrospinalfluidbasedondensenetandtransformer
AT yanliqu densecsecretedproteinpredictionincerebrospinalfluidbasedondensenetandtransformer
AT kaihe densecsecretedproteinpredictionincerebrospinalfluidbasedondensenetandtransformer
AT yanwang densecsecretedproteinpredictionincerebrospinalfluidbasedondensenetandtransformer
AT danshao densecsecretedproteinpredictionincerebrospinalfluidbasedondensenetandtransformer