AMP-GSM: Prediction of Antimicrobial Peptides via a Grouping–Scoring–Modeling Approach

Due to the increasing resistance of bacteria to antibiotics, scientists began seeking new solutions against this problem. One of the most promising solutions in this field are antimicrobial peptides (AMP). To identify antimicrobial peptides, and to aid the design and production of novel antimicrobia...

Full description

Bibliographic Details
Main Authors: Ümmü Gülsüm Söylemez, Malik Yousef, Burcu Bakir-Gungor
Format: Article
Language:English
Published: MDPI AG 2023-04-01
Series:Applied Sciences
Subjects:
Online Access:https://www.mdpi.com/2076-3417/13/8/5106
_version_ 1797606473042231296
author Ümmü Gülsüm Söylemez
Malik Yousef
Burcu Bakir-Gungor
author_facet Ümmü Gülsüm Söylemez
Malik Yousef
Burcu Bakir-Gungor
author_sort Ümmü Gülsüm Söylemez
collection DOAJ
description Due to the increasing resistance of bacteria to antibiotics, scientists began seeking new solutions against this problem. One of the most promising solutions in this field are antimicrobial peptides (AMP). To identify antimicrobial peptides, and to aid the design and production of novel antimicrobial peptides, there is a growing interest in the development of computational prediction approaches, in parallel with the studies performing wet-lab experiments. The computational approaches aim to understand what controls antimicrobial activity from the perspective of machine learning, and to uncover the biological properties that define antimicrobial activity. Throughout this study, we aim to develop a novel prediction approach that can identify peptides with high antimicrobial activity against selected target bacteria. Along this line, we propose a novel method called AMP-GSM (antimicrobial peptide-grouping–scoring–modeling). AMP-GSM includes three main components: grouping, scoring, and modeling. The grouping component creates sub-datasets via placing the physicochemical, linguistic, sequence, and structure-based features into different groups. The scoring component gives a score for each group according to their ability to distinguish whether it is an antimicrobial peptide or not. As the final part of our method, the model built using the top-ranked groups is evaluated (modeling component). The method was tested for three AMP prediction datasets, and the prediction performance of AMP-GSM was comparatively evaluated with several feature selection methods and several classifiers. When we used 10 features (which are members of the physicochemical group), we obtained the highest area under curve (AUC) value for both the Gram-negative (99%) and Gram-positive (98%) datasets. AMP-GSM investigates the most significant feature groups that improve AMP prediction. A number of physico-chemical features from the AMP-GSM’s final selection demonstrate how important these variables are in terms of defining peptide characteristics and how they should be taken into account when creating models to predict peptide activity.
first_indexed 2024-03-11T05:15:41Z
format Article
id doaj.art-1e461bd3d63c47bc96ede5c79051e52b
institution Directory Open Access Journal
issn 2076-3417
language English
last_indexed 2024-03-11T05:15:41Z
publishDate 2023-04-01
publisher MDPI AG
record_format Article
series Applied Sciences
spelling doaj.art-1e461bd3d63c47bc96ede5c79051e52b2023-11-17T18:13:44ZengMDPI AGApplied Sciences2076-34172023-04-01138510610.3390/app13085106AMP-GSM: Prediction of Antimicrobial Peptides via a Grouping–Scoring–Modeling ApproachÜmmü Gülsüm Söylemez0Malik Yousef1Burcu Bakir-Gungor2Department of Software Engineering, Faculty of Engineering, Muş Alparslan University, Muş 49100, TurkeyDepartment of Information Systems, Zefat Academic College, Zefat 13206, IsraelDepartment of Computer Engineering, Faculty of Engineering, Abdullah Gul University, Kayseri 38170, TurkeyDue to the increasing resistance of bacteria to antibiotics, scientists began seeking new solutions against this problem. One of the most promising solutions in this field are antimicrobial peptides (AMP). To identify antimicrobial peptides, and to aid the design and production of novel antimicrobial peptides, there is a growing interest in the development of computational prediction approaches, in parallel with the studies performing wet-lab experiments. The computational approaches aim to understand what controls antimicrobial activity from the perspective of machine learning, and to uncover the biological properties that define antimicrobial activity. Throughout this study, we aim to develop a novel prediction approach that can identify peptides with high antimicrobial activity against selected target bacteria. Along this line, we propose a novel method called AMP-GSM (antimicrobial peptide-grouping–scoring–modeling). AMP-GSM includes three main components: grouping, scoring, and modeling. The grouping component creates sub-datasets via placing the physicochemical, linguistic, sequence, and structure-based features into different groups. The scoring component gives a score for each group according to their ability to distinguish whether it is an antimicrobial peptide or not. As the final part of our method, the model built using the top-ranked groups is evaluated (modeling component). The method was tested for three AMP prediction datasets, and the prediction performance of AMP-GSM was comparatively evaluated with several feature selection methods and several classifiers. When we used 10 features (which are members of the physicochemical group), we obtained the highest area under curve (AUC) value for both the Gram-negative (99%) and Gram-positive (98%) datasets. AMP-GSM investigates the most significant feature groups that improve AMP prediction. A number of physico-chemical features from the AMP-GSM’s final selection demonstrate how important these variables are in terms of defining peptide characteristics and how they should be taken into account when creating models to predict peptide activity.https://www.mdpi.com/2076-3417/13/8/5106antimicrobial peptide (AMP) predictionphysico-chemical propertiesgroupingscoringmodeling (GSM)antibiotic resistance
spellingShingle Ümmü Gülsüm Söylemez
Malik Yousef
Burcu Bakir-Gungor
AMP-GSM: Prediction of Antimicrobial Peptides via a Grouping–Scoring–Modeling Approach
Applied Sciences
antimicrobial peptide (AMP) prediction
physico-chemical properties
grouping
scoring
modeling (GSM)
antibiotic resistance
title AMP-GSM: Prediction of Antimicrobial Peptides via a Grouping–Scoring–Modeling Approach
title_full AMP-GSM: Prediction of Antimicrobial Peptides via a Grouping–Scoring–Modeling Approach
title_fullStr AMP-GSM: Prediction of Antimicrobial Peptides via a Grouping–Scoring–Modeling Approach
title_full_unstemmed AMP-GSM: Prediction of Antimicrobial Peptides via a Grouping–Scoring–Modeling Approach
title_short AMP-GSM: Prediction of Antimicrobial Peptides via a Grouping–Scoring–Modeling Approach
title_sort amp gsm prediction of antimicrobial peptides via a grouping scoring modeling approach
topic antimicrobial peptide (AMP) prediction
physico-chemical properties
grouping
scoring
modeling (GSM)
antibiotic resistance
url https://www.mdpi.com/2076-3417/13/8/5106
work_keys_str_mv AT ummugulsumsoylemez ampgsmpredictionofantimicrobialpeptidesviaagroupingscoringmodelingapproach
AT malikyousef ampgsmpredictionofantimicrobialpeptidesviaagroupingscoringmodelingapproach
AT burcubakirgungor ampgsmpredictionofantimicrobialpeptidesviaagroupingscoringmodelingapproach