AMP-GSM: Prediction of Antimicrobial Peptides via a Grouping–Scoring–Modeling Approach
Due to the increasing resistance of bacteria to antibiotics, scientists began seeking new solutions against this problem. One of the most promising solutions in this field are antimicrobial peptides (AMP). To identify antimicrobial peptides, and to aid the design and production of novel antimicrobia...
Main Authors: | , , |
---|---|
Format: | Article |
Language: | English |
Published: |
MDPI AG
2023-04-01
|
Series: | Applied Sciences |
Subjects: | |
Online Access: | https://www.mdpi.com/2076-3417/13/8/5106 |
_version_ | 1797606473042231296 |
---|---|
author | Ümmü Gülsüm Söylemez Malik Yousef Burcu Bakir-Gungor |
author_facet | Ümmü Gülsüm Söylemez Malik Yousef Burcu Bakir-Gungor |
author_sort | Ümmü Gülsüm Söylemez |
collection | DOAJ |
description | Due to the increasing resistance of bacteria to antibiotics, scientists began seeking new solutions against this problem. One of the most promising solutions in this field are antimicrobial peptides (AMP). To identify antimicrobial peptides, and to aid the design and production of novel antimicrobial peptides, there is a growing interest in the development of computational prediction approaches, in parallel with the studies performing wet-lab experiments. The computational approaches aim to understand what controls antimicrobial activity from the perspective of machine learning, and to uncover the biological properties that define antimicrobial activity. Throughout this study, we aim to develop a novel prediction approach that can identify peptides with high antimicrobial activity against selected target bacteria. Along this line, we propose a novel method called AMP-GSM (antimicrobial peptide-grouping–scoring–modeling). AMP-GSM includes three main components: grouping, scoring, and modeling. The grouping component creates sub-datasets via placing the physicochemical, linguistic, sequence, and structure-based features into different groups. The scoring component gives a score for each group according to their ability to distinguish whether it is an antimicrobial peptide or not. As the final part of our method, the model built using the top-ranked groups is evaluated (modeling component). The method was tested for three AMP prediction datasets, and the prediction performance of AMP-GSM was comparatively evaluated with several feature selection methods and several classifiers. When we used 10 features (which are members of the physicochemical group), we obtained the highest area under curve (AUC) value for both the Gram-negative (99%) and Gram-positive (98%) datasets. AMP-GSM investigates the most significant feature groups that improve AMP prediction. A number of physico-chemical features from the AMP-GSM’s final selection demonstrate how important these variables are in terms of defining peptide characteristics and how they should be taken into account when creating models to predict peptide activity. |
first_indexed | 2024-03-11T05:15:41Z |
format | Article |
id | doaj.art-1e461bd3d63c47bc96ede5c79051e52b |
institution | Directory Open Access Journal |
issn | 2076-3417 |
language | English |
last_indexed | 2024-03-11T05:15:41Z |
publishDate | 2023-04-01 |
publisher | MDPI AG |
record_format | Article |
series | Applied Sciences |
spelling | doaj.art-1e461bd3d63c47bc96ede5c79051e52b2023-11-17T18:13:44ZengMDPI AGApplied Sciences2076-34172023-04-01138510610.3390/app13085106AMP-GSM: Prediction of Antimicrobial Peptides via a Grouping–Scoring–Modeling ApproachÜmmü Gülsüm Söylemez0Malik Yousef1Burcu Bakir-Gungor2Department of Software Engineering, Faculty of Engineering, Muş Alparslan University, Muş 49100, TurkeyDepartment of Information Systems, Zefat Academic College, Zefat 13206, IsraelDepartment of Computer Engineering, Faculty of Engineering, Abdullah Gul University, Kayseri 38170, TurkeyDue to the increasing resistance of bacteria to antibiotics, scientists began seeking new solutions against this problem. One of the most promising solutions in this field are antimicrobial peptides (AMP). To identify antimicrobial peptides, and to aid the design and production of novel antimicrobial peptides, there is a growing interest in the development of computational prediction approaches, in parallel with the studies performing wet-lab experiments. The computational approaches aim to understand what controls antimicrobial activity from the perspective of machine learning, and to uncover the biological properties that define antimicrobial activity. Throughout this study, we aim to develop a novel prediction approach that can identify peptides with high antimicrobial activity against selected target bacteria. Along this line, we propose a novel method called AMP-GSM (antimicrobial peptide-grouping–scoring–modeling). AMP-GSM includes three main components: grouping, scoring, and modeling. The grouping component creates sub-datasets via placing the physicochemical, linguistic, sequence, and structure-based features into different groups. The scoring component gives a score for each group according to their ability to distinguish whether it is an antimicrobial peptide or not. As the final part of our method, the model built using the top-ranked groups is evaluated (modeling component). The method was tested for three AMP prediction datasets, and the prediction performance of AMP-GSM was comparatively evaluated with several feature selection methods and several classifiers. When we used 10 features (which are members of the physicochemical group), we obtained the highest area under curve (AUC) value for both the Gram-negative (99%) and Gram-positive (98%) datasets. AMP-GSM investigates the most significant feature groups that improve AMP prediction. A number of physico-chemical features from the AMP-GSM’s final selection demonstrate how important these variables are in terms of defining peptide characteristics and how they should be taken into account when creating models to predict peptide activity.https://www.mdpi.com/2076-3417/13/8/5106antimicrobial peptide (AMP) predictionphysico-chemical propertiesgroupingscoringmodeling (GSM)antibiotic resistance |
spellingShingle | Ümmü Gülsüm Söylemez Malik Yousef Burcu Bakir-Gungor AMP-GSM: Prediction of Antimicrobial Peptides via a Grouping–Scoring–Modeling Approach Applied Sciences antimicrobial peptide (AMP) prediction physico-chemical properties grouping scoring modeling (GSM) antibiotic resistance |
title | AMP-GSM: Prediction of Antimicrobial Peptides via a Grouping–Scoring–Modeling Approach |
title_full | AMP-GSM: Prediction of Antimicrobial Peptides via a Grouping–Scoring–Modeling Approach |
title_fullStr | AMP-GSM: Prediction of Antimicrobial Peptides via a Grouping–Scoring–Modeling Approach |
title_full_unstemmed | AMP-GSM: Prediction of Antimicrobial Peptides via a Grouping–Scoring–Modeling Approach |
title_short | AMP-GSM: Prediction of Antimicrobial Peptides via a Grouping–Scoring–Modeling Approach |
title_sort | amp gsm prediction of antimicrobial peptides via a grouping scoring modeling approach |
topic | antimicrobial peptide (AMP) prediction physico-chemical properties grouping scoring modeling (GSM) antibiotic resistance |
url | https://www.mdpi.com/2076-3417/13/8/5106 |
work_keys_str_mv | AT ummugulsumsoylemez ampgsmpredictionofantimicrobialpeptidesviaagroupingscoringmodelingapproach AT malikyousef ampgsmpredictionofantimicrobialpeptidesviaagroupingscoringmodelingapproach AT burcubakirgungor ampgsmpredictionofantimicrobialpeptidesviaagroupingscoringmodelingapproach |