ENet-6mA: Identification of 6mA Modification Sites in Plant Genomes Using ElasticNet and Neural Networks

N6-methyladenine (6mA) has been recognized as a key epigenetic alteration that affects a variety of biological activities. Precise prediction of 6mA modification sites is essential for understanding the logical consistency of biological activity. There are various experimental methods for identifyin...

Full description

Bibliographic Details
Main Authors: Zeeshan Abbas, Hilal Tayara, Kil To Chong
Format: Article
Language:English
Published: MDPI AG 2022-07-01
Series:International Journal of Molecular Sciences
Subjects:
Online Access:https://www.mdpi.com/1422-0067/23/15/8314
_version_ 1797432825542082560
author Zeeshan Abbas
Hilal Tayara
Kil To Chong
author_facet Zeeshan Abbas
Hilal Tayara
Kil To Chong
author_sort Zeeshan Abbas
collection DOAJ
description N6-methyladenine (6mA) has been recognized as a key epigenetic alteration that affects a variety of biological activities. Precise prediction of 6mA modification sites is essential for understanding the logical consistency of biological activity. There are various experimental methods for identifying 6mA modification sites, but in silico prediction has emerged as a potential option due to the very high cost and labor-intensive nature of experimental procedures. Taking this into consideration, developing an efficient and accurate model for identifying N6-methyladenine is one of the top objectives in the field of bioinformatics. Therefore, we have created an in silico model for the classification of 6mA modifications in plant genomes. ENet-6mA uses three encoding methods, including one-hot, nucleotide chemical properties (NCP), and electron–ion interaction potential (EIIP), which are concatenated and fed as input to ElasticNet for feature reduction, and then the optimized features are given directly to the neural network to get classified. We used a benchmark dataset of rice for five-fold cross-validation testing and three other datasets from plant genomes for cross-species testing purposes. The results show that the model can predict the N6-methyladenine sites very well, even cross-species. Additionally, we separated the datasets into different ratios and calculated the performance using the area under the precision–recall curve (AUPRC), achieving 0.81, 0.79, and 0.50 with 1:10 (positive:negative) samples for <i>F. vesca</i>, <i>R. chinensis</i>, and <i>A. thaliana</i>, respectively.
first_indexed 2024-03-09T10:08:25Z
format Article
id doaj.art-3a66199ba82b4bd7990269b55f5c85e4
institution Directory Open Access Journal
issn 1661-6596
1422-0067
language English
last_indexed 2024-03-09T10:08:25Z
publishDate 2022-07-01
publisher MDPI AG
record_format Article
series International Journal of Molecular Sciences
spelling doaj.art-3a66199ba82b4bd7990269b55f5c85e42023-12-01T22:57:28ZengMDPI AGInternational Journal of Molecular Sciences1661-65961422-00672022-07-012315831410.3390/ijms23158314ENet-6mA: Identification of 6mA Modification Sites in Plant Genomes Using ElasticNet and Neural NetworksZeeshan Abbas0Hilal Tayara1Kil To Chong2Department of Electronics and Information Engineering, Jeonbuk National University, Jeonju 54896, KoreaSchool of International Engineering and Science, Jeonbuk National University, Jeonju 54896, KoreaDepartment of Electronics and Information Engineering, Jeonbuk National University, Jeonju 54896, KoreaN6-methyladenine (6mA) has been recognized as a key epigenetic alteration that affects a variety of biological activities. Precise prediction of 6mA modification sites is essential for understanding the logical consistency of biological activity. There are various experimental methods for identifying 6mA modification sites, but in silico prediction has emerged as a potential option due to the very high cost and labor-intensive nature of experimental procedures. Taking this into consideration, developing an efficient and accurate model for identifying N6-methyladenine is one of the top objectives in the field of bioinformatics. Therefore, we have created an in silico model for the classification of 6mA modifications in plant genomes. ENet-6mA uses three encoding methods, including one-hot, nucleotide chemical properties (NCP), and electron–ion interaction potential (EIIP), which are concatenated and fed as input to ElasticNet for feature reduction, and then the optimized features are given directly to the neural network to get classified. We used a benchmark dataset of rice for five-fold cross-validation testing and three other datasets from plant genomes for cross-species testing purposes. The results show that the model can predict the N6-methyladenine sites very well, even cross-species. Additionally, we separated the datasets into different ratios and calculated the performance using the area under the precision–recall curve (AUPRC), achieving 0.81, 0.79, and 0.50 with 1:10 (positive:negative) samples for <i>F. vesca</i>, <i>R. chinensis</i>, and <i>A. thaliana</i>, respectively.https://www.mdpi.com/1422-0067/23/15/8314bioinformaticsDNA methylationElasticNetepigenomicsepigenome engineeringneural networks
spellingShingle Zeeshan Abbas
Hilal Tayara
Kil To Chong
ENet-6mA: Identification of 6mA Modification Sites in Plant Genomes Using ElasticNet and Neural Networks
International Journal of Molecular Sciences
bioinformatics
DNA methylation
ElasticNet
epigenomics
epigenome engineering
neural networks
title ENet-6mA: Identification of 6mA Modification Sites in Plant Genomes Using ElasticNet and Neural Networks
title_full ENet-6mA: Identification of 6mA Modification Sites in Plant Genomes Using ElasticNet and Neural Networks
title_fullStr ENet-6mA: Identification of 6mA Modification Sites in Plant Genomes Using ElasticNet and Neural Networks
title_full_unstemmed ENet-6mA: Identification of 6mA Modification Sites in Plant Genomes Using ElasticNet and Neural Networks
title_short ENet-6mA: Identification of 6mA Modification Sites in Plant Genomes Using ElasticNet and Neural Networks
title_sort enet 6ma identification of 6ma modification sites in plant genomes using elasticnet and neural networks
topic bioinformatics
DNA methylation
ElasticNet
epigenomics
epigenome engineering
neural networks
url https://www.mdpi.com/1422-0067/23/15/8314
work_keys_str_mv AT zeeshanabbas enet6maidentificationof6mamodificationsitesinplantgenomesusingelasticnetandneuralnetworks
AT hilaltayara enet6maidentificationof6mamodificationsitesinplantgenomesusingelasticnetandneuralnetworks
AT kiltochong enet6maidentificationof6mamodificationsitesinplantgenomesusingelasticnetandneuralnetworks