Evolutionary binary feature selection using adaptive ebola optimization search algorithm for high-dimensional datasets.

Feature selection problem represents the field of study that requires approximate algorithms to identify discriminative and optimally combined features. The evaluation and suitability of these selected features are often analyzed using classifiers. These features are locked with data increasingly be...

Full description

Bibliographic Details
Main Authors: Olaide N Oyelade, Jeffrey O Agushaka, Absalom E Ezugwu
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2023-01-01
Series:PLoS ONE
Online Access:https://doi.org/10.1371/journal.pone.0282812
_version_ 1797843183163408384
author Olaide N Oyelade
Jeffrey O Agushaka
Absalom E Ezugwu
author_facet Olaide N Oyelade
Jeffrey O Agushaka
Absalom E Ezugwu
author_sort Olaide N Oyelade
collection DOAJ
description Feature selection problem represents the field of study that requires approximate algorithms to identify discriminative and optimally combined features. The evaluation and suitability of these selected features are often analyzed using classifiers. These features are locked with data increasingly being generated from different sources such as social media, surveillance systems, network applications, and medical records. The high dimensionality of these datasets often impairs the quality of the optimal combination of these features selected. The use of the binary optimization method has been proposed in the literature to address this challenge. However, the underlying deficiency of the single binary optimizer is transferred to the quality of the features selected. Though hybrid methods have been proposed, most still suffer from the inherited design limitation of the single combined methods. To address this, we proposed a novel hybrid binary optimization capable of effectively selecting features from increasingly high-dimensional datasets. The approach used in this study designed a sub-population selective mechanism that dynamically assigns individuals to a 2-level optimization process. The level-1 method first mutates items in the population and then reassigns them to a level-2 optimizer. The selective mechanism determines what sub-population is assigned for the level-2 optimizer based on the exploration and exploitation phase of the level-1 optimizer. In addition, we designed nested transfer (NT) functions and investigated the influence of the function on the level-1 optimizer. The binary Ebola optimization search algorithm (BEOSA) is applied for the level-1 mutation, while the simulated annealing (SA) and firefly (FFA) algorithms are investigated for the level-2 optimizer. The outcome of these are the HBEOSA-SA and HBEOSA-FFA, which are then investigated on the NT, and their corresponding variants HBEOSA-SA-NT and HBEOSA-FFA-NT with no NT applied. The hybrid methods were experimentally tested over high-dimensional datasets to address the challenge of feature selection. A comparative analysis was done on the methods to obtain performance variability with the low-dimensional datasets. Results obtained for classification accuracy for large, medium, and small-scale datasets are 0.995 using HBEOSA-FFA, 0.967 using HBEOSA-FFA-NT, and 0.953 using HBEOSA-FFA, respectively. Fitness and cost values relative to large, medium, and small-scale datasets are 0.066 and 0.934 using HBEOSA-FFA, 0.068 and 0.932 using HBEOSA-FFA, with 0.222 and 0.970 using HBEOSA-SA-NT, respectively. Findings from the study indicate that the HBEOSA-SA, HBEOSA-FFA, HBEOSA-SA-NT and HBEOSA-FFA-NT outperformed the BEOSA.
first_indexed 2024-04-09T17:00:41Z
format Article
id doaj.art-7b1e927314114336909f78c12e5c07b1
institution Directory Open Access Journal
issn 1932-6203
language English
last_indexed 2024-04-09T17:00:41Z
publishDate 2023-01-01
publisher Public Library of Science (PLoS)
record_format Article
series PLoS ONE
spelling doaj.art-7b1e927314114336909f78c12e5c07b12023-04-21T05:32:50ZengPublic Library of Science (PLoS)PLoS ONE1932-62032023-01-01183e028281210.1371/journal.pone.0282812Evolutionary binary feature selection using adaptive ebola optimization search algorithm for high-dimensional datasets.Olaide N OyeladeJeffrey O AgushakaAbsalom E EzugwuFeature selection problem represents the field of study that requires approximate algorithms to identify discriminative and optimally combined features. The evaluation and suitability of these selected features are often analyzed using classifiers. These features are locked with data increasingly being generated from different sources such as social media, surveillance systems, network applications, and medical records. The high dimensionality of these datasets often impairs the quality of the optimal combination of these features selected. The use of the binary optimization method has been proposed in the literature to address this challenge. However, the underlying deficiency of the single binary optimizer is transferred to the quality of the features selected. Though hybrid methods have been proposed, most still suffer from the inherited design limitation of the single combined methods. To address this, we proposed a novel hybrid binary optimization capable of effectively selecting features from increasingly high-dimensional datasets. The approach used in this study designed a sub-population selective mechanism that dynamically assigns individuals to a 2-level optimization process. The level-1 method first mutates items in the population and then reassigns them to a level-2 optimizer. The selective mechanism determines what sub-population is assigned for the level-2 optimizer based on the exploration and exploitation phase of the level-1 optimizer. In addition, we designed nested transfer (NT) functions and investigated the influence of the function on the level-1 optimizer. The binary Ebola optimization search algorithm (BEOSA) is applied for the level-1 mutation, while the simulated annealing (SA) and firefly (FFA) algorithms are investigated for the level-2 optimizer. The outcome of these are the HBEOSA-SA and HBEOSA-FFA, which are then investigated on the NT, and their corresponding variants HBEOSA-SA-NT and HBEOSA-FFA-NT with no NT applied. The hybrid methods were experimentally tested over high-dimensional datasets to address the challenge of feature selection. A comparative analysis was done on the methods to obtain performance variability with the low-dimensional datasets. Results obtained for classification accuracy for large, medium, and small-scale datasets are 0.995 using HBEOSA-FFA, 0.967 using HBEOSA-FFA-NT, and 0.953 using HBEOSA-FFA, respectively. Fitness and cost values relative to large, medium, and small-scale datasets are 0.066 and 0.934 using HBEOSA-FFA, 0.068 and 0.932 using HBEOSA-FFA, with 0.222 and 0.970 using HBEOSA-SA-NT, respectively. Findings from the study indicate that the HBEOSA-SA, HBEOSA-FFA, HBEOSA-SA-NT and HBEOSA-FFA-NT outperformed the BEOSA.https://doi.org/10.1371/journal.pone.0282812
spellingShingle Olaide N Oyelade
Jeffrey O Agushaka
Absalom E Ezugwu
Evolutionary binary feature selection using adaptive ebola optimization search algorithm for high-dimensional datasets.
PLoS ONE
title Evolutionary binary feature selection using adaptive ebola optimization search algorithm for high-dimensional datasets.
title_full Evolutionary binary feature selection using adaptive ebola optimization search algorithm for high-dimensional datasets.
title_fullStr Evolutionary binary feature selection using adaptive ebola optimization search algorithm for high-dimensional datasets.
title_full_unstemmed Evolutionary binary feature selection using adaptive ebola optimization search algorithm for high-dimensional datasets.
title_short Evolutionary binary feature selection using adaptive ebola optimization search algorithm for high-dimensional datasets.
title_sort evolutionary binary feature selection using adaptive ebola optimization search algorithm for high dimensional datasets
url https://doi.org/10.1371/journal.pone.0282812
work_keys_str_mv AT olaidenoyelade evolutionarybinaryfeatureselectionusingadaptiveebolaoptimizationsearchalgorithmforhighdimensionaldatasets
AT jeffreyoagushaka evolutionarybinaryfeatureselectionusingadaptiveebolaoptimizationsearchalgorithmforhighdimensionaldatasets
AT absalomeezugwu evolutionarybinaryfeatureselectionusingadaptiveebolaoptimizationsearchalgorithmforhighdimensionaldatasets