Binary Starling Murmuration Optimizer Algorithm to Select Effective Features from Medical Data

Feature selection is an NP-hard problem to remove irrelevant and redundant features with no predictive information to increase the performance of machine learning algorithms. Many wrapper-based methods using metaheuristic algorithms have been proposed to select effective features. However, they achi...

Full description

Bibliographic Details
Main Authors: Mohammad H. Nadimi-Shahraki, Zahra Asghari Varzaneh, Hoda Zamani, Seyedali Mirjalili
Format: Article
Language:English
Published: MDPI AG 2022-12-01
Series:Applied Sciences
Subjects:
Online Access:https://www.mdpi.com/2076-3417/13/1/564
_version_ 1797626207790956544
author Mohammad H. Nadimi-Shahraki
Zahra Asghari Varzaneh
Hoda Zamani
Seyedali Mirjalili
author_facet Mohammad H. Nadimi-Shahraki
Zahra Asghari Varzaneh
Hoda Zamani
Seyedali Mirjalili
author_sort Mohammad H. Nadimi-Shahraki
collection DOAJ
description Feature selection is an NP-hard problem to remove irrelevant and redundant features with no predictive information to increase the performance of machine learning algorithms. Many wrapper-based methods using metaheuristic algorithms have been proposed to select effective features. However, they achieve differently on medical data, and most of them cannot find those effective features that may fulfill the required accuracy in diagnosing important diseases such as Diabetes, Heart problems, Hepatitis, and Coronavirus, which are targeted datasets in this study. To tackle this drawback, an algorithm is needed that can strike a balance between local and global search strategies in selecting effective features from medical datasets. In this paper, a new binary optimizer algorithm named BSMO is proposed. It is based on the newly proposed starling murmuration optimizer (SMO) that has a high ability to solve different complex and engineering problems, and it is expected that BSMO can also effectively find an optimal subset of features. Two distinct approaches are utilized by the BSMO algorithm when searching medical datasets to find effective features. Each dimension in a continuous solution generated by SMO is simply mapped to 0 or 1 using a variable threshold in the second approach, whereas in the first, binary versions of BSMO are developed using several S-shaped and V-shaped transfer functions. The performance of the proposed BSMO was evaluated using four targeted medical datasets, and results were compared with well-known binary metaheuristic algorithms in terms of different metrics, including fitness, accuracy, sensitivity, specificity, precision, and error. Finally, the superiority of the proposed BSMO algorithm was statistically analyzed using Friedman non-parametric test. The statistical and experimental tests proved that the proposed BSMO attains better performance in comparison to the competitive algorithms such as ACO, BBA, bGWO, and BWOA for selecting effective features from the medical datasets targeted in this study.
first_indexed 2024-03-11T10:07:11Z
format Article
id doaj.art-bf249f810ad149b3978449fb3b4cb10b
institution Directory Open Access Journal
issn 2076-3417
language English
last_indexed 2024-03-11T10:07:11Z
publishDate 2022-12-01
publisher MDPI AG
record_format Article
series Applied Sciences
spelling doaj.art-bf249f810ad149b3978449fb3b4cb10b2023-11-16T14:58:45ZengMDPI AGApplied Sciences2076-34172022-12-0113156410.3390/app13010564Binary Starling Murmuration Optimizer Algorithm to Select Effective Features from Medical DataMohammad H. Nadimi-Shahraki0Zahra Asghari Varzaneh1Hoda Zamani2Seyedali Mirjalili3Faculty of Computer Engineering, Najafabad Branch, Islamic Azad University, Najafabad 8514143131, IranDepartment of Computer Science, Faculty of Mathematics and Computer, Shahid Bahonar University of Kerman, Kerman 7616914111, IranFaculty of Computer Engineering, Najafabad Branch, Islamic Azad University, Najafabad 8514143131, IranCentre for Artificial Intelligence Research and Optimisation, Torrens University Australia, Brisbane, QLD 4006, AustraliaFeature selection is an NP-hard problem to remove irrelevant and redundant features with no predictive information to increase the performance of machine learning algorithms. Many wrapper-based methods using metaheuristic algorithms have been proposed to select effective features. However, they achieve differently on medical data, and most of them cannot find those effective features that may fulfill the required accuracy in diagnosing important diseases such as Diabetes, Heart problems, Hepatitis, and Coronavirus, which are targeted datasets in this study. To tackle this drawback, an algorithm is needed that can strike a balance between local and global search strategies in selecting effective features from medical datasets. In this paper, a new binary optimizer algorithm named BSMO is proposed. It is based on the newly proposed starling murmuration optimizer (SMO) that has a high ability to solve different complex and engineering problems, and it is expected that BSMO can also effectively find an optimal subset of features. Two distinct approaches are utilized by the BSMO algorithm when searching medical datasets to find effective features. Each dimension in a continuous solution generated by SMO is simply mapped to 0 or 1 using a variable threshold in the second approach, whereas in the first, binary versions of BSMO are developed using several S-shaped and V-shaped transfer functions. The performance of the proposed BSMO was evaluated using four targeted medical datasets, and results were compared with well-known binary metaheuristic algorithms in terms of different metrics, including fitness, accuracy, sensitivity, specificity, precision, and error. Finally, the superiority of the proposed BSMO algorithm was statistically analyzed using Friedman non-parametric test. The statistical and experimental tests proved that the proposed BSMO attains better performance in comparison to the competitive algorithms such as ACO, BBA, bGWO, and BWOA for selecting effective features from the medical datasets targeted in this study.https://www.mdpi.com/2076-3417/13/1/564disease diagnosismedical datafeature selectionbinary metaheuristic algorithmsstarling murmuration optimizer (SMO)transfer function
spellingShingle Mohammad H. Nadimi-Shahraki
Zahra Asghari Varzaneh
Hoda Zamani
Seyedali Mirjalili
Binary Starling Murmuration Optimizer Algorithm to Select Effective Features from Medical Data
Applied Sciences
disease diagnosis
medical data
feature selection
binary metaheuristic algorithms
starling murmuration optimizer (SMO)
transfer function
title Binary Starling Murmuration Optimizer Algorithm to Select Effective Features from Medical Data
title_full Binary Starling Murmuration Optimizer Algorithm to Select Effective Features from Medical Data
title_fullStr Binary Starling Murmuration Optimizer Algorithm to Select Effective Features from Medical Data
title_full_unstemmed Binary Starling Murmuration Optimizer Algorithm to Select Effective Features from Medical Data
title_short Binary Starling Murmuration Optimizer Algorithm to Select Effective Features from Medical Data
title_sort binary starling murmuration optimizer algorithm to select effective features from medical data
topic disease diagnosis
medical data
feature selection
binary metaheuristic algorithms
starling murmuration optimizer (SMO)
transfer function
url https://www.mdpi.com/2076-3417/13/1/564
work_keys_str_mv AT mohammadhnadimishahraki binarystarlingmurmurationoptimizeralgorithmtoselecteffectivefeaturesfrommedicaldata
AT zahraasgharivarzaneh binarystarlingmurmurationoptimizeralgorithmtoselecteffectivefeaturesfrommedicaldata
AT hodazamani binarystarlingmurmurationoptimizeralgorithmtoselecteffectivefeaturesfrommedicaldata
AT seyedalimirjalili binarystarlingmurmurationoptimizeralgorithmtoselecteffectivefeaturesfrommedicaldata