Different hybrid machine intelligence techniques for handling IoT‐based imbalanced data

Abstract In the era of automatic task processing or designing complex algorithms, to analyse data, it is always pertinent to find real‐life solutions using cutting‐edge tools and techniques to generate insights into the data. The data‐driven machine learning models are now offering more or less wort...

Full description

Bibliographic Details
Main Authors: Gaurav Mohindru, Koushik Mondal, Haider Banka
Format: Article
Language:English
Published: Wiley 2021-12-01
Series:CAAI Transactions on Intelligence Technology
Subjects:
Online Access:https://doi.org/10.1049/cit2.12032
_version_ 1817998357068513280
author Gaurav Mohindru
Koushik Mondal
Haider Banka
author_facet Gaurav Mohindru
Koushik Mondal
Haider Banka
author_sort Gaurav Mohindru
collection DOAJ
description Abstract In the era of automatic task processing or designing complex algorithms, to analyse data, it is always pertinent to find real‐life solutions using cutting‐edge tools and techniques to generate insights into the data. The data‐driven machine learning models are now offering more or less worthy results when they are certainly balanced in the input data sets. Imbalanced data occurs when an unequal distribution of classes occurs in the input datasets. Building a predictive model on the imbalanced data set would cause a model that appears to yield high accuracy but does not generalize well to the new data in the minority class. Now the time has come to look into the datasets which are not so‐called ‘balanced’ in nature but such datasets are generally encountered frequently in a workspace. To prevent creating models with false levels of accuracy, the imbalanced data should be rearranged before creating a predictive model. Those data are, sometimes, voluminous, heterogeneous and complex in nature and generate from different autonomous sources with distributed and decentralized control. The driving force is to efficiently handle these data sets using latest tools and techniques for research and commercial insights. The present article provides different such tools and techniques, in different computing frameworks, to handle such Internet of Things and other related datasets to review common techniques for handling imbalanced data in data ecosystems and offers a comparative data modelling framework in Keras for balanced and imbalanced datasets.
first_indexed 2024-04-14T02:52:01Z
format Article
id doaj.art-014d1b354220429580ded2cebdc4d73b
institution Directory Open Access Journal
issn 2468-2322
language English
last_indexed 2024-04-14T02:52:01Z
publishDate 2021-12-01
publisher Wiley
record_format Article
series CAAI Transactions on Intelligence Technology
spelling doaj.art-014d1b354220429580ded2cebdc4d73b2022-12-22T02:16:15ZengWileyCAAI Transactions on Intelligence Technology2468-23222021-12-016440541610.1049/cit2.12032Different hybrid machine intelligence techniques for handling IoT‐based imbalanced dataGaurav Mohindru0Koushik Mondal1Haider Banka2Department of Computer Science and Engineering IIT(ISM)Dhanbad IndiaDepartment of Computer Centre IIT(ISM)Dhanbad IndiaDepartment of Computer Science and Engineering IIT(ISM)Dhanbad IndiaAbstract In the era of automatic task processing or designing complex algorithms, to analyse data, it is always pertinent to find real‐life solutions using cutting‐edge tools and techniques to generate insights into the data. The data‐driven machine learning models are now offering more or less worthy results when they are certainly balanced in the input data sets. Imbalanced data occurs when an unequal distribution of classes occurs in the input datasets. Building a predictive model on the imbalanced data set would cause a model that appears to yield high accuracy but does not generalize well to the new data in the minority class. Now the time has come to look into the datasets which are not so‐called ‘balanced’ in nature but such datasets are generally encountered frequently in a workspace. To prevent creating models with false levels of accuracy, the imbalanced data should be rearranged before creating a predictive model. Those data are, sometimes, voluminous, heterogeneous and complex in nature and generate from different autonomous sources with distributed and decentralized control. The driving force is to efficiently handle these data sets using latest tools and techniques for research and commercial insights. The present article provides different such tools and techniques, in different computing frameworks, to handle such Internet of Things and other related datasets to review common techniques for handling imbalanced data in data ecosystems and offers a comparative data modelling framework in Keras for balanced and imbalanced datasets.https://doi.org/10.1049/cit2.12032neural netsartificial intelligencemedical computingInternet of Thingspattern classificationlearning (artificial intelligence)
spellingShingle Gaurav Mohindru
Koushik Mondal
Haider Banka
Different hybrid machine intelligence techniques for handling IoT‐based imbalanced data
CAAI Transactions on Intelligence Technology
neural nets
artificial intelligence
medical computing
Internet of Things
pattern classification
learning (artificial intelligence)
title Different hybrid machine intelligence techniques for handling IoT‐based imbalanced data
title_full Different hybrid machine intelligence techniques for handling IoT‐based imbalanced data
title_fullStr Different hybrid machine intelligence techniques for handling IoT‐based imbalanced data
title_full_unstemmed Different hybrid machine intelligence techniques for handling IoT‐based imbalanced data
title_short Different hybrid machine intelligence techniques for handling IoT‐based imbalanced data
title_sort different hybrid machine intelligence techniques for handling iot based imbalanced data
topic neural nets
artificial intelligence
medical computing
Internet of Things
pattern classification
learning (artificial intelligence)
url https://doi.org/10.1049/cit2.12032
work_keys_str_mv AT gauravmohindru differenthybridmachineintelligencetechniquesforhandlingiotbasedimbalanceddata
AT koushikmondal differenthybridmachineintelligencetechniquesforhandlingiotbasedimbalanceddata
AT haiderbanka differenthybridmachineintelligencetechniquesforhandlingiotbasedimbalanceddata