XtremeAugment: Getting More From Your Data Through Combination of Image Collection and Image Augmentation

Deep convolutional neural networks are highly efficient for computer vision tasks using plenty of training data. However, there remains a problem of small training datasets. For addressing this problem the training pipeline which handles rare object types and an overall lack of training data to buil...

Full description

Bibliographic Details
Main Authors:	Sergey Nesteruk, Svetlana Illarionova, Timur Akhtyamov, Dmitrii Shadrin, Andrey Somov, Mariia Pukalchik, Ivan Oseledets
Format:	Article
Language:	English
Published:	IEEE 2022-01-01
Series:	IEEE Access
Subjects:	Image augmentation computer vision image segmentation data collection Internet of Things few-shot learning
Online Access:	https://ieeexplore.ieee.org/document/9721254/

_version_	1819105227921424384
author	Sergey Nesteruk Svetlana Illarionova Timur Akhtyamov Dmitrii Shadrin Andrey Somov Mariia Pukalchik Ivan Oseledets
author_facet	Sergey Nesteruk Svetlana Illarionova Timur Akhtyamov Dmitrii Shadrin Andrey Somov Mariia Pukalchik Ivan Oseledets
author_sort	Sergey Nesteruk
collection	DOAJ
description	Deep convolutional neural networks are highly efficient for computer vision tasks using plenty of training data. However, there remains a problem of small training datasets. For addressing this problem the training pipeline which handles rare object types and an overall lack of training data to build well-performing models that provide stable predictions is required. This article reports on the comprehensive framework <italic>XtremeAugment</italic> which provides an easy, reliable, and scalable way to collect image datasets and to efficiently label and augment collected data. The presented framework consists of two augmentation techniques that can be used independently and complement each other when applied together. These are Hardware Dataset Augmentation (HDA) and Object-Based Augmentation (OBA). HDA allows the users to collect more data and spend less time on manual data labeling. OBA significantly increases the training data variability and remains the distribution of the augmented images being close to the original dataset. We assess the proposed approach for the apple spoil segmentation scenario. Our results demonstrate a substantial increase in the model accuracy reaching 0.91 F1-score and outperforming the baseline model for up to 0.62 F1-score for a few-shot learning case in the wild data. The highest benefit of applying <italic>XtremeAugment</italic> is achieved for the cases where we collect images in the controlled indoor environment, but have to use the model in the wild.
first_indexed	2024-12-22T02:18:54Z
format	Article
id	doaj.art-7dfc0072490e4b60bd0b6a7515fcecfc
institution	Directory Open Access Journal
issn	2169-3536
language	English
last_indexed	2024-12-22T02:18:54Z
publishDate	2022-01-01
publisher	IEEE
record_format	Article
series	IEEE Access
spelling	doaj.art-7dfc0072490e4b60bd0b6a7515fcecfc2022-12-21T18:42:11ZengIEEEIEEE Access2169-35362022-01-0110240102402810.1109/ACCESS.2022.31547099721254XtremeAugment: Getting More From Your Data Through Combination of Image Collection and Image AugmentationSergey Nesteruk0https://orcid.org/0000-0002-9740-6685Svetlana Illarionova1Timur Akhtyamov2Dmitrii Shadrin3Andrey Somov4https://orcid.org/0000-0002-4615-3008Mariia Pukalchik5Ivan Oseledets6Skolkovo Institute of Science and Technology, Skolkovo, RussiaSkolkovo Institute of Science and Technology, Skolkovo, RussiaSkolkovo Institute of Science and Technology, Skolkovo, RussiaSkolkovo Institute of Science and Technology, Skolkovo, RussiaSkolkovo Institute of Science and Technology, Skolkovo, RussiaSkolkovo Institute of Science and Technology, Skolkovo, RussiaSkolkovo Institute of Science and Technology, Skolkovo, RussiaDeep convolutional neural networks are highly efficient for computer vision tasks using plenty of training data. However, there remains a problem of small training datasets. For addressing this problem the training pipeline which handles rare object types and an overall lack of training data to build well-performing models that provide stable predictions is required. This article reports on the comprehensive framework <italic>XtremeAugment</italic> which provides an easy, reliable, and scalable way to collect image datasets and to efficiently label and augment collected data. The presented framework consists of two augmentation techniques that can be used independently and complement each other when applied together. These are Hardware Dataset Augmentation (HDA) and Object-Based Augmentation (OBA). HDA allows the users to collect more data and spend less time on manual data labeling. OBA significantly increases the training data variability and remains the distribution of the augmented images being close to the original dataset. We assess the proposed approach for the apple spoil segmentation scenario. Our results demonstrate a substantial increase in the model accuracy reaching 0.91 F1-score and outperforming the baseline model for up to 0.62 F1-score for a few-shot learning case in the wild data. The highest benefit of applying <italic>XtremeAugment</italic> is achieved for the cases where we collect images in the controlled indoor environment, but have to use the model in the wild.https://ieeexplore.ieee.org/document/9721254/Image augmentationcomputer visionimage segmentationdata collectionInternet of Thingsfew-shot learning
spellingShingle	Sergey Nesteruk Svetlana Illarionova Timur Akhtyamov Dmitrii Shadrin Andrey Somov Mariia Pukalchik Ivan Oseledets XtremeAugment: Getting More From Your Data Through Combination of Image Collection and Image Augmentation IEEE Access Image augmentation computer vision image segmentation data collection Internet of Things few-shot learning
title	XtremeAugment: Getting More From Your Data Through Combination of Image Collection and Image Augmentation
title_full	XtremeAugment: Getting More From Your Data Through Combination of Image Collection and Image Augmentation
title_fullStr	XtremeAugment: Getting More From Your Data Through Combination of Image Collection and Image Augmentation
title_full_unstemmed	XtremeAugment: Getting More From Your Data Through Combination of Image Collection and Image Augmentation
title_short	XtremeAugment: Getting More From Your Data Through Combination of Image Collection and Image Augmentation
title_sort	xtremeaugment getting more from your data through combination of image collection and image augmentation
topic	Image augmentation computer vision image segmentation data collection Internet of Things few-shot learning
url	https://ieeexplore.ieee.org/document/9721254/
work_keys_str_mv	AT sergeynesteruk xtremeaugmentgettingmorefromyourdatathroughcombinationofimagecollectionandimageaugmentation AT svetlanaillarionova xtremeaugmentgettingmorefromyourdatathroughcombinationofimagecollectionandimageaugmentation AT timurakhtyamov xtremeaugmentgettingmorefromyourdatathroughcombinationofimagecollectionandimageaugmentation AT dmitriishadrin xtremeaugmentgettingmorefromyourdatathroughcombinationofimagecollectionandimageaugmentation AT andreysomov xtremeaugmentgettingmorefromyourdatathroughcombinationofimagecollectionandimageaugmentation AT mariiapukalchik xtremeaugmentgettingmorefromyourdatathroughcombinationofimagecollectionandimageaugmentation AT ivanoseledets xtremeaugmentgettingmorefromyourdatathroughcombinationofimagecollectionandimageaugmentation

XtremeAugment: Getting More From Your Data Through Combination of Image Collection and Image Augmentation

Similar Items