Brain-inspired semantic data augmentation for multi-style images

Data augmentation is an effective technique for automatically expanding training data in deep learning. Brain-inspired methods are approaches that draw inspiration from the functionality and structure of the human brain and apply these mechanisms and principles to artificial intelligence and compute...

Disgrifiad llawn

Manylion Llyfryddiaeth
Prif Awduron: Wei Wang, Zhaowei Shang, Chengxing Li
Fformat: Erthygl
Iaith:English
Cyhoeddwyd: Frontiers Media S.A. 2024-03-01
Cyfres:Frontiers in Neurorobotics
Pynciau:
Mynediad Ar-lein:https://www.frontiersin.org/articles/10.3389/fnbot.2024.1382406/full
_version_ 1827308910431698944
author Wei Wang
Zhaowei Shang
Chengxing Li
author_facet Wei Wang
Zhaowei Shang
Chengxing Li
author_sort Wei Wang
collection DOAJ
description Data augmentation is an effective technique for automatically expanding training data in deep learning. Brain-inspired methods are approaches that draw inspiration from the functionality and structure of the human brain and apply these mechanisms and principles to artificial intelligence and computer science. When there is a large style difference between training data and testing data, common data augmentation methods cannot effectively enhance the generalization performance of the deep model. To solve this problem, we improve modeling Domain Shifts with Uncertainty (DSU) and propose a new brain-inspired computer vision image data augmentation method which consists of two key components, namely, using Robust statistics and controlling the Coefficient of variance for DSU (RCDSU) and Feature Data Augmentation (FeatureDA). RCDSU calculates feature statistics (mean and standard deviation) with robust statistics to weaken the influence of outliers, making the statistics close to the real values and improving the robustness of deep learning models. By controlling the coefficient of variance, RCDSU makes the feature statistics shift with semantic preservation and increases shift range. FeatureDA controls the coefficient of variance similarly to generate the augmented features with semantics unchanged and increase the coverage of augmented features. RCDSU and FeatureDA are proposed to perform style transfer and content transfer in the feature space, and improve the generalization ability of the model at the style and content level respectively. On Photo, Art Painting, Cartoon, and Sketch (PACS) multi-style classification task, RCDSU plus FeatureDA achieves competitive accuracy. After adding Gaussian noise to PACS dataset, RCDSU plus FeatureDA shows strong robustness against outliers. FeatureDA achieves excellent results on CIFAR-100 image classification task. RCDSU plus FeatureDA can be applied as a novel brain-inspired semantic data augmentation method with implicit robot automation which is suitable for datasets with large style differences between training and testing data.
first_indexed 2024-04-24T19:20:01Z
format Article
id doaj.art-bc2b45d354834e4da44fab4106df9d12
institution Directory Open Access Journal
issn 1662-5218
language English
last_indexed 2024-04-24T19:20:01Z
publishDate 2024-03-01
publisher Frontiers Media S.A.
record_format Article
series Frontiers in Neurorobotics
spelling doaj.art-bc2b45d354834e4da44fab4106df9d122024-03-26T04:20:51ZengFrontiers Media S.A.Frontiers in Neurorobotics1662-52182024-03-011810.3389/fnbot.2024.13824061382406Brain-inspired semantic data augmentation for multi-style imagesWei WangZhaowei ShangChengxing LiData augmentation is an effective technique for automatically expanding training data in deep learning. Brain-inspired methods are approaches that draw inspiration from the functionality and structure of the human brain and apply these mechanisms and principles to artificial intelligence and computer science. When there is a large style difference between training data and testing data, common data augmentation methods cannot effectively enhance the generalization performance of the deep model. To solve this problem, we improve modeling Domain Shifts with Uncertainty (DSU) and propose a new brain-inspired computer vision image data augmentation method which consists of two key components, namely, using Robust statistics and controlling the Coefficient of variance for DSU (RCDSU) and Feature Data Augmentation (FeatureDA). RCDSU calculates feature statistics (mean and standard deviation) with robust statistics to weaken the influence of outliers, making the statistics close to the real values and improving the robustness of deep learning models. By controlling the coefficient of variance, RCDSU makes the feature statistics shift with semantic preservation and increases shift range. FeatureDA controls the coefficient of variance similarly to generate the augmented features with semantics unchanged and increase the coverage of augmented features. RCDSU and FeatureDA are proposed to perform style transfer and content transfer in the feature space, and improve the generalization ability of the model at the style and content level respectively. On Photo, Art Painting, Cartoon, and Sketch (PACS) multi-style classification task, RCDSU plus FeatureDA achieves competitive accuracy. After adding Gaussian noise to PACS dataset, RCDSU plus FeatureDA shows strong robustness against outliers. FeatureDA achieves excellent results on CIFAR-100 image classification task. RCDSU plus FeatureDA can be applied as a novel brain-inspired semantic data augmentation method with implicit robot automation which is suitable for datasets with large style differences between training and testing data.https://www.frontiersin.org/articles/10.3389/fnbot.2024.1382406/fulldata augmentationdeep learningrobust statisticsstyle transferuncertainty modelingbrain-inspired computer vision
spellingShingle Wei Wang
Zhaowei Shang
Chengxing Li
Brain-inspired semantic data augmentation for multi-style images
Frontiers in Neurorobotics
data augmentation
deep learning
robust statistics
style transfer
uncertainty modeling
brain-inspired computer vision
title Brain-inspired semantic data augmentation for multi-style images
title_full Brain-inspired semantic data augmentation for multi-style images
title_fullStr Brain-inspired semantic data augmentation for multi-style images
title_full_unstemmed Brain-inspired semantic data augmentation for multi-style images
title_short Brain-inspired semantic data augmentation for multi-style images
title_sort brain inspired semantic data augmentation for multi style images
topic data augmentation
deep learning
robust statistics
style transfer
uncertainty modeling
brain-inspired computer vision
url https://www.frontiersin.org/articles/10.3389/fnbot.2024.1382406/full
work_keys_str_mv AT weiwang braininspiredsemanticdataaugmentationformultistyleimages
AT zhaoweishang braininspiredsemanticdataaugmentationformultistyleimages
AT chengxingli braininspiredsemanticdataaugmentationformultistyleimages