DefenseFea: An Input Transformation Feature Searching Algorithm Based Latent Space for Adversarial Defense

Deep neural networks based image classification systems could suffer from adversarial attack algorithms, which generate input examples by adding deliberately crafted yet imperceptible noise to original input images. These crafted examples can fool systems and further threaten their security. In this...

Full description

Bibliographic Details
Main Authors: Pan Zhang, Yangjie Cao, Chenxi Zhu, Yan Zhuang, Haobo Wang, Jie Li
Format: Article
Language:English
Published: Sciendo 2024-02-01
Series:Foundations of Computing and Decision Sciences
Subjects:
Online Access:https://doi.org/10.2478/fcds-2024-0002
Description
Summary:Deep neural networks based image classification systems could suffer from adversarial attack algorithms, which generate input examples by adding deliberately crafted yet imperceptible noise to original input images. These crafted examples can fool systems and further threaten their security. In this paper, we propose to use latent space protect image classification. Specifically, we train a feature searching network to make up the difference between adversarial examples and clean examples with label guided loss function. We name it DefenseFea(input transformation based defense with label guided loss function), experimental result shows that DefenseFea can improve the rate of adversarial examples that achieved a success rate of about 99% on a specific set of 5000 images from ILSVRC 2012. This study plays a positive role in the further investigation of the relationship between adversarial examples and clean examples.
ISSN:2300-3405