A Textual Backdoor Defense Method Based on Deep Feature Classification
Natural language processing (NLP) models based on deep neural networks (DNNs) are vulnerable to backdoor attacks. Existing backdoor defense methods have limited effectiveness and coverage scenarios. We propose a textual backdoor defense method based on deep feature classification. The method include...
Main Authors: | Kun Shao, Junan Yang, Pengjiang Hu, Xiaoshuai Li |
---|---|
Format: | Article |
Language: | English |
Published: |
MDPI AG
2023-01-01
|
Series: | Entropy |
Subjects: | |
Online Access: | https://www.mdpi.com/1099-4300/25/2/220 |
Similar Items
-
Textual Backdoor Defense via Poisoned Sample Recognition
by: Kun Shao, et al.
Published: (2021-10-01) -
Backdoor Pony: Evaluating backdoor attacks and defenses in different domains
by: Arthur Mercier, et al.
Published: (2023-05-01) -
Defending Against Backdoor Attacks by Quarantine Training
by: Chengxu Yu, et al.
Published: (2024-01-01) -
Survey on Backdoor Attacks and Countermeasures in Deep Neural Network
by: QIAN Hanwei, SUN Weisong
Published: (2023-05-01) -
A Comprehensive Survey on Backdoor Attacks and Their Defenses in Face Recognition Systems
by: Quentin Le Roux, et al.
Published: (2024-01-01)