Make some noise: reliable and efficient single-step adversarial training

Make some noise: reliable and efficient single-step adversarial training

Recently, Wong et al. (2020) showed that adversarial training with single-step FGSM leads to a characteristic failure mode named catastrophic overfitting (CO), in which a model becomes suddenly vulnerable to multi-step attacks. Experimentally they showed that simply adding a random perturbation prio...

Mô tả đầy đủ

Chi tiết về thư mục
Những tác giả chính:	de Jorge, P, Bibi, A, Volpi, R, Sanyal, A, Torr, PHS, Rogez, G, Dokania, PK
Định dạng:	Conference item
Ngôn ngữ:	English
Được phát hành:	Curran Associates 2023

Những quyển sách tương tự

Placing objects in context via inpainting for out-of-distribution segmentation
Bằng: De Jorge, P, et al.
Được phát hành: (2024)

Progressive skeletonization: trimming more fat from a network at initialization
Bằng: de Jorge, P, et al.
Được phát hành: (2020)

GDumb: A simple approach that questions our progress in continual learning
Bằng: Prabhu, A, et al.
Được phát hành: (2020)

Discovering class-specific pixels for weakly-supervised semantic segmentation
Bằng: Chaudhry, A, et al.
Được phát hành: (2017)

On using focal loss for neural network calibration
Bằng: Mukhoti, J, et al.
Được phát hành: (2020)

Calibrating deep neural networks using focal loss
Bằng: Mukhoti, J, et al.
Được phát hành: (2020)

Reliability in semantic segmentation: are we on the right track?
Bằng: De Jorge, P, et al.
Được phát hành: (2023)

Towards certification of uncertainty calibration under adversarial attacks
Bằng: Emde, C, et al.
Được phát hành: (2024)

What makes and breaks safety fine-tuning? a mechanistic study
Bằng: Jain, S, et al.
Được phát hành: (2024)

Continual learning in low-rank orthogonal subspaces
Bằng: Chaudhry, A, et al.
Được phát hành: (2020)

On the robustness of semantic segmentation models to adversarial attacks
Bằng: Arnab, A, et al.
Được phát hành: (2019)

Sample-dependent adaptive temperature scaling for improved calibration
Bằng: Joy, T, et al.
Được phát hành: (2023)

Mirror Descent view for Neural Network quantization
Bằng: Ajanthan, T, et al.
Được phát hành: (2021)

Clustering generative adversarial networks for story visualization
Bằng: Li, B, et al.
Được phát hành: (2022)

Multi-agent diverse generative adversarial networks
Bằng: Ghosh, A, et al.
Được phát hành: (2018)

Combating adversaries with anti-adversaries
Bằng: Alfarra, M, et al.
Được phát hành: (2022)

Using mixup as a regularizer can surprisingly improve accuracy and out-of-distribution robustness
Bằng: Pinto, F, et al.
Được phát hành: (2023)

Mix-MaxEnt: improving accuracy and uncertainty estimates of deterministic neural networks
Bằng: Pinto, F, et al.
Được phát hành: (2021)

RanDumb: a simple approach that questions the efficacy of continual representation learning
Bằng: Prabhu, A, et al.
Được phát hành: (2025)

RanDumb: random representations outperform online continually learned representations
Bằng: Prabhu, A, et al.
Được phát hành: (2025)

Adversarial masking for self-supervised learning
Bằng: Shi, Y, et al.
Được phát hành: (2022)

Stable rank normalization for improved generalization in neural networks and GANs
Bằng: Sanyal, A, et al.
Được phát hành: (2020)

Lightweight generative adversarial networks for text-guided image manipulation
Bằng: Li, B, et al.
Được phát hành: (2020)

Certifying ensembles: a general certification theory with s-lipschitzness
Bằng: Petrov, A, et al.
Được phát hành: (2023)

Adversarial metric attack and defense for person re-identification
Bằng: Bai, S, et al.
Được phát hành: (2020)

SegPGD: an effective and efficient adversarial attack for evaluating and boosting segmentation robustness
Bằng: Gu, J, et al.
Được phát hành: (2022)

As firm as their foundations: creating transferable adversarial examples across downstream tasks with CLIP
Bằng: Hu, A, et al.
Được phát hành: (2024)

How benign is benign overfitting?
Bằng: Sanyal, A, et al.
Được phát hành: (2021)

Exploiting projective geometry for view-invariant monocular human motion analysis in man-made environments
Bằng: Rogez, G, et al.
Được phát hành: (2014)

Are vision transformers always more robust than convolutional neural networks?
Bằng: Pinto, F, et al.
Được phát hành: (2021)

An impartial take to the CNN vs transformer robustness contest
Bằng: Pinto, F, et al.
Được phát hành: (2022)

Raising the bar on the evaluation of out-of-distribution detection
Bằng: Mukhoti, J, et al.
Được phát hành: (2023)

Pre-training concept frequency is predictive of CLIP zero-shot performance
Bằng: Udandarao, V, et al.
Được phát hành: (2024)

Randomized trees for human pose detection
Bằng: Rogez, G, et al.
Được phát hành: (2008)

Fast human pose detection using randomized hierarchical cascades of rejectors
Bằng: Rogez, G, et al.
Được phát hành: (2012)

Prompting a pretrained transformer can be a universal approximator
Bằng: Petrov, A, et al.
Được phát hành: (2024)

AttentionGAN: unpaired image-to-image translation using attention-guided generative adversarial networks.
Bằng: Tang, H, et al.
Được phát hành: (2021)

Graph inductive biases in transformers without message passing
Bằng: Ma, L, et al.
Được phát hành: (2023)

Graph inductive biases in transformers without message passing
Bằng: Ma, L, et al.
Được phát hành: (2023)

Local class-specific and global image-level generative adversarial networks for semantic-guided scene generation
Bằng: Tang, H, et al.
Được phát hành: (2020)