Bayesian inference with certifiable adversarial robustness

We consider adversarial training of deep neural networks through the lens of Bayesian learning and present a principled framework for adversarial training of Bayesian Neural Networks (BNNs) with certifiable guarantees. We rely on techniques from constraint relaxation of non-convex optimisation probl...

Cijeli opis

Bibliografski detalji
Glavni autori: Wicker, M, Laurenti, L, Patane, A, Chen, Z, Zhang, Z, Kwiatkowska, M
Format: Conference item
Jezik:English
Izdano: Journal of Machine Learning Research 2021
_version_ 1826309008957374464
author Wicker, M
Laurenti, L
Patane, A
Chen, Z
Zhang, Z
Kwiatkowska, M
author_facet Wicker, M
Laurenti, L
Patane, A
Chen, Z
Zhang, Z
Kwiatkowska, M
author_sort Wicker, M
collection OXFORD
description We consider adversarial training of deep neural networks through the lens of Bayesian learning and present a principled framework for adversarial training of Bayesian Neural Networks (BNNs) with certifiable guarantees. We rely on techniques from constraint relaxation of non-convex optimisation problems and modify the standard cross-entropy error model to enforce posterior robustness to worst-case perturbations in ϵ−balls around input points. We illustrate how the resulting framework can be combined with methods commonly employed for approximate inference of BNNs. In an empirical investigation, we demonstrate that the presented approach enables training of certifiably robust models on MNIST, FashionMNIST, and CIFAR-10 and can also be beneficial for uncertainty calibration. Our method is the first to directly train certifiable BNNs, thus facilitating their deployment in safety-critical applications.
first_indexed 2024-03-07T07:27:49Z
format Conference item
id oxford-uuid:b9c010c1-7e37-4dd8-aa13-3ca0d9c49f83
institution University of Oxford
language English
last_indexed 2024-03-07T07:27:49Z
publishDate 2021
publisher Journal of Machine Learning Research
record_format dspace
spelling oxford-uuid:b9c010c1-7e37-4dd8-aa13-3ca0d9c49f832022-12-01T17:09:27ZBayesian inference with certifiable adversarial robustnessConference itemhttp://purl.org/coar/resource_type/c_5794uuid:b9c010c1-7e37-4dd8-aa13-3ca0d9c49f83EnglishSymplectic ElementsJournal of Machine Learning Research2021Wicker, MLaurenti, LPatane, AChen, ZZhang, ZKwiatkowska, MWe consider adversarial training of deep neural networks through the lens of Bayesian learning and present a principled framework for adversarial training of Bayesian Neural Networks (BNNs) with certifiable guarantees. We rely on techniques from constraint relaxation of non-convex optimisation problems and modify the standard cross-entropy error model to enforce posterior robustness to worst-case perturbations in ϵ−balls around input points. We illustrate how the resulting framework can be combined with methods commonly employed for approximate inference of BNNs. In an empirical investigation, we demonstrate that the presented approach enables training of certifiably robust models on MNIST, FashionMNIST, and CIFAR-10 and can also be beneficial for uncertainty calibration. Our method is the first to directly train certifiable BNNs, thus facilitating their deployment in safety-critical applications.
spellingShingle Wicker, M
Laurenti, L
Patane, A
Chen, Z
Zhang, Z
Kwiatkowska, M
Bayesian inference with certifiable adversarial robustness
title Bayesian inference with certifiable adversarial robustness
title_full Bayesian inference with certifiable adversarial robustness
title_fullStr Bayesian inference with certifiable adversarial robustness
title_full_unstemmed Bayesian inference with certifiable adversarial robustness
title_short Bayesian inference with certifiable adversarial robustness
title_sort bayesian inference with certifiable adversarial robustness
work_keys_str_mv AT wickerm bayesianinferencewithcertifiableadversarialrobustness
AT laurentil bayesianinferencewithcertifiableadversarialrobustness
AT patanea bayesianinferencewithcertifiableadversarialrobustness
AT chenz bayesianinferencewithcertifiableadversarialrobustness
AT zhangz bayesianinferencewithcertifiableadversarialrobustness
AT kwiatkowskam bayesianinferencewithcertifiableadversarialrobustness