Improved Uncertainty Quantification for Neural Networks With Bayesian Last Layer

Uncertainty quantification is an important task in machine learning - a task in which standard neural networks (NNs) have traditionally not excelled. This can be a limitation for safety-critical applications, where uncertainty-aware methods like Gaussian processes or Bayesian linear regression are o...

Full description

Bibliographic Details
Main Authors: Felix Fiedler, Sergio Lucia
Format: Article
Language:English
Published: IEEE 2023-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/10305157/
_version_ 1797632795988721664
author Felix Fiedler
Sergio Lucia
author_facet Felix Fiedler
Sergio Lucia
author_sort Felix Fiedler
collection DOAJ
description Uncertainty quantification is an important task in machine learning - a task in which standard neural networks (NNs) have traditionally not excelled. This can be a limitation for safety-critical applications, where uncertainty-aware methods like Gaussian processes or Bayesian linear regression are often preferred. Bayesian neural networks are an approach to address this limitation. They assume probability distributions for all parameters and yield distributed predictions. However, training and inference are typically intractable and approximations must be employed. A promising approximation is NNs with Bayesian last layer (BLL). They assume distributed weights only in the linear output layer and yield a normally distributed prediction. To approximate the intractable Bayesian neural network, point estimates of the distributed weights in all but the last layer should be obtained by maximizing the marginal likelihood. This has previously been challenging, as the marginal likelihood is expensive to evaluate in this setting. We present a reformulation of the log-marginal likelihood of a NN with BLL which allows for efficient training using backpropagation. Furthermore, we address the challenge of uncertainty quantification for extrapolation points. We provide a metric to quantify the degree of extrapolation and derive a method to improve the uncertainty quantification for these points. Our methods are derived for the multivariate case and demonstrated in a simulation study. In comparison to Bayesian linear regression with fixed features, and a Bayesian neural network trained with variational inference, our proposed method achieves the highest log-predictive density on test data.
first_indexed 2024-03-11T11:42:51Z
format Article
id doaj.art-c378e6cfbd804ed69f0513b4b7bce864
institution Directory Open Access Journal
issn 2169-3536
language English
last_indexed 2024-03-11T11:42:51Z
publishDate 2023-01-01
publisher IEEE
record_format Article
series IEEE Access
spelling doaj.art-c378e6cfbd804ed69f0513b4b7bce8642023-11-10T00:01:02ZengIEEEIEEE Access2169-35362023-01-011112314912316010.1109/ACCESS.2023.332968510305157Improved Uncertainty Quantification for Neural Networks With Bayesian Last LayerFelix Fiedler0https://orcid.org/0000-0003-3490-1256Sergio Lucia1https://orcid.org/0000-0002-3347-5593Chair of Process Automation Systems, TU Dortmund University, Dortmund, GermanyChair of Process Automation Systems, TU Dortmund University, Dortmund, GermanyUncertainty quantification is an important task in machine learning - a task in which standard neural networks (NNs) have traditionally not excelled. This can be a limitation for safety-critical applications, where uncertainty-aware methods like Gaussian processes or Bayesian linear regression are often preferred. Bayesian neural networks are an approach to address this limitation. They assume probability distributions for all parameters and yield distributed predictions. However, training and inference are typically intractable and approximations must be employed. A promising approximation is NNs with Bayesian last layer (BLL). They assume distributed weights only in the linear output layer and yield a normally distributed prediction. To approximate the intractable Bayesian neural network, point estimates of the distributed weights in all but the last layer should be obtained by maximizing the marginal likelihood. This has previously been challenging, as the marginal likelihood is expensive to evaluate in this setting. We present a reformulation of the log-marginal likelihood of a NN with BLL which allows for efficient training using backpropagation. Furthermore, we address the challenge of uncertainty quantification for extrapolation points. We provide a metric to quantify the degree of extrapolation and derive a method to improve the uncertainty quantification for these points. Our methods are derived for the multivariate case and demonstrated in a simulation study. In comparison to Bayesian linear regression with fixed features, and a Bayesian neural network trained with variational inference, our proposed method achieves the highest log-predictive density on test data.https://ieeexplore.ieee.org/document/10305157/Bayesian last layerbayesian neural networkuncertainty quantification
spellingShingle Felix Fiedler
Sergio Lucia
Improved Uncertainty Quantification for Neural Networks With Bayesian Last Layer
IEEE Access
Bayesian last layer
bayesian neural network
uncertainty quantification
title Improved Uncertainty Quantification for Neural Networks With Bayesian Last Layer
title_full Improved Uncertainty Quantification for Neural Networks With Bayesian Last Layer
title_fullStr Improved Uncertainty Quantification for Neural Networks With Bayesian Last Layer
title_full_unstemmed Improved Uncertainty Quantification for Neural Networks With Bayesian Last Layer
title_short Improved Uncertainty Quantification for Neural Networks With Bayesian Last Layer
title_sort improved uncertainty quantification for neural networks with bayesian last layer
topic Bayesian last layer
bayesian neural network
uncertainty quantification
url https://ieeexplore.ieee.org/document/10305157/
work_keys_str_mv AT felixfiedler improveduncertaintyquantificationforneuralnetworkswithbayesianlastlayer
AT sergiolucia improveduncertaintyquantificationforneuralnetworkswithbayesianlastlayer