Output-weighted optimal sampling for Bayesian regression and rare event statistics using few samples

For many important problems the quantity of interest is an unknown function of the parameters, which is a random vector with known statistics. Since the dependence of the output on this random vector is unknown, the challenge is to identify its statistics, using the minimum number of function evalua...

Full description

Bibliographic Details
Main Author:	Sapsis, Themistoklis Panagiotis
Other Authors:	Massachusetts Institute of Technology. Department of Mechanical Engineering
Format:	Article
Language:	English
Published:	The Royal Society 2020
Online Access:	https://hdl.handle.net/1721.1/126917

_version_	1826214846087036928
author	Sapsis, Themistoklis Panagiotis
author2	Massachusetts Institute of Technology. Department of Mechanical Engineering
author_facet	Massachusetts Institute of Technology. Department of Mechanical Engineering Sapsis, Themistoklis Panagiotis
author_sort	Sapsis, Themistoklis Panagiotis
collection	MIT
description	For many important problems the quantity of interest is an unknown function of the parameters, which is a random vector with known statistics. Since the dependence of the output on this random vector is unknown, the challenge is to identify its statistics, using the minimum number of function evaluations. This problem can be seen in the context of active learning or optimal experimental design. We employ Bayesian regression to represent the derived model uncertainty due to finite and small number of input–output pairs. In this context we evaluate existing methods for optimal sample selection, such as model error minimization and mutual information maximization. We show that for the case of known output variance, the commonly employed criteria in the literature do not take into account the output values of the existing input–output pairs, while for the case of unknown output variance this dependence can be very weak. We introduce a criterion that takes into account the values of the output for the existing samples and adaptively selects inputs from regions of the parameter space which have an important contribution to the output. The new method allows for application to high-dimensional inputs, paving the way for optimal experimental design in high dimensions. ©2020 The Author(s) Published by the Royal Society. All rights reserved.
first_indexed	2024-09-23T16:12:05Z
format	Article
id	mit-1721.1/126917
institution	Massachusetts Institute of Technology
language	English
last_indexed	2024-09-23T16:12:05Z
publishDate	2020
publisher	The Royal Society
record_format	dspace
spelling	mit-1721.1/1269172022-09-29T18:56:26Z Output-weighted optimal sampling for Bayesian regression and rare event statistics using few samples Sapsis, Themistoklis Panagiotis Massachusetts Institute of Technology. Department of Mechanical Engineering For many important problems the quantity of interest is an unknown function of the parameters, which is a random vector with known statistics. Since the dependence of the output on this random vector is unknown, the challenge is to identify its statistics, using the minimum number of function evaluations. This problem can be seen in the context of active learning or optimal experimental design. We employ Bayesian regression to represent the derived model uncertainty due to finite and small number of input–output pairs. In this context we evaluate existing methods for optimal sample selection, such as model error minimization and mutual information maximization. We show that for the case of known output variance, the commonly employed criteria in the literature do not take into account the output values of the existing input–output pairs, while for the case of unknown output variance this dependence can be very weak. We introduce a criterion that takes into account the values of the output for the existing samples and adaptively selects inputs from regions of the parameter space which have an important contribution to the output. The new method allows for application to high-dimensional inputs, paving the way for optimal experimental design in high dimensions. ©2020 The Author(s) Published by the Royal Society. All rights reserved. 2020-09-03T16:16:13Z 2020-09-03T16:16:13Z 2020-02 2019-11 2020-08-04T17:14:07Z Article http://purl.org/eprint/type/JournalArticle 1471-2946 https://hdl.handle.net/1721.1/126917 Sapsis, Themistoklis P., "Output-weighted optimal sampling for Bayesian regression and rare event statistics using few samples." Proceedings of the Royal Society A: Mathematical, Physical and Engineering Sciences 476, 2234 (February 2020): no. 20190834 doi. 10.1098/rspa.2019.0834 ©2020 Author(s) en https://dx.doi.org/10.1098/rspa.2019.0834 Proceedings of the Royal Society A: Mathematical, Physical and Engineering Sciences Creative Commons Attribution-Noncommercial-Share Alike http://creativecommons.org/licenses/by-nc-sa/4.0/ application/pdf The Royal Society arXiv
spellingShingle	Sapsis, Themistoklis Panagiotis Output-weighted optimal sampling for Bayesian regression and rare event statistics using few samples
title	Output-weighted optimal sampling for Bayesian regression and rare event statistics using few samples
title_full	Output-weighted optimal sampling for Bayesian regression and rare event statistics using few samples
title_fullStr	Output-weighted optimal sampling for Bayesian regression and rare event statistics using few samples
title_full_unstemmed	Output-weighted optimal sampling for Bayesian regression and rare event statistics using few samples
title_short	Output-weighted optimal sampling for Bayesian regression and rare event statistics using few samples
title_sort	output weighted optimal sampling for bayesian regression and rare event statistics using few samples
url	https://hdl.handle.net/1721.1/126917
work_keys_str_mv	AT sapsisthemistoklispanagiotis outputweightedoptimalsamplingforbayesianregressionandrareeventstatisticsusingfewsamples

Output-weighted optimal sampling for Bayesian regression and rare event statistics using few samples

Similar Items