On the low-dimensional structure of Bayesian inference

Thesis: Ph. D., Massachusetts Institute of Technology, Department of Aeronautics and Astronautics, 2017.

Bibliographic Details
Main Author: Spantini, Alessio
Other Authors: Youssef M. Marzouk.
Format: Thesis
Language:eng
Published: Massachusetts Institute of Technology 2018
Subjects:
Online Access:http://hdl.handle.net/1721.1/113716
_version_ 1811079963250524160
author Spantini, Alessio
author2 Youssef M. Marzouk.
author_facet Youssef M. Marzouk.
Spantini, Alessio
author_sort Spantini, Alessio
collection MIT
description Thesis: Ph. D., Massachusetts Institute of Technology, Department of Aeronautics and Astronautics, 2017.
first_indexed 2024-09-23T11:23:17Z
format Thesis
id mit-1721.1/113716
institution Massachusetts Institute of Technology
language eng
last_indexed 2024-09-23T11:23:17Z
publishDate 2018
publisher Massachusetts Institute of Technology
record_format dspace
spelling mit-1721.1/1137162019-04-11T09:44:15Z On the low-dimensional structure of Bayesian inference Spantini, Alessio Youssef M. Marzouk. Massachusetts Institute of Technology. Department of Aeronautics and Astronautics. Massachusetts Institute of Technology. Department of Aeronautics and Astronautics. Aeronautics and Astronautics. Thesis: Ph. D., Massachusetts Institute of Technology, Department of Aeronautics and Astronautics, 2017. This electronic version was submitted by the student author. The certified thesis is available in the Institute Archives and Special Collections. Cataloged from student-submitted PDF version of thesis. Includes bibliographical references (pages 239-266). The Bayesian approach to inference characterizes model parameters and predictions through the exploration of their posterior distributions, i.e., their distributions conditioned on available data. The Bayesian paradigm provides a flexible, principled framework for quantifying uncertainty, wherein heterogeneous and incomplete sources of information (e.g., prior knowledge, noisy observations, imperfect models) can be properly rationalized. Yet a major obstacle to deploying Bayesian inference in realistic applications is computational: characterizing the associated high-dimensional and non-Gaussian posterior distributions remains a challenging task. While the Bayesian formulation is quite general, essential features of a statistical model can bring additional structure to the Bayesian update. For instance, the prior distribution often encodes some kind of regularity in the parameters; observations might be sparse and corrupted by noise; observations might also be indirect, related to the parameters by a forward operator that filters out some information; the posterior distribution might satisfy conditional independence assumptions that reflect local probabilistic interactions; and in some cases we might be uninterested in the posterior distribution per se, but rather in specific prediction goals. In this thesis we: (1) provide a rigorous mathematical characterization of low-dimensional structures that enable efficient Bayesian inference in high-dimensional and continuous parameter spaces; and (2) exploit this characterization to devise new structure-exploiting and computationally efficient inference algorithms. Our contributions encompass multiple related topics. First we characterize optimal low-rank approximations of linear-Gaussian Bayesian inverse problems, and of their goal-oriented extensions. Then we turn to inference in the nonlinear non-Gaussian setting--analyzing the sparsity, decomposability, and low-rank structure of deterministic couplings between distributions. These couplings facilitate efficient computation of posterior expectations in generically non-Gaussian settings. Based on this analysis, we introduce a number of approaches for representing non-Gaussian Markov random fields and for exploiting their conditional independence structure in computation by means of sparse nonlinear transport maps. We also develop new variational algorithms for nonlinear smoothing and sequential parameter estimation. These algorithms can be understood as the natural generalization--to the non-Gaussian case--of the square-root Rauch-Tung-Striebel Gaussian smoother. Finally, we outline a new class of nonlinear filters induced by local couplings, for inference in high-dimensional spatiotemporal processes with chaotic dynamics. by Alessio Spantini. Ph. D. 2018-02-16T19:26:53Z 2018-02-16T19:26:53Z 2017 2017 Thesis http://hdl.handle.net/1721.1/113716 1021851916 eng MIT theses are protected by copyright. They may be viewed, downloaded, or printed from this source but further reproduction or distribution in any format is prohibited without written permission. http://dspace.mit.edu/handle/1721.1/7582 266 pages application/pdf Massachusetts Institute of Technology
spellingShingle Aeronautics and Astronautics.
Spantini, Alessio
On the low-dimensional structure of Bayesian inference
title On the low-dimensional structure of Bayesian inference
title_full On the low-dimensional structure of Bayesian inference
title_fullStr On the low-dimensional structure of Bayesian inference
title_full_unstemmed On the low-dimensional structure of Bayesian inference
title_short On the low-dimensional structure of Bayesian inference
title_sort on the low dimensional structure of bayesian inference
topic Aeronautics and Astronautics.
url http://hdl.handle.net/1721.1/113716
work_keys_str_mv AT spantinialessio onthelowdimensionalstructureofbayesianinference