Guaranteed bounds on posterior distributions of discrete probabilistic programs with loops

We study the problem of bounding the posterior distribution of discrete probabilistic programs with unbounded support, loops, and conditioning. Loops pose the main difficulty in this setting: even if exact Bayesian inference is possible, the state of the art requires user-provided loop invariant tem...

Full description

Bibliographic Details
Main Authors: Zaiser, F, Murawski, AS, Ong, C-HL
Format: Journal article
Language:English
Published: Association for Computing Machinery 2024
_version_ 1817931543724687360
author Zaiser, F
Murawski, AS
Ong, C-HL
author_facet Zaiser, F
Murawski, AS
Ong, C-HL
author_sort Zaiser, F
collection OXFORD
description We study the problem of bounding the posterior distribution of discrete probabilistic programs with unbounded support, loops, and conditioning. Loops pose the main difficulty in this setting: even if exact Bayesian inference is possible, the state of the art requires user-provided loop invariant templates. By contrast, we aim to find guaranteed bounds, which sandwich the true distribution. They are fully automated, applicable to more programs and provide more provable guarantees than approximate sampling-based inference. Since lower bounds can be obtained by unrolling loops, the main challenge is upper bounds, and we attack it in two ways. The first is called residual mass semantics, which is a flat bound based on the residual probability mass of a loop. The approach is simple, efficient, and has provable guarantees. <br> The main novelty of our work is the second approach, called geometric bound semantics. It operates on a novel family of distributions, called eventually geometric distributions (EGDs), and can bound the distribution of loops with a new form of loop invariants called contraction invariants. The invariant synthesis problem reduces to a system of polynomial inequality constraints, which is a decidable problem with automated solvers. If a solution exists, it yields an exponentially decreasing bound on the whole distribution, and can therefore bound moments and tail asymptotics as well, not just probabilities as in the first approach. <br> Both semantics enjoy desirable theoretical properties. In particular, we prove soundness and convergence, i.e. the bounds converge to the exact posterior as loops are unrolled further. We also investigate sufficient and necessary conditions for the existence of geometric bounds. On the practical side, we describe Diabolo, a fully-automated implementation of both semantics, and evaluate them on a variety of bench
first_indexed 2024-12-09T03:23:42Z
format Journal article
id oxford-uuid:47f1cf0a-c839-4537-a997-a9c7e1402c6b
institution University of Oxford
language English
last_indexed 2024-12-09T03:23:42Z
publishDate 2024
publisher Association for Computing Machinery
record_format dspace
spelling oxford-uuid:47f1cf0a-c839-4537-a997-a9c7e1402c6b2024-11-25T11:30:10ZGuaranteed bounds on posterior distributions of discrete probabilistic programs with loopsJournal articlehttp://purl.org/coar/resource_type/c_dcae04bcuuid:47f1cf0a-c839-4537-a997-a9c7e1402c6bEnglishSymplectic ElementsAssociation for Computing Machinery2024Zaiser, FMurawski, ASOng, C-HLWe study the problem of bounding the posterior distribution of discrete probabilistic programs with unbounded support, loops, and conditioning. Loops pose the main difficulty in this setting: even if exact Bayesian inference is possible, the state of the art requires user-provided loop invariant templates. By contrast, we aim to find guaranteed bounds, which sandwich the true distribution. They are fully automated, applicable to more programs and provide more provable guarantees than approximate sampling-based inference. Since lower bounds can be obtained by unrolling loops, the main challenge is upper bounds, and we attack it in two ways. The first is called residual mass semantics, which is a flat bound based on the residual probability mass of a loop. The approach is simple, efficient, and has provable guarantees. <br> The main novelty of our work is the second approach, called geometric bound semantics. It operates on a novel family of distributions, called eventually geometric distributions (EGDs), and can bound the distribution of loops with a new form of loop invariants called contraction invariants. The invariant synthesis problem reduces to a system of polynomial inequality constraints, which is a decidable problem with automated solvers. If a solution exists, it yields an exponentially decreasing bound on the whole distribution, and can therefore bound moments and tail asymptotics as well, not just probabilities as in the first approach. <br> Both semantics enjoy desirable theoretical properties. In particular, we prove soundness and convergence, i.e. the bounds converge to the exact posterior as loops are unrolled further. We also investigate sufficient and necessary conditions for the existence of geometric bounds. On the practical side, we describe Diabolo, a fully-automated implementation of both semantics, and evaluate them on a variety of bench
spellingShingle Zaiser, F
Murawski, AS
Ong, C-HL
Guaranteed bounds on posterior distributions of discrete probabilistic programs with loops
title Guaranteed bounds on posterior distributions of discrete probabilistic programs with loops
title_full Guaranteed bounds on posterior distributions of discrete probabilistic programs with loops
title_fullStr Guaranteed bounds on posterior distributions of discrete probabilistic programs with loops
title_full_unstemmed Guaranteed bounds on posterior distributions of discrete probabilistic programs with loops
title_short Guaranteed bounds on posterior distributions of discrete probabilistic programs with loops
title_sort guaranteed bounds on posterior distributions of discrete probabilistic programs with loops
work_keys_str_mv AT zaiserf guaranteedboundsonposteriordistributionsofdiscreteprobabilisticprogramswithloops
AT murawskias guaranteedboundsonposteriordistributionsofdiscreteprobabilisticprogramswithloops
AT ongchl guaranteedboundsonposteriordistributionsofdiscreteprobabilisticprogramswithloops