Sequential decision problems, dependent types and generic solutions

We present a computer-checked generic implementation for solving finite-horizon sequential decision problems. This is a wide class of problems, including inter-temporal optimizations, knapsack, optimal bracketing, scheduling, etc. The implementation can handle time-step dependent control and state s...

Full description

Bibliographic Details
Main Authors: Nicola Botta, Patrik Jansson, Cezar Ionescu, David R. Christiansen, Edwin Brady
Format: Article
Language:English
Published: Logical Methods in Computer Science e.V. 2017-03-01
Series:Logical Methods in Computer Science
Subjects:
Online Access:https://lmcs.episciences.org/3191/pdf
_version_ 1797268612576182272
author Nicola Botta
Patrik Jansson
Cezar Ionescu
David R. Christiansen
Edwin Brady
author_facet Nicola Botta
Patrik Jansson
Cezar Ionescu
David R. Christiansen
Edwin Brady
author_sort Nicola Botta
collection DOAJ
description We present a computer-checked generic implementation for solving finite-horizon sequential decision problems. This is a wide class of problems, including inter-temporal optimizations, knapsack, optimal bracketing, scheduling, etc. The implementation can handle time-step dependent control and state spaces, and monadic representations of uncertainty (such as stochastic, non-deterministic, fuzzy, or combinations thereof). This level of genericity is achievable in a programming language with dependent types (we have used both Idris and Agda). Dependent types are also the means that allow us to obtain a formalization and computer-checked proof of the central component of our implementation: Bellman's principle of optimality and the associated backwards induction algorithm. The formalization clarifies certain aspects of backwards induction and, by making explicit notions such as viability and reachability, can serve as a starting point for a theory of controllability of monadic dynamical systems, commonly encountered in, e.g., climate impact research.
first_indexed 2024-04-25T01:35:15Z
format Article
id doaj.art-26cc1087053f4b338c3fe8bd22bfac07
institution Directory Open Access Journal
issn 1860-5974
language English
last_indexed 2024-04-25T01:35:15Z
publishDate 2017-03-01
publisher Logical Methods in Computer Science e.V.
record_format Article
series Logical Methods in Computer Science
spelling doaj.art-26cc1087053f4b338c3fe8bd22bfac072024-03-08T09:49:43ZengLogical Methods in Computer Science e.V.Logical Methods in Computer Science1860-59742017-03-01Volume 13, Issue 110.23638/LMCS-13(1:7)20173191Sequential decision problems, dependent types and generic solutionsNicola BottaPatrik JanssonCezar IonescuDavid R. ChristiansenEdwin BradyWe present a computer-checked generic implementation for solving finite-horizon sequential decision problems. This is a wide class of problems, including inter-temporal optimizations, knapsack, optimal bracketing, scheduling, etc. The implementation can handle time-step dependent control and state spaces, and monadic representations of uncertainty (such as stochastic, non-deterministic, fuzzy, or combinations thereof). This level of genericity is achievable in a programming language with dependent types (we have used both Idris and Agda). Dependent types are also the means that allow us to obtain a formalization and computer-checked proof of the central component of our implementation: Bellman's principle of optimality and the associated backwards induction algorithm. The formalization clarifies certain aspects of backwards induction and, by making explicit notions such as viability and reachability, can serve as a starting point for a theory of controllability of monadic dynamical systems, commonly encountered in, e.g., climate impact research.https://lmcs.episciences.org/3191/pdfcomputer science - logic in computer science
spellingShingle Nicola Botta
Patrik Jansson
Cezar Ionescu
David R. Christiansen
Edwin Brady
Sequential decision problems, dependent types and generic solutions
Logical Methods in Computer Science
computer science - logic in computer science
title Sequential decision problems, dependent types and generic solutions
title_full Sequential decision problems, dependent types and generic solutions
title_fullStr Sequential decision problems, dependent types and generic solutions
title_full_unstemmed Sequential decision problems, dependent types and generic solutions
title_short Sequential decision problems, dependent types and generic solutions
title_sort sequential decision problems dependent types and generic solutions
topic computer science - logic in computer science
url https://lmcs.episciences.org/3191/pdf
work_keys_str_mv AT nicolabotta sequentialdecisionproblemsdependenttypesandgenericsolutions
AT patrikjansson sequentialdecisionproblemsdependenttypesandgenericsolutions
AT cezarionescu sequentialdecisionproblemsdependenttypesandgenericsolutions
AT davidrchristiansen sequentialdecisionproblemsdependenttypesandgenericsolutions
AT edwinbrady sequentialdecisionproblemsdependenttypesandgenericsolutions