Gradient bounded dynamic programming with submodular and concave extensible value functions

We consider dynamic programming problems with finite, discrete-time horizons and prohibitively high-dimensional, discrete state-spaces for direct computation of the value function from the Bellman equation. For the case that the value function of the dynamic program is concave extensible and submodu...

全面介紹

書目詳細資料
Main Authors: Lebedev, D, Goulart, P, Margellos, K
格式: Conference item
語言:English
出版: Elsevier 2021
_version_ 1826296294845448192
author Lebedev, D
Goulart, P
Margellos, K
author_facet Lebedev, D
Goulart, P
Margellos, K
author_sort Lebedev, D
collection OXFORD
description We consider dynamic programming problems with finite, discrete-time horizons and prohibitively high-dimensional, discrete state-spaces for direct computation of the value function from the Bellman equation. For the case that the value function of the dynamic program is concave extensible and submodular in its state-space, we present a new algorithm that computes deterministic upper and stochastic lower bounds of the value function similar to dual dynamic programming. We then show that the proposed algorithm terminates after a fnite number of iterations. Finally, we demonstrate the efficacy of our approach on a high-dimensional numerical example from delivery slot pricing in attended home delivery.
first_indexed 2024-03-07T04:14:08Z
format Conference item
id oxford-uuid:c8d6b6bd-8c1c-4782-bd4f-b82ed1c39a6c
institution University of Oxford
language English
last_indexed 2024-03-07T04:14:08Z
publishDate 2021
publisher Elsevier
record_format dspace
spelling oxford-uuid:c8d6b6bd-8c1c-4782-bd4f-b82ed1c39a6c2022-03-27T06:54:55ZGradient bounded dynamic programming with submodular and concave extensible value functionsConference itemhttp://purl.org/coar/resource_type/c_5794uuid:c8d6b6bd-8c1c-4782-bd4f-b82ed1c39a6cEnglishSymplectic ElementsElsevier2021Lebedev, DGoulart, PMargellos, KWe consider dynamic programming problems with finite, discrete-time horizons and prohibitively high-dimensional, discrete state-spaces for direct computation of the value function from the Bellman equation. For the case that the value function of the dynamic program is concave extensible and submodular in its state-space, we present a new algorithm that computes deterministic upper and stochastic lower bounds of the value function similar to dual dynamic programming. We then show that the proposed algorithm terminates after a fnite number of iterations. Finally, we demonstrate the efficacy of our approach on a high-dimensional numerical example from delivery slot pricing in attended home delivery.
spellingShingle Lebedev, D
Goulart, P
Margellos, K
Gradient bounded dynamic programming with submodular and concave extensible value functions
title Gradient bounded dynamic programming with submodular and concave extensible value functions
title_full Gradient bounded dynamic programming with submodular and concave extensible value functions
title_fullStr Gradient bounded dynamic programming with submodular and concave extensible value functions
title_full_unstemmed Gradient bounded dynamic programming with submodular and concave extensible value functions
title_short Gradient bounded dynamic programming with submodular and concave extensible value functions
title_sort gradient bounded dynamic programming with submodular and concave extensible value functions
work_keys_str_mv AT lebedevd gradientboundeddynamicprogrammingwithsubmodularandconcaveextensiblevaluefunctions
AT goulartp gradientboundeddynamicprogrammingwithsubmodularandconcaveextensiblevaluefunctions
AT margellosk gradientboundeddynamicprogrammingwithsubmodularandconcaveextensiblevaluefunctions