Bounds for Markov Decision Processes
We consider the problem of producing lower bounds on the optimal cost-to-go function of a Markov decision problem. We present two approaches to this problem: one based on the methodology of approximate linear programming (ALP) and another based on the so-called martingale duality approach. We show t...
Main Authors: | , , |
---|---|
Other Authors: | |
Format: | Article |
Published: |
John Wiley & Sons, Inc.
2019
|
Online Access: | http://hdl.handle.net/1721.1/120518 https://orcid.org/0000-0002-5856-9246 |