Bounds for Markov Decision Processes

Bounds for Markov Decision Processes

We consider the problem of producing lower bounds on the optimal cost-to-go function of a Markov decision problem. We present two approaches to this problem: one based on the methodology of approximate linear programming (ALP) and another based on the so-called martingale duality approach. We show t...

Full description

Bibliographic Details
Main Authors:	Desai, Vijay V., Farias, Vivek F., Moallemi, Ciamac C.
Other Authors:	Sloan School of Management
Format:	Article
Published:	John Wiley & Sons, Inc. 2019
Online Access:	http://hdl.handle.net/1721.1/120518 https://orcid.org/0000-0002-5856-9246

Similar Items

Approximate Dynamic Programming via a Smoothed Linear Program
by: Desai, Vijay V., et al.
Published: (2012)

Non-Parametric Approximate Dynamic Programming via the Kernel Method
by: Bhat, Nikhil, et al.
Published: (2014)

Near-Optimal A-B Testing
by: Bhat, Nikhil, et al.
Published: (2021)

Universal Reinforcement Learning
by: Farias, Vivek F., et al.
Published: (2010)

Learning bounded optimal behavior using Markov decision processes
by: Vuong, Hon Fai, 1975-
Published: (2009)

Loss bounds for uncertain transition probabilities in Markov decision processes
by: Jaillet, Patrick, et al.
Published: (2014)

On the flow-level dynamics of a packet-switched network
by: Moallemi, Ciamac C., et al.
Published: (2012)

The complexity of Markov decision processes
Published: (2003)

Bounds and Low-Rank Approximation for Controlled Markov Processes
by: Holtorf, Flemming
Published: (2024)

Hazard Avoidance Alerting With Markov Decision Processes
by: Winder, Lee F., et al.
Published: (2007)

Hazard avoidance alerting with Markov decision processes
by: Winder, Lee F. (Lee Francis), 1973-
Published: (2005)

Bounding the difference between the values of robust and non-robust Markov decision problems
by: Neufeld, Ariel, et al.
Published: (2025)

Pavement Maintenance Optimization Model using Markov Decision Processes
by: Mandiartha, Putu, et al.
Published: (2017)

Approximate solution methods for partially observable Markov and semi-Markov decision processes
by: Yu, Huizhen, Ph. D. Massachusetts Institute of Technology
Published: (2007)

Online Reinforcement Learning in Factored Markov Decision Processes and Unknown Markov Games
by: Tian, Yi
Published: (2022)

Mean-Variance Optimization in Markov Decision Processes
by: Mannor, Shie, et al.
Published: (2013)

Quantum partially observable Markov decision processes
by: Barry, Jennifer, et al.
Published: (2014)

Hierarchical Solution of Large Markov Decision Processes
by: Barry, Jennifer, et al.
Published: (2011)

Tensor decomposition and parallelization of Markov Decision Processes
by: Smart, David P. (David Paul)
Published: (2016)

Simulation-based optimization of Markov decision processes
by: Marbach, Peter, 1966-
Published: (2005)

Monotone optimal control for a class of Markov decision processes
by: Li, Michael Z. F., et al.
Published: (2013)

Robust Adaptive Markov Decision Processes in Multi-vehicle Applications
by: How, Jonathan P., et al.
Published: (2010)

Robust Adaptive Markov Decision Processes in Multi-vehicle Applications
by: Bertuccelli, Luca F., et al.
Published: (2010)

Upper bounds for symmetric Markov transition functions
Published: (2003)

Generalization bounds of ERM-based learning processes for continuous-time Markov chains
by: Zhang, Chao, et al.
Published: (2013)

A Markov Decision Process model for traffic prioritisation provisioning
by: Gani, Abdullah, et al.
Published: (2004)

An extended Kalman filter extension of the augmented Markov decision process
by: Lommel, Peter Hans
Published: (2006)

Algorithmic aspects of mean–variance optimization in Markov decision processes
by: Tsitsiklis, John N, et al.
Published: (2017)

Regret Based Robust Solutions for Uncertain Markov Decision Processes
by: Ahmed, Asrar, et al.
Published: (2015)

Collision Avoidance for Unmanned Aircraft using Markov Decision Processes
by: Temizer, Selim, et al.
Published: (2011)

Methods and Experiments With Bounded Tree-width Markov Networks
by: Liang, Percy, et al.
Published: (2005)

Learning Adversarial Markov Decision Processes with Bandit Feedback and Unknown Transition
by: Jin, Chi, et al.
Published: (2022)

Polynomial time algorithms for finite horizon, stationary Markov decision processes
Published: (2003)

Robust, risk-sensitive, and data-driven control of Markov Decision Processes
by: Le Tallec, Yann
Published: (2007)

DetH*: Approximate Hierarchical Solution of Large Markov Decision Processes
by: Barry, Jennifer, et al.
Published: (2014)

Risk assessment in transactions under threat as Partially Observable Markov Decision Process
by: Vassilev, Vassil, et al.
Published: (2021)

Sampling Based Approaches for Minimizing Regret in Uncertain Markov Decision Processes (MDPs)
by: Ahmed, Asrar, et al.
Published: (2021)

Reinforcement Learning for Non-Stationary Markov Decision Processes: The Blessing of (More) Optimism
by: Cheung, Wang Chi, et al.
Published: (2021)

Equivalence and Reduction of Hidden Markov Models
by: Balasubramanian, Vijay
Published: (2004)

Incremental Q-learning Partially Observable Markov Decision Process intraday trading system
by: Goh, Choon Tat.
Published: (2010)