Sampling-based algorithms for continuous-time POMDPs

This paper focuses on a continuous-time, continuous-space formulation of the stochastic optimal control problem with nonlinear dynamics and observation noise. We lay the mathematical foundations to construct, via incremental sampling, an approximating sequence of discrete-time finite-state partially...

Full description

Bibliographic Details
Main Authors:	Chaudhari, Pratik Anil, Karaman, Sertac, Hsu, David, Frazzoli, Emilio
Other Authors:	Massachusetts Institute of Technology. Department of Aeronautics and Astronautics
Format:	Article
Language:	en_US
Published:	Institute of Electrical and Electronics Engineers (IEEE) 2013
Online Access:	http://hdl.handle.net/1721.1/81825 https://orcid.org/0000-0002-0505-1400 https://orcid.org/0000-0002-2225-7275

_version_	1811092796959883264
author	Chaudhari, Pratik Anil Karaman, Sertac Hsu, David Frazzoli, Emilio
author2	Massachusetts Institute of Technology. Department of Aeronautics and Astronautics
author_facet	Massachusetts Institute of Technology. Department of Aeronautics and Astronautics Chaudhari, Pratik Anil Karaman, Sertac Hsu, David Frazzoli, Emilio
author_sort	Chaudhari, Pratik Anil
collection	MIT
description	This paper focuses on a continuous-time, continuous-space formulation of the stochastic optimal control problem with nonlinear dynamics and observation noise. We lay the mathematical foundations to construct, via incremental sampling, an approximating sequence of discrete-time finite-state partially observable Markov decision processes (POMDPs), such that the behavior of successive approximations converges to the behavior of the original continuous system in an appropriate sense. We also show that the optimal cost function and control policies for these POMDP approximations converge almost surely to their counterparts for the underlying continuous system in the limit. We demonstrate this approach on two popular continuous-time problems, viz., the Linear-Quadratic-Gaussian (LQG) control problem and the light-dark domain problem.
first_indexed	2024-09-23T15:26:35Z
format	Article
id	mit-1721.1/81825
institution	Massachusetts Institute of Technology
language	en_US
last_indexed	2024-09-23T15:26:35Z
publishDate	2013
publisher	Institute of Electrical and Electronics Engineers (IEEE)
record_format	dspace
spelling	mit-1721.1/818252022-09-29T14:41:38Z Sampling-based algorithms for continuous-time POMDPs Chaudhari, Pratik Anil Karaman, Sertac Hsu, David Frazzoli, Emilio Massachusetts Institute of Technology. Department of Aeronautics and Astronautics Chaudhari, Pratik Anil Karaman, Sertac Frazzoli, Emilio This paper focuses on a continuous-time, continuous-space formulation of the stochastic optimal control problem with nonlinear dynamics and observation noise. We lay the mathematical foundations to construct, via incremental sampling, an approximating sequence of discrete-time finite-state partially observable Markov decision processes (POMDPs), such that the behavior of successive approximations converges to the behavior of the original continuous system in an appropriate sense. We also show that the optimal cost function and control policies for these POMDP approximations converge almost surely to their counterparts for the underlying continuous system in the limit. We demonstrate this approach on two popular continuous-time problems, viz., the Linear-Quadratic-Gaussian (LQG) control problem and the light-dark domain problem. United States. Army Research Office. Multidisciplinary University Research Initiative (Grant W911NF-11-1-0046) 2013-10-29T13:48:44Z 2013-10-29T13:48:44Z 2013-06 Article http://purl.org/eprint/type/ConferencePaper 978-1-4799-0177-7 http://hdl.handle.net/1721.1/81825 Chaudhari, Pratik Anil et al. "Sampling-based algorithms for continuous-time POMDPs." IEEE American Control Conference (ACC), 2013. https://orcid.org/0000-0002-0505-1400 https://orcid.org/0000-0002-2225-7275 en_US http://ieeexplore.ieee.org/xpl/articleDetails.jsp?arnumber=6580549 Proceedings of the 2013 American Control Conference (ACC) Creative Commons Attribution-Noncommercial-Share Alike 3.0 http://creativecommons.org/licenses/by-nc-sa/3.0/ application/pdf Institute of Electrical and Electronics Engineers (IEEE) MIT web domain
spellingShingle	Chaudhari, Pratik Anil Karaman, Sertac Hsu, David Frazzoli, Emilio Sampling-based algorithms for continuous-time POMDPs
title	Sampling-based algorithms for continuous-time POMDPs
title_full	Sampling-based algorithms for continuous-time POMDPs
title_fullStr	Sampling-based algorithms for continuous-time POMDPs
title_full_unstemmed	Sampling-based algorithms for continuous-time POMDPs
title_short	Sampling-based algorithms for continuous-time POMDPs
title_sort	sampling based algorithms for continuous time pomdps
url	http://hdl.handle.net/1721.1/81825 https://orcid.org/0000-0002-0505-1400 https://orcid.org/0000-0002-2225-7275
work_keys_str_mv	AT chaudharipratikanil samplingbasedalgorithmsforcontinuoustimepomdps AT karamansertac samplingbasedalgorithmsforcontinuoustimepomdps AT hsudavid samplingbasedalgorithmsforcontinuoustimepomdps AT frazzoliemilio samplingbasedalgorithmsforcontinuoustimepomdps

Sampling-based algorithms for continuous-time POMDPs

Similar Items