Sampling-based algorithms for continuous-time POMDPs

This paper focuses on a continuous-time, continuous-space formulation of the stochastic optimal control problem with nonlinear dynamics and observation noise. We lay the mathematical foundations to construct, via incremental sampling, an approximating sequence of discrete-time finite-state partially...

Full description

Bibliographic Details
Main Authors: Chaudhari, Pratik Anil, Karaman, Sertac, Hsu, David, Frazzoli, Emilio
Other Authors: Massachusetts Institute of Technology. Department of Aeronautics and Astronautics
Format: Article
Language:en_US
Published: Institute of Electrical and Electronics Engineers (IEEE) 2013
Online Access:http://hdl.handle.net/1721.1/81825
https://orcid.org/0000-0002-0505-1400
https://orcid.org/0000-0002-2225-7275
_version_ 1811092796959883264
author Chaudhari, Pratik Anil
Karaman, Sertac
Hsu, David
Frazzoli, Emilio
author2 Massachusetts Institute of Technology. Department of Aeronautics and Astronautics
author_facet Massachusetts Institute of Technology. Department of Aeronautics and Astronautics
Chaudhari, Pratik Anil
Karaman, Sertac
Hsu, David
Frazzoli, Emilio
author_sort Chaudhari, Pratik Anil
collection MIT
description This paper focuses on a continuous-time, continuous-space formulation of the stochastic optimal control problem with nonlinear dynamics and observation noise. We lay the mathematical foundations to construct, via incremental sampling, an approximating sequence of discrete-time finite-state partially observable Markov decision processes (POMDPs), such that the behavior of successive approximations converges to the behavior of the original continuous system in an appropriate sense. We also show that the optimal cost function and control policies for these POMDP approximations converge almost surely to their counterparts for the underlying continuous system in the limit. We demonstrate this approach on two popular continuous-time problems, viz., the Linear-Quadratic-Gaussian (LQG) control problem and the light-dark domain problem.
first_indexed 2024-09-23T15:26:35Z
format Article
id mit-1721.1/81825
institution Massachusetts Institute of Technology
language en_US
last_indexed 2024-09-23T15:26:35Z
publishDate 2013
publisher Institute of Electrical and Electronics Engineers (IEEE)
record_format dspace
spelling mit-1721.1/818252022-09-29T14:41:38Z Sampling-based algorithms for continuous-time POMDPs Chaudhari, Pratik Anil Karaman, Sertac Hsu, David Frazzoli, Emilio Massachusetts Institute of Technology. Department of Aeronautics and Astronautics Chaudhari, Pratik Anil Karaman, Sertac Frazzoli, Emilio This paper focuses on a continuous-time, continuous-space formulation of the stochastic optimal control problem with nonlinear dynamics and observation noise. We lay the mathematical foundations to construct, via incremental sampling, an approximating sequence of discrete-time finite-state partially observable Markov decision processes (POMDPs), such that the behavior of successive approximations converges to the behavior of the original continuous system in an appropriate sense. We also show that the optimal cost function and control policies for these POMDP approximations converge almost surely to their counterparts for the underlying continuous system in the limit. We demonstrate this approach on two popular continuous-time problems, viz., the Linear-Quadratic-Gaussian (LQG) control problem and the light-dark domain problem. United States. Army Research Office. Multidisciplinary University Research Initiative (Grant W911NF-11-1-0046) 2013-10-29T13:48:44Z 2013-10-29T13:48:44Z 2013-06 Article http://purl.org/eprint/type/ConferencePaper 978-1-4799-0177-7 http://hdl.handle.net/1721.1/81825 Chaudhari, Pratik Anil et al. "Sampling-based algorithms for continuous-time POMDPs." IEEE American Control Conference (ACC), 2013. https://orcid.org/0000-0002-0505-1400 https://orcid.org/0000-0002-2225-7275 en_US http://ieeexplore.ieee.org/xpl/articleDetails.jsp?arnumber=6580549 Proceedings of the 2013 American Control Conference (ACC) Creative Commons Attribution-Noncommercial-Share Alike 3.0 http://creativecommons.org/licenses/by-nc-sa/3.0/ application/pdf Institute of Electrical and Electronics Engineers (IEEE) MIT web domain
spellingShingle Chaudhari, Pratik Anil
Karaman, Sertac
Hsu, David
Frazzoli, Emilio
Sampling-based algorithms for continuous-time POMDPs
title Sampling-based algorithms for continuous-time POMDPs
title_full Sampling-based algorithms for continuous-time POMDPs
title_fullStr Sampling-based algorithms for continuous-time POMDPs
title_full_unstemmed Sampling-based algorithms for continuous-time POMDPs
title_short Sampling-based algorithms for continuous-time POMDPs
title_sort sampling based algorithms for continuous time pomdps
url http://hdl.handle.net/1721.1/81825
https://orcid.org/0000-0002-0505-1400
https://orcid.org/0000-0002-2225-7275
work_keys_str_mv AT chaudharipratikanil samplingbasedalgorithmsforcontinuoustimepomdps
AT karamansertac samplingbasedalgorithmsforcontinuoustimepomdps
AT hsudavid samplingbasedalgorithmsforcontinuoustimepomdps
AT frazzoliemilio samplingbasedalgorithmsforcontinuoustimepomdps