DetH*: Approximate Hierarchical Solution of Large Markov Decision Processes

This paper presents an algorithm for finding approximately optimal policies in very large Markov decision processes by constructing a hierarchical model and then solving it approximately. It exploits factored representations to achieve compactness and efficiency and to discover connectivity properti...

Full description

Bibliographic Details
Main Authors: Barry, Jennifer, Kaelbling, Leslie P., Lozano-Perez, Tomas
Other Authors: Massachusetts Institute of Technology. Computer Science and Artificial Intelligence Laboratory
Format: Article
Language:en_US
Published: AAAI Press/International Joint Conferences on Artificial Intelligence 2014
Online Access:http://hdl.handle.net/1721.1/90898
https://orcid.org/0000-0002-8657-2450
https://orcid.org/0000-0001-6054-7145
_version_ 1826209886540660736
author Barry, Jennifer
Kaelbling, Leslie P.
Lozano-Perez, Tomas
author2 Massachusetts Institute of Technology. Computer Science and Artificial Intelligence Laboratory
author_facet Massachusetts Institute of Technology. Computer Science and Artificial Intelligence Laboratory
Barry, Jennifer
Kaelbling, Leslie P.
Lozano-Perez, Tomas
author_sort Barry, Jennifer
collection MIT
description This paper presents an algorithm for finding approximately optimal policies in very large Markov decision processes by constructing a hierarchical model and then solving it approximately. It exploits factored representations to achieve compactness and efficiency and to discover connectivity properties of the domain. We provide a bound on the quality of the solutions and give asymptotic analysis of the runtimes; in addition we demonstrate performance on a collection of very large domains. Results show that the quality of resulting policies is very good and the total running times, for both creating and solving the hierarchy, are significantly less than for an optimal factored MDP solver.
first_indexed 2024-09-23T14:34:02Z
format Article
id mit-1721.1/90898
institution Massachusetts Institute of Technology
language en_US
last_indexed 2024-09-23T14:34:02Z
publishDate 2014
publisher AAAI Press/International Joint Conferences on Artificial Intelligence
record_format dspace
spelling mit-1721.1/908982022-10-01T21:43:50Z DetH*: Approximate Hierarchical Solution of Large Markov Decision Processes Barry, Jennifer Kaelbling, Leslie P. Lozano-Perez, Tomas Massachusetts Institute of Technology. Computer Science and Artificial Intelligence Laboratory Massachusetts Institute of Technology. Department of Electrical Engineering and Computer Science Barry, Jennifer Kaelbling, Leslie P. Lozano-Perez, Tomas This paper presents an algorithm for finding approximately optimal policies in very large Markov decision processes by constructing a hierarchical model and then solving it approximately. It exploits factored representations to achieve compactness and efficiency and to discover connectivity properties of the domain. We provide a bound on the quality of the solutions and give asymptotic analysis of the runtimes; in addition we demonstrate performance on a collection of very large domains. Results show that the quality of resulting policies is very good and the total running times, for both creating and solving the hierarchy, are significantly less than for an optimal factored MDP solver. United States. Office of Naval Research (ONR MURI grant N00014-09-1-1051) United States. Air Force Office of Scientific Research (AFOSR grant AOARD-104135) 2014-10-10T17:43:26Z 2014-10-10T17:43:26Z 2011-07 Article http://purl.org/eprint/type/ConferencePaper 978-1-57735-512-0 978-1-57735-516-8 http://hdl.handle.net/1721.1/90898 Barry, Jennifer, Leslie Pack Kaelbling, and Tomás Lozano-Pérez. "DetH*: Approximate Hierarchical Solution of Large Markov Decision Processes. In 22nd 2011 International Joint Conference on Artificial Intelligence, IJCAI-11, Barcelona, Catalonia, Spain, 16–22 July 2011. AAAI Press, (2011): p.1928-1935. https://orcid.org/0000-0002-8657-2450 https://orcid.org/0000-0001-6054-7145 en_US http://ijcai.org/papers11/Papers/IJCAI11-323.pdf Proceedings of the 22nd 2011 International Joint Conference on Artificial Intelligence Creative Commons Attribution-Noncommercial-Share Alike http://creativecommons.org/licenses/by-nc-sa/4.0/ application/pdf AAAI Press/International Joint Conferences on Artificial Intelligence MIT web domain
spellingShingle Barry, Jennifer
Kaelbling, Leslie P.
Lozano-Perez, Tomas
DetH*: Approximate Hierarchical Solution of Large Markov Decision Processes
title DetH*: Approximate Hierarchical Solution of Large Markov Decision Processes
title_full DetH*: Approximate Hierarchical Solution of Large Markov Decision Processes
title_fullStr DetH*: Approximate Hierarchical Solution of Large Markov Decision Processes
title_full_unstemmed DetH*: Approximate Hierarchical Solution of Large Markov Decision Processes
title_short DetH*: Approximate Hierarchical Solution of Large Markov Decision Processes
title_sort deth approximate hierarchical solution of large markov decision processes
url http://hdl.handle.net/1721.1/90898
https://orcid.org/0000-0002-8657-2450
https://orcid.org/0000-0001-6054-7145
work_keys_str_mv AT barryjennifer dethapproximatehierarchicalsolutionoflargemarkovdecisionprocesses
AT kaelblinglesliep dethapproximatehierarchicalsolutionoflargemarkovdecisionprocesses
AT lozanopereztomas dethapproximatehierarchicalsolutionoflargemarkovdecisionprocesses