Dynamic execution of temporal plans with sensing actions and bounded risk

Thesis: Ph. D. in Aerospace Engineering, Massachusetts Institute of Technology, Department of Aeronautics and Astronautics, 2016.

Bibliographic Details
Main Author: Rodrigues Quemel e Assis Santana, Pedro Henrique de
Other Authors: Brian C. Williams.
Format: Thesis
Language:eng
Published: Massachusetts Institute of Technology 2017
Subjects:
Online Access:http://hdl.handle.net/1721.1/107049
_version_ 1826215656130871296
author Rodrigues Quemel e Assis Santana, Pedro Henrique de
author2 Brian C. Williams.
author_facet Brian C. Williams.
Rodrigues Quemel e Assis Santana, Pedro Henrique de
author_sort Rodrigues Quemel e Assis Santana, Pedro Henrique de
collection MIT
description Thesis: Ph. D. in Aerospace Engineering, Massachusetts Institute of Technology, Department of Aeronautics and Astronautics, 2016.
first_indexed 2024-09-23T16:37:22Z
format Thesis
id mit-1721.1/107049
institution Massachusetts Institute of Technology
language eng
last_indexed 2024-09-23T16:37:22Z
publishDate 2017
publisher Massachusetts Institute of Technology
record_format dspace
spelling mit-1721.1/1070492019-04-12T20:23:30Z Dynamic execution of temporal plans with sensing actions and bounded risk Rodrigues Quemel e Assis Santana, Pedro Henrique de Brian C. Williams. Massachusetts Institute of Technology. Department of Aeronautics and Astronautics. Massachusetts Institute of Technology. Department of Aeronautics and Astronautics. Aeronautics and Astronautics. Thesis: Ph. D. in Aerospace Engineering, Massachusetts Institute of Technology, Department of Aeronautics and Astronautics, 2016. Cataloged from PDF version of thesis. Includes bibliographical references (pages 289-306). A special report on the cover of the June 2016 issue of the IEEE Spectrum magazine reads: "can we trust robots?" In a world that has been experiencing a seemingly irreversible process by which autonomous systems have been given increasingly more space in strategic areas such as transportation, manufacturing, energy supply, planetary exploration, and even medical surgeries, it is natural that we start asking ourselves if these systems could be held at the same or even higher levels of safety than we expect from humans. In an effort to make a contribution towards a world of autonomy that we can trust, this thesis argues that one necessary step in this direction is the endowment of autonomous agents with the ability to dynamically adapt to their environment while meeting strict safety guarantees. From a technical standpoint, we propose that autonomous agents in safety-critical applications be able to execute conditional plans (or policies) within risk bounds (also referred to as chance constraints). By being conditional, the plan allows the autonomous agent to adapt to its environment in real-time by conditioning the choice of activity to be executed on the agent's current level of knowledge, or belief, about the true state of world. This belief state is, in turn, a function of the history of potentially noisy sensor observations gathered by the agent from the environment. With respect to bounded risk, it refers to the fact that executing such conditional plans should guarantee to keep the agent "safe" - as defined by sets of state constraints - with high probability, while moving away from the conservatism of minimum risk approaches. In this thesis, we propose Chance-Constrained Partially Observable Markov Decision Processes (CC-POMDP's) as a formalism for conditional risk-bounded planning under uncertainty. Moreover, we present Risk-bounded AO* (RAO*), a heuristic forward search-based algorithm that searches for solutions to a CC-POMDP by leveraging admissible utility and risk heuristics to simultaneously guide the search and perform early pruning of overly-risky policy branches. In an effort to facilitate the specification of risk-bounded behavior by human modelers, we also present the Chance-constrained Reactive Model-based Programming Language (cRMPL), a novel variant of RMPL that incorporates chance constraints as part of its syntax. Finally, in support of the temporal planning applications with duration uncertainty that this thesis is concerned about, we present the Polynomial-time Algorithm for Risk-aware Scheduling (PARIS) and its extension to conditional scheduling of Probabilistic Temporal Plan Networks (PTPN's). The different tools and algorithms developed in the context of this thesis are combined to form the Conditional Planning for Autonomy with Risk (CLARK) system, a risk-aware conditional planning system that can generate chance-constrained, dynamic temporal plans for autonomous agents that must operate under uncertainty. With respect to our empirical validation, each component of CLARK is benchmarked against the relevant state of the art throughout the chapters, followed by several demonstrations of the whole CLARK system working in tandem with other building blocks of an architecture for autonomy. by Pedro Henrique de Rodrigues Quemel e Assis Santana. Ph. D. in Aerospace Engineering 2017-02-22T19:01:09Z 2017-02-22T19:01:09Z 2016 2016 Thesis http://hdl.handle.net/1721.1/107049 971021121 eng MIT theses are protected by copyright. They may be viewed, downloaded, or printed from this source but further reproduction or distribution in any format is prohibited without written permission. http://dspace.mit.edu/handle/1721.1/7582 306 pages application/pdf Massachusetts Institute of Technology
spellingShingle Aeronautics and Astronautics.
Rodrigues Quemel e Assis Santana, Pedro Henrique de
Dynamic execution of temporal plans with sensing actions and bounded risk
title Dynamic execution of temporal plans with sensing actions and bounded risk
title_full Dynamic execution of temporal plans with sensing actions and bounded risk
title_fullStr Dynamic execution of temporal plans with sensing actions and bounded risk
title_full_unstemmed Dynamic execution of temporal plans with sensing actions and bounded risk
title_short Dynamic execution of temporal plans with sensing actions and bounded risk
title_sort dynamic execution of temporal plans with sensing actions and bounded risk
topic Aeronautics and Astronautics.
url http://hdl.handle.net/1721.1/107049
work_keys_str_mv AT rodriguesquemeleassissantanapedrohenriquede dynamicexecutionoftemporalplanswithsensingactionsandboundedrisk