An approach for nonlinear control design via approximate dynamic programming

Thesis (Ph.D.)--Massachusetts Institute of Technology, Dept. of Mechanical Engineering, 1998.

Bibliographic Details
Main Author: Boussios, Constantinos I
Other Authors: Munther A. Dahleh and John N. Tsitsiklis.
Format: Thesis
Language:eng
Published: Massachusetts Institute of Technology 2005
Subjects:
Online Access:http://hdl.handle.net/1721.1/9792
_version_ 1826192471640506368
author Boussios, Constantinos I
author2 Munther A. Dahleh and John N. Tsitsiklis.
author_facet Munther A. Dahleh and John N. Tsitsiklis.
Boussios, Constantinos I
author_sort Boussios, Constantinos I
collection MIT
description Thesis (Ph.D.)--Massachusetts Institute of Technology, Dept. of Mechanical Engineering, 1998.
first_indexed 2024-09-23T09:15:16Z
format Thesis
id mit-1721.1/9792
institution Massachusetts Institute of Technology
language eng
last_indexed 2024-09-23T09:15:16Z
publishDate 2005
publisher Massachusetts Institute of Technology
record_format dspace
spelling mit-1721.1/97922020-04-27T20:59:22Z An approach for nonlinear control design via approximate dynamic programming Boussios, Constantinos I Munther A. Dahleh and John N. Tsitsiklis. Massachusetts Institute of Technology. Department of Mechanical Engineering Mechanical Engineering Thesis (Ph.D.)--Massachusetts Institute of Technology, Dept. of Mechanical Engineering, 1998. Includes bibliographical references (p. 173-181). This thesis proposes and studies a methodology for designing controllers for nonlinear dynamic systems. We are interested in state feedback controllers (policies) that stabilize the state in a given region around an equilibrium point while minimizing a cost functional that captures the performance of the closed loop system. The optimal control problem can be solved in principle using dynamic programming algorithms such as policy iteration. Exact policy iteration is computationally infeasible for systems of even moderate dimension, which leads us to consider methods based on Approximate Policy Iteration. In such methods, we first select an approximation architecture (i.e., a parametrized class of functions) that is used to approximate the cost-to-go function under a given policy, on the basis of cost samples that are obtained through simulation. The resulting approximate cost function is used to derive another, hopefully better policy, and the procedure is repeated iteratively. There are several case studies exploring the use of this methodology, but they are of limited generality, and without much of a theoretical foundation. This thesis explores the soundness of Approximate Policy Iteration. We address the prob­lem of improving the performance of a given stabilizing controller, as well as the problem of designing stabilizing controllers for unstable systems. For the first problem, we develop bounds on the approximation error that can be tolerated if we wish to guarantee that the re­sulting controllers are stabilizing and/or offer improved performance. We give bounds on the suboptimality of the resulting controllers, in terms of the assumed approximation errors. We also extend the methodology to the unstable case by introducing an appropriate modification of the optimal control problem. The computational burden of cost function approximation can be often reduced, thereby enhancing the practicality of the method, by exploiting special structure. We illustrate this for a special class of nonlinear systems with fast linear and slow nonlinear dynamics. We also present an approximation based on state space gridding, whose performance can be evaluated via a systematic test. Finally, analysis is supported by applying Approximate Policy Iteration to two specific problems, one involving a missile model and the other involving a beam-and-ball model. by Constantinos I. Boussios. Ph.D. 2005-08-19T20:12:40Z 2005-08-19T20:12:40Z 1998 1998 Thesis http://hdl.handle.net/1721.1/9792 42916260 eng M.I.T. theses are protected by copyright. They may be viewed from this source for any purpose, but reproduction or distribution in any format is prohibited without written permission. See provided URL for inquiries about permission. http://dspace.mit.edu/handle/1721.1/7582 181 p. 9778437 bytes 9778195 bytes application/pdf application/pdf application/pdf Massachusetts Institute of Technology
spellingShingle Mechanical Engineering
Boussios, Constantinos I
An approach for nonlinear control design via approximate dynamic programming
title An approach for nonlinear control design via approximate dynamic programming
title_full An approach for nonlinear control design via approximate dynamic programming
title_fullStr An approach for nonlinear control design via approximate dynamic programming
title_full_unstemmed An approach for nonlinear control design via approximate dynamic programming
title_short An approach for nonlinear control design via approximate dynamic programming
title_sort approach for nonlinear control design via approximate dynamic programming
topic Mechanical Engineering
url http://hdl.handle.net/1721.1/9792
work_keys_str_mv AT boussiosconstantinosi anapproachfornonlinearcontroldesignviaapproximatedynamicprogramming
AT boussiosconstantinosi approachfornonlinearcontroldesignviaapproximatedynamicprogramming