An approach for nonlinear control design via approximate dynamic programming
Thesis (Ph.D.)--Massachusetts Institute of Technology, Dept. of Mechanical Engineering, 1998.
Main Author: | |
---|---|
Other Authors: | |
Format: | Thesis |
Language: | eng |
Published: |
Massachusetts Institute of Technology
2005
|
Subjects: | |
Online Access: | http://hdl.handle.net/1721.1/9792 |
_version_ | 1826192471640506368 |
---|---|
author | Boussios, Constantinos I |
author2 | Munther A. Dahleh and John N. Tsitsiklis. |
author_facet | Munther A. Dahleh and John N. Tsitsiklis. Boussios, Constantinos I |
author_sort | Boussios, Constantinos I |
collection | MIT |
description | Thesis (Ph.D.)--Massachusetts Institute of Technology, Dept. of Mechanical Engineering, 1998. |
first_indexed | 2024-09-23T09:15:16Z |
format | Thesis |
id | mit-1721.1/9792 |
institution | Massachusetts Institute of Technology |
language | eng |
last_indexed | 2024-09-23T09:15:16Z |
publishDate | 2005 |
publisher | Massachusetts Institute of Technology |
record_format | dspace |
spelling | mit-1721.1/97922020-04-27T20:59:22Z An approach for nonlinear control design via approximate dynamic programming Boussios, Constantinos I Munther A. Dahleh and John N. Tsitsiklis. Massachusetts Institute of Technology. Department of Mechanical Engineering Mechanical Engineering Thesis (Ph.D.)--Massachusetts Institute of Technology, Dept. of Mechanical Engineering, 1998. Includes bibliographical references (p. 173-181). This thesis proposes and studies a methodology for designing controllers for nonlinear dynamic systems. We are interested in state feedback controllers (policies) that stabilize the state in a given region around an equilibrium point while minimizing a cost functional that captures the performance of the closed loop system. The optimal control problem can be solved in principle using dynamic programming algorithms such as policy iteration. Exact policy iteration is computationally infeasible for systems of even moderate dimension, which leads us to consider methods based on Approximate Policy Iteration. In such methods, we first select an approximation architecture (i.e., a parametrized class of functions) that is used to approximate the cost-to-go function under a given policy, on the basis of cost samples that are obtained through simulation. The resulting approximate cost function is used to derive another, hopefully better policy, and the procedure is repeated iteratively. There are several case studies exploring the use of this methodology, but they are of limited generality, and without much of a theoretical foundation. This thesis explores the soundness of Approximate Policy Iteration. We address the problem of improving the performance of a given stabilizing controller, as well as the problem of designing stabilizing controllers for unstable systems. For the first problem, we develop bounds on the approximation error that can be tolerated if we wish to guarantee that the resulting controllers are stabilizing and/or offer improved performance. We give bounds on the suboptimality of the resulting controllers, in terms of the assumed approximation errors. We also extend the methodology to the unstable case by introducing an appropriate modification of the optimal control problem. The computational burden of cost function approximation can be often reduced, thereby enhancing the practicality of the method, by exploiting special structure. We illustrate this for a special class of nonlinear systems with fast linear and slow nonlinear dynamics. We also present an approximation based on state space gridding, whose performance can be evaluated via a systematic test. Finally, analysis is supported by applying Approximate Policy Iteration to two specific problems, one involving a missile model and the other involving a beam-and-ball model. by Constantinos I. Boussios. Ph.D. 2005-08-19T20:12:40Z 2005-08-19T20:12:40Z 1998 1998 Thesis http://hdl.handle.net/1721.1/9792 42916260 eng M.I.T. theses are protected by copyright. They may be viewed from this source for any purpose, but reproduction or distribution in any format is prohibited without written permission. See provided URL for inquiries about permission. http://dspace.mit.edu/handle/1721.1/7582 181 p. 9778437 bytes 9778195 bytes application/pdf application/pdf application/pdf Massachusetts Institute of Technology |
spellingShingle | Mechanical Engineering Boussios, Constantinos I An approach for nonlinear control design via approximate dynamic programming |
title | An approach for nonlinear control design via approximate dynamic programming |
title_full | An approach for nonlinear control design via approximate dynamic programming |
title_fullStr | An approach for nonlinear control design via approximate dynamic programming |
title_full_unstemmed | An approach for nonlinear control design via approximate dynamic programming |
title_short | An approach for nonlinear control design via approximate dynamic programming |
title_sort | approach for nonlinear control design via approximate dynamic programming |
topic | Mechanical Engineering |
url | http://hdl.handle.net/1721.1/9792 |
work_keys_str_mv | AT boussiosconstantinosi anapproachfornonlinearcontroldesignviaapproximatedynamicprogramming AT boussiosconstantinosi approachfornonlinearcontroldesignviaapproximatedynamicprogramming |