Design and Comparison of Reinforcement-Learning-Based Time-Varying PID Controllers with Gain-Scheduled Actions

This paper presents innovative reinforcement learning methods for automatically tuning the parameters of a proportional integral derivative controller. Conventionally, the high dimension of the Q-table is a primary drawback when implementing a reinforcement learning algorithm. To overcome the obstac...

Full description

Bibliographic Details
Main Authors:	Yi-Liang Yeh, Po-Kai Yang
Format:	Article
Language:	English
Published:	MDPI AG 2021-11-01
Series:	Machines
Subjects:	reinforcement learning Q-learning Sarsa gain-scheduled action time-varying PID controller PZT stage
Online Access:	https://www.mdpi.com/2075-1702/9/12/319

Description
Summary:	This paper presents innovative reinforcement learning methods for automatically tuning the parameters of a proportional integral derivative controller. Conventionally, the high dimension of the Q-table is a primary drawback when implementing a reinforcement learning algorithm. To overcome the obstacle, the idea underlying the <i>n</i>-armed bandit problem is used in this paper. Moreover, gain-scheduled actions are presented to tune the algorithms to improve the overall system behavior; therefore, the proposed controllers fulfill the multiple performance requirements. An experiment was conducted for the piezo-actuated stage to illustrate the effectiveness of the proposed control designs relative to competing algorithms.
ISSN:	2075-1702

Design and Comparison of Reinforcement-Learning-Based Time-Varying PID Controllers with Gain-Scheduled Actions

Similar Items