Stability-certified reinforcement learning control via spectral normalization

In this study, two types of methods from different perspectives based on spectral normalization (SN) are described for ensuring the stability of a feedback system controlled by a neural network (NN). The first one is that the L2gain of the feedback system is bounded less than 1 to satisfy a stabilit...

Full description

Bibliographic Details
Main Authors:	Ryoichi Takase, Nobuyuki Yoshikawa, Toshisada Mariyama, Takeshi Tsuchiya
Format:	Article
Language:	English
Published:	Elsevier 2022-12-01
Series:	Machine Learning with Applications
Subjects:	Reinforcement learning Stability Spectral normalization Linear matrix inequality
Online Access:	http://www.sciencedirect.com/science/article/pii/S2666827022000846

Description
Summary:	In this study, two types of methods from different perspectives based on spectral normalization (SN) are described for ensuring the stability of a feedback system controlled by a neural network (NN). The first one is that the L2gain of the feedback system is bounded less than 1 to satisfy a stability condition derived from the small-gain theorem. When explicitly including the stability condition, the first type of method may provide an insufficient performance on the NN controller due to its strict stability condition. To overcome this difficulty, the second type of method is proposed, ensuring local stability with a larger region of attraction. In this second type, the stability is ensured by solving linear matrix inequalities after training the NN controller. SN improves the feasibility of the a posteriori stability test by constructing tighter local sectors. Numerical experiments show that the second type of method provides sufficient performance compared with the first one and ensures sufficient stability compared with existing reinforcement learning algorithms.11 Project page: https://sites.google.com/g.ecc.u-tokyo.ac.jp/stability-certified-rl-via-sn.
ISSN:	2666-8270

Stability-certified reinforcement learning control via spectral normalization

Similar Items