Online Reinforcement-Learning-Based Adaptive Terminal Sliding Mode Control for Disturbed Bicycle Robots on a Curved Pavement

The reaction wheel is able to help improve the balancing ability of a bicycle robot on curved pavement. However, preserving good control performances for such a robot that is driving on unstructured surfaces under matched and mismatched disturbances is challenging due to the underactuated characteri...

Full description

Bibliographic Details
Main Authors:	Xianjin Zhu, Yang Deng, Xudong Zheng, Qingyuan Zheng, Bin Liang, Yu Liu
Format:	Article
Language:	English
Published:	MDPI AG 2022-10-01
Series:	Electronics
Subjects:	reaction wheel bicycle robot reinforcement learning sliding model control robustness
Online Access:	https://www.mdpi.com/2079-9292/11/21/3495

_version_	1797468523780374528
author	Xianjin Zhu Yang Deng Xudong Zheng Qingyuan Zheng Bin Liang Yu Liu
author_facet	Xianjin Zhu Yang Deng Xudong Zheng Qingyuan Zheng Bin Liang Yu Liu
author_sort	Xianjin Zhu
collection	DOAJ
description	The reaction wheel is able to help improve the balancing ability of a bicycle robot on curved pavement. However, preserving good control performances for such a robot that is driving on unstructured surfaces under matched and mismatched disturbances is challenging due to the underactuated characteristic and the nonlinearity of the robot. In this paper, a controller combining proximal policy optimization algorithms with terminal sliding mode controls is developed for controlling the balance of the robot. Online reinforcement-learning-based adaptive terminal sliding mode control is proposed to attenuate the influence of the matched and mismatched disturbance by adjusting parameters of the controller online. Different from several existing adaptive sliding mode approaches that only tune parameters of the reaching controller, the proposed method also considers the online adjustment of the sliding surface to provide adequate robustness against mismatched disturbances. The co-simulation experimental results in MSC Adams illustrate that the proposed controller can achieve better control performances than four existing methods for a reaction wheel bicycle robot moving on curved pavement, which verifies the robustness and applicability of the method.
first_indexed	2024-03-09T19:08:37Z
format	Article
id	doaj.art-a88d191c064c4d2b922ecebf918729b0
institution	Directory Open Access Journal
issn	2079-9292
language	English
last_indexed	2024-03-09T19:08:37Z
publishDate	2022-10-01
publisher	MDPI AG
record_format	Article
series	Electronics
spelling	doaj.art-a88d191c064c4d2b922ecebf918729b02023-11-24T04:24:51ZengMDPI AGElectronics2079-92922022-10-011121349510.3390/electronics11213495Online Reinforcement-Learning-Based Adaptive Terminal Sliding Mode Control for Disturbed Bicycle Robots on a Curved PavementXianjin Zhu0Yang Deng1Xudong Zheng2Qingyuan Zheng3Bin Liang4Yu Liu5School of Mechatronics Engineering, Harbin Institute of Technology, Harbin 150006, ChinaDepartment of Automation, Tsinghua University, Beijing 100084, ChinaSchool of Modern Post (School of Automation), Beijing University of Posts and Communications, Beijing 100876, ChinaDepartment of Automation, Tsinghua University, Beijing 100084, ChinaDepartment of Automation, Tsinghua University, Beijing 100084, ChinaSchool of Mechatronics Engineering, Harbin Institute of Technology, Harbin 150006, ChinaThe reaction wheel is able to help improve the balancing ability of a bicycle robot on curved pavement. However, preserving good control performances for such a robot that is driving on unstructured surfaces under matched and mismatched disturbances is challenging due to the underactuated characteristic and the nonlinearity of the robot. In this paper, a controller combining proximal policy optimization algorithms with terminal sliding mode controls is developed for controlling the balance of the robot. Online reinforcement-learning-based adaptive terminal sliding mode control is proposed to attenuate the influence of the matched and mismatched disturbance by adjusting parameters of the controller online. Different from several existing adaptive sliding mode approaches that only tune parameters of the reaching controller, the proposed method also considers the online adjustment of the sliding surface to provide adequate robustness against mismatched disturbances. The co-simulation experimental results in MSC Adams illustrate that the proposed controller can achieve better control performances than four existing methods for a reaction wheel bicycle robot moving on curved pavement, which verifies the robustness and applicability of the method.https://www.mdpi.com/2079-9292/11/21/3495reaction wheel bicycle robotreinforcement learningsliding model controlrobustness
spellingShingle	Xianjin Zhu Yang Deng Xudong Zheng Qingyuan Zheng Bin Liang Yu Liu Online Reinforcement-Learning-Based Adaptive Terminal Sliding Mode Control for Disturbed Bicycle Robots on a Curved Pavement Electronics reaction wheel bicycle robot reinforcement learning sliding model control robustness
title	Online Reinforcement-Learning-Based Adaptive Terminal Sliding Mode Control for Disturbed Bicycle Robots on a Curved Pavement
title_full	Online Reinforcement-Learning-Based Adaptive Terminal Sliding Mode Control for Disturbed Bicycle Robots on a Curved Pavement
title_fullStr	Online Reinforcement-Learning-Based Adaptive Terminal Sliding Mode Control for Disturbed Bicycle Robots on a Curved Pavement
title_full_unstemmed	Online Reinforcement-Learning-Based Adaptive Terminal Sliding Mode Control for Disturbed Bicycle Robots on a Curved Pavement
title_short	Online Reinforcement-Learning-Based Adaptive Terminal Sliding Mode Control for Disturbed Bicycle Robots on a Curved Pavement
title_sort	online reinforcement learning based adaptive terminal sliding mode control for disturbed bicycle robots on a curved pavement
topic	reaction wheel bicycle robot reinforcement learning sliding model control robustness
url	https://www.mdpi.com/2079-9292/11/21/3495
work_keys_str_mv	AT xianjinzhu onlinereinforcementlearningbasedadaptiveterminalslidingmodecontrolfordisturbedbicyclerobotsonacurvedpavement AT yangdeng onlinereinforcementlearningbasedadaptiveterminalslidingmodecontrolfordisturbedbicyclerobotsonacurvedpavement AT xudongzheng onlinereinforcementlearningbasedadaptiveterminalslidingmodecontrolfordisturbedbicyclerobotsonacurvedpavement AT qingyuanzheng onlinereinforcementlearningbasedadaptiveterminalslidingmodecontrolfordisturbedbicyclerobotsonacurvedpavement AT binliang onlinereinforcementlearningbasedadaptiveterminalslidingmodecontrolfordisturbedbicyclerobotsonacurvedpavement AT yuliu onlinereinforcementlearningbasedadaptiveterminalslidingmodecontrolfordisturbedbicyclerobotsonacurvedpavement

Online Reinforcement-Learning-Based Adaptive Terminal Sliding Mode Control for Disturbed Bicycle Robots on a Curved Pavement

Similar Items