Use of PID control during Education in Reinforcement Learning on Two Wheel Balance Robot

This study’s primary objective was to try to shorten the training time of the Reinforcement Learning (RL) method, which is one of the Machine Learning methods, by using the proportional-integral-derivative (PID) control method during training. In this study, a balancing robot with two wheels that ca...

Full description

Bibliographic Details
Main Authors:	Emrah ATAÇ, Kazım YILDIZ, Eyüp Emre ÜLKÜ
Format:	Article
Language:	English
Published:	Gazi University 2021-12-01
Series:	Gazi Üniversitesi Fen Bilimleri Dergisi
Subjects:	reinforcement learning machine learning balancing robot pid control
Online Access:	https://dergipark.org.tr/tr/download/article-file/1835863

_version_	1797910564007051264
author	Emrah ATAÇ Kazım YILDIZ Eyüp Emre ÜLKÜ
author_facet	Emrah ATAÇ Kazım YILDIZ Eyüp Emre ÜLKÜ
author_sort	Emrah ATAÇ
collection	DOAJ
description	This study’s primary objective was to try to shorten the training time of the Reinforcement Learning (RL) method, which is one of the Machine Learning methods, by using the proportional-integral-derivative (PID) control method during training. In this study, a balancing robot with two wheels that can be controlled independently on the same axis is used. While the robot is in balance, the RL software block follows how the PID block maintains the balance, and the RL blog learned how to behave against disturbing factors without physical falling / rising. In the training of RL, it is necessary to create approximately 500 policy / reward / path equations between the current state and future state matrices. Obviously, the amount of equations will increase considerably when subjects such as old position and acceleration are added. Approximately 1000 trial / error is required for training purposes. This means many falling / rising cycles. With the method we present, the RL block has learned to keep the robot in balance without falling and requiring human intervention in 900 trials. This shortened the training time by about 60%.
first_indexed	2024-04-10T11:26:18Z
format	Article
id	doaj.art-2019263c10704c0182793dc4e474375d
institution	Directory Open Access Journal
issn	2147-9526
language	English
last_indexed	2024-04-10T11:26:18Z
publishDate	2021-12-01
publisher	Gazi University
record_format	Article
series	Gazi Üniversitesi Fen Bilimleri Dergisi
spelling	doaj.art-2019263c10704c0182793dc4e474375d2023-02-15T16:18:22ZengGazi UniversityGazi Üniversitesi Fen Bilimleri Dergisi2147-95262021-12-019459760710.29109/gujsc.955562Use of PID control during Education in Reinforcement Learning on Two Wheel Balance RobotEmrah ATAÇhttps://orcid.org/0000-0003-1342-7276Kazım YILDIZ https://orcid.org/0000-0001-6999-1410Eyüp Emre ÜLKÜhttps://orcid.org/0000-0002-1985-6461This study’s primary objective was to try to shorten the training time of the Reinforcement Learning (RL) method, which is one of the Machine Learning methods, by using the proportional-integral-derivative (PID) control method during training. In this study, a balancing robot with two wheels that can be controlled independently on the same axis is used. While the robot is in balance, the RL software block follows how the PID block maintains the balance, and the RL blog learned how to behave against disturbing factors without physical falling / rising. In the training of RL, it is necessary to create approximately 500 policy / reward / path equations between the current state and future state matrices. Obviously, the amount of equations will increase considerably when subjects such as old position and acceleration are added. Approximately 1000 trial / error is required for training purposes. This means many falling / rising cycles. With the method we present, the RL block has learned to keep the robot in balance without falling and requiring human intervention in 900 trials. This shortened the training time by about 60%.https://dergipark.org.tr/tr/download/article-file/1835863reinforcement learningmachine learningbalancing robotpid control
spellingShingle	Emrah ATAÇ Kazım YILDIZ Eyüp Emre ÜLKÜ Use of PID control during Education in Reinforcement Learning on Two Wheel Balance Robot Gazi Üniversitesi Fen Bilimleri Dergisi reinforcement learning machine learning balancing robot pid control
title	Use of PID control during Education in Reinforcement Learning on Two Wheel Balance Robot
title_full	Use of PID control during Education in Reinforcement Learning on Two Wheel Balance Robot
title_fullStr	Use of PID control during Education in Reinforcement Learning on Two Wheel Balance Robot
title_full_unstemmed	Use of PID control during Education in Reinforcement Learning on Two Wheel Balance Robot
title_short	Use of PID control during Education in Reinforcement Learning on Two Wheel Balance Robot
title_sort	use of pid control during education in reinforcement learning on two wheel balance robot
topic	reinforcement learning machine learning balancing robot pid control
url	https://dergipark.org.tr/tr/download/article-file/1835863
work_keys_str_mv	AT emrahatac useofpidcontrolduringeducationinreinforcementlearningontwowheelbalancerobot AT kazımyildiz useofpidcontrolduringeducationinreinforcementlearningontwowheelbalancerobot AT eyupemreulku useofpidcontrolduringeducationinreinforcementlearningontwowheelbalancerobot

Use of PID control during Education in Reinforcement Learning on Two Wheel Balance Robot

Similar Items