Integrating the Generative Adversarial Network for Decision Making in Reinforcement Learning for Industrial Robot Agents

Many robotics systems carrying certain payloads are employed in manufacturing industries for pick and place tasks. The system experiences inefficiency if more or less weight is introduced. If a different payload is introduced (either due to a change in the load or a change in the parameters of the r...

Full description

Bibliographic Details
Main Authors: Neelabh Paul, Vaibhav Tasgaonkar, Rahee Walambe, Ketan Kotecha
Format: Article
Language:English
Published: MDPI AG 2022-12-01
Series:Robotics
Subjects:
Online Access:https://www.mdpi.com/2218-6581/11/6/150
_version_ 1827637055261245440
author Neelabh Paul
Vaibhav Tasgaonkar
Rahee Walambe
Ketan Kotecha
author_facet Neelabh Paul
Vaibhav Tasgaonkar
Rahee Walambe
Ketan Kotecha
author_sort Neelabh Paul
collection DOAJ
description Many robotics systems carrying certain payloads are employed in manufacturing industries for pick and place tasks. The system experiences inefficiency if more or less weight is introduced. If a different payload is introduced (either due to a change in the load or a change in the parameters of the robot system), the robot must be re-trained with the new weight/parameters and the new network must be trained. Parameters such as the robot weight, length of limbs, or new payload may vary for an agent depending on the circumstance. Parameter changes pose a problem to the agent in achieving the same goal it is expected to achieve with the original parameters. Hence, it becomes mandatory to re-train the agent with the new parameters in order for it to achieve its goal. This research proposes a novel framework for the adaption of varying conditions on a robot agent in a given simulated environment without any retraining. Utilizing the properties of Generative Adversarial Network (GAN), the agent is able to train only once with reinforcement learning and by tweaking the noise vector of the generator in the GAN network, the agent can adapt to new conditions accordingly and demonstrate similar performance as if it were trained with the new physical attributes using reinforcement learning. A simple CartPole environment is considered for the experimentation, and it is shown that with the propose approached the agent remains stable for more iterations. The approach can be extended to the real world in the future.
first_indexed 2024-03-09T15:54:01Z
format Article
id doaj.art-e8dcedde8dd6401583b0bed14ea17860
institution Directory Open Access Journal
issn 2218-6581
language English
last_indexed 2024-03-09T15:54:01Z
publishDate 2022-12-01
publisher MDPI AG
record_format Article
series Robotics
spelling doaj.art-e8dcedde8dd6401583b0bed14ea178602023-11-24T17:50:42ZengMDPI AGRobotics2218-65812022-12-0111615010.3390/robotics11060150Integrating the Generative Adversarial Network for Decision Making in Reinforcement Learning for Industrial Robot AgentsNeelabh Paul0Vaibhav Tasgaonkar1Rahee Walambe2Ketan Kotecha3Symbiosis Institute of Technology, Symbiosis International (Deemed University), Pune 412115, IndiaSymbiosis Institute of Technology, Symbiosis International (Deemed University), Pune 412115, IndiaSymbiosis Institute of Technology, Symbiosis International (Deemed University), Pune 412115, IndiaSymbiosis Institute of Technology, Symbiosis International (Deemed University), Pune 412115, IndiaMany robotics systems carrying certain payloads are employed in manufacturing industries for pick and place tasks. The system experiences inefficiency if more or less weight is introduced. If a different payload is introduced (either due to a change in the load or a change in the parameters of the robot system), the robot must be re-trained with the new weight/parameters and the new network must be trained. Parameters such as the robot weight, length of limbs, or new payload may vary for an agent depending on the circumstance. Parameter changes pose a problem to the agent in achieving the same goal it is expected to achieve with the original parameters. Hence, it becomes mandatory to re-train the agent with the new parameters in order for it to achieve its goal. This research proposes a novel framework for the adaption of varying conditions on a robot agent in a given simulated environment without any retraining. Utilizing the properties of Generative Adversarial Network (GAN), the agent is able to train only once with reinforcement learning and by tweaking the noise vector of the generator in the GAN network, the agent can adapt to new conditions accordingly and demonstrate similar performance as if it were trained with the new physical attributes using reinforcement learning. A simple CartPole environment is considered for the experimentation, and it is shown that with the propose approached the agent remains stable for more iterations. The approach can be extended to the real world in the future.https://www.mdpi.com/2218-6581/11/6/150reinforcement learningGANsQ-tablesnoise vectorpayloadindustrial robots
spellingShingle Neelabh Paul
Vaibhav Tasgaonkar
Rahee Walambe
Ketan Kotecha
Integrating the Generative Adversarial Network for Decision Making in Reinforcement Learning for Industrial Robot Agents
Robotics
reinforcement learning
GANs
Q-tables
noise vector
payload
industrial robots
title Integrating the Generative Adversarial Network for Decision Making in Reinforcement Learning for Industrial Robot Agents
title_full Integrating the Generative Adversarial Network for Decision Making in Reinforcement Learning for Industrial Robot Agents
title_fullStr Integrating the Generative Adversarial Network for Decision Making in Reinforcement Learning for Industrial Robot Agents
title_full_unstemmed Integrating the Generative Adversarial Network for Decision Making in Reinforcement Learning for Industrial Robot Agents
title_short Integrating the Generative Adversarial Network for Decision Making in Reinforcement Learning for Industrial Robot Agents
title_sort integrating the generative adversarial network for decision making in reinforcement learning for industrial robot agents
topic reinforcement learning
GANs
Q-tables
noise vector
payload
industrial robots
url https://www.mdpi.com/2218-6581/11/6/150
work_keys_str_mv AT neelabhpaul integratingthegenerativeadversarialnetworkfordecisionmakinginreinforcementlearningforindustrialrobotagents
AT vaibhavtasgaonkar integratingthegenerativeadversarialnetworkfordecisionmakinginreinforcementlearningforindustrialrobotagents
AT raheewalambe integratingthegenerativeadversarialnetworkfordecisionmakinginreinforcementlearningforindustrialrobotagents
AT ketankotecha integratingthegenerativeadversarialnetworkfordecisionmakinginreinforcementlearningforindustrialrobotagents