Integrating the Generative Adversarial Network for Decision Making in Reinforcement Learning for Industrial Robot Agents
Many robotics systems carrying certain payloads are employed in manufacturing industries for pick and place tasks. The system experiences inefficiency if more or less weight is introduced. If a different payload is introduced (either due to a change in the load or a change in the parameters of the r...
Main Authors: | , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
MDPI AG
2022-12-01
|
Series: | Robotics |
Subjects: | |
Online Access: | https://www.mdpi.com/2218-6581/11/6/150 |
_version_ | 1827637055261245440 |
---|---|
author | Neelabh Paul Vaibhav Tasgaonkar Rahee Walambe Ketan Kotecha |
author_facet | Neelabh Paul Vaibhav Tasgaonkar Rahee Walambe Ketan Kotecha |
author_sort | Neelabh Paul |
collection | DOAJ |
description | Many robotics systems carrying certain payloads are employed in manufacturing industries for pick and place tasks. The system experiences inefficiency if more or less weight is introduced. If a different payload is introduced (either due to a change in the load or a change in the parameters of the robot system), the robot must be re-trained with the new weight/parameters and the new network must be trained. Parameters such as the robot weight, length of limbs, or new payload may vary for an agent depending on the circumstance. Parameter changes pose a problem to the agent in achieving the same goal it is expected to achieve with the original parameters. Hence, it becomes mandatory to re-train the agent with the new parameters in order for it to achieve its goal. This research proposes a novel framework for the adaption of varying conditions on a robot agent in a given simulated environment without any retraining. Utilizing the properties of Generative Adversarial Network (GAN), the agent is able to train only once with reinforcement learning and by tweaking the noise vector of the generator in the GAN network, the agent can adapt to new conditions accordingly and demonstrate similar performance as if it were trained with the new physical attributes using reinforcement learning. A simple CartPole environment is considered for the experimentation, and it is shown that with the propose approached the agent remains stable for more iterations. The approach can be extended to the real world in the future. |
first_indexed | 2024-03-09T15:54:01Z |
format | Article |
id | doaj.art-e8dcedde8dd6401583b0bed14ea17860 |
institution | Directory Open Access Journal |
issn | 2218-6581 |
language | English |
last_indexed | 2024-03-09T15:54:01Z |
publishDate | 2022-12-01 |
publisher | MDPI AG |
record_format | Article |
series | Robotics |
spelling | doaj.art-e8dcedde8dd6401583b0bed14ea178602023-11-24T17:50:42ZengMDPI AGRobotics2218-65812022-12-0111615010.3390/robotics11060150Integrating the Generative Adversarial Network for Decision Making in Reinforcement Learning for Industrial Robot AgentsNeelabh Paul0Vaibhav Tasgaonkar1Rahee Walambe2Ketan Kotecha3Symbiosis Institute of Technology, Symbiosis International (Deemed University), Pune 412115, IndiaSymbiosis Institute of Technology, Symbiosis International (Deemed University), Pune 412115, IndiaSymbiosis Institute of Technology, Symbiosis International (Deemed University), Pune 412115, IndiaSymbiosis Institute of Technology, Symbiosis International (Deemed University), Pune 412115, IndiaMany robotics systems carrying certain payloads are employed in manufacturing industries for pick and place tasks. The system experiences inefficiency if more or less weight is introduced. If a different payload is introduced (either due to a change in the load or a change in the parameters of the robot system), the robot must be re-trained with the new weight/parameters and the new network must be trained. Parameters such as the robot weight, length of limbs, or new payload may vary for an agent depending on the circumstance. Parameter changes pose a problem to the agent in achieving the same goal it is expected to achieve with the original parameters. Hence, it becomes mandatory to re-train the agent with the new parameters in order for it to achieve its goal. This research proposes a novel framework for the adaption of varying conditions on a robot agent in a given simulated environment without any retraining. Utilizing the properties of Generative Adversarial Network (GAN), the agent is able to train only once with reinforcement learning and by tweaking the noise vector of the generator in the GAN network, the agent can adapt to new conditions accordingly and demonstrate similar performance as if it were trained with the new physical attributes using reinforcement learning. A simple CartPole environment is considered for the experimentation, and it is shown that with the propose approached the agent remains stable for more iterations. The approach can be extended to the real world in the future.https://www.mdpi.com/2218-6581/11/6/150reinforcement learningGANsQ-tablesnoise vectorpayloadindustrial robots |
spellingShingle | Neelabh Paul Vaibhav Tasgaonkar Rahee Walambe Ketan Kotecha Integrating the Generative Adversarial Network for Decision Making in Reinforcement Learning for Industrial Robot Agents Robotics reinforcement learning GANs Q-tables noise vector payload industrial robots |
title | Integrating the Generative Adversarial Network for Decision Making in Reinforcement Learning for Industrial Robot Agents |
title_full | Integrating the Generative Adversarial Network for Decision Making in Reinforcement Learning for Industrial Robot Agents |
title_fullStr | Integrating the Generative Adversarial Network for Decision Making in Reinforcement Learning for Industrial Robot Agents |
title_full_unstemmed | Integrating the Generative Adversarial Network for Decision Making in Reinforcement Learning for Industrial Robot Agents |
title_short | Integrating the Generative Adversarial Network for Decision Making in Reinforcement Learning for Industrial Robot Agents |
title_sort | integrating the generative adversarial network for decision making in reinforcement learning for industrial robot agents |
topic | reinforcement learning GANs Q-tables noise vector payload industrial robots |
url | https://www.mdpi.com/2218-6581/11/6/150 |
work_keys_str_mv | AT neelabhpaul integratingthegenerativeadversarialnetworkfordecisionmakinginreinforcementlearningforindustrialrobotagents AT vaibhavtasgaonkar integratingthegenerativeadversarialnetworkfordecisionmakinginreinforcementlearningforindustrialrobotagents AT raheewalambe integratingthegenerativeadversarialnetworkfordecisionmakinginreinforcementlearningforindustrialrobotagents AT ketankotecha integratingthegenerativeadversarialnetworkfordecisionmakinginreinforcementlearningforindustrialrobotagents |