Enhancing Efficiency in Hierarchical Reinforcement Learning through Topological-Sorted Potential Calculation

Hierarchical reinforcement learning (HRL) offers a hierarchical structure for organizing tasks, enabling agents to learn and make decisions autonomously in complex environments. However, traditional HRL approaches face limitations in effectively handling complex tasks. Reward machines, which specify...

Full description

Bibliographic Details
Main Authors: Ziyun Zhou, Jingwei Shang, Yimang Li
Format: Article
Language:English
Published: MDPI AG 2023-09-01
Series:Electronics
Subjects:
Online Access:https://www.mdpi.com/2079-9292/12/17/3700
_version_ 1797582648393072640
author Ziyun Zhou
Jingwei Shang
Yimang Li
author_facet Ziyun Zhou
Jingwei Shang
Yimang Li
author_sort Ziyun Zhou
collection DOAJ
description Hierarchical reinforcement learning (HRL) offers a hierarchical structure for organizing tasks, enabling agents to learn and make decisions autonomously in complex environments. However, traditional HRL approaches face limitations in effectively handling complex tasks. Reward machines, which specify high-level goals and associated rewards for sub-goals, have been introduced to address these limitations by facilitating the agent’s understanding and reasoning with respect to the task hierarchy. In this paper, we propose a novel approach to enhance HRL performance through topologically sorted potential calculation for reward machines. By leveraging the topological structure of the task hierarchy, our method efficiently determines potentials for different sub-goals. This topological sorting enables the agent to prioritize actions leading to the accomplishment of higher-level goals, enhancing the learning process. To assess the efficacy of our approach, we conducted experiments in the grid-world environment with OpenAI-Gym. The results showcase the superiority of our proposed method over traditional HRL techniques and reward machine-based reinforcement learning approaches in terms of learning efficiency and overall task performance.
first_indexed 2024-03-10T23:24:25Z
format Article
id doaj.art-21d1a29946934a0a9e477929238e963f
institution Directory Open Access Journal
issn 2079-9292
language English
last_indexed 2024-03-10T23:24:25Z
publishDate 2023-09-01
publisher MDPI AG
record_format Article
series Electronics
spelling doaj.art-21d1a29946934a0a9e477929238e963f2023-11-19T08:02:49ZengMDPI AGElectronics2079-92922023-09-011217370010.3390/electronics12173700Enhancing Efficiency in Hierarchical Reinforcement Learning through Topological-Sorted Potential CalculationZiyun Zhou0Jingwei Shang1Yimang Li2School of Mechanical Engineering and Rail Transit, Changzhou University, Changzhou 213164, ChinaChina Electronic Product Reliability and Environmental Testing Research Institute, Guangzhou 510610, ChinaSchool of Mechanical Engineering and Rail Transit, Changzhou University, Changzhou 213164, ChinaHierarchical reinforcement learning (HRL) offers a hierarchical structure for organizing tasks, enabling agents to learn and make decisions autonomously in complex environments. However, traditional HRL approaches face limitations in effectively handling complex tasks. Reward machines, which specify high-level goals and associated rewards for sub-goals, have been introduced to address these limitations by facilitating the agent’s understanding and reasoning with respect to the task hierarchy. In this paper, we propose a novel approach to enhance HRL performance through topologically sorted potential calculation for reward machines. By leveraging the topological structure of the task hierarchy, our method efficiently determines potentials for different sub-goals. This topological sorting enables the agent to prioritize actions leading to the accomplishment of higher-level goals, enhancing the learning process. To assess the efficacy of our approach, we conducted experiments in the grid-world environment with OpenAI-Gym. The results showcase the superiority of our proposed method over traditional HRL techniques and reward machine-based reinforcement learning approaches in terms of learning efficiency and overall task performance.https://www.mdpi.com/2079-9292/12/17/3700reinforcement learningreward learningtopological sorting
spellingShingle Ziyun Zhou
Jingwei Shang
Yimang Li
Enhancing Efficiency in Hierarchical Reinforcement Learning through Topological-Sorted Potential Calculation
Electronics
reinforcement learning
reward learning
topological sorting
title Enhancing Efficiency in Hierarchical Reinforcement Learning through Topological-Sorted Potential Calculation
title_full Enhancing Efficiency in Hierarchical Reinforcement Learning through Topological-Sorted Potential Calculation
title_fullStr Enhancing Efficiency in Hierarchical Reinforcement Learning through Topological-Sorted Potential Calculation
title_full_unstemmed Enhancing Efficiency in Hierarchical Reinforcement Learning through Topological-Sorted Potential Calculation
title_short Enhancing Efficiency in Hierarchical Reinforcement Learning through Topological-Sorted Potential Calculation
title_sort enhancing efficiency in hierarchical reinforcement learning through topological sorted potential calculation
topic reinforcement learning
reward learning
topological sorting
url https://www.mdpi.com/2079-9292/12/17/3700
work_keys_str_mv AT ziyunzhou enhancingefficiencyinhierarchicalreinforcementlearningthroughtopologicalsortedpotentialcalculation
AT jingweishang enhancingefficiencyinhierarchicalreinforcementlearningthroughtopologicalsortedpotentialcalculation
AT yimangli enhancingefficiencyinhierarchicalreinforcementlearningthroughtopologicalsortedpotentialcalculation