Reinforcement Learning-Based Multi-Objective Optimization for Generation Scheduling in Power Systems

Multi-objective power scheduling (MOPS) aims to address the simultaneous minimization of economic costs and different types of environmental emissions during electricity generation. Recognizing it as an NP-hard problem, this article proposes a novel multi-agent deep reinforcement learning (MADRL)-ba...

Full description

Bibliographic Details
Main Authors: Awol Seid Ebrie, Young Jin Kim
Format: Article
Language:English
Published: MDPI AG 2024-03-01
Series:Systems
Subjects:
Online Access:https://www.mdpi.com/2079-8954/12/3/106
_version_ 1797239254097592320
author Awol Seid Ebrie
Young Jin Kim
author_facet Awol Seid Ebrie
Young Jin Kim
author_sort Awol Seid Ebrie
collection DOAJ
description Multi-objective power scheduling (MOPS) aims to address the simultaneous minimization of economic costs and different types of environmental emissions during electricity generation. Recognizing it as an NP-hard problem, this article proposes a novel multi-agent deep reinforcement learning (MADRL)-based optimization algorithm. Within a custom multi-agent simulation environment, representing power-generating units as collaborative types of reinforcement learning (RL) agents, the MOPS problem is decomposed into sequential Markov decision processes (MDPs). The MDPs are then utilized for training an MADRL model, which subsequently offers the optimal solution to the optimization problem. The practical viability of the proposed method is evaluated across several experimental test systems consisting of up to 100 units featuring bi-objective and tri-objective problems. The results demonstrate that the proposed MADRL algorithm has better performance compared to established methods, such as teaching learning-based optimization (TLBO), real coded grey wolf optimization (RCGWO), evolutionary algorithm based on decomposition (EAD), non-dominated sorting algorithm II (NSGA-II), and non-dominated sorting algorithm III (NSGA-III).
first_indexed 2024-04-24T17:48:37Z
format Article
id doaj.art-a2dfe187fcca4eed953cba4e2a06f61e
institution Directory Open Access Journal
issn 2079-8954
language English
last_indexed 2024-04-24T17:48:37Z
publishDate 2024-03-01
publisher MDPI AG
record_format Article
series Systems
spelling doaj.art-a2dfe187fcca4eed953cba4e2a06f61e2024-03-27T14:05:46ZengMDPI AGSystems2079-89542024-03-0112310610.3390/systems12030106Reinforcement Learning-Based Multi-Objective Optimization for Generation Scheduling in Power SystemsAwol Seid Ebrie0Young Jin Kim1Major in Industrial Data Science & Engineering, Department of Industrial and Data Engineering, Pukyong National University, Busan 48513, Republic of KoreaDepartment of Systems Management and Engineering, Pukyong National University, Busan 48513, Republic of KoreaMulti-objective power scheduling (MOPS) aims to address the simultaneous minimization of economic costs and different types of environmental emissions during electricity generation. Recognizing it as an NP-hard problem, this article proposes a novel multi-agent deep reinforcement learning (MADRL)-based optimization algorithm. Within a custom multi-agent simulation environment, representing power-generating units as collaborative types of reinforcement learning (RL) agents, the MOPS problem is decomposed into sequential Markov decision processes (MDPs). The MDPs are then utilized for training an MADRL model, which subsequently offers the optimal solution to the optimization problem. The practical viability of the proposed method is evaluated across several experimental test systems consisting of up to 100 units featuring bi-objective and tri-objective problems. The results demonstrate that the proposed MADRL algorithm has better performance compared to established methods, such as teaching learning-based optimization (TLBO), real coded grey wolf optimization (RCGWO), evolutionary algorithm based on decomposition (EAD), non-dominated sorting algorithm II (NSGA-II), and non-dominated sorting algorithm III (NSGA-III).https://www.mdpi.com/2079-8954/12/3/106deep reinforcement learningeconomic dispatchenvironmental dispatchmulti-objective optimizationunit commitment
spellingShingle Awol Seid Ebrie
Young Jin Kim
Reinforcement Learning-Based Multi-Objective Optimization for Generation Scheduling in Power Systems
Systems
deep reinforcement learning
economic dispatch
environmental dispatch
multi-objective optimization
unit commitment
title Reinforcement Learning-Based Multi-Objective Optimization for Generation Scheduling in Power Systems
title_full Reinforcement Learning-Based Multi-Objective Optimization for Generation Scheduling in Power Systems
title_fullStr Reinforcement Learning-Based Multi-Objective Optimization for Generation Scheduling in Power Systems
title_full_unstemmed Reinforcement Learning-Based Multi-Objective Optimization for Generation Scheduling in Power Systems
title_short Reinforcement Learning-Based Multi-Objective Optimization for Generation Scheduling in Power Systems
title_sort reinforcement learning based multi objective optimization for generation scheduling in power systems
topic deep reinforcement learning
economic dispatch
environmental dispatch
multi-objective optimization
unit commitment
url https://www.mdpi.com/2079-8954/12/3/106
work_keys_str_mv AT awolseidebrie reinforcementlearningbasedmultiobjectiveoptimizationforgenerationschedulinginpowersystems
AT youngjinkim reinforcementlearningbasedmultiobjectiveoptimizationforgenerationschedulinginpowersystems