Reinforcement Learning-Based Multi-Objective Optimization for Generation Scheduling in Power Systems

Multi-objective power scheduling (MOPS) aims to address the simultaneous minimization of economic costs and different types of environmental emissions during electricity generation. Recognizing it as an NP-hard problem, this article proposes a novel multi-agent deep reinforcement learning (MADRL)-ba...

Full description

Bibliographic Details
Main Authors:	Awol Seid Ebrie, Young Jin Kim
Format:	Article
Language:	English
Published:	MDPI AG 2024-03-01
Series:	Systems
Subjects:	deep reinforcement learning economic dispatch environmental dispatch multi-objective optimization unit commitment
Online Access:	https://www.mdpi.com/2079-8954/12/3/106

_version_	1797239254097592320
author	Awol Seid Ebrie Young Jin Kim
author_facet	Awol Seid Ebrie Young Jin Kim
author_sort	Awol Seid Ebrie
collection	DOAJ
description	Multi-objective power scheduling (MOPS) aims to address the simultaneous minimization of economic costs and different types of environmental emissions during electricity generation. Recognizing it as an NP-hard problem, this article proposes a novel multi-agent deep reinforcement learning (MADRL)-based optimization algorithm. Within a custom multi-agent simulation environment, representing power-generating units as collaborative types of reinforcement learning (RL) agents, the MOPS problem is decomposed into sequential Markov decision processes (MDPs). The MDPs are then utilized for training an MADRL model, which subsequently offers the optimal solution to the optimization problem. The practical viability of the proposed method is evaluated across several experimental test systems consisting of up to 100 units featuring bi-objective and tri-objective problems. The results demonstrate that the proposed MADRL algorithm has better performance compared to established methods, such as teaching learning-based optimization (TLBO), real coded grey wolf optimization (RCGWO), evolutionary algorithm based on decomposition (EAD), non-dominated sorting algorithm II (NSGA-II), and non-dominated sorting algorithm III (NSGA-III).
first_indexed	2024-04-24T17:48:37Z
format	Article
id	doaj.art-a2dfe187fcca4eed953cba4e2a06f61e
institution	Directory Open Access Journal
issn	2079-8954
language	English
last_indexed	2024-04-24T17:48:37Z
publishDate	2024-03-01
publisher	MDPI AG
record_format	Article
series	Systems
spelling	doaj.art-a2dfe187fcca4eed953cba4e2a06f61e2024-03-27T14:05:46ZengMDPI AGSystems2079-89542024-03-0112310610.3390/systems12030106Reinforcement Learning-Based Multi-Objective Optimization for Generation Scheduling in Power SystemsAwol Seid Ebrie0Young Jin Kim1Major in Industrial Data Science & Engineering, Department of Industrial and Data Engineering, Pukyong National University, Busan 48513, Republic of KoreaDepartment of Systems Management and Engineering, Pukyong National University, Busan 48513, Republic of KoreaMulti-objective power scheduling (MOPS) aims to address the simultaneous minimization of economic costs and different types of environmental emissions during electricity generation. Recognizing it as an NP-hard problem, this article proposes a novel multi-agent deep reinforcement learning (MADRL)-based optimization algorithm. Within a custom multi-agent simulation environment, representing power-generating units as collaborative types of reinforcement learning (RL) agents, the MOPS problem is decomposed into sequential Markov decision processes (MDPs). The MDPs are then utilized for training an MADRL model, which subsequently offers the optimal solution to the optimization problem. The practical viability of the proposed method is evaluated across several experimental test systems consisting of up to 100 units featuring bi-objective and tri-objective problems. The results demonstrate that the proposed MADRL algorithm has better performance compared to established methods, such as teaching learning-based optimization (TLBO), real coded grey wolf optimization (RCGWO), evolutionary algorithm based on decomposition (EAD), non-dominated sorting algorithm II (NSGA-II), and non-dominated sorting algorithm III (NSGA-III).https://www.mdpi.com/2079-8954/12/3/106deep reinforcement learningeconomic dispatchenvironmental dispatchmulti-objective optimizationunit commitment
spellingShingle	Awol Seid Ebrie Young Jin Kim Reinforcement Learning-Based Multi-Objective Optimization for Generation Scheduling in Power Systems Systems deep reinforcement learning economic dispatch environmental dispatch multi-objective optimization unit commitment
title	Reinforcement Learning-Based Multi-Objective Optimization for Generation Scheduling in Power Systems
title_full	Reinforcement Learning-Based Multi-Objective Optimization for Generation Scheduling in Power Systems
title_fullStr	Reinforcement Learning-Based Multi-Objective Optimization for Generation Scheduling in Power Systems
title_full_unstemmed	Reinforcement Learning-Based Multi-Objective Optimization for Generation Scheduling in Power Systems
title_short	Reinforcement Learning-Based Multi-Objective Optimization for Generation Scheduling in Power Systems
title_sort	reinforcement learning based multi objective optimization for generation scheduling in power systems
topic	deep reinforcement learning economic dispatch environmental dispatch multi-objective optimization unit commitment
url	https://www.mdpi.com/2079-8954/12/3/106
work_keys_str_mv	AT awolseidebrie reinforcementlearningbasedmultiobjectiveoptimizationforgenerationschedulinginpowersystems AT youngjinkim reinforcementlearningbasedmultiobjectiveoptimizationforgenerationschedulinginpowersystems

Reinforcement Learning-Based Multi-Objective Optimization for Generation Scheduling in Power Systems

Similar Items