Improving the Computational Efficiency of the Unit Commitment Problem in Hydrothermal Systems by Using Multi-Agent Deep Reinforcement Learning

In power systems with a significant hydroelectric component, instances of the Unit Commitment (UC) problem may be much more computationally intensive due to the longer decision horizons and the additional hydro constraints. Therefore, this paper presents a methodology to reduce the solution space to...

Full description

Bibliographic Details
Main Authors: Philip Guerra, Esteban Gil, Victor H. Hinojosa
Format: Article
Language:English
Published: IEEE 2024-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/10486892/
_version_ 1797201310518345728
author Philip Guerra
Esteban Gil
Victor H. Hinojosa
author_facet Philip Guerra
Esteban Gil
Victor H. Hinojosa
author_sort Philip Guerra
collection DOAJ
description In power systems with a significant hydroelectric component, instances of the Unit Commitment (UC) problem may be much more computationally intensive due to the longer decision horizons and the additional hydro constraints. Therefore, this paper presents a methodology to reduce the solution space to accelerate 168-hour-ahead UC formulated as a Mixed-Integer Linear Program (MILP). First, an offline model maps environment observations to actions in a Multi-Agent Deep Reinforcement Learning (MADRL) model. This mapping uses historical power system operation data to determine the on/off status of specific generation units. Then, the online model uses the binary variable solutions obtained by the offline model to solve a UC problem with a reduced solution space. The Multi-Agent approach allows each agent, based on Artificial Neural Networks (ANN) with a Temporal Convolutional Network (TCN) architecture, to group units that are located in the same region. A shared cumulative reward function is used to adjust simultaneously the different ANN weights during the learning phase. The effectiveness of our method is demonstrated using real operational data of the Chilean National Electricity System, achieving statistically significant lower computation times and a negligible error that is within the integrality gap of the solver.
first_indexed 2024-04-24T07:45:31Z
format Article
id doaj.art-3c1173a2c4744ec898ee067de5ccd300
institution Directory Open Access Journal
issn 2169-3536
language English
last_indexed 2024-04-24T07:45:31Z
publishDate 2024-01-01
publisher IEEE
record_format Article
series IEEE Access
spelling doaj.art-3c1173a2c4744ec898ee067de5ccd3002024-04-18T23:00:32ZengIEEEIEEE Access2169-35362024-01-0112532665327610.1109/ACCESS.2024.338344210486892Improving the Computational Efficiency of the Unit Commitment Problem in Hydrothermal Systems by Using Multi-Agent Deep Reinforcement LearningPhilip Guerra0Esteban Gil1https://orcid.org/0000-0001-9414-1063Victor H. Hinojosa2https://orcid.org/0000-0001-7586-0572Department of Electrical Engineering, Universidad Técnica Federico Santa María, Valparaíso, ChileDepartment of Electrical Engineering, Universidad Técnica Federico Santa María, Valparaíso, ChileDepartment of Electrical Engineering, Universidad Técnica Federico Santa María, Valparaíso, ChileIn power systems with a significant hydroelectric component, instances of the Unit Commitment (UC) problem may be much more computationally intensive due to the longer decision horizons and the additional hydro constraints. Therefore, this paper presents a methodology to reduce the solution space to accelerate 168-hour-ahead UC formulated as a Mixed-Integer Linear Program (MILP). First, an offline model maps environment observations to actions in a Multi-Agent Deep Reinforcement Learning (MADRL) model. This mapping uses historical power system operation data to determine the on/off status of specific generation units. Then, the online model uses the binary variable solutions obtained by the offline model to solve a UC problem with a reduced solution space. The Multi-Agent approach allows each agent, based on Artificial Neural Networks (ANN) with a Temporal Convolutional Network (TCN) architecture, to group units that are located in the same region. A shared cumulative reward function is used to adjust simultaneously the different ANN weights during the learning phase. The effectiveness of our method is demonstrated using real operational data of the Chilean National Electricity System, achieving statistically significant lower computation times and a negligible error that is within the integrality gap of the solver.https://ieeexplore.ieee.org/document/10486892/Artificial neural networksmulti-agent deep reinforcement learningunit commitmentvariable reduction
spellingShingle Philip Guerra
Esteban Gil
Victor H. Hinojosa
Improving the Computational Efficiency of the Unit Commitment Problem in Hydrothermal Systems by Using Multi-Agent Deep Reinforcement Learning
IEEE Access
Artificial neural networks
multi-agent deep reinforcement learning
unit commitment
variable reduction
title Improving the Computational Efficiency of the Unit Commitment Problem in Hydrothermal Systems by Using Multi-Agent Deep Reinforcement Learning
title_full Improving the Computational Efficiency of the Unit Commitment Problem in Hydrothermal Systems by Using Multi-Agent Deep Reinforcement Learning
title_fullStr Improving the Computational Efficiency of the Unit Commitment Problem in Hydrothermal Systems by Using Multi-Agent Deep Reinforcement Learning
title_full_unstemmed Improving the Computational Efficiency of the Unit Commitment Problem in Hydrothermal Systems by Using Multi-Agent Deep Reinforcement Learning
title_short Improving the Computational Efficiency of the Unit Commitment Problem in Hydrothermal Systems by Using Multi-Agent Deep Reinforcement Learning
title_sort improving the computational efficiency of the unit commitment problem in hydrothermal systems by using multi agent deep reinforcement learning
topic Artificial neural networks
multi-agent deep reinforcement learning
unit commitment
variable reduction
url https://ieeexplore.ieee.org/document/10486892/
work_keys_str_mv AT philipguerra improvingthecomputationalefficiencyoftheunitcommitmentprobleminhydrothermalsystemsbyusingmultiagentdeepreinforcementlearning
AT estebangil improvingthecomputationalefficiencyoftheunitcommitmentprobleminhydrothermalsystemsbyusingmultiagentdeepreinforcementlearning
AT victorhhinojosa improvingthecomputationalefficiencyoftheunitcommitmentprobleminhydrothermalsystemsbyusingmultiagentdeepreinforcementlearning