Improving the Computational Efficiency of the Unit Commitment Problem in Hydrothermal Systems by Using Multi-Agent Deep Reinforcement Learning
In power systems with a significant hydroelectric component, instances of the Unit Commitment (UC) problem may be much more computationally intensive due to the longer decision horizons and the additional hydro constraints. Therefore, this paper presents a methodology to reduce the solution space to...
Main Authors: | , , |
---|---|
Format: | Article |
Language: | English |
Published: |
IEEE
2024-01-01
|
Series: | IEEE Access |
Subjects: | |
Online Access: | https://ieeexplore.ieee.org/document/10486892/ |
_version_ | 1797201310518345728 |
---|---|
author | Philip Guerra Esteban Gil Victor H. Hinojosa |
author_facet | Philip Guerra Esteban Gil Victor H. Hinojosa |
author_sort | Philip Guerra |
collection | DOAJ |
description | In power systems with a significant hydroelectric component, instances of the Unit Commitment (UC) problem may be much more computationally intensive due to the longer decision horizons and the additional hydro constraints. Therefore, this paper presents a methodology to reduce the solution space to accelerate 168-hour-ahead UC formulated as a Mixed-Integer Linear Program (MILP). First, an offline model maps environment observations to actions in a Multi-Agent Deep Reinforcement Learning (MADRL) model. This mapping uses historical power system operation data to determine the on/off status of specific generation units. Then, the online model uses the binary variable solutions obtained by the offline model to solve a UC problem with a reduced solution space. The Multi-Agent approach allows each agent, based on Artificial Neural Networks (ANN) with a Temporal Convolutional Network (TCN) architecture, to group units that are located in the same region. A shared cumulative reward function is used to adjust simultaneously the different ANN weights during the learning phase. The effectiveness of our method is demonstrated using real operational data of the Chilean National Electricity System, achieving statistically significant lower computation times and a negligible error that is within the integrality gap of the solver. |
first_indexed | 2024-04-24T07:45:31Z |
format | Article |
id | doaj.art-3c1173a2c4744ec898ee067de5ccd300 |
institution | Directory Open Access Journal |
issn | 2169-3536 |
language | English |
last_indexed | 2024-04-24T07:45:31Z |
publishDate | 2024-01-01 |
publisher | IEEE |
record_format | Article |
series | IEEE Access |
spelling | doaj.art-3c1173a2c4744ec898ee067de5ccd3002024-04-18T23:00:32ZengIEEEIEEE Access2169-35362024-01-0112532665327610.1109/ACCESS.2024.338344210486892Improving the Computational Efficiency of the Unit Commitment Problem in Hydrothermal Systems by Using Multi-Agent Deep Reinforcement LearningPhilip Guerra0Esteban Gil1https://orcid.org/0000-0001-9414-1063Victor H. Hinojosa2https://orcid.org/0000-0001-7586-0572Department of Electrical Engineering, Universidad Técnica Federico Santa María, Valparaíso, ChileDepartment of Electrical Engineering, Universidad Técnica Federico Santa María, Valparaíso, ChileDepartment of Electrical Engineering, Universidad Técnica Federico Santa María, Valparaíso, ChileIn power systems with a significant hydroelectric component, instances of the Unit Commitment (UC) problem may be much more computationally intensive due to the longer decision horizons and the additional hydro constraints. Therefore, this paper presents a methodology to reduce the solution space to accelerate 168-hour-ahead UC formulated as a Mixed-Integer Linear Program (MILP). First, an offline model maps environment observations to actions in a Multi-Agent Deep Reinforcement Learning (MADRL) model. This mapping uses historical power system operation data to determine the on/off status of specific generation units. Then, the online model uses the binary variable solutions obtained by the offline model to solve a UC problem with a reduced solution space. The Multi-Agent approach allows each agent, based on Artificial Neural Networks (ANN) with a Temporal Convolutional Network (TCN) architecture, to group units that are located in the same region. A shared cumulative reward function is used to adjust simultaneously the different ANN weights during the learning phase. The effectiveness of our method is demonstrated using real operational data of the Chilean National Electricity System, achieving statistically significant lower computation times and a negligible error that is within the integrality gap of the solver.https://ieeexplore.ieee.org/document/10486892/Artificial neural networksmulti-agent deep reinforcement learningunit commitmentvariable reduction |
spellingShingle | Philip Guerra Esteban Gil Victor H. Hinojosa Improving the Computational Efficiency of the Unit Commitment Problem in Hydrothermal Systems by Using Multi-Agent Deep Reinforcement Learning IEEE Access Artificial neural networks multi-agent deep reinforcement learning unit commitment variable reduction |
title | Improving the Computational Efficiency of the Unit Commitment Problem in Hydrothermal Systems by Using Multi-Agent Deep Reinforcement Learning |
title_full | Improving the Computational Efficiency of the Unit Commitment Problem in Hydrothermal Systems by Using Multi-Agent Deep Reinforcement Learning |
title_fullStr | Improving the Computational Efficiency of the Unit Commitment Problem in Hydrothermal Systems by Using Multi-Agent Deep Reinforcement Learning |
title_full_unstemmed | Improving the Computational Efficiency of the Unit Commitment Problem in Hydrothermal Systems by Using Multi-Agent Deep Reinforcement Learning |
title_short | Improving the Computational Efficiency of the Unit Commitment Problem in Hydrothermal Systems by Using Multi-Agent Deep Reinforcement Learning |
title_sort | improving the computational efficiency of the unit commitment problem in hydrothermal systems by using multi agent deep reinforcement learning |
topic | Artificial neural networks multi-agent deep reinforcement learning unit commitment variable reduction |
url | https://ieeexplore.ieee.org/document/10486892/ |
work_keys_str_mv | AT philipguerra improvingthecomputationalefficiencyoftheunitcommitmentprobleminhydrothermalsystemsbyusingmultiagentdeepreinforcementlearning AT estebangil improvingthecomputationalefficiencyoftheunitcommitmentprobleminhydrothermalsystemsbyusingmultiagentdeepreinforcementlearning AT victorhhinojosa improvingthecomputationalefficiencyoftheunitcommitmentprobleminhydrothermalsystemsbyusingmultiagentdeepreinforcementlearning |