An Optimization Method-Assisted Ensemble Deep Reinforcement Learning Algorithm to Solve Unit Commitment Problems

Unit commitment (UC) is a fundamental problem in the day-ahead electricity market, and it is critical to solve UC problems efficiently. Mathematical optimization techniques like dynamic programming, Lagrangian relaxation, and mixed-integer quadratic programming (MIQP) are commonly adopted for UC pro...

Full description

Bibliographic Details
Main Authors:	Jingtao Qin, Yuanqi Gao, Mikhail Bragin, Nanpeng Yu
Format:	Article
Language:	English
Published:	IEEE 2023-01-01
Series:	IEEE Access
Subjects:	Deep reinforcement learning multi-step return optimization methods unit commitment
Online Access:	https://ieeexplore.ieee.org/document/10247003/

_version_	1797680919605149696
author	Jingtao Qin Yuanqi Gao Mikhail Bragin Nanpeng Yu
author_facet	Jingtao Qin Yuanqi Gao Mikhail Bragin Nanpeng Yu
author_sort	Jingtao Qin
collection	DOAJ
description	Unit commitment (UC) is a fundamental problem in the day-ahead electricity market, and it is critical to solve UC problems efficiently. Mathematical optimization techniques like dynamic programming, Lagrangian relaxation, and mixed-integer quadratic programming (MIQP) are commonly adopted for UC problems. However, the calculation time of these methods increases at an exponential rate with the number of generators and energy resources, which is still the main bottleneck in the industry. Recent advances in artificial intelligence have demonstrated the capability of reinforcement learning (RL) to solve UC problems. Unfortunately, the existing research on solving UC problems with RL suffers from the curse of dimensionality when the size of UC problems grows. To deal with these problems, we propose an optimization method-assisted ensemble deep reinforcement learning algorithm, where UC problems are formulated as a Markov Decision Process (MDP) and solved by multi-step deep Q-learning in an ensemble framework. The proposed algorithm establishes a candidate action set by solving tailored optimization problems to ensure relatively high performance and the satisfaction of operational constraints. Numerical studies on three test systems show that our algorithm outperforms the baseline RL algorithm in terms of computation efficiency and operation cost. By employing the output of our proposed algorithm as a warm start, the MIQP technique can achieve further reductions in operational costs. Furthermore, the proposed algorithm shows strong generalization capacity under unforeseen operational conditions.
first_indexed	2024-03-11T23:37:15Z
format	Article
id	doaj.art-bdfc628df8994f6e8a86a6cff9011120
institution	Directory Open Access Journal
issn	2169-3536
language	English
last_indexed	2024-03-11T23:37:15Z
publishDate	2023-01-01
publisher	IEEE
record_format	Article
series	IEEE Access
spelling	doaj.art-bdfc628df8994f6e8a86a6cff90111202023-09-19T23:01:00ZengIEEEIEEE Access2169-35362023-01-011110012510013610.1109/ACCESS.2023.331399810247003An Optimization Method-Assisted Ensemble Deep Reinforcement Learning Algorithm to Solve Unit Commitment ProblemsJingtao Qin0https://orcid.org/0000-0002-7645-7547Yuanqi Gao1https://orcid.org/0000-0003-4078-6143Mikhail Bragin2https://orcid.org/0000-0002-7783-9053Nanpeng Yu3https://orcid.org/0000-0001-5086-5465Department of Electrical and Computer Engineering, University of California, Riverside, Riverside, CA, USADepartment of Electrical and Computer Engineering, University of California, Riverside, Riverside, CA, USADepartment of Electrical and Computer Engineering, University of California, Riverside, Riverside, CA, USADepartment of Electrical and Computer Engineering, University of California, Riverside, Riverside, CA, USAUnit commitment (UC) is a fundamental problem in the day-ahead electricity market, and it is critical to solve UC problems efficiently. Mathematical optimization techniques like dynamic programming, Lagrangian relaxation, and mixed-integer quadratic programming (MIQP) are commonly adopted for UC problems. However, the calculation time of these methods increases at an exponential rate with the number of generators and energy resources, which is still the main bottleneck in the industry. Recent advances in artificial intelligence have demonstrated the capability of reinforcement learning (RL) to solve UC problems. Unfortunately, the existing research on solving UC problems with RL suffers from the curse of dimensionality when the size of UC problems grows. To deal with these problems, we propose an optimization method-assisted ensemble deep reinforcement learning algorithm, where UC problems are formulated as a Markov Decision Process (MDP) and solved by multi-step deep Q-learning in an ensemble framework. The proposed algorithm establishes a candidate action set by solving tailored optimization problems to ensure relatively high performance and the satisfaction of operational constraints. Numerical studies on three test systems show that our algorithm outperforms the baseline RL algorithm in terms of computation efficiency and operation cost. By employing the output of our proposed algorithm as a warm start, the MIQP technique can achieve further reductions in operational costs. Furthermore, the proposed algorithm shows strong generalization capacity under unforeseen operational conditions.https://ieeexplore.ieee.org/document/10247003/Deep reinforcement learningmulti-step returnoptimization methodsunit commitment
spellingShingle	Jingtao Qin Yuanqi Gao Mikhail Bragin Nanpeng Yu An Optimization Method-Assisted Ensemble Deep Reinforcement Learning Algorithm to Solve Unit Commitment Problems IEEE Access Deep reinforcement learning multi-step return optimization methods unit commitment
title	An Optimization Method-Assisted Ensemble Deep Reinforcement Learning Algorithm to Solve Unit Commitment Problems
title_full	An Optimization Method-Assisted Ensemble Deep Reinforcement Learning Algorithm to Solve Unit Commitment Problems
title_fullStr	An Optimization Method-Assisted Ensemble Deep Reinforcement Learning Algorithm to Solve Unit Commitment Problems
title_full_unstemmed	An Optimization Method-Assisted Ensemble Deep Reinforcement Learning Algorithm to Solve Unit Commitment Problems
title_short	An Optimization Method-Assisted Ensemble Deep Reinforcement Learning Algorithm to Solve Unit Commitment Problems
title_sort	optimization method assisted ensemble deep reinforcement learning algorithm to solve unit commitment problems
topic	Deep reinforcement learning multi-step return optimization methods unit commitment
url	https://ieeexplore.ieee.org/document/10247003/
work_keys_str_mv	AT jingtaoqin anoptimizationmethodassistedensembledeepreinforcementlearningalgorithmtosolveunitcommitmentproblems AT yuanqigao anoptimizationmethodassistedensembledeepreinforcementlearningalgorithmtosolveunitcommitmentproblems AT mikhailbragin anoptimizationmethodassistedensembledeepreinforcementlearningalgorithmtosolveunitcommitmentproblems AT nanpengyu anoptimizationmethodassistedensembledeepreinforcementlearningalgorithmtosolveunitcommitmentproblems AT jingtaoqin optimizationmethodassistedensembledeepreinforcementlearningalgorithmtosolveunitcommitmentproblems AT yuanqigao optimizationmethodassistedensembledeepreinforcementlearningalgorithmtosolveunitcommitmentproblems AT mikhailbragin optimizationmethodassistedensembledeepreinforcementlearningalgorithmtosolveunitcommitmentproblems AT nanpengyu optimizationmethodassistedensembledeepreinforcementlearningalgorithmtosolveunitcommitmentproblems

An Optimization Method-Assisted Ensemble Deep Reinforcement Learning Algorithm to Solve Unit Commitment Problems

Similar Items