A prior knowledge-embedded reinforcement learning method for real-time active power corrective control in complex power systems

With the increasing uncertainty and complexity of modern power grids, the real-time active power corrective control problem becomes intractable, bringing significant challenges to the stable operation of future power systems. To promote effective and efficient active power corrective control, a prio...

Full description

Bibliographic Details
Main Authors:	Peidong Xu, Jun Zhang, Jixiang Lu, Haoran Zhang, Tianlu Gao, Siyuan Chen
Format:	Article
Language:	English
Published:	Frontiers Media S.A. 2022-09-01
Series:	Frontiers in Energy Research
Subjects:	active power corrective control deep Q-learning from demonstrations graph pooling power system prior knowledge
Online Access:	https://www.frontiersin.org/articles/10.3389/fenrg.2022.1009545/full

_version_	1811206609191305216
author	Peidong Xu Jun Zhang Jixiang Lu Haoran Zhang Tianlu Gao Siyuan Chen
author_facet	Peidong Xu Jun Zhang Jixiang Lu Haoran Zhang Tianlu Gao Siyuan Chen
author_sort	Peidong Xu
collection	DOAJ
description	With the increasing uncertainty and complexity of modern power grids, the real-time active power corrective control problem becomes intractable, bringing significant challenges to the stable operation of future power systems. To promote effective and efficient active power corrective control, a prior knowledge-embedded reinforcement learning method is proposed in this paper, to improve the performance of the deep reinforcement learning agent while maintaining the real-time control manner. The system-level feature is first established based on prior knowledge and cooperating with the equipment-level features, to provide a thorough description of the power network states. A global-local network structure is then constructed to integrate the two-level information accordingly by introducing the graph pooling method. Based on the multi-level representation of power system states, the Deep Q-learning from Demonstrations method is adopted to guide the deep reinforcement learning agent to learn from the expert policy along with the interactive improving process. Considering the infrequent corrective control actions in practice, the double-prioritized training mechanism combined with the λ-return is further developed to help the agent lay emphasis on learning from critical control experience. Simulation results demonstrate that the proposed method prevails over the conventional deep reinforcement learning methods in training efficiency and control effects, and has the potential to solve the complex active power corrective control problem in the future.
first_indexed	2024-04-12T03:50:06Z
format	Article
id	doaj.art-be6ae9b5ea794af9a12f65de1a709ed9
institution	Directory Open Access Journal
issn	2296-598X
language	English
last_indexed	2024-04-12T03:50:06Z
publishDate	2022-09-01
publisher	Frontiers Media S.A.
record_format	Article
series	Frontiers in Energy Research
spelling	doaj.art-be6ae9b5ea794af9a12f65de1a709ed92022-12-22T03:49:01ZengFrontiers Media S.A.Frontiers in Energy Research2296-598X2022-09-011010.3389/fenrg.2022.10095451009545A prior knowledge-embedded reinforcement learning method for real-time active power corrective control in complex power systemsPeidong Xu0Jun Zhang1Jixiang Lu2Haoran Zhang3Tianlu Gao4Siyuan Chen5School of Electrical Engineering and Automation, Wuhan University, Wuhan, ChinaSchool of Electrical Engineering and Automation, Wuhan University, Wuhan, ChinaTechnology Research Center, State Key Laboratory of Intelligent Power Grid Protection and Operation Control, NARI Group Corporation, Nanjing, ChinaSchool of Electrical Engineering and Automation, Wuhan University, Wuhan, ChinaSchool of Electrical Engineering and Automation, Wuhan University, Wuhan, ChinaSchool of Electrical Engineering and Automation, Wuhan University, Wuhan, ChinaWith the increasing uncertainty and complexity of modern power grids, the real-time active power corrective control problem becomes intractable, bringing significant challenges to the stable operation of future power systems. To promote effective and efficient active power corrective control, a prior knowledge-embedded reinforcement learning method is proposed in this paper, to improve the performance of the deep reinforcement learning agent while maintaining the real-time control manner. The system-level feature is first established based on prior knowledge and cooperating with the equipment-level features, to provide a thorough description of the power network states. A global-local network structure is then constructed to integrate the two-level information accordingly by introducing the graph pooling method. Based on the multi-level representation of power system states, the Deep Q-learning from Demonstrations method is adopted to guide the deep reinforcement learning agent to learn from the expert policy along with the interactive improving process. Considering the infrequent corrective control actions in practice, the double-prioritized training mechanism combined with the λ-return is further developed to help the agent lay emphasis on learning from critical control experience. Simulation results demonstrate that the proposed method prevails over the conventional deep reinforcement learning methods in training efficiency and control effects, and has the potential to solve the complex active power corrective control problem in the future.https://www.frontiersin.org/articles/10.3389/fenrg.2022.1009545/fullactive power corrective controldeep Q-learning from demonstrationsgraph poolingpower systemprior knowledge
spellingShingle	Peidong Xu Jun Zhang Jixiang Lu Haoran Zhang Tianlu Gao Siyuan Chen A prior knowledge-embedded reinforcement learning method for real-time active power corrective control in complex power systems Frontiers in Energy Research active power corrective control deep Q-learning from demonstrations graph pooling power system prior knowledge
title	A prior knowledge-embedded reinforcement learning method for real-time active power corrective control in complex power systems
title_full	A prior knowledge-embedded reinforcement learning method for real-time active power corrective control in complex power systems
title_fullStr	A prior knowledge-embedded reinforcement learning method for real-time active power corrective control in complex power systems
title_full_unstemmed	A prior knowledge-embedded reinforcement learning method for real-time active power corrective control in complex power systems
title_short	A prior knowledge-embedded reinforcement learning method for real-time active power corrective control in complex power systems
title_sort	prior knowledge embedded reinforcement learning method for real time active power corrective control in complex power systems
topic	active power corrective control deep Q-learning from demonstrations graph pooling power system prior knowledge
url	https://www.frontiersin.org/articles/10.3389/fenrg.2022.1009545/full
work_keys_str_mv	AT peidongxu apriorknowledgeembeddedreinforcementlearningmethodforrealtimeactivepowercorrectivecontrolincomplexpowersystems AT junzhang apriorknowledgeembeddedreinforcementlearningmethodforrealtimeactivepowercorrectivecontrolincomplexpowersystems AT jixianglu apriorknowledgeembeddedreinforcementlearningmethodforrealtimeactivepowercorrectivecontrolincomplexpowersystems AT haoranzhang apriorknowledgeembeddedreinforcementlearningmethodforrealtimeactivepowercorrectivecontrolincomplexpowersystems AT tianlugao apriorknowledgeembeddedreinforcementlearningmethodforrealtimeactivepowercorrectivecontrolincomplexpowersystems AT siyuanchen apriorknowledgeembeddedreinforcementlearningmethodforrealtimeactivepowercorrectivecontrolincomplexpowersystems AT peidongxu priorknowledgeembeddedreinforcementlearningmethodforrealtimeactivepowercorrectivecontrolincomplexpowersystems AT junzhang priorknowledgeembeddedreinforcementlearningmethodforrealtimeactivepowercorrectivecontrolincomplexpowersystems AT jixianglu priorknowledgeembeddedreinforcementlearningmethodforrealtimeactivepowercorrectivecontrolincomplexpowersystems AT haoranzhang priorknowledgeembeddedreinforcementlearningmethodforrealtimeactivepowercorrectivecontrolincomplexpowersystems AT tianlugao priorknowledgeembeddedreinforcementlearningmethodforrealtimeactivepowercorrectivecontrolincomplexpowersystems AT siyuanchen priorknowledgeembeddedreinforcementlearningmethodforrealtimeactivepowercorrectivecontrolincomplexpowersystems

A prior knowledge-embedded reinforcement learning method for real-time active power corrective control in complex power systems

Similar Items