Safe reinforcement learning for multi-energy management systems with known constraint functions

Reinforcement learning (RL) is a promising optimal control technique for multi-energy management systems. It does not require a model a priori - reducing the upfront and ongoing project-specific engineering effort and is capable of learning better representations of the underlying system dynamics. H...

Full description

Bibliographic Details
Main Authors:	Glenn Ceusters, Luis Ramirez Camargo, Rüdiger Franke, Ann Nowé, Maarten Messagie
Format:	Article
Language:	English
Published:	Elsevier 2023-04-01
Series:	Energy and AI
Subjects:	Reinforcement learning Constraints Multi-energy systems Energy management system
Online Access:	http://www.sciencedirect.com/science/article/pii/S2666546822000738

_version_	1797833420425920512
author	Glenn Ceusters Luis Ramirez Camargo Rüdiger Franke Ann Nowé Maarten Messagie
author_facet	Glenn Ceusters Luis Ramirez Camargo Rüdiger Franke Ann Nowé Maarten Messagie
author_sort	Glenn Ceusters
collection	DOAJ
description	Reinforcement learning (RL) is a promising optimal control technique for multi-energy management systems. It does not require a model a priori - reducing the upfront and ongoing project-specific engineering effort and is capable of learning better representations of the underlying system dynamics. However, vanilla RL does not provide constraint satisfaction guarantees — resulting in various potentially unsafe interactions within its environment. In this paper, we present two novel online model-free safe RL methods, namely SafeFallback and GiveSafe, where the safety constraint formulation is decoupled from the RL formulation. These provide hard-constraint satisfaction guarantees both during training and deployment of the (near) optimal policy. This is without the need of solving a mathematical program, resulting in less computational power requirements and more flexible constraint function formulations. In a simulated multi-energy systems case study we have shown that both methods start with a significantly higher utility compared to a vanilla RL benchmark and Optlayer benchmark (94,6% and 82,8% compared to 35,5% and 77,8%) and that the proposed SafeFallback method even can outperform the vanilla RL benchmark (102,9% to 100%). We conclude that both methods are viably safety constraint handling techniques applicable beyond RL, as demonstrated with random policies while still providing hard-constraint guarantees.
first_indexed	2024-04-09T14:23:55Z
format	Article
id	doaj.art-68aca0d2b26848d1b84a4a728e7440e0
institution	Directory Open Access Journal
issn	2666-5468
language	English
last_indexed	2024-04-09T14:23:55Z
publishDate	2023-04-01
publisher	Elsevier
record_format	Article
series	Energy and AI
spelling	doaj.art-68aca0d2b26848d1b84a4a728e7440e02023-05-04T10:44:11ZengElsevierEnergy and AI2666-54682023-04-0112100227Safe reinforcement learning for multi-energy management systems with known constraint functionsGlenn Ceusters0Luis Ramirez Camargo1Rüdiger Franke2Ann Nowé3Maarten Messagie4ABB, Hoge Wei 27, 1930 Zaventem, Belgium; Vrije Universiteit Brussel (VUB), ETEC-MOBI, Pleinlaan 2, 1050 Brussels, Belgium; Vrije Universiteit Brussel (VUB), AI-lab, Pleinlaan 2, 1050 Brussels, Belgium; Corresponding author at: Vrije Universiteit Brussel (VUB), ETEC-MOBI, Pleinlaan 2, 1050 Brussels, Belgium.Vrije Universiteit Brussel (VUB), ETEC-MOBI, Pleinlaan 2, 1050 Brussels, Belgium; Copernicus Institute of Sustainable Development - Utrecht University, Princetonlaan 8a, 3584, CB Utrecht, NetherlandsABB, Hoge Wei 27, 1930 Zaventem, BelgiumVrije Universiteit Brussel (VUB), AI-lab, Pleinlaan 2, 1050 Brussels, BelgiumVrije Universiteit Brussel (VUB), ETEC-MOBI, Pleinlaan 2, 1050 Brussels, BelgiumReinforcement learning (RL) is a promising optimal control technique for multi-energy management systems. It does not require a model a priori - reducing the upfront and ongoing project-specific engineering effort and is capable of learning better representations of the underlying system dynamics. However, vanilla RL does not provide constraint satisfaction guarantees — resulting in various potentially unsafe interactions within its environment. In this paper, we present two novel online model-free safe RL methods, namely SafeFallback and GiveSafe, where the safety constraint formulation is decoupled from the RL formulation. These provide hard-constraint satisfaction guarantees both during training and deployment of the (near) optimal policy. This is without the need of solving a mathematical program, resulting in less computational power requirements and more flexible constraint function formulations. In a simulated multi-energy systems case study we have shown that both methods start with a significantly higher utility compared to a vanilla RL benchmark and Optlayer benchmark (94,6% and 82,8% compared to 35,5% and 77,8%) and that the proposed SafeFallback method even can outperform the vanilla RL benchmark (102,9% to 100%). We conclude that both methods are viably safety constraint handling techniques applicable beyond RL, as demonstrated with random policies while still providing hard-constraint guarantees.http://www.sciencedirect.com/science/article/pii/S2666546822000738Reinforcement learningConstraintsMulti-energy systemsEnergy management system
spellingShingle	Glenn Ceusters Luis Ramirez Camargo Rüdiger Franke Ann Nowé Maarten Messagie Safe reinforcement learning for multi-energy management systems with known constraint functions Energy and AI Reinforcement learning Constraints Multi-energy systems Energy management system
title	Safe reinforcement learning for multi-energy management systems with known constraint functions
title_full	Safe reinforcement learning for multi-energy management systems with known constraint functions
title_fullStr	Safe reinforcement learning for multi-energy management systems with known constraint functions
title_full_unstemmed	Safe reinforcement learning for multi-energy management systems with known constraint functions
title_short	Safe reinforcement learning for multi-energy management systems with known constraint functions
title_sort	safe reinforcement learning for multi energy management systems with known constraint functions
topic	Reinforcement learning Constraints Multi-energy systems Energy management system
url	http://www.sciencedirect.com/science/article/pii/S2666546822000738
work_keys_str_mv	AT glennceusters safereinforcementlearningformultienergymanagementsystemswithknownconstraintfunctions AT luisramirezcamargo safereinforcementlearningformultienergymanagementsystemswithknownconstraintfunctions AT rudigerfranke safereinforcementlearningformultienergymanagementsystemswithknownconstraintfunctions AT annnowe safereinforcementlearningformultienergymanagementsystemswithknownconstraintfunctions AT maartenmessagie safereinforcementlearningformultienergymanagementsystemswithknownconstraintfunctions

Safe reinforcement learning for multi-energy management systems with known constraint functions

Similar Items