Temporal Consistency-Based Loss Function for Both Deep Q-Networks and Deep Deterministic Policy Gradients for Continuous Actions

Artificial intelligence (AI) techniques in power grid control and energy management in building automation require both deep Q-networks (DQNs) and deep deterministic policy gradients (DDPGs) in deep reinforcement learning (DRL) as off-policy algorithms. Most studies on improving the stability of DRL...

Full description

Bibliographic Details
Main Author: Chayoung Kim
Format: Article
Language:English
Published: MDPI AG 2021-12-01
Series:Symmetry
Subjects:
Online Access:https://www.mdpi.com/2073-8994/13/12/2411