Temporal Consistency-Based Loss Function for Both Deep Q-Networks and Deep Deterministic Policy Gradients for Continuous Actions
Artificial intelligence (AI) techniques in power grid control and energy management in building automation require both deep Q-networks (DQNs) and deep deterministic policy gradients (DDPGs) in deep reinforcement learning (DRL) as off-policy algorithms. Most studies on improving the stability of DRL...
Main Author: | Chayoung Kim |
---|---|
Format: | Article |
Language: | English |
Published: |
MDPI AG
2021-12-01
|
Series: | Symmetry |
Subjects: | |
Online Access: | https://www.mdpi.com/2073-8994/13/12/2411 |
Similar Items
-
DDRCN: Deep Deterministic Policy Gradient Recommendation Framework Fused with Deep Cross Networks
by: Tianhan Gao, et al.
Published: (2023-02-01) -
Generative Adversarial Inverse Reinforcement Learning With Deep Deterministic Policy Gradient
by: Ming Zhan, et al.
Published: (2023-01-01) -
A State-Compensated Deep Deterministic Policy Gradient Algorithm for UAV Trajectory Tracking
by: Jiying Wu, et al.
Published: (2022-06-01) -
Enhanced Deep Deterministic Policy Gradient Algorithm Using Grey Wolf Optimizer for Continuous Control Tasks
by: Ebrahim Hamid Hasan Sumiea, et al.
Published: (2023-01-01) -
Agent-Based Energy Sharing Mechanism Using Deep Deterministic Policy Gradient Algorithm
by: Yi Kuang, et al.
Published: (2020-09-01)