Distributed Off-Policy Temporal Difference Learning Using Primal-Dual Method

Distributed Off-Policy Temporal Difference Learning Using Primal-Dual Method

The goal of this paper is to provide theoretical analysis and additional insights on a distributed temporal-difference (TD)-learning algorithm for the multi-agent Markov decision processes (MDPs) via saddle-point viewpoints. The (single-agent) TD-learning is a reinforcement learning (RL) algorithm f...

Full description

Bibliographic Details
Main Authors:	Donghwan Lee, Do Wan Kim, Jianghai Hu
Format:	Article
Language:	English
Published:	IEEE 2022-01-01
Series:	IEEE Access
Subjects:	Reinforcement learning (RL) multi-agent systems convergence temporal difference (TD) learning machine learning primal-dual method
Online Access:	https://ieeexplore.ieee.org/document/9906992/

Similar Items

Learning to play against any mixture of opponents
by: Max Olan Smith, et al.
Published: (2023-07-01)

Dual-Layer Q-Learning Strategy for Energy Management of Battery Storage in Grid-Connected Microgrids
by: Khawaja Haider Ali, et al.
Published: (2023-01-01)

Relaxed Variable Metric Primal-Dual Fixed-Point Algorithm with Applications
by: Wenli Huang, et al.
Published: (2022-11-01)

Primal-Dual Method of Solving Convex Quadratic Programming Problems
by: V. Moraru
Published: (2000-10-01)

Primal-Dual Splitting Algorithms for Solving Structured Monotone Inclusion with Applications
by: Jinjian Chen, et al.
Published: (2021-12-01)

On the Number of Witnesses in the Miller–Rabin Primality Test
by: Shamil Talgatovich Ishmukhametov, et al.
Published: (2020-06-01)

Some operators in soft primal spaces
by: Ahmad Al-Omari, et al.
Published: (2024-03-01)

Primal Structure with Closure Operators and Their Applications
by: Ahmad Al-Omari, et al.
Published: (2023-12-01)

A Primal–Dual-Based Power Control Approach for Capacitated Edge Servers
by: Qinghui Zhang, et al.
Published: (2022-10-01)

On Primal Soft Topology
by: Tareq M. Al-shami, et al.
Published: (2023-05-01)

Feature Selection Method Using Multi-Agent Reinforcement Learning Based on Guide Agents
by: Minwoo Kim, et al.
Published: (2022-12-01)

Multi-Agent Reinforcement Learning Based Actuator Control for EV HVAC Systems
by: Sungho Joo, et al.
Published: (2023-01-01)

Fuzzy optimization of primal-dual pair using piecewise linear membership functions
by: Pandey D., et al.
Published: (2012-01-01)

Dense-Frequency Signal-Detection Based on the Primal–Dual Splitting Method
by: Jiaoyu Zheng, et al.
Published: (2022-07-01)

Predicting the Execution Time of the Primal and Dual Simplex Algorithms Using Artificial Neural Networks
by: Sophia Voulgaropoulou, et al.
Published: (2022-03-01)

A note on the primality of sums
by: Antonie Dinculescu
Published: (2022-08-01)

Primal-Dual Learning Based Risk-Averse Optimal Integrated Allocation of Hybrid Energy Generation Plants under Uncertainty
by: Xiao Zhao, et al.
Published: (2019-06-01)

Regularity and normality on primal spaces
by: Ahmad Al-Omari, et al.
Published: (2024-02-01)

Multi-Agent Reinforcement Learning Using Linear Fuzzy Model Applied to Cooperative Mobile Robots
by: David Luviano-Cruz, et al.
Published: (2018-10-01)

Deep Reinforcement Learning-Based Adaptive Controller for Trajectory Tracking and Altitude Control of an Aerial Robot
by: Ali Barzegar, et al.
Published: (2022-05-01)

An accelerated primal‐dual method for semi‐definite programming relaxation of optimal power flow
by: Zhan Shi, et al.
Published: (2023-12-01)

Long live the Liver King: right-wing carnivorism and the digital dissemination of primal rhetoric
by: S. Marek Muller, et al.
Published: (2024-03-01)

Complexity analysis of primal-dual algorithms for the semidefinite linear complementarity problem
by: Mohamed Achache, et al.
Published: (2011-08-01)

Multi-Agent Reinforcement Learning via Adaptive Kalman Temporal Difference and Successor Representation
by: Mohammad Salimibeni, et al.
Published: (2022-02-01)

Generalized primal topological spaces
by: Hanan Al-Saadi, et al.
Published: (2023-08-01)

Path Planning Algorithm for Dual-Arm Robot Based on Depth Deterministic Gradient Strategy Algorithm
by: Xiaomei Zhang, et al.
Published: (2023-10-01)

RLECN—A learning based dynamic threshold control of ECN
by: Shahzad, et al.
Published: (2023-12-01)

Reinforcement Learning for Energy-Storage Systems in Grid-Connected Microgrids: An Investigation of Online vs. Offline Implementation
by: Khawaja Haider Ali, et al.
Published: (2021-09-01)

Energy Efficient Power Allocation in Massive MIMO Based on Parameterized Deep DQN
by: Shruti Sharma, et al.
Published: (2023-11-01)

Comparação entre métodos numéricos para sistemas lineares na aplicação do método primal dual barreira logarítmica para dimensionamento de biodigestores rurais
by: João Pedro Mucheroni Covolan, et al.
Published: (2020-02-01)

Decentralized Primal-Dual Proximal Operator Algorithm for Constrained Nonsmooth Composite Optimization Problems over Networks
by: Liping Feng, et al.
Published: (2022-09-01)

The Implicit Constraints of the Primal Sketch
by: Grimson, W.E.L
Published: (2004)

Stochastic Ensemble Policy Transfer
by: CHANG Tian, ZHANG Zongzhang, YU Yang
Published: (2022-11-01)

Importance of prefrontal meta control in human-like reinforcement learning
by: Jee Hang Lee, et al.
Published: (2022-12-01)

General value functions for fault detection in multivariate time series data
by: Andy Wong, et al.
Published: (2024-03-01)

An Efficient Centralized Multi-Agent Reinforcement Learner for Cooperative Tasks
by: Dengyu Liao, et al.
Published: (2023-01-01)

Multi-USV Dynamic Navigation and Target Capture: A Guided Multi-Agent Reinforcement Learning Approach
by: Sulemana Nantogma, et al.
Published: (2023-03-01)

Sim-to-real via latent prediction: Transferring visual non-prehensile manipulation policies
by: Carlo Rizzardo, et al.
Published: (2023-01-01)

Action Valuation of On- and Off-Ball Soccer Players Based on Multi-Agent Deep Reinforcement Learning
by: Hiroshi Nakahara, et al.
Published: (2023-01-01)

A Collaborative Control Method of Dual-Arm Robots Based on Deep Reinforcement Learning
by: Luyu Liu, et al.
Published: (2021-02-01)