Weighted QMIX: Expanding monotonic value function factorisation for deep multi−agent reinforcement learning

Weighted QMIX: Expanding monotonic value function factorisation for deep multi−agent reinforcement learning

QMIX is a popular Q-learning algorithm for cooperative MARL in the centralised training and decentralised execution paradigm. In order to enable easy decentralisation, QMIX restricts the joint action Q-values it can represent to be a monotonic mixing of each agent’s utilities. However, this restrict...

Fuld beskrivelse

Bibliografiske detaljer
Main Authors:	Rashid, T, Farquhar, G, Peng, B, Whiteson, S
Format:	Conference item
Sprog:	English
Udgivet:	NeurIPS 2020

Lignende værker

QMIX: Monotonic value function factorisation for deep multi-agent reinforcement learning
af: Rashid, T, et al.
Udgivet: (2018)

Monotonic value function factorisation for deep multi-agent reinforcement learning
af: Rashid, T, et al.
Udgivet: (2020)

Exploration and value function factorisation in single and multi-agent reinforcement learning
af: Rashid, T
Udgivet: (2021)

Stabilising experience replay for deep multi-agent reinforcement learning
af: Foerster, J, et al.
Udgivet: (2017)

Bayesian action decoder for deep multi-agent reinforcement learning
af: Whiteson, S
Udgivet: (2019)

UneVEn: Universal value exploration for multi-agent reinforcement learning
af: Gupta, T, et al.
Udgivet: (2021)

Learning to communicate with Deep multi-agent reinforcement learning
af: Foerster, J, et al.
Udgivet: (2016)

Multi-agent common knowledge reinforcement learning
af: de Witt, C, et al.
Udgivet: (2019)

Regularized Softmax Deep Multi−Agent Q−Learning
af: Pan, L, et al.
Udgivet: (2022)

Analysing factorizations of action-value networks for cooperative multi-agent reinforcement learning
af: Castellini, J, et al.
Udgivet: (2021)

Deep reinforcement learning to multi-agent deep reinforcement learning
af: Samieiyeganeh, Mehdi, et al.
Udgivet: (2022)

Deep multi-agent reinforcement learning
af: Foerster, J
Udgivet: (2018)

Counterfactual multi−agent policy gradients
af: Foerster, J, et al.
Udgivet: (2018)

QMix: A Python package for simulating the quasiparticle tunneling currents in SIS junctions
af: Garrett, J, et al.
Udgivet: (2019)

TreeQN and ATreeC: differentiable tree planning for deep reinforcement learning
af: Farquhar, G, et al.
Udgivet: (2018)

Transient non−stationarity and generalisation in deep reinforcement learning
af: Igl, M, et al.
Udgivet: (2021)

Randomized entity-wise factorization for multi-agent reinforcement learning
af: Iqbal, S, et al.
Udgivet: (2021)

On Factorisation of Provenance Polynomials
af: Olteanu, D, et al.
Udgivet: (2011)

Factorisation in relational databases
af: Zavodny, J
Udgivet: (2014)

Coordination and communication in deep multi-agent reinforcement learning
af: Schroeder de Witt, CA
Udgivet: (2021)

Loading monotonicity of weighted premiums, and total positivity properties of weight functions
af: Richards, Donald, et al.
Udgivet: (2021)

From matrix factorisation to signal propagation in deep learning: algorithms and guarantees
af: Murray, M
Udgivet: (2021)

Factorising Proofs in Timed CSP
af: Davies, J, et al.
Udgivet: (1989)

Pushing forward matrix factorisations
af: Dyckerhoff, T, et al.
Udgivet: (2011)

Improving single and multi-agent deep reinforcement learning methods
af: Gupta, T
Udgivet: (2023)

MAVEN: Multi-Agent Variational Exploration
af: Mahajan, A, et al.
Udgivet: (2019)

Efficient and scalable methods for deep reinforcement learning
af: Farquhar, G
Udgivet: (2020)

Tesseract: tensorised actors for multi−agent reinforcement learning
af: Mahajan, A, et al.
Udgivet: (2021)

The value of information in monotone decixion problems
af: Athey, Susan, et al.
Udgivet: (2011)

Deep residual reinforcement learning
af: Zhang, S, et al.
Udgivet: (2020)

End-to-end deep reinforcement learning for multi-agent collaborative exploration
af: Chen, Zichen, et al.
Udgivet: (2021)

Multi-agent deep reinforcement learning for mix-mode runway sequencing
af: Shi, Limin, et al.
Udgivet: (2022)

The StarCraft Multi-Agent Challenge
af: Mikayel Samvelyan, et al.
Udgivet: (2019)

Deep decentralized multi-task multi-agent reinforcement learning under partial observability
af: How, Jonathan
Udgivet: (2021)

Factorisation of greedoid polynomials of rooted digraphs
af: Yow, Kai Siong, et al.
Udgivet: (2021)

The antitriangular factorisation of saddle point matrices
af: Pestana, J, et al.
Udgivet: (2013)

Monotone Equilibrium in Multi-Unit Auctions
af: McAdams, David
Udgivet: (2002)

Loaded DiCE: Trading off bias and variance in any-order score function gradient estimators for reinforcement learning
af: Farquhar, G, et al.
Udgivet: (2019)

Multi-agent deep reinforcement learning based multi-timescale voltage control for distribution system
af: Wang, Bingyu
Udgivet: (2022)

Forward jets in high energy factorisation at the lhc
af: Deák, M, et al.
Udgivet: (2009)