Weighted QMIX: Expanding monotonic value function factorisation for deep multi−agent reinforcement learning

Weighted QMIX: Expanding monotonic value function factorisation for deep multi−agent reinforcement learning

QMIX is a popular Q-learning algorithm for cooperative MARL in the centralised training and decentralised execution paradigm. In order to enable easy decentralisation, QMIX restricts the joint action Q-values it can represent to be a monotonic mixing of each agent’s utilities. However, this restrict...

Full description

Bibliographic Details
Main Authors:	Rashid, T, Farquhar, G, Peng, B, Whiteson, S
Format:	Conference item
Language:	English
Published:	NeurIPS 2020

Similar Items

QMIX: Monotonic value function factorisation for deep multi-agent reinforcement learning
by: Rashid, T, et al.
Published: (2018)

Monotonic value function factorisation for deep multi-agent reinforcement learning
by: Rashid, T, et al.
Published: (2020)

Exploration and value function factorisation in single and multi-agent reinforcement learning
by: Rashid, T
Published: (2021)

Stabilising experience replay for deep multi-agent reinforcement learning
by: Foerster, J, et al.
Published: (2017)

Bayesian action decoder for deep multi-agent reinforcement learning
by: Whiteson, S
Published: (2019)

UneVEn: Universal value exploration for multi-agent reinforcement learning
by: Gupta, T, et al.
Published: (2021)

Learning to communicate with Deep multi-agent reinforcement learning
by: Foerster, J, et al.
Published: (2016)

Multi-agent common knowledge reinforcement learning
by: de Witt, C, et al.
Published: (2019)

Regularized Softmax Deep Multi−Agent Q−Learning
by: Pan, L, et al.
Published: (2022)

Analysing factorizations of action-value networks for cooperative multi-agent reinforcement learning
by: Castellini, J, et al.
Published: (2021)

Deep reinforcement learning to multi-agent deep reinforcement learning
by: Samieiyeganeh, Mehdi, et al.
Published: (2022)

Deep multi-agent reinforcement learning
by: Foerster, J
Published: (2018)

Counterfactual multi−agent policy gradients
by: Foerster, J, et al.
Published: (2018)

QMix: A Python package for simulating the quasiparticle tunneling currents in SIS junctions
by: Garrett, J, et al.
Published: (2019)

TreeQN and ATreeC: differentiable tree planning for deep reinforcement learning
by: Farquhar, G, et al.
Published: (2018)

Transient non−stationarity and generalisation in deep reinforcement learning
by: Igl, M, et al.
Published: (2021)

Randomized entity-wise factorization for multi-agent reinforcement learning
by: Iqbal, S, et al.
Published: (2021)

On Factorisation of Provenance Polynomials
by: Olteanu, D, et al.
Published: (2011)

Factorisation in relational databases
by: Zavodny, J
Published: (2014)

Coordination and communication in deep multi-agent reinforcement learning
by: Schroeder de Witt, CA
Published: (2021)

Loading monotonicity of weighted premiums, and total positivity properties of weight functions
by: Richards, Donald, et al.
Published: (2021)

From matrix factorisation to signal propagation in deep learning: algorithms and guarantees
by: Murray, M
Published: (2021)

Improving single and multi-agent deep reinforcement learning methods
by: Gupta, T
Published: (2023)

Factorising Proofs in Timed CSP
by: Davies, J, et al.
Published: (1989)

Pushing forward matrix factorisations
by: Dyckerhoff, T, et al.
Published: (2011)

Efficient and scalable methods for deep reinforcement learning
by: Farquhar, G
Published: (2020)

MAVEN: Multi-Agent Variational Exploration
by: Mahajan, A, et al.
Published: (2019)

Tesseract: tensorised actors for multi−agent reinforcement learning
by: Mahajan, A, et al.
Published: (2021)

The value of information in monotone decixion problems
by: Athey, Susan, et al.
Published: (2011)

The StarCraft Multi-Agent Challenge
by: Mikayel Samvelyan, et al.
Published: (2019)

End-to-end deep reinforcement learning for multi-agent collaborative exploration
by: Chen, Zichen, et al.
Published: (2021)

Multi-agent deep reinforcement learning for mix-mode runway sequencing
by: Shi, Limin, et al.
Published: (2022)

Deep residual reinforcement learning
by: Zhang, S, et al.
Published: (2020)

Deep decentralized multi-task multi-agent reinforcement learning under partial observability
by: How, Jonathan
Published: (2021)

Factorisation of greedoid polynomials of rooted digraphs
by: Yow, Kai Siong, et al.
Published: (2021)

The antitriangular factorisation of saddle point matrices
by: Pestana, J, et al.
Published: (2013)

Monotone Equilibrium in Multi-Unit Auctions
by: McAdams, David
Published: (2002)

Loaded DiCE: Trading off bias and variance in any-order score function gradient estimators for reinforcement learning
by: Farquhar, G, et al.
Published: (2019)

Multi-agent deep reinforcement learning based multi-timescale voltage control for distribution system
by: Wang, Bingyu
Published: (2022)

Forward jets in high energy factorisation at the lhc
by: Deák, M, et al.
Published: (2009)