Markovian Restless Bandits and Index Policies: A Review

The restless multi-armed bandit problem is a paradigmatic modeling framework for optimal dynamic priority allocation in stochastic models of wide-ranging applications that has been widely investigated and applied since its inception in a seminal paper by Whittle in the late 1980s. The problem has ge...

Full description

Bibliographic Details
Main Author:	José Niño-Mora
Format:	Article
Language:	English
Published:	MDPI AG 2023-03-01
Series:	Mathematics
Subjects:	Markov decision processes bandit problems restless bandits dynamic and stochastic resource allocation index policies online learning
Online Access:	https://www.mdpi.com/2227-7390/11/7/1639

Internet

https://www.mdpi.com/2227-7390/11/7/1639

Markovian Restless Bandits and Index Policies: A Review

Internet

Similar Items