Markovian Restless Bandits and Index Policies: A Review

The restless multi-armed bandit problem is a paradigmatic modeling framework for optimal dynamic priority allocation in stochastic models of wide-ranging applications that has been widely investigated and applied since its inception in a seminal paper by Whittle in the late 1980s. The problem has ge...

Full beskrivning

Bibliografiska uppgifter
Huvudupphovsman: José Niño-Mora
Materialtyp: Artikel
Språk:English
Publicerad: MDPI AG 2023-03-01
Serie:Mathematics
Ämnen:
Länkar:https://www.mdpi.com/2227-7390/11/7/1639