StreamingBandit: Experimenting with Bandit Policies
A large number of statistical decision problems in the social sciences and beyond can be framed as a (contextual) multi-armed bandit problem. However, it is notoriously hard to develop and evaluate policies that tackle these types of problems, and to use such policies in applied studies. To address...
Main Authors: | Jules Kruijswijk, Robin van Emden, Petri Parvinen, Maurits Kaptein |
---|---|
Format: | Article |
Language: | English |
Published: |
Foundation for Open Access Statistics
2020-08-01
|
Series: | Journal of Statistical Software |
Subjects: | |
Online Access: | https://www.jstatsoft.org/index.php/jss/article/view/2881 |
Similar Items
-
The Perils of Misspecified Priors and Optional Stopping in Multi-Armed Bandits
by: Markus Loecher
Published: (2021-07-01) -
A linear response bandit problem
by: Assaf Zeevi, et al.
Published: (2013-01-01) -
Signal detection models as contextual bandits
by: Thomas N. Sherratt, et al.
Published: (2023-06-01) -
Multi-armed linear bandits with latent biases
by: Kang, Qiyu, et al.
Published: (2024) -
Bandit Learning-Based Edge Caching for 360-Degree Video Streaming With Switching Cost
by: Zhendong Yu, et al.
Published: (2022-01-01)