Beyond UCB: Optimal and Efficient Contextual Bandits with Regression Oracles

A fundamental challenge in contextual bandits is to develop flexible, general-purpose algorithms with computational requirements no worse than classical supervised learning tasks such as classification and regression. Algorithms based on regression have shown promising empirical success, but theoret...

Cur síos iomlán

Sonraí bibleagrafaíochta
Príomhchruthaitheoirí: Foster, Dylan J, Rakhlin, Alexander
Rannpháirtithe: Statistics and Data Science Center (Massachusetts Institute of Technology)
Formáid: Alt
Teanga:English
Foilsithe / Cruthaithe: 2021
Rochtain ar líne:https://hdl.handle.net/1721.1/138306

Míreanna comhchosúla