Learning to Optimize Under Non-Stationarity

© 2019 by the author(s). We introduce algorithms that achieve state-of-the-art dynamic regret bounds for non-stationary linear stochastic bandit setting. It captures natural applications such as dynamic pricing and ads allocation in a changing environment. We show how the difficulty posed by the non...

Full description

Bibliographic Details
Main Authors:	Cheung, Wang Chi, Simchi-Levi, David, Zhu, Ruihao
Other Authors:	Massachusetts Institute of Technology. Institute for Data, Systems, and Society
Format:	Article
Language:	English
Published:	Elsevier BV 2021
Online Access:	https://hdl.handle.net/1721.1/137064

Internet

https://hdl.handle.net/1721.1/137064

Learning to Optimize Under Non-Stationarity

Internet

Similar Items