Near-Optimal Model-Free Reinforcement Learning in Non-Stationary Episodic MDPs
Main Authors: | Mao, Weichao, Zhang, Kaiqing, Zhu, Ruihao, Simchi-Levi, David, Basar, Tamer |
---|---|
Other Authors: | Massachusetts Institute of Technology. Department of Civil and Environmental Engineering |
Format: | Article |
Language: | English |
Published: |
2023
|
Online Access: | https://hdl.handle.net/1721.1/148645 |
Similar Items
-
Reinforcement Learning for Non-Stationary Markov Decision Processes: The Blessing of (More) Optimism
by: Cheung, Wang Chi, et al.
Published: (2021) -
Learning to Optimize Under Non-Stationarity
by: Cheung, Wang Chi, et al.
Published: (2021) -
Hedging the Drift: Learning to Optimize Under Nonstationarity
by: Cheung, Wang Chi, et al.
Published: (2023) -
Transience in countable MDPs
by: Kiefer, SM, et al.
Published: (2021) -
Social Interactions as Recursive MDPs
by: Tejwani, Ravi, et al.
Published: (2022)