Near-Optimal Model-Free Reinforcement Learning in Non-Stationary Episodic MDPs

Near-Optimal Model-Free Reinforcement Learning in Non-Stationary Episodic MDPs

Bibliographic Details
Main Authors:	Mao, Weichao, Zhang, Kaiqing, Zhu, Ruihao, Simchi-Levi, David, Basar, Tamer
Other Authors:	Massachusetts Institute of Technology. Department of Civil and Environmental Engineering
Format:	Article
Language:	English
Published:	2023
Online Access:	https://hdl.handle.net/1721.1/148645

Similar Items

Reinforcement Learning for Non-Stationary Markov Decision Processes: The Blessing of (More) Optimism
by: Cheung, Wang Chi, et al.
Published: (2021)

Learning to Optimize Under Non-Stationarity
by: Cheung, Wang Chi, et al.
Published: (2021)

Hedging the Drift: Learning to Optimize Under Nonstationarity
by: Cheung, Wang Chi, et al.
Published: (2023)

Transience in countable MDPs
by: Kiefer, SM, et al.
Published: (2021)

Social Interactions as Recursive MDPs
by: Tejwani, Ravi, et al.
Published: (2022)

Meta Dynamic Pricing: Transfer Learning Across Experiments
by: Bastani, Hamsa, et al.
Published: (2023)

Towards Stable Reinforcement Learning in Non-Episodic Tasks
by: Karnik, Sathwik
Published: (2023)

Fast approximate hierarchical solution of MDPs
by: Barry, Jennifer L. (Jennifer Lynn)
Published: (2010)

Combining dynamic abstractions in large MDPs
by: Steinkraus, Kurt, et al.
Published: (2005)

Incorporating Rich Social Interactions Into MDPs
by: Tejwani, Ravi, et al.
Published: (2022)

Solving Dec-MDPs with options and intention recognition
by: Cruz, Gabriel, M. Eng. Massachusetts Institute of Technology
Published: (2016)

Adaptive Envelope MDPs for Relational Equivalence-based Planning
by: Gardiol, Natalia H., et al.
Published: (2008)

Batch-iFDD for representation expansion in large MDPs
by: Geramifard, Alborz, et al.
Published: (2015)

NP-Hardness of checking the unichain condition in average cost MDPs
by: Tsitsiklis, John N.
Published: (2012)

Optimized supply routing at Dell under non-stationary demand
by: Foreman, John William
Published: (2009)

Solving uncertain MDPs with objectives that are separable over instantiations of model uncertainty
by: Adulyasak, Yossiri, et al.
Published: (2018)

Exploring and Learning in Sparse Linear MDPs without Computationally Intractable Oracles
by: Golowich, Noah, et al.
Published: (2024)

Estimation of Stationary and Non-stationary Random Fields: Kriging in the Analysis of Orographic Precipitation
by: Chua, Siong Huat, et al.
Published: (2022)

Variational inference for non-stationary distributions
by: Mamikonyan, Arsen
Published: (2018)

Online Learning of Non-stationary Sequences
by: Monteleoni, Claire
Published: (2004)

Online learning of non-stationary sequences
by: Monteleoni, Claire E. (Claire Elizabeth), 1975-
Published: (2014)

Online Learning of Non-stationary Sequences
by: Monteleoni, Claire, et al.
Published: (2005)

Sampling Based Approaches for Minimizing Regret in Uncertain Markov Decision Processes (MDPs)
by: Ahmed, Asrar, et al.
Published: (2021)

Univariate generalized additive models for simulated stationary and non-stationary generalized Pareto distribution
by: Behzadi, Mostafa, et al.
Published: (2017)

Cluster analysis for non-stationary time series
by: Bing, Gao, et al.
Published: (2015)

Effective Learning in Non-Stationary Multiagent Environments
by: Kim, Dong Ki
Published: (2023)

Blind source separation for non-stationary mixing
by: Everson, R, et al.
Published: (2000)

Time-bounded mission planning in time-varying domains with semi-MDPS and Gaussian processes
by: Duckworth, P, et al.
Published: (2021)

Analyzing process flexibility: A distribution-free approach with partial expectations
by: Bidkhori, Hoda, et al.
Published: (2018)

Transient laws of non-stationary queueing systems and their applications
Published: (2003)

Transient laws of non-stationary queueing systems and their applications
Published: (2004)

Causal Inference: Heterogeneous Effects and Non-stationary Environments
by: Slavov, Stanislav
Published: (2022)

Optimal static pricing for a tree network
by: Caro, Felipe, et al.
Published: (2013)

Theory of a Stationary Current-Free Double Layer in a Collisionless Plasma
by: Ahedo, Eduardo, et al.
Published: (2010)

Bayesian extreme modeling for non-stationary air quality data
by: Mohd Amin, Nor Azrita, et al.
Published: (2013)

Handling non-stationary data streams under complex environments
by: Weng, Weiwei
Published: (2024)

Fluctuation relations in non-equilibrium stationary states of Ising models
by: Piscitelli, Antonio, et al.
Published: (2015)

Learning non-stationary SVBRDFs using GANs and differentiable rendering
by: Duinkharjav, Budmonde.
Published: (2019)

Learning to Schedule in Non-Stationary Wireless Networks With Unknown Statistics
by: Nguyen, Quang, et al.
Published: (2023)

Estimators for Persistent and Possibly Non-Stationary Data with Classical Properties
by: Gorodnichenko, Yuriy, et al.
Published: (2012)