Improving PAC exploration using the median of means

We present the first application of the median of means in a PAC exploration algorithm for MDPs. Using the median of means allows us to significantly reduce the dependence of our bounds on the range of values that the value function can take, while introducing a dependence on the (potentially much s...

Full description

Bibliographic Details
Main Authors:	Pazis, Jason, How, Jonathan P
Other Authors:	Massachusetts Institute of Technology. Aerospace Controls Laboratory
Format:	Article
Published:	Neural Information Processing Systems Foundation 2018
Online Access:	http://hdl.handle.net/1721.1/114290 https://orcid.org/0000-0001-8576-1930

Internet

http://hdl.handle.net/1721.1/114290
https://orcid.org/0000-0001-8576-1930

Improving PAC exploration using the median of means

Internet

Similar Items