Reinforcement learning with limited reinforcement: Using Bayes risk for active learning in POMDPs

Acting in domains where an agent must plan several steps ahead to achieve a goal can be a challenging task, especially if the agentʼs sensors provide only noisy or partial information. In this setting, Partially Observable Markov Decision Processes (POMDPs) provide a planning framework that optimall...

Full description

Bibliographic Details
Main Authors:	Pineau, Joelle, Doshi-Velez, Finale P, Roy, Nicholas
Other Authors:	Massachusetts Institute of Technology. Department of Aeronautics and Astronautics
Format:	Article
Language:	en_US
Published:	Elsevier 2017
Online Access:	http://hdl.handle.net/1721.1/108303 https://orcid.org/0000-0002-8293-0492

Internet

http://hdl.handle.net/1721.1/108303
https://orcid.org/0000-0002-8293-0492

Reinforcement learning with limited reinforcement: Using Bayes risk for active learning in POMDPs

Internet

Similar Items