Automatic shaping and decomposition of reward functions

This paper investigates the problem of automatically learning how torestructure the reward function of a Markov decision process so as tospeed up reinforcement learning. We begin by describing a method thatlearns a shaped reward function given a set of state and temporalabstractions. Next, we cons...

Full description

Bibliographic Details
Main Author:	Marthi, Bhaskara
Other Authors:	Leslie Kaelbling
Published:	2007
Online Access:	http://hdl.handle.net/1721.1/35890

Internet

http://hdl.handle.net/1721.1/35890

Automatic shaping and decomposition of reward functions

Internet

Similar Items