The Essential Dynamics Algorithm: Essential Results
This paper presents a novel algorithm for learning in a class of stochastic Markov decision processes (MDPs) with continuous state and action spaces that trades speed for accuracy. A transform of the stochastic MDP into a deterministic one is presented which captures the essence of the original...
Main Author: | Martin, Martin C. |
---|---|
Language: | en_US |
Published: |
2004
|
Subjects: | |
Online Access: | http://hdl.handle.net/1721.1/6718 |
Similar Items
-
Reinforcement Learning by Policy Search
by: Peshkin, Leonid
Published: (2004) -
A Structured Multiarmed Bandit Problem and the Greedy Policy
by: Rusmevichientong, Paat, et al.
Published: (2010) -
Multi depot dynamic vehicle routing problem with stochastic road capacity for emergency medical supply delivery in humanitarian logistics
by: Anuar, Wadi Khalid
Published: (2022) -
Towards Feature Selection In Actor-Critic Algorithms
by: Rohanimanesh, Khashayar, et al.
Published: (2007) -
Learning with Deictic Representation
by: Finney, Sarah, et al.
Published: (2004)