Efficient reinforcement learning via singular value decomposition, end-to-end model-based methods and reward shaping
Reinforcement learning (RL) provides a general framework for data-driven decision making. However, the very same generality that makes this approach applicable to a wide range of problems is also responsible for its well-known inefficiencies. In this thesis, we consider different properties which ar...
Main Author: | Gehring, Clement |
---|---|
Other Authors: | Kaelbling, Leslie Pack |
Format: | Thesis |
Published: |
Massachusetts Institute of Technology
2022
|
Online Access: | https://hdl.handle.net/1721.1/144562 |
Similar Items
-
Adaptive Reward Method for End-to-End Cooperation Based on Multi-agent Reinforcement Learning
by: SHI Dian-xi, ZHAO Chen-ran, ZHANG Yao-wen, YANG Shao-wu, ZHANG Yong-jun
Published: (2022-08-01) -
Automatic shaping and decomposition of reward functions
by: Marthi, Bhaskara
Published: (2007) -
Singular value decomposition /
by: Zulezzah Abd Halim, et al.
Published: (2010) -
Reward Shaping Based Federated Reinforcement Learning
by: Yiqiu Hu, et al.
Published: (2021-01-01) -
Learning to route efficiently with end-to-end feedback : the value of (identifiable) networked structure
by: Zhu, Ruihao, S. M. Massachusetts Institute of Technology
Published: (2019)