Exploration and value function factorisation in single and multi-agent reinforcement learning
<p>The ability to learn from data is crucial in developing satisfactory solutions to many complex problems. In particular, in the design of intelligent agents that exist and interact with a complex environment in the pursuit of some goal. In this thesis we investigate some bottlenecks that can...
Prif Awdur: | |
---|---|
Awduron Eraill: | |
Fformat: | Traethawd Ymchwil |
Iaith: | English |
Cyhoeddwyd: |
2021
|
Pynciau: |