Volume-weighted Bellman error method for adaptive meshing in approximate dynamic programming

Optimal control and reinforcement learning have an associate “value function” which must be suitably approximated. Value function approximation problems usually have different precision requirements in different regions of the state space. An uniform gridding wastes resources in regions in which the...

Full description

Bibliographic Details
Main Authors: Leopoldo Armesto, Antonio Sala
Format: Article
Language:Spanish
Published: Universitat Politecnica de Valencia 2021-12-01
Series:Revista Iberoamericana de Automática e Informática Industrial RIAI
Subjects:
Online Access:https://polipapers.upv.es/index.php/RIAI/article/view/15698