Q-Learning with the Variable Box Method: A Case Study to Land a Solid Rocket

Some critical tasks require refined actions near the target, for instance, steering a car in a crowded parking lot or landing a rocket. These tasks are critical because failure to comply with the constraints near the target may lead to a fatal (unrecoverable) condition. Thus, a higher resolution act...

Full description

Bibliographic Details
Main Authors: Alejandro Tevera-Ruiz, Rodolfo Garcia-Rodriguez, Vicente Parra-Vega, Luis Enrique Ramos-Velasco
Format: Article
Language:English
Published: MDPI AG 2023-02-01
Series:Machines
Subjects:
Online Access:https://www.mdpi.com/2075-1702/11/2/214