Predicting before acting: improving policy quality by taking a vision of consequence

Deep reinforcement learning has achieved great success in many fields. However, the agent may get trapped during the exploration, lingering around feckless states that pull the agent away from optimal policies. Thus it's worth studying how to improve learning strategies by forecasting future st...

Full description

Bibliographic Details
Main Authors: Xiaoshu Zhou, Fei Zhu, Peiyao Zhao
Format: Article
Language:English
Published: Taylor & Francis Group 2022-12-01
Series:Connection Science
Subjects:
Online Access:http://dx.doi.org/10.1080/09540091.2022.2025765