Predicting before acting: improving policy quality by taking a vision of consequence

Deep reinforcement learning has achieved great success in many fields. However, the agent may get trapped during the exploration, lingering around feckless states that pull the agent away from optimal policies. Thus it's worth studying how to improve learning strategies by forecasting future st...

Full description

Bibliographic Details
Main Authors:	Xiaoshu Zhou, Fei Zhu, Peiyao Zhao
Format:	Article
Language:	English
Published:	Taylor & Francis Group 2022-12-01
Series:	Connection Science
Subjects:	reinforcement learning policy optimisation exploration belief representation prediction
Online Access:	http://dx.doi.org/10.1080/09540091.2022.2025765

Internet

http://dx.doi.org/10.1080/09540091.2022.2025765

Predicting before acting: improving policy quality by taking a vision of consequence

Internet

Similar Items