Transformer Decoder-Based Enhanced Exploration Method to Alleviate Initial Exploration Problems in Reinforcement Learning

In reinforcement learning, the epsilon (ε)-greedy strategy is commonly employed as an exploration technique This method, however, leads to extensive initial exploration and prolonged learning periods. Existing approaches to mitigate this issue involve constraining the exploration range using expert...

Full description

Bibliographic Details
Main Authors:	Dohyun Kyoung, Yunsick Sung
Format:	Article
Language:	English
Published:	MDPI AG 2023-08-01
Series:	Sensors
Subjects:	machine learning reinforcement learning pretraining exploration transformer-decoder
Online Access:	https://www.mdpi.com/1424-8220/23/17/7411

Internet

https://www.mdpi.com/1424-8220/23/17/7411

Transformer Decoder-Based Enhanced Exploration Method to Alleviate Initial Exploration Problems in Reinforcement Learning

Internet

Similar Items