Combined Constraint on Behavior Cloning and Discriminator in Offline Reinforcement Learning
In recent years, reinforcement learning (RL) has received a lot of attention because we can automatically learn optimal behavioral policies. However, since RL acquires the policy by repeatedly interacting with the environment, it is difficult to learn about realistic tasks. In recent years, there ha...
Main Authors: | Shunya Kidera, Kosuke Shintani, Toi Tsuneda, Satoshi Yamane |
---|---|
Format: | Article |
Language: | English |
Published: |
IEEE
2024-01-01
|
Series: | IEEE Access |
Subjects: | |
Online Access: | https://ieeexplore.ieee.org/document/10418100/ |
Similar Items
-
Offline Meta-Reinforcement Learning with Contrastive Prediction
by: HAN Xu, WU Feng
Published: (2023-08-01) -
Importance-Weighted Variational Inference Model Estimation for Offline Bayesian Model-Based Reinforcement Learning
by: Toru Hishinuma, et al.
Published: (2023-01-01) -
Optimizing trajectories for highway driving with offline reinforcement learning
by: Branka Mirchevska, et al.
Published: (2023-05-01) -
Corrigendum: Optimizing trajectories for highway driving with offline reinforcement learning
by: Branka Mirchevska, et al.
Published: (2023-12-01) -
Pruning With Scaled Policy Constraints for Light-Weight Reinforcement Learning
by: Seongmin Park, et al.
Published: (2024-01-01)