VariBAD: variational bayes-adaptive deep RL via meta-learning

VariBAD: variational bayes-adaptive deep RL via meta-learning

Trading off exploration and exploitation in an unknown environment is key to maximising expected online return during learning. A Bayes-optimal policy, which does so optimally, conditions its actions not only on the environment state but also on the agent's uncertainty about the environment. Co...

Description complète

Détails bibliographiques
Auteur principal:	Whiteson, S
Format:	Journal article
Langue:	English
Publié:	Journal of Machine Learning Research 2021

Documents similaires

VariBAD: a very good method for Bayes-adaptive deep RL via meta-learning
par: Zintgraf, L, et autres
Publié: (2020)

Knowledge Transfer in Deep Reinforcement Learning via an RL-Specific GAN-Based Correspondence Function
par: Marko Ruman, et autres
Publié: (2024-01-01)

PharmRL: pharmacophore elucidation with deep geometric reinforcement learning
par: Rishal Aggarwal, et autres
Publié: (2024-12-01)

Fast Context Adaptation via Meta-Learning
par: Zintgraf, L, et autres
Publié: (2019)

Experience Replay Optimisation via ATSC and TSC for Performance Stability in Deep RL
par: Richard Sakyi Osei, et autres
Publié: (2023-02-01)

AC-RL: A Framework for Real-Time Control, Learning & Adaptation
par: Guha, Anubhav
Publié: (2023)

Learning to Utilize Curiosity: A New Approach of Automatic Curriculum Learning for Deep RL
par: Zeyang Lin, et autres
Publié: (2022-07-01)

RobustAnalog: Fast Variation-Aware Analog Circuit Design Via Multi-task RL
par: Shi, Wei, et autres
Publié: (2022)

Fiber Bundle Meta-learning Algorithm Based on Variational Bayes
par: LIU Yang, LI Fan-zhang
Publié: (2022-03-01)

Reinforcement learning (RL) based stock trading system via support vector machine
par: Ong, Zhi Yuan.
Publié: (2010)

GenFedRL: a general federated reinforcement learning framework for deep reinforcement learning agents
par: Biao JIN, et autres
Publié: (2023-06-01)

GenFedRL: a general federated reinforcement learning framework for deep reinforcement learning agents
par: Biao JIN, et autres
Publié: (2023-06-01)

Time-in-action RL
par: Jiangcheng Zhu, et autres
Publié: (2019-02-01)

RL-ANC: Reinforcement Learning-Based Adaptive Network Coding in the Ocean Mobile Internet of Things
par: Ying Zhang, et autres
Publié: (2024-06-01)

RL-AKF: An Adaptive Kalman Filter Navigation Algorithm Based on Reinforcement Learning for Ground Vehicles
par: Xile Gao, et autres
Publié: (2020-05-01)

Exploiting multiple abstractions in episodic RL via reward shaping
par: Cipollone, R, et autres
Publié: (2023)

Deep variational reinforcement learning for POMDPs
par: Igl, M, et autres
Publié: (2018)

rl4dtn: Q-Learning for Opportunistic Networks
par: Jorge Visca, et autres
Publié: (2022-11-01)

ACC-RL: Adaptive Congestion Control Based on Reinforcement Learning in Power Distribution Networks with Data Centers
par: Tairan Huang, et autres
Publié: (2023-07-01)

RL-SPIHT: Reinforcement Learning-Based Adaptive Selection of Compression Ratios for 1-D SPIHT Algorithm
par: Jin Shin, et autres
Publié: (2021-01-01)

RL-QPSO net: deep reinforcement learning-enhanced QPSO for efficient mobile robot path planning
par: Yang Jing, et autres
Publié: (2025-01-01)

iADA*-RL: Anytime Graph-Based Path Planning with Deep Reinforcement Learning for an Autonomous UAV
par: Aye Aye Maw, et autres
Publié: (2021-04-01)

Reflections of RL in The Virtual World
par: Andra Siibak
Publié: (2007-11-01)

Elimination of All Bad Local Minima in Deep Learning
par: Kawaguchi, Kenji, et autres
Publié: (2021)

Automation of digital crime investigation using Reinforcement Learning (RL)
par: Ghanem, Mohamed Chahine
Publié: (2023)

RL4CEP: reinforcement learning for updating CEP rules
par: Afef Mdhaffar, et autres
Publié: (2025-01-01)

CST-RL: Contrastive Spatio-Temporal Representations for Reinforcement Learning
par: Chi-Kai Ho, et autres
Publié: (2023-01-01)

ADAS-RL: Safety learning approach for stable autonomous driving
par: Dongsu Lee, et autres
Publié: (2022-09-01)

HLifeRL: A hierarchical lifelong reinforcement learning framework
par: Fan Ding, et autres
Publié: (2022-07-01)

RL-CWtrans Net: multimodal swimming coaching driven via robot vision
par: Guanlin Wang
Publié: (2024-08-01)

Improving Student Learning Outcomes Through the TaRL Learning Model on Discussion
par: Miftahunajah Aditiya Pratama
Publié: (2023-11-01)

Implementation of the TaRL Approach to Increase Student Learning Motivation in Physics Learning
par: Melinda Cahya Ningrum Ningrum, et autres
Publié: (2023-05-01)

Model-based RL in ATARI games
par: Akarapu, Bharadwaj
Publié: (2021)

Information asymmetry in KL-regularized RL
par: Galashov, A, et autres
Publié: (2018)

Model-Free RL or Action Sequences?
par: Adam Morris, et autres
Publié: (2019-12-01)

R.L. Moore : mathematician and teacher /
par: 236772 Parker, John
Publié: (2005)

Packet Size-Aware Broadcasting in VANETs With Fuzzy Logic and RL-Based Parameter Adaptation
par: Celimuge Wu, et autres
Publié: (2015-01-01)

RAMBO-RL: robust adversarial model-based offline reinforcement learning
par: Rigter, M, et autres
Publié: (2023)

SpaceRL — A reinforcement learning-based knowledge graph driver
par: Miguel Bermudo, et autres
Publié: (2025-05-01)

FleetRL: Realistic reinforcement learning environments for commercial vehicle fleets
par: Enzo Cording, et autres
Publié: (2024-05-01)