VariBAD: variational bayes-adaptive deep RL via meta-learning

VariBAD: variational bayes-adaptive deep RL via meta-learning

Trading off exploration and exploitation in an unknown environment is key to maximising expected online return during learning. A Bayes-optimal policy, which does so optimally, conditions its actions not only on the environment state but also on the agent's uncertainty about the environment. Co...

Descripció completa

Dades bibliogràfiques
Autor principal:	Whiteson, S
Format:	Journal article
Idioma:	English
Publicat:	Journal of Machine Learning Research 2021

Ítems similars

VariBAD: a very good method for Bayes-adaptive deep RL via meta-learning
per: Zintgraf, L, et al.
Publicat: (2020)

Knowledge Transfer in Deep Reinforcement Learning via an RL-Specific GAN-Based Correspondence Function
per: Marko Ruman, et al.
Publicat: (2024-01-01)

PharmRL: pharmacophore elucidation with deep geometric reinforcement learning
per: Rishal Aggarwal, et al.
Publicat: (2024-12-01)

Fast Context Adaptation via Meta-Learning
per: Zintgraf, L, et al.
Publicat: (2019)

Experience Replay Optimisation via ATSC and TSC for Performance Stability in Deep RL
per: Richard Sakyi Osei, et al.
Publicat: (2023-02-01)

AC-RL: A Framework for Real-Time Control, Learning & Adaptation
per: Guha, Anubhav
Publicat: (2023)

Learning to Utilize Curiosity: A New Approach of Automatic Curriculum Learning for Deep RL
per: Zeyang Lin, et al.
Publicat: (2022-07-01)

RobustAnalog: Fast Variation-Aware Analog Circuit Design Via Multi-task RL
per: Shi, Wei, et al.
Publicat: (2022)

Fiber Bundle Meta-learning Algorithm Based on Variational Bayes
per: LIU Yang, LI Fan-zhang
Publicat: (2022-03-01)

Reinforcement learning (RL) based stock trading system via support vector machine
per: Ong, Zhi Yuan.
Publicat: (2010)

GenFedRL: a general federated reinforcement learning framework for deep reinforcement learning agents
per: Biao JIN, et al.
Publicat: (2023-06-01)

GenFedRL: a general federated reinforcement learning framework for deep reinforcement learning agents
per: Biao JIN, et al.
Publicat: (2023-06-01)

Time-in-action RL
per: Jiangcheng Zhu, et al.
Publicat: (2019-02-01)

RL-ANC: Reinforcement Learning-Based Adaptive Network Coding in the Ocean Mobile Internet of Things
per: Ying Zhang, et al.
Publicat: (2024-06-01)

RL-AKF: An Adaptive Kalman Filter Navigation Algorithm Based on Reinforcement Learning for Ground Vehicles
per: Xile Gao, et al.
Publicat: (2020-05-01)

Exploiting multiple abstractions in episodic RL via reward shaping
per: Cipollone, R, et al.
Publicat: (2023)

Deep variational reinforcement learning for POMDPs
per: Igl, M, et al.
Publicat: (2018)

rl4dtn: Q-Learning for Opportunistic Networks
per: Jorge Visca, et al.
Publicat: (2022-11-01)

ACC-RL: Adaptive Congestion Control Based on Reinforcement Learning in Power Distribution Networks with Data Centers
per: Tairan Huang, et al.
Publicat: (2023-07-01)

RL-SPIHT: Reinforcement Learning-Based Adaptive Selection of Compression Ratios for 1-D SPIHT Algorithm
per: Jin Shin, et al.
Publicat: (2021-01-01)

RL-QPSO net: deep reinforcement learning-enhanced QPSO for efficient mobile robot path planning
per: Yang Jing, et al.
Publicat: (2025-01-01)

iADA*-RL: Anytime Graph-Based Path Planning with Deep Reinforcement Learning for an Autonomous UAV
per: Aye Aye Maw, et al.
Publicat: (2021-04-01)

Reflections of RL in The Virtual World
per: Andra Siibak
Publicat: (2007-11-01)

Elimination of All Bad Local Minima in Deep Learning
per: Kawaguchi, Kenji, et al.
Publicat: (2021)

Automation of digital crime investigation using Reinforcement Learning (RL)
per: Ghanem, Mohamed Chahine
Publicat: (2023)

RL4CEP: reinforcement learning for updating CEP rules
per: Afef Mdhaffar, et al.
Publicat: (2025-01-01)

CST-RL: Contrastive Spatio-Temporal Representations for Reinforcement Learning
per: Chi-Kai Ho, et al.
Publicat: (2023-01-01)

ADAS-RL: Safety learning approach for stable autonomous driving
per: Dongsu Lee, et al.
Publicat: (2022-09-01)

HLifeRL: A hierarchical lifelong reinforcement learning framework
per: Fan Ding, et al.
Publicat: (2022-07-01)

RL-CWtrans Net: multimodal swimming coaching driven via robot vision
per: Guanlin Wang
Publicat: (2024-08-01)

Improving Student Learning Outcomes Through the TaRL Learning Model on Discussion
per: Miftahunajah Aditiya Pratama
Publicat: (2023-11-01)

Implementation of the TaRL Approach to Increase Student Learning Motivation in Physics Learning
per: Melinda Cahya Ningrum Ningrum, et al.
Publicat: (2023-05-01)

Model-based RL in ATARI games
per: Akarapu, Bharadwaj
Publicat: (2021)

Information asymmetry in KL-regularized RL
per: Galashov, A, et al.
Publicat: (2018)

Model-Free RL or Action Sequences?
per: Adam Morris, et al.
Publicat: (2019-12-01)

R.L. Moore : mathematician and teacher /
per: 236772 Parker, John
Publicat: (2005)

Packet Size-Aware Broadcasting in VANETs With Fuzzy Logic and RL-Based Parameter Adaptation
per: Celimuge Wu, et al.
Publicat: (2015-01-01)

RAMBO-RL: robust adversarial model-based offline reinforcement learning
per: Rigter, M, et al.
Publicat: (2023)

SpaceRL — A reinforcement learning-based knowledge graph driver
per: Miguel Bermudo, et al.
Publicat: (2025-05-01)

FleetRL: Realistic reinforcement learning environments for commercial vehicle fleets
per: Enzo Cording, et al.
Publicat: (2024-05-01)