VariBAD: variational bayes-adaptive deep RL via meta-learning

VariBAD: variational bayes-adaptive deep RL via meta-learning

Trading off exploration and exploitation in an unknown environment is key to maximising expected online return during learning. A Bayes-optimal policy, which does so optimally, conditions its actions not only on the environment state but also on the agent's uncertainty about the environment. Co...

Descrición completa

Detalles Bibliográficos
Autor Principal:	Whiteson, S
Formato:	Journal article
Idioma:	English
Publicado:	Journal of Machine Learning Research 2021

Títulos similares

VariBAD: a very good method for Bayes-adaptive deep RL via meta-learning
por: Zintgraf, L, et al.
Publicado: (2020)

Knowledge Transfer in Deep Reinforcement Learning via an RL-Specific GAN-Based Correspondence Function
por: Marko Ruman, et al.
Publicado: (2024-01-01)

PharmRL: pharmacophore elucidation with deep geometric reinforcement learning
por: Rishal Aggarwal, et al.
Publicado: (2024-12-01)

Fast Context Adaptation via Meta-Learning
por: Zintgraf, L, et al.
Publicado: (2019)

Experience Replay Optimisation via ATSC and TSC for Performance Stability in Deep RL
por: Richard Sakyi Osei, et al.
Publicado: (2023-02-01)

AC-RL: A Framework for Real-Time Control, Learning & Adaptation
por: Guha, Anubhav
Publicado: (2023)

Learning to Utilize Curiosity: A New Approach of Automatic Curriculum Learning for Deep RL
por: Zeyang Lin, et al.
Publicado: (2022-07-01)

RobustAnalog: Fast Variation-Aware Analog Circuit Design Via Multi-task RL
por: Shi, Wei, et al.
Publicado: (2022)

Fiber Bundle Meta-learning Algorithm Based on Variational Bayes
por: LIU Yang, LI Fan-zhang
Publicado: (2022-03-01)

Reinforcement learning (RL) based stock trading system via support vector machine
por: Ong, Zhi Yuan.
Publicado: (2010)

GenFedRL: a general federated reinforcement learning framework for deep reinforcement learning agents
por: Biao JIN, et al.
Publicado: (2023-06-01)

GenFedRL: a general federated reinforcement learning framework for deep reinforcement learning agents
por: Biao JIN, et al.
Publicado: (2023-06-01)

Time-in-action RL
por: Jiangcheng Zhu, et al.
Publicado: (2019-02-01)

RL-ANC: Reinforcement Learning-Based Adaptive Network Coding in the Ocean Mobile Internet of Things
por: Ying Zhang, et al.
Publicado: (2024-06-01)

RL-AKF: An Adaptive Kalman Filter Navigation Algorithm Based on Reinforcement Learning for Ground Vehicles
por: Xile Gao, et al.
Publicado: (2020-05-01)

Exploiting multiple abstractions in episodic RL via reward shaping
por: Cipollone, R, et al.
Publicado: (2023)

Deep variational reinforcement learning for POMDPs
por: Igl, M, et al.
Publicado: (2018)

rl4dtn: Q-Learning for Opportunistic Networks
por: Jorge Visca, et al.
Publicado: (2022-11-01)

ACC-RL: Adaptive Congestion Control Based on Reinforcement Learning in Power Distribution Networks with Data Centers
por: Tairan Huang, et al.
Publicado: (2023-07-01)

RL-SPIHT: Reinforcement Learning-Based Adaptive Selection of Compression Ratios for 1-D SPIHT Algorithm
por: Jin Shin, et al.
Publicado: (2021-01-01)

RL-QPSO net: deep reinforcement learning-enhanced QPSO for efficient mobile robot path planning
por: Yang Jing, et al.
Publicado: (2025-01-01)

iADA*-RL: Anytime Graph-Based Path Planning with Deep Reinforcement Learning for an Autonomous UAV
por: Aye Aye Maw, et al.
Publicado: (2021-04-01)

Reflections of RL in The Virtual World
por: Andra Siibak
Publicado: (2007-11-01)

Elimination of All Bad Local Minima in Deep Learning
por: Kawaguchi, Kenji, et al.
Publicado: (2021)

Automation of digital crime investigation using Reinforcement Learning (RL)
por: Ghanem, Mohamed Chahine
Publicado: (2023)

RL4CEP: reinforcement learning for updating CEP rules
por: Afef Mdhaffar, et al.
Publicado: (2025-01-01)

CST-RL: Contrastive Spatio-Temporal Representations for Reinforcement Learning
por: Chi-Kai Ho, et al.
Publicado: (2023-01-01)

ADAS-RL: Safety learning approach for stable autonomous driving
por: Dongsu Lee, et al.
Publicado: (2022-09-01)

HLifeRL: A hierarchical lifelong reinforcement learning framework
por: Fan Ding, et al.
Publicado: (2022-07-01)

RL-CWtrans Net: multimodal swimming coaching driven via robot vision
por: Guanlin Wang
Publicado: (2024-08-01)

Improving Student Learning Outcomes Through the TaRL Learning Model on Discussion
por: Miftahunajah Aditiya Pratama
Publicado: (2023-11-01)

Implementation of the TaRL Approach to Increase Student Learning Motivation in Physics Learning
por: Melinda Cahya Ningrum Ningrum, et al.
Publicado: (2023-05-01)

Model-based RL in ATARI games
por: Akarapu, Bharadwaj
Publicado: (2021)

Information asymmetry in KL-regularized RL
por: Galashov, A, et al.
Publicado: (2018)

Model-Free RL or Action Sequences?
por: Adam Morris, et al.
Publicado: (2019-12-01)

R.L. Moore : mathematician and teacher /
por: 236772 Parker, John
Publicado: (2005)

Packet Size-Aware Broadcasting in VANETs With Fuzzy Logic and RL-Based Parameter Adaptation
por: Celimuge Wu, et al.
Publicado: (2015-01-01)

RAMBO-RL: robust adversarial model-based offline reinforcement learning
por: Rigter, M, et al.
Publicado: (2023)

SpaceRL — A reinforcement learning-based knowledge graph driver
por: Miguel Bermudo, et al.
Publicado: (2025-05-01)

FleetRL: Realistic reinforcement learning environments for commercial vehicle fleets
por: Enzo Cording, et al.
Publicado: (2024-05-01)