VariBAD: variational bayes-adaptive deep RL via meta-learning

VariBAD: variational bayes-adaptive deep RL via meta-learning

Trading off exploration and exploitation in an unknown environment is key to maximising expected online return during learning. A Bayes-optimal policy, which does so optimally, conditions its actions not only on the environment state but also on the agent's uncertainty about the environment. Co...

Volledige beschrijving

Bibliografische gegevens
Hoofdauteur:	Whiteson, S
Formaat:	Journal article
Taal:	English
Gepubliceerd in:	Journal of Machine Learning Research 2021

Gelijkaardige items

VariBAD: a very good method for Bayes-adaptive deep RL via meta-learning
door: Zintgraf, L, et al.
Gepubliceerd in: (2020)

Knowledge Transfer in Deep Reinforcement Learning via an RL-Specific GAN-Based Correspondence Function
door: Marko Ruman, et al.
Gepubliceerd in: (2024-01-01)

PharmRL: pharmacophore elucidation with deep geometric reinforcement learning
door: Rishal Aggarwal, et al.
Gepubliceerd in: (2024-12-01)

Fast Context Adaptation via Meta-Learning
door: Zintgraf, L, et al.
Gepubliceerd in: (2019)

Experience Replay Optimisation via ATSC and TSC for Performance Stability in Deep RL
door: Richard Sakyi Osei, et al.
Gepubliceerd in: (2023-02-01)

AC-RL: A Framework for Real-Time Control, Learning & Adaptation
door: Guha, Anubhav
Gepubliceerd in: (2023)

Learning to Utilize Curiosity: A New Approach of Automatic Curriculum Learning for Deep RL
door: Zeyang Lin, et al.
Gepubliceerd in: (2022-07-01)

RobustAnalog: Fast Variation-Aware Analog Circuit Design Via Multi-task RL
door: Shi, Wei, et al.
Gepubliceerd in: (2022)

Fiber Bundle Meta-learning Algorithm Based on Variational Bayes
door: LIU Yang, LI Fan-zhang
Gepubliceerd in: (2022-03-01)

Reinforcement learning (RL) based stock trading system via support vector machine
door: Ong, Zhi Yuan.
Gepubliceerd in: (2010)

GenFedRL: a general federated reinforcement learning framework for deep reinforcement learning agents
door: Biao JIN, et al.
Gepubliceerd in: (2023-06-01)

GenFedRL: a general federated reinforcement learning framework for deep reinforcement learning agents
door: Biao JIN, et al.
Gepubliceerd in: (2023-06-01)

Time-in-action RL
door: Jiangcheng Zhu, et al.
Gepubliceerd in: (2019-02-01)

RL-ANC: Reinforcement Learning-Based Adaptive Network Coding in the Ocean Mobile Internet of Things
door: Ying Zhang, et al.
Gepubliceerd in: (2024-06-01)

RL-AKF: An Adaptive Kalman Filter Navigation Algorithm Based on Reinforcement Learning for Ground Vehicles
door: Xile Gao, et al.
Gepubliceerd in: (2020-05-01)

Exploiting multiple abstractions in episodic RL via reward shaping
door: Cipollone, R, et al.
Gepubliceerd in: (2023)

Deep variational reinforcement learning for POMDPs
door: Igl, M, et al.
Gepubliceerd in: (2018)

rl4dtn: Q-Learning for Opportunistic Networks
door: Jorge Visca, et al.
Gepubliceerd in: (2022-11-01)

ACC-RL: Adaptive Congestion Control Based on Reinforcement Learning in Power Distribution Networks with Data Centers
door: Tairan Huang, et al.
Gepubliceerd in: (2023-07-01)

RL-SPIHT: Reinforcement Learning-Based Adaptive Selection of Compression Ratios for 1-D SPIHT Algorithm
door: Jin Shin, et al.
Gepubliceerd in: (2021-01-01)

RL-QPSO net: deep reinforcement learning-enhanced QPSO for efficient mobile robot path planning
door: Yang Jing, et al.
Gepubliceerd in: (2025-01-01)

iADA*-RL: Anytime Graph-Based Path Planning with Deep Reinforcement Learning for an Autonomous UAV
door: Aye Aye Maw, et al.
Gepubliceerd in: (2021-04-01)

Reflections of RL in The Virtual World
door: Andra Siibak
Gepubliceerd in: (2007-11-01)

Elimination of All Bad Local Minima in Deep Learning
door: Kawaguchi, Kenji, et al.
Gepubliceerd in: (2021)

Automation of digital crime investigation using Reinforcement Learning (RL)
door: Ghanem, Mohamed Chahine
Gepubliceerd in: (2023)

RL4CEP: reinforcement learning for updating CEP rules
door: Afef Mdhaffar, et al.
Gepubliceerd in: (2025-01-01)

CST-RL: Contrastive Spatio-Temporal Representations for Reinforcement Learning
door: Chi-Kai Ho, et al.
Gepubliceerd in: (2023-01-01)

ADAS-RL: Safety learning approach for stable autonomous driving
door: Dongsu Lee, et al.
Gepubliceerd in: (2022-09-01)

HLifeRL: A hierarchical lifelong reinforcement learning framework
door: Fan Ding, et al.
Gepubliceerd in: (2022-07-01)

RL-CWtrans Net: multimodal swimming coaching driven via robot vision
door: Guanlin Wang
Gepubliceerd in: (2024-08-01)

Improving Student Learning Outcomes Through the TaRL Learning Model on Discussion
door: Miftahunajah Aditiya Pratama
Gepubliceerd in: (2023-11-01)

Implementation of the TaRL Approach to Increase Student Learning Motivation in Physics Learning
door: Melinda Cahya Ningrum Ningrum, et al.
Gepubliceerd in: (2023-05-01)

Model-based RL in ATARI games
door: Akarapu, Bharadwaj
Gepubliceerd in: (2021)

Information asymmetry in KL-regularized RL
door: Galashov, A, et al.
Gepubliceerd in: (2018)

Model-Free RL or Action Sequences?
door: Adam Morris, et al.
Gepubliceerd in: (2019-12-01)

R.L. Moore : mathematician and teacher /
door: 236772 Parker, John
Gepubliceerd in: (2005)

Packet Size-Aware Broadcasting in VANETs With Fuzzy Logic and RL-Based Parameter Adaptation
door: Celimuge Wu, et al.
Gepubliceerd in: (2015-01-01)

RAMBO-RL: robust adversarial model-based offline reinforcement learning
door: Rigter, M, et al.
Gepubliceerd in: (2023)

SpaceRL — A reinforcement learning-based knowledge graph driver
door: Miguel Bermudo, et al.
Gepubliceerd in: (2025-05-01)

FleetRL: Realistic reinforcement learning environments for commercial vehicle fleets
door: Enzo Cording, et al.
Gepubliceerd in: (2024-05-01)