VariBAD: variational bayes-adaptive deep RL via meta-learning

VariBAD: variational bayes-adaptive deep RL via meta-learning

Trading off exploration and exploitation in an unknown environment is key to maximising expected online return during learning. A Bayes-optimal policy, which does so optimally, conditions its actions not only on the environment state but also on the agent's uncertainty about the environment. Co...

Full beskrivning

Bibliografiska uppgifter
Huvudupphovsman:	Whiteson, S
Materialtyp:	Journal article
Språk:	English
Publicerad:	Journal of Machine Learning Research 2021

Liknande verk

VariBAD: a very good method for Bayes-adaptive deep RL via meta-learning
av: Zintgraf, L, et al.
Publicerad: (2020)

Knowledge Transfer in Deep Reinforcement Learning via an RL-Specific GAN-Based Correspondence Function
av: Marko Ruman, et al.
Publicerad: (2024-01-01)

PharmRL: pharmacophore elucidation with deep geometric reinforcement learning
av: Rishal Aggarwal, et al.
Publicerad: (2024-12-01)

Fast Context Adaptation via Meta-Learning
av: Zintgraf, L, et al.
Publicerad: (2019)

Experience Replay Optimisation via ATSC and TSC for Performance Stability in Deep RL
av: Richard Sakyi Osei, et al.
Publicerad: (2023-02-01)

AC-RL: A Framework for Real-Time Control, Learning & Adaptation
av: Guha, Anubhav
Publicerad: (2023)

Learning to Utilize Curiosity: A New Approach of Automatic Curriculum Learning for Deep RL
av: Zeyang Lin, et al.
Publicerad: (2022-07-01)

RobustAnalog: Fast Variation-Aware Analog Circuit Design Via Multi-task RL
av: Shi, Wei, et al.
Publicerad: (2022)

Fiber Bundle Meta-learning Algorithm Based on Variational Bayes
av: LIU Yang, LI Fan-zhang
Publicerad: (2022-03-01)

Reinforcement learning (RL) based stock trading system via support vector machine
av: Ong, Zhi Yuan.
Publicerad: (2010)

GenFedRL: a general federated reinforcement learning framework for deep reinforcement learning agents
av: Biao JIN, et al.
Publicerad: (2023-06-01)

GenFedRL: a general federated reinforcement learning framework for deep reinforcement learning agents
av: Biao JIN, et al.
Publicerad: (2023-06-01)

Time-in-action RL
av: Jiangcheng Zhu, et al.
Publicerad: (2019-02-01)

RL-ANC: Reinforcement Learning-Based Adaptive Network Coding in the Ocean Mobile Internet of Things
av: Ying Zhang, et al.
Publicerad: (2024-06-01)

RL-AKF: An Adaptive Kalman Filter Navigation Algorithm Based on Reinforcement Learning for Ground Vehicles
av: Xile Gao, et al.
Publicerad: (2020-05-01)

Exploiting multiple abstractions in episodic RL via reward shaping
av: Cipollone, R, et al.
Publicerad: (2023)

Deep variational reinforcement learning for POMDPs
av: Igl, M, et al.
Publicerad: (2018)

rl4dtn: Q-Learning for Opportunistic Networks
av: Jorge Visca, et al.
Publicerad: (2022-11-01)

ACC-RL: Adaptive Congestion Control Based on Reinforcement Learning in Power Distribution Networks with Data Centers
av: Tairan Huang, et al.
Publicerad: (2023-07-01)

RL-SPIHT: Reinforcement Learning-Based Adaptive Selection of Compression Ratios for 1-D SPIHT Algorithm
av: Jin Shin, et al.
Publicerad: (2021-01-01)

RL-QPSO net: deep reinforcement learning-enhanced QPSO for efficient mobile robot path planning
av: Yang Jing, et al.
Publicerad: (2025-01-01)

iADA*-RL: Anytime Graph-Based Path Planning with Deep Reinforcement Learning for an Autonomous UAV
av: Aye Aye Maw, et al.
Publicerad: (2021-04-01)

Reflections of RL in The Virtual World
av: Andra Siibak
Publicerad: (2007-11-01)

Elimination of All Bad Local Minima in Deep Learning
av: Kawaguchi, Kenji, et al.
Publicerad: (2021)

Automation of digital crime investigation using Reinforcement Learning (RL)
av: Ghanem, Mohamed Chahine
Publicerad: (2023)

RL4CEP: reinforcement learning for updating CEP rules
av: Afef Mdhaffar, et al.
Publicerad: (2025-01-01)

CST-RL: Contrastive Spatio-Temporal Representations for Reinforcement Learning
av: Chi-Kai Ho, et al.
Publicerad: (2023-01-01)

ADAS-RL: Safety learning approach for stable autonomous driving
av: Dongsu Lee, et al.
Publicerad: (2022-09-01)

HLifeRL: A hierarchical lifelong reinforcement learning framework
av: Fan Ding, et al.
Publicerad: (2022-07-01)

RL-CWtrans Net: multimodal swimming coaching driven via robot vision
av: Guanlin Wang
Publicerad: (2024-08-01)

Improving Student Learning Outcomes Through the TaRL Learning Model on Discussion
av: Miftahunajah Aditiya Pratama
Publicerad: (2023-11-01)

Implementation of the TaRL Approach to Increase Student Learning Motivation in Physics Learning
av: Melinda Cahya Ningrum Ningrum, et al.
Publicerad: (2023-05-01)

Model-based RL in ATARI games
av: Akarapu, Bharadwaj
Publicerad: (2021)

Information asymmetry in KL-regularized RL
av: Galashov, A, et al.
Publicerad: (2018)

Model-Free RL or Action Sequences?
av: Adam Morris, et al.
Publicerad: (2019-12-01)

R.L. Moore : mathematician and teacher /
av: 236772 Parker, John
Publicerad: (2005)

Packet Size-Aware Broadcasting in VANETs With Fuzzy Logic and RL-Based Parameter Adaptation
av: Celimuge Wu, et al.
Publicerad: (2015-01-01)

RAMBO-RL: robust adversarial model-based offline reinforcement learning
av: Rigter, M, et al.
Publicerad: (2023)

SpaceRL — A reinforcement learning-based knowledge graph driver
av: Miguel Bermudo, et al.
Publicerad: (2025-05-01)

FleetRL: Realistic reinforcement learning environments for commercial vehicle fleets
av: Enzo Cording, et al.
Publicerad: (2024-05-01)