VariBAD: variational bayes-adaptive deep RL via meta-learning

VariBAD: variational bayes-adaptive deep RL via meta-learning

Trading off exploration and exploitation in an unknown environment is key to maximising expected online return during learning. A Bayes-optimal policy, which does so optimally, conditions its actions not only on the environment state but also on the agent's uncertainty about the environment. Co...

Ausführliche Beschreibung

Bibliographische Detailangaben
1. Verfasser:	Whiteson, S
Format:	Journal article
Sprache:	English
Veröffentlicht:	Journal of Machine Learning Research 2021

Ähnliche Einträge

VariBAD: a very good method for Bayes-adaptive deep RL via meta-learning
von: Zintgraf, L, et al.
Veröffentlicht: (2020)

Knowledge Transfer in Deep Reinforcement Learning via an RL-Specific GAN-Based Correspondence Function
von: Marko Ruman, et al.
Veröffentlicht: (2024-01-01)

PharmRL: pharmacophore elucidation with deep geometric reinforcement learning
von: Rishal Aggarwal, et al.
Veröffentlicht: (2024-12-01)

Fast Context Adaptation via Meta-Learning
von: Zintgraf, L, et al.
Veröffentlicht: (2019)

Experience Replay Optimisation via ATSC and TSC for Performance Stability in Deep RL
von: Richard Sakyi Osei, et al.
Veröffentlicht: (2023-02-01)

AC-RL: A Framework for Real-Time Control, Learning & Adaptation
von: Guha, Anubhav
Veröffentlicht: (2023)

Learning to Utilize Curiosity: A New Approach of Automatic Curriculum Learning for Deep RL
von: Zeyang Lin, et al.
Veröffentlicht: (2022-07-01)

RobustAnalog: Fast Variation-Aware Analog Circuit Design Via Multi-task RL
von: Shi, Wei, et al.
Veröffentlicht: (2022)

Fiber Bundle Meta-learning Algorithm Based on Variational Bayes
von: LIU Yang, LI Fan-zhang
Veröffentlicht: (2022-03-01)

Reinforcement learning (RL) based stock trading system via support vector machine
von: Ong, Zhi Yuan.
Veröffentlicht: (2010)

GenFedRL: a general federated reinforcement learning framework for deep reinforcement learning agents
von: Biao JIN, et al.
Veröffentlicht: (2023-06-01)

GenFedRL: a general federated reinforcement learning framework for deep reinforcement learning agents
von: Biao JIN, et al.
Veröffentlicht: (2023-06-01)

Time-in-action RL
von: Jiangcheng Zhu, et al.
Veröffentlicht: (2019-02-01)

RL-ANC: Reinforcement Learning-Based Adaptive Network Coding in the Ocean Mobile Internet of Things
von: Ying Zhang, et al.
Veröffentlicht: (2024-06-01)

RL-AKF: An Adaptive Kalman Filter Navigation Algorithm Based on Reinforcement Learning for Ground Vehicles
von: Xile Gao, et al.
Veröffentlicht: (2020-05-01)

Exploiting multiple abstractions in episodic RL via reward shaping
von: Cipollone, R, et al.
Veröffentlicht: (2023)

Deep variational reinforcement learning for POMDPs
von: Igl, M, et al.
Veröffentlicht: (2018)

rl4dtn: Q-Learning for Opportunistic Networks
von: Jorge Visca, et al.
Veröffentlicht: (2022-11-01)

ACC-RL: Adaptive Congestion Control Based on Reinforcement Learning in Power Distribution Networks with Data Centers
von: Tairan Huang, et al.
Veröffentlicht: (2023-07-01)

RL-SPIHT: Reinforcement Learning-Based Adaptive Selection of Compression Ratios for 1-D SPIHT Algorithm
von: Jin Shin, et al.
Veröffentlicht: (2021-01-01)

RL-QPSO net: deep reinforcement learning-enhanced QPSO for efficient mobile robot path planning
von: Yang Jing, et al.
Veröffentlicht: (2025-01-01)

iADA*-RL: Anytime Graph-Based Path Planning with Deep Reinforcement Learning for an Autonomous UAV
von: Aye Aye Maw, et al.
Veröffentlicht: (2021-04-01)

Reflections of RL in The Virtual World
von: Andra Siibak
Veröffentlicht: (2007-11-01)

Elimination of All Bad Local Minima in Deep Learning
von: Kawaguchi, Kenji, et al.
Veröffentlicht: (2021)

Automation of digital crime investigation using Reinforcement Learning (RL)
von: Ghanem, Mohamed Chahine
Veröffentlicht: (2023)

RL4CEP: reinforcement learning for updating CEP rules
von: Afef Mdhaffar, et al.
Veröffentlicht: (2025-01-01)

CST-RL: Contrastive Spatio-Temporal Representations for Reinforcement Learning
von: Chi-Kai Ho, et al.
Veröffentlicht: (2023-01-01)

ADAS-RL: Safety learning approach for stable autonomous driving
von: Dongsu Lee, et al.
Veröffentlicht: (2022-09-01)

HLifeRL: A hierarchical lifelong reinforcement learning framework
von: Fan Ding, et al.
Veröffentlicht: (2022-07-01)

RL-CWtrans Net: multimodal swimming coaching driven via robot vision
von: Guanlin Wang
Veröffentlicht: (2024-08-01)

Improving Student Learning Outcomes Through the TaRL Learning Model on Discussion
von: Miftahunajah Aditiya Pratama
Veröffentlicht: (2023-11-01)

Implementation of the TaRL Approach to Increase Student Learning Motivation in Physics Learning
von: Melinda Cahya Ningrum Ningrum, et al.
Veröffentlicht: (2023-05-01)

Model-based RL in ATARI games
von: Akarapu, Bharadwaj
Veröffentlicht: (2021)

Information asymmetry in KL-regularized RL
von: Galashov, A, et al.
Veröffentlicht: (2018)

Model-Free RL or Action Sequences?
von: Adam Morris, et al.
Veröffentlicht: (2019-12-01)

R.L. Moore : mathematician and teacher /
von: 236772 Parker, John
Veröffentlicht: (2005)

Packet Size-Aware Broadcasting in VANETs With Fuzzy Logic and RL-Based Parameter Adaptation
von: Celimuge Wu, et al.
Veröffentlicht: (2015-01-01)

RAMBO-RL: robust adversarial model-based offline reinforcement learning
von: Rigter, M, et al.
Veröffentlicht: (2023)

SpaceRL — A reinforcement learning-based knowledge graph driver
von: Miguel Bermudo, et al.
Veröffentlicht: (2025-05-01)

FleetRL: Realistic reinforcement learning environments for commercial vehicle fleets
von: Enzo Cording, et al.
Veröffentlicht: (2024-05-01)