VariBAD: variational bayes-adaptive deep RL via meta-learning

VariBAD: variational bayes-adaptive deep RL via meta-learning

Trading off exploration and exploitation in an unknown environment is key to maximising expected online return during learning. A Bayes-optimal policy, which does so optimally, conditions its actions not only on the environment state but also on the agent's uncertainty about the environment. Co...

Descrizione completa

Dettagli Bibliografici
Autore principale:	Whiteson, S
Natura:	Journal article
Lingua:	English
Pubblicazione:	Journal of Machine Learning Research 2021

Documenti analoghi

VariBAD: a very good method for Bayes-adaptive deep RL via meta-learning
di: Zintgraf, L, et al.
Pubblicazione: (2020)

Knowledge Transfer in Deep Reinforcement Learning via an RL-Specific GAN-Based Correspondence Function
di: Marko Ruman, et al.
Pubblicazione: (2024-01-01)

PharmRL: pharmacophore elucidation with deep geometric reinforcement learning
di: Rishal Aggarwal, et al.
Pubblicazione: (2024-12-01)

Fast Context Adaptation via Meta-Learning
di: Zintgraf, L, et al.
Pubblicazione: (2019)

Experience Replay Optimisation via ATSC and TSC for Performance Stability in Deep RL
di: Richard Sakyi Osei, et al.
Pubblicazione: (2023-02-01)

AC-RL: A Framework for Real-Time Control, Learning & Adaptation
di: Guha, Anubhav
Pubblicazione: (2023)

Learning to Utilize Curiosity: A New Approach of Automatic Curriculum Learning for Deep RL
di: Zeyang Lin, et al.
Pubblicazione: (2022-07-01)

RobustAnalog: Fast Variation-Aware Analog Circuit Design Via Multi-task RL
di: Shi, Wei, et al.
Pubblicazione: (2022)

Fiber Bundle Meta-learning Algorithm Based on Variational Bayes
di: LIU Yang, LI Fan-zhang
Pubblicazione: (2022-03-01)

Reinforcement learning (RL) based stock trading system via support vector machine
di: Ong, Zhi Yuan.
Pubblicazione: (2010)

GenFedRL: a general federated reinforcement learning framework for deep reinforcement learning agents
di: Biao JIN, et al.
Pubblicazione: (2023-06-01)

GenFedRL: a general federated reinforcement learning framework for deep reinforcement learning agents
di: Biao JIN, et al.
Pubblicazione: (2023-06-01)

Time-in-action RL
di: Jiangcheng Zhu, et al.
Pubblicazione: (2019-02-01)

RL-ANC: Reinforcement Learning-Based Adaptive Network Coding in the Ocean Mobile Internet of Things
di: Ying Zhang, et al.
Pubblicazione: (2024-06-01)

RL-AKF: An Adaptive Kalman Filter Navigation Algorithm Based on Reinforcement Learning for Ground Vehicles
di: Xile Gao, et al.
Pubblicazione: (2020-05-01)

Exploiting multiple abstractions in episodic RL via reward shaping
di: Cipollone, R, et al.
Pubblicazione: (2023)

Deep variational reinforcement learning for POMDPs
di: Igl, M, et al.
Pubblicazione: (2018)

rl4dtn: Q-Learning for Opportunistic Networks
di: Jorge Visca, et al.
Pubblicazione: (2022-11-01)

ACC-RL: Adaptive Congestion Control Based on Reinforcement Learning in Power Distribution Networks with Data Centers
di: Tairan Huang, et al.
Pubblicazione: (2023-07-01)

RL-SPIHT: Reinforcement Learning-Based Adaptive Selection of Compression Ratios for 1-D SPIHT Algorithm
di: Jin Shin, et al.
Pubblicazione: (2021-01-01)

RL-QPSO net: deep reinforcement learning-enhanced QPSO for efficient mobile robot path planning
di: Yang Jing, et al.
Pubblicazione: (2025-01-01)

iADA*-RL: Anytime Graph-Based Path Planning with Deep Reinforcement Learning for an Autonomous UAV
di: Aye Aye Maw, et al.
Pubblicazione: (2021-04-01)

Reflections of RL in The Virtual World
di: Andra Siibak
Pubblicazione: (2007-11-01)

Elimination of All Bad Local Minima in Deep Learning
di: Kawaguchi, Kenji, et al.
Pubblicazione: (2021)

Automation of digital crime investigation using Reinforcement Learning (RL)
di: Ghanem, Mohamed Chahine
Pubblicazione: (2023)

RL4CEP: reinforcement learning for updating CEP rules
di: Afef Mdhaffar, et al.
Pubblicazione: (2025-01-01)

CST-RL: Contrastive Spatio-Temporal Representations for Reinforcement Learning
di: Chi-Kai Ho, et al.
Pubblicazione: (2023-01-01)

ADAS-RL: Safety learning approach for stable autonomous driving
di: Dongsu Lee, et al.
Pubblicazione: (2022-09-01)

HLifeRL: A hierarchical lifelong reinforcement learning framework
di: Fan Ding, et al.
Pubblicazione: (2022-07-01)

RL-CWtrans Net: multimodal swimming coaching driven via robot vision
di: Guanlin Wang
Pubblicazione: (2024-08-01)

Improving Student Learning Outcomes Through the TaRL Learning Model on Discussion
di: Miftahunajah Aditiya Pratama
Pubblicazione: (2023-11-01)

Implementation of the TaRL Approach to Increase Student Learning Motivation in Physics Learning
di: Melinda Cahya Ningrum Ningrum, et al.
Pubblicazione: (2023-05-01)

Model-based RL in ATARI games
di: Akarapu, Bharadwaj
Pubblicazione: (2021)

Information asymmetry in KL-regularized RL
di: Galashov, A, et al.
Pubblicazione: (2018)

Model-Free RL or Action Sequences?
di: Adam Morris, et al.
Pubblicazione: (2019-12-01)

R.L. Moore : mathematician and teacher /
di: 236772 Parker, John
Pubblicazione: (2005)

Packet Size-Aware Broadcasting in VANETs With Fuzzy Logic and RL-Based Parameter Adaptation
di: Celimuge Wu, et al.
Pubblicazione: (2015-01-01)

RAMBO-RL: robust adversarial model-based offline reinforcement learning
di: Rigter, M, et al.
Pubblicazione: (2023)

SpaceRL — A reinforcement learning-based knowledge graph driver
di: Miguel Bermudo, et al.
Pubblicazione: (2025-05-01)

FleetRL: Realistic reinforcement learning environments for commercial vehicle fleets
di: Enzo Cording, et al.
Pubblicazione: (2024-05-01)