VariBAD: variational bayes-adaptive deep RL via meta-learning

VariBAD: variational bayes-adaptive deep RL via meta-learning

Trading off exploration and exploitation in an unknown environment is key to maximising expected online return during learning. A Bayes-optimal policy, which does so optimally, conditions its actions not only on the environment state but also on the agent's uncertainty about the environment. Co...

ver descrição completa

Detalhes bibliográficos
Autor principal:	Whiteson, S
Formato:	Journal article
Idioma:	English
Publicado em:	Journal of Machine Learning Research 2021

Registos relacionados

VariBAD: a very good method for Bayes-adaptive deep RL via meta-learning
Por: Zintgraf, L, et al.
Publicado em: (2020)

Knowledge Transfer in Deep Reinforcement Learning via an RL-Specific GAN-Based Correspondence Function
Por: Marko Ruman, et al.
Publicado em: (2024-01-01)

PharmRL: pharmacophore elucidation with deep geometric reinforcement learning
Por: Rishal Aggarwal, et al.
Publicado em: (2024-12-01)

Fast Context Adaptation via Meta-Learning
Por: Zintgraf, L, et al.
Publicado em: (2019)

Experience Replay Optimisation via ATSC and TSC for Performance Stability in Deep RL
Por: Richard Sakyi Osei, et al.
Publicado em: (2023-02-01)

AC-RL: A Framework for Real-Time Control, Learning & Adaptation
Por: Guha, Anubhav
Publicado em: (2023)

Learning to Utilize Curiosity: A New Approach of Automatic Curriculum Learning for Deep RL
Por: Zeyang Lin, et al.
Publicado em: (2022-07-01)

RobustAnalog: Fast Variation-Aware Analog Circuit Design Via Multi-task RL
Por: Shi, Wei, et al.
Publicado em: (2022)

Fiber Bundle Meta-learning Algorithm Based on Variational Bayes
Por: LIU Yang, LI Fan-zhang
Publicado em: (2022-03-01)

Reinforcement learning (RL) based stock trading system via support vector machine
Por: Ong, Zhi Yuan.
Publicado em: (2010)

GenFedRL: a general federated reinforcement learning framework for deep reinforcement learning agents
Por: Biao JIN, et al.
Publicado em: (2023-06-01)

GenFedRL: a general federated reinforcement learning framework for deep reinforcement learning agents
Por: Biao JIN, et al.
Publicado em: (2023-06-01)

Time-in-action RL
Por: Jiangcheng Zhu, et al.
Publicado em: (2019-02-01)

RL-ANC: Reinforcement Learning-Based Adaptive Network Coding in the Ocean Mobile Internet of Things
Por: Ying Zhang, et al.
Publicado em: (2024-06-01)

RL-AKF: An Adaptive Kalman Filter Navigation Algorithm Based on Reinforcement Learning for Ground Vehicles
Por: Xile Gao, et al.
Publicado em: (2020-05-01)

Exploiting multiple abstractions in episodic RL via reward shaping
Por: Cipollone, R, et al.
Publicado em: (2023)

Deep variational reinforcement learning for POMDPs
Por: Igl, M, et al.
Publicado em: (2018)

rl4dtn: Q-Learning for Opportunistic Networks
Por: Jorge Visca, et al.
Publicado em: (2022-11-01)

ACC-RL: Adaptive Congestion Control Based on Reinforcement Learning in Power Distribution Networks with Data Centers
Por: Tairan Huang, et al.
Publicado em: (2023-07-01)

RL-SPIHT: Reinforcement Learning-Based Adaptive Selection of Compression Ratios for 1-D SPIHT Algorithm
Por: Jin Shin, et al.
Publicado em: (2021-01-01)

RL-QPSO net: deep reinforcement learning-enhanced QPSO for efficient mobile robot path planning
Por: Yang Jing, et al.
Publicado em: (2025-01-01)

iADA*-RL: Anytime Graph-Based Path Planning with Deep Reinforcement Learning for an Autonomous UAV
Por: Aye Aye Maw, et al.
Publicado em: (2021-04-01)

Reflections of RL in The Virtual World
Por: Andra Siibak
Publicado em: (2007-11-01)

Elimination of All Bad Local Minima in Deep Learning
Por: Kawaguchi, Kenji, et al.
Publicado em: (2021)

Automation of digital crime investigation using Reinforcement Learning (RL)
Por: Ghanem, Mohamed Chahine
Publicado em: (2023)

RL4CEP: reinforcement learning for updating CEP rules
Por: Afef Mdhaffar, et al.
Publicado em: (2025-01-01)

CST-RL: Contrastive Spatio-Temporal Representations for Reinforcement Learning
Por: Chi-Kai Ho, et al.
Publicado em: (2023-01-01)

ADAS-RL: Safety learning approach for stable autonomous driving
Por: Dongsu Lee, et al.
Publicado em: (2022-09-01)

HLifeRL: A hierarchical lifelong reinforcement learning framework
Por: Fan Ding, et al.
Publicado em: (2022-07-01)

RL-CWtrans Net: multimodal swimming coaching driven via robot vision
Por: Guanlin Wang
Publicado em: (2024-08-01)

Improving Student Learning Outcomes Through the TaRL Learning Model on Discussion
Por: Miftahunajah Aditiya Pratama
Publicado em: (2023-11-01)

Implementation of the TaRL Approach to Increase Student Learning Motivation in Physics Learning
Por: Melinda Cahya Ningrum Ningrum, et al.
Publicado em: (2023-05-01)

Model-based RL in ATARI games
Por: Akarapu, Bharadwaj
Publicado em: (2021)

Information asymmetry in KL-regularized RL
Por: Galashov, A, et al.
Publicado em: (2018)

Model-Free RL or Action Sequences?
Por: Adam Morris, et al.
Publicado em: (2019-12-01)

R.L. Moore : mathematician and teacher /
Por: 236772 Parker, John
Publicado em: (2005)

Packet Size-Aware Broadcasting in VANETs With Fuzzy Logic and RL-Based Parameter Adaptation
Por: Celimuge Wu, et al.
Publicado em: (2015-01-01)

RAMBO-RL: robust adversarial model-based offline reinforcement learning
Por: Rigter, M, et al.
Publicado em: (2023)

SpaceRL — A reinforcement learning-based knowledge graph driver
Por: Miguel Bermudo, et al.
Publicado em: (2025-05-01)

FleetRL: Realistic reinforcement learning environments for commercial vehicle fleets
Por: Enzo Cording, et al.
Publicado em: (2024-05-01)