VariBAD: variational bayes-adaptive deep RL via meta-learning

VariBAD: variational bayes-adaptive deep RL via meta-learning

Trading off exploration and exploitation in an unknown environment is key to maximising expected online return during learning. A Bayes-optimal policy, which does so optimally, conditions its actions not only on the environment state but also on the agent's uncertainty about the environment. Co...

Szczegółowa specyfikacja

Opis bibliograficzny
1. autor:	Whiteson, S
Format:	Journal article
Język:	English
Wydane:	Journal of Machine Learning Research 2021

Podobne zapisy

VariBAD: a very good method for Bayes-adaptive deep RL via meta-learning
od: Zintgraf, L, i wsp.
Wydane: (2020)

Knowledge Transfer in Deep Reinforcement Learning via an RL-Specific GAN-Based Correspondence Function
od: Marko Ruman, i wsp.
Wydane: (2024-01-01)

PharmRL: pharmacophore elucidation with deep geometric reinforcement learning
od: Rishal Aggarwal, i wsp.
Wydane: (2024-12-01)

Fast Context Adaptation via Meta-Learning
od: Zintgraf, L, i wsp.
Wydane: (2019)

Experience Replay Optimisation via ATSC and TSC for Performance Stability in Deep RL
od: Richard Sakyi Osei, i wsp.
Wydane: (2023-02-01)

AC-RL: A Framework for Real-Time Control, Learning & Adaptation
od: Guha, Anubhav
Wydane: (2023)

Learning to Utilize Curiosity: A New Approach of Automatic Curriculum Learning for Deep RL
od: Zeyang Lin, i wsp.
Wydane: (2022-07-01)

RobustAnalog: Fast Variation-Aware Analog Circuit Design Via Multi-task RL
od: Shi, Wei, i wsp.
Wydane: (2022)

Fiber Bundle Meta-learning Algorithm Based on Variational Bayes
od: LIU Yang, LI Fan-zhang
Wydane: (2022-03-01)

Reinforcement learning (RL) based stock trading system via support vector machine
od: Ong, Zhi Yuan.
Wydane: (2010)

GenFedRL: a general federated reinforcement learning framework for deep reinforcement learning agents
od: Biao JIN, i wsp.
Wydane: (2023-06-01)

GenFedRL: a general federated reinforcement learning framework for deep reinforcement learning agents
od: Biao JIN, i wsp.
Wydane: (2023-06-01)

Time-in-action RL
od: Jiangcheng Zhu, i wsp.
Wydane: (2019-02-01)

RL-ANC: Reinforcement Learning-Based Adaptive Network Coding in the Ocean Mobile Internet of Things
od: Ying Zhang, i wsp.
Wydane: (2024-06-01)

RL-AKF: An Adaptive Kalman Filter Navigation Algorithm Based on Reinforcement Learning for Ground Vehicles
od: Xile Gao, i wsp.
Wydane: (2020-05-01)

Exploiting multiple abstractions in episodic RL via reward shaping
od: Cipollone, R, i wsp.
Wydane: (2023)

Deep variational reinforcement learning for POMDPs
od: Igl, M, i wsp.
Wydane: (2018)

rl4dtn: Q-Learning for Opportunistic Networks
od: Jorge Visca, i wsp.
Wydane: (2022-11-01)

ACC-RL: Adaptive Congestion Control Based on Reinforcement Learning in Power Distribution Networks with Data Centers
od: Tairan Huang, i wsp.
Wydane: (2023-07-01)

RL-SPIHT: Reinforcement Learning-Based Adaptive Selection of Compression Ratios for 1-D SPIHT Algorithm
od: Jin Shin, i wsp.
Wydane: (2021-01-01)

RL-QPSO net: deep reinforcement learning-enhanced QPSO for efficient mobile robot path planning
od: Yang Jing, i wsp.
Wydane: (2025-01-01)

iADA*-RL: Anytime Graph-Based Path Planning with Deep Reinforcement Learning for an Autonomous UAV
od: Aye Aye Maw, i wsp.
Wydane: (2021-04-01)

Reflections of RL in The Virtual World
od: Andra Siibak
Wydane: (2007-11-01)

Elimination of All Bad Local Minima in Deep Learning
od: Kawaguchi, Kenji, i wsp.
Wydane: (2021)

Automation of digital crime investigation using Reinforcement Learning (RL)
od: Ghanem, Mohamed Chahine
Wydane: (2023)

RL4CEP: reinforcement learning for updating CEP rules
od: Afef Mdhaffar, i wsp.
Wydane: (2025-01-01)

CST-RL: Contrastive Spatio-Temporal Representations for Reinforcement Learning
od: Chi-Kai Ho, i wsp.
Wydane: (2023-01-01)

ADAS-RL: Safety learning approach for stable autonomous driving
od: Dongsu Lee, i wsp.
Wydane: (2022-09-01)

HLifeRL: A hierarchical lifelong reinforcement learning framework
od: Fan Ding, i wsp.
Wydane: (2022-07-01)

RL-CWtrans Net: multimodal swimming coaching driven via robot vision
od: Guanlin Wang
Wydane: (2024-08-01)

Improving Student Learning Outcomes Through the TaRL Learning Model on Discussion
od: Miftahunajah Aditiya Pratama
Wydane: (2023-11-01)

Implementation of the TaRL Approach to Increase Student Learning Motivation in Physics Learning
od: Melinda Cahya Ningrum Ningrum, i wsp.
Wydane: (2023-05-01)

Model-based RL in ATARI games
od: Akarapu, Bharadwaj
Wydane: (2021)

Information asymmetry in KL-regularized RL
od: Galashov, A, i wsp.
Wydane: (2018)

Model-Free RL or Action Sequences?
od: Adam Morris, i wsp.
Wydane: (2019-12-01)

R.L. Moore : mathematician and teacher /
od: 236772 Parker, John
Wydane: (2005)

Packet Size-Aware Broadcasting in VANETs With Fuzzy Logic and RL-Based Parameter Adaptation
od: Celimuge Wu, i wsp.
Wydane: (2015-01-01)

RAMBO-RL: robust adversarial model-based offline reinforcement learning
od: Rigter, M, i wsp.
Wydane: (2023)

SpaceRL — A reinforcement learning-based knowledge graph driver
od: Miguel Bermudo, i wsp.
Wydane: (2025-05-01)

FleetRL: Realistic reinforcement learning environments for commercial vehicle fleets
od: Enzo Cording, i wsp.
Wydane: (2024-05-01)