VariBAD: variational bayes-adaptive deep RL via meta-learning

VariBAD: variational bayes-adaptive deep RL via meta-learning

Trading off exploration and exploitation in an unknown environment is key to maximising expected online return during learning. A Bayes-optimal policy, which does so optimally, conditions its actions not only on the environment state but also on the agent's uncertainty about the environment. Co...

Cur síos iomlán

Sonraí bibleagrafaíochta
Príomhchruthaitheoir:	Whiteson, S
Formáid:	Journal article
Teanga:	English
Foilsithe / Cruthaithe:	Journal of Machine Learning Research 2021

Míreanna comhchosúla

VariBAD: a very good method for Bayes-adaptive deep RL via meta-learning
de réir: Zintgraf, L, et al.
Foilsithe / Cruthaithe: (2020)

Knowledge Transfer in Deep Reinforcement Learning via an RL-Specific GAN-Based Correspondence Function
de réir: Marko Ruman, et al.
Foilsithe / Cruthaithe: (2024-01-01)

PharmRL: pharmacophore elucidation with deep geometric reinforcement learning
de réir: Rishal Aggarwal, et al.
Foilsithe / Cruthaithe: (2024-12-01)

Fast Context Adaptation via Meta-Learning
de réir: Zintgraf, L, et al.
Foilsithe / Cruthaithe: (2019)

Experience Replay Optimisation via ATSC and TSC for Performance Stability in Deep RL
de réir: Richard Sakyi Osei, et al.
Foilsithe / Cruthaithe: (2023-02-01)

AC-RL: A Framework for Real-Time Control, Learning & Adaptation
de réir: Guha, Anubhav
Foilsithe / Cruthaithe: (2023)

Learning to Utilize Curiosity: A New Approach of Automatic Curriculum Learning for Deep RL
de réir: Zeyang Lin, et al.
Foilsithe / Cruthaithe: (2022-07-01)

RobustAnalog: Fast Variation-Aware Analog Circuit Design Via Multi-task RL
de réir: Shi, Wei, et al.
Foilsithe / Cruthaithe: (2022)

Fiber Bundle Meta-learning Algorithm Based on Variational Bayes
de réir: LIU Yang, LI Fan-zhang
Foilsithe / Cruthaithe: (2022-03-01)

Reinforcement learning (RL) based stock trading system via support vector machine
de réir: Ong, Zhi Yuan.
Foilsithe / Cruthaithe: (2010)

GenFedRL: a general federated reinforcement learning framework for deep reinforcement learning agents
de réir: Biao JIN, et al.
Foilsithe / Cruthaithe: (2023-06-01)

GenFedRL: a general federated reinforcement learning framework for deep reinforcement learning agents
de réir: Biao JIN, et al.
Foilsithe / Cruthaithe: (2023-06-01)

Time-in-action RL
de réir: Jiangcheng Zhu, et al.
Foilsithe / Cruthaithe: (2019-02-01)

RL-ANC: Reinforcement Learning-Based Adaptive Network Coding in the Ocean Mobile Internet of Things
de réir: Ying Zhang, et al.
Foilsithe / Cruthaithe: (2024-06-01)

RL-AKF: An Adaptive Kalman Filter Navigation Algorithm Based on Reinforcement Learning for Ground Vehicles
de réir: Xile Gao, et al.
Foilsithe / Cruthaithe: (2020-05-01)

Exploiting multiple abstractions in episodic RL via reward shaping
de réir: Cipollone, R, et al.
Foilsithe / Cruthaithe: (2023)

Deep variational reinforcement learning for POMDPs
de réir: Igl, M, et al.
Foilsithe / Cruthaithe: (2018)

rl4dtn: Q-Learning for Opportunistic Networks
de réir: Jorge Visca, et al.
Foilsithe / Cruthaithe: (2022-11-01)

ACC-RL: Adaptive Congestion Control Based on Reinforcement Learning in Power Distribution Networks with Data Centers
de réir: Tairan Huang, et al.
Foilsithe / Cruthaithe: (2023-07-01)

RL-SPIHT: Reinforcement Learning-Based Adaptive Selection of Compression Ratios for 1-D SPIHT Algorithm
de réir: Jin Shin, et al.
Foilsithe / Cruthaithe: (2021-01-01)

RL-QPSO net: deep reinforcement learning-enhanced QPSO for efficient mobile robot path planning
de réir: Yang Jing, et al.
Foilsithe / Cruthaithe: (2025-01-01)

iADA*-RL: Anytime Graph-Based Path Planning with Deep Reinforcement Learning for an Autonomous UAV
de réir: Aye Aye Maw, et al.
Foilsithe / Cruthaithe: (2021-04-01)

Reflections of RL in The Virtual World
de réir: Andra Siibak
Foilsithe / Cruthaithe: (2007-11-01)

Elimination of All Bad Local Minima in Deep Learning
de réir: Kawaguchi, Kenji, et al.
Foilsithe / Cruthaithe: (2021)

Automation of digital crime investigation using Reinforcement Learning (RL)
de réir: Ghanem, Mohamed Chahine
Foilsithe / Cruthaithe: (2023)

RL4CEP: reinforcement learning for updating CEP rules
de réir: Afef Mdhaffar, et al.
Foilsithe / Cruthaithe: (2025-01-01)

CST-RL: Contrastive Spatio-Temporal Representations for Reinforcement Learning
de réir: Chi-Kai Ho, et al.
Foilsithe / Cruthaithe: (2023-01-01)

ADAS-RL: Safety learning approach for stable autonomous driving
de réir: Dongsu Lee, et al.
Foilsithe / Cruthaithe: (2022-09-01)

HLifeRL: A hierarchical lifelong reinforcement learning framework
de réir: Fan Ding, et al.
Foilsithe / Cruthaithe: (2022-07-01)

RL-CWtrans Net: multimodal swimming coaching driven via robot vision
de réir: Guanlin Wang
Foilsithe / Cruthaithe: (2024-08-01)

Improving Student Learning Outcomes Through the TaRL Learning Model on Discussion
de réir: Miftahunajah Aditiya Pratama
Foilsithe / Cruthaithe: (2023-11-01)

Implementation of the TaRL Approach to Increase Student Learning Motivation in Physics Learning
de réir: Melinda Cahya Ningrum Ningrum, et al.
Foilsithe / Cruthaithe: (2023-05-01)

Model-based RL in ATARI games
de réir: Akarapu, Bharadwaj
Foilsithe / Cruthaithe: (2021)

Information asymmetry in KL-regularized RL
de réir: Galashov, A, et al.
Foilsithe / Cruthaithe: (2018)

Model-Free RL or Action Sequences?
de réir: Adam Morris, et al.
Foilsithe / Cruthaithe: (2019-12-01)

R.L. Moore : mathematician and teacher /
de réir: 236772 Parker, John
Foilsithe / Cruthaithe: (2005)

Packet Size-Aware Broadcasting in VANETs With Fuzzy Logic and RL-Based Parameter Adaptation
de réir: Celimuge Wu, et al.
Foilsithe / Cruthaithe: (2015-01-01)

RAMBO-RL: robust adversarial model-based offline reinforcement learning
de réir: Rigter, M, et al.
Foilsithe / Cruthaithe: (2023)

SpaceRL — A reinforcement learning-based knowledge graph driver
de réir: Miguel Bermudo, et al.
Foilsithe / Cruthaithe: (2025-05-01)

FleetRL: Realistic reinforcement learning environments for commercial vehicle fleets
de réir: Enzo Cording, et al.
Foilsithe / Cruthaithe: (2024-05-01)