Deep variational reinforcement learning for POMDPs

Deep variational reinforcement learning for POMDPs

Many real-world sequential decision making problems are partially observable by nature, and the environment model is typically unknown. Consequently, there is great need for reinforcement learning methods that can tackle such problems given only a stream of incomplete and noisy observations. In this...

Full description

Bibliographic Details
Main Authors:	Igl, M, Zintgraf, L, Le, T, Wood, F, Whiteson, S
Format:	Conference item
Published:	Journal of Machine Learning Research 2018

Similar Items

Exploration in approximate hyper-state space for meta reinforcement learning
by: Zintgraf, L, et al.
Published: (2021)

Reinforcement learning with limited reinforcement: Using Bayes risk for active learning in POMDPs
by: Pineau, Joelle, et al.
Published: (2017)

VariBAD: a very good method for Bayes-adaptive deep RL via meta-learning
by: Zintgraf, L, et al.
Published: (2020)

Multi-Agent Active Perception Based on Reinforcement Learning and POMDP
by: Tarik Selimovic, et al.
Published: (2024-01-01)

TreeQN and ATreeC: differentiable tree planning for deep reinforcement learning
by: Farquhar, G, et al.
Published: (2018)

Transient non−stationarity and generalisation in deep reinforcement learning
by: Igl, M, et al.
Published: (2021)

Incremental Clustering and Expansion for Faster Optimal Planning in Dec-POMDPs
by: Oliehoek, Frans A., et al.
Published: (2013)

Stick-breaking policy learning in Dec-POMDPs
by: Amato, Christopher, et al.
Published: (2016)

Inductive biases and generalisation for deep reinforcement learning
by: Igl, M
Published: (2021)

Fast adaptation via meta reinforcement learning
by: Zintgraf, L
Published: (2022)

An online algorithm for constrained POMDPs
by: Undurti, Aditya, et al.
Published: (2011)

Improved Deep Recurrent Q-Network of POMDPs for Automated Penetration Testing
by: Yue Zhang, et al.
Published: (2022-10-01)

Monte-Carlo planning in large POMDPs
by: Silver, David, et al.
Published: (2015)

Planning with Macro-Actions in Decentralized POMDPs
by: Amato, Christopher, et al.
Published: (2016)

RAO*: an Algorithm for Chance-Constrained POMDP’s
by: Santana, Pedro, et al.
Published: (2016)

Safe POMDP online planning via shielding
by: Sheng, S, et al.
Published: (2024)

Modeling and Planning with Macro-Actions in Decentralized POMDPs
by: Amato, Christopher, et al.
Published: (2021)

Sampling-based algorithms for continuous-time POMDPs
by: Chaudhari, Pratik Anil, et al.
Published: (2013)

Trust oriented decision making via POMDPs
by: Aravazhi Irissappane, Athirai
Published: (2016)

Policy Evaluation in Decentralized POMDPs With Belief Sharing
by: Mert Kayaalp, et al.
Published: (2023-01-01)

DGA domain detection and botnet prevention using Q-learning for POMDP
by: Y. V. Bubnov, et al.
Published: (2021-03-01)

Policy Improvement for POMDPs Using Normalized Importance Sampling
by: Shelton, Christian R.
Published: (2004)

Spatial and Temporal Abstractions in POMDPs Applied to Robot Navigation
by: Theocharous, Georgios, et al.
Published: (2005)

A POMDP Approach to Map Victims in Disaster Scenarios
by: Pedro Gabriel Villani, et al.
Published: (2024-11-01)

Spectrum Access Algoritbm Based on POMDP Model in CVANET
by: Xuefei Zhang, et al.
Published: (2014-09-01)

Spectrum Access Algoritbm Based on POMDP Model in CVANET
by: Xuefei Zhang, et al.
Published: (2014-09-01)

Bottom-up learning of hierarchical models in a class of deterministic POMDP environments
by: Itoh Hideaki, et al.
Published: (2015-09-01)

Deep residual reinforcement learning
by: Zhang, S, et al.
Published: (2020)

Efficient POMDP Forward Search by Predicting the Posterior Belief Distribution
by: Roy, Nicholas, et al.
Published: (2009)

Interference Coordination Based on POMDP in Multi-Cell OFDMA System
by: Qiang Wei, et al.
Published: (2013-04-01)

Cognitive radio auto-adaptive sensing algorithm based on POMDP
by: Rui-chen XU, et al.
Published: (2013-06-01)

Interference Coordination Based on POMDP in Multi-Cell OFDMA System
by: Qiang Wei, et al.
Published: (2013-04-01)

Cognitive radio auto-adaptive sensing algorithm based on POMDP
by: Rui-chen XU, et al.
Published: (2013-06-01)

Point-Based Policy Transformation: Adapting Policy to Changing POMDP Models
by: Kurniawati, Hanna, et al.
Published: (2019)

Recent Advances in Deep Reinforcement Learning Applications for Solving Partially Observable Markov Decision Processes (POMDP) Problems Part 2—Applications in Transportation, Industries, Communications and Networking and More Topics
by: Xuanchen Xiang, et al.
Published: (2021-10-01)

Recent Advances in Deep Reinforcement Learning Applications for Solving Partially Observable Markov Decision Processes (POMDP) Problems: Part 1—Fundamentals and Applications in Games, Robotics and Natural Language Processing
by: Xuanchen Xiang, et al.
Published: (2021-07-01)

CAR-DESPOT: causally-informed online POMDP planning for robots in confounded environments
by: Cannizzaro, R, et al.
Published: (2023)

DualSMC: Tunneling Differentiable Filtering and Planning under Continuous POMDPs
by: Wang, Yunbo, et al.
Published: (2021)

Personalized Cotesting Policies for Cervical Cancer Screening: A POMDP Approach
by: Malek Ebadi, et al.
Published: (2021-03-01)

A POMDP Framework for Coordinated Guidance of Autonomous UAVs for Multitarget Tracking
Published: (2009-03-01)