Deep variational reinforcement learning for POMDPs

Deep variational reinforcement learning for POMDPs

Many real-world sequential decision making problems are partially observable by nature, and the environment model is typically unknown. Consequently, there is great need for reinforcement learning methods that can tackle such problems given only a stream of incomplete and noisy observations. In this...

Бүрэн тодорхойлолт

Номзүйн дэлгэрэнгүй
Үндсэн зохиолчид:	Igl, M, Zintgraf, L, Le, T, Wood, F, Whiteson, S
Формат:	Conference item
Хэвлэсэн:	Journal of Machine Learning Research 2018

Ижил төстэй зүйлс

Exploration in approximate hyper-state space for meta reinforcement learning
-н: Zintgraf, L, зэрэг
Хэвлэсэн: (2021)

Reinforcement learning with limited reinforcement: Using Bayes risk for active learning in POMDPs
-н: Pineau, Joelle, зэрэг
Хэвлэсэн: (2017)

VariBAD: a very good method for Bayes-adaptive deep RL via meta-learning
-н: Zintgraf, L, зэрэг
Хэвлэсэн: (2020)

Multi-Agent Active Perception Based on Reinforcement Learning and POMDP
-н: Tarik Selimovic, зэрэг
Хэвлэсэн: (2024-01-01)

TreeQN and ATreeC: differentiable tree planning for deep reinforcement learning
-н: Farquhar, G, зэрэг
Хэвлэсэн: (2018)

Transient non−stationarity and generalisation in deep reinforcement learning
-н: Igl, M, зэрэг
Хэвлэсэн: (2021)

Incremental Clustering and Expansion for Faster Optimal Planning in Dec-POMDPs
-н: Oliehoek, Frans A., зэрэг
Хэвлэсэн: (2013)

Stick-breaking policy learning in Dec-POMDPs
-н: Amato, Christopher, зэрэг
Хэвлэсэн: (2016)

Inductive biases and generalisation for deep reinforcement learning
-н: Igl, M
Хэвлэсэн: (2021)

Fast adaptation via meta reinforcement learning
-н: Zintgraf, L
Хэвлэсэн: (2022)

An online algorithm for constrained POMDPs
-н: Undurti, Aditya, зэрэг
Хэвлэсэн: (2011)

Improved Deep Recurrent Q-Network of POMDPs for Automated Penetration Testing
-н: Yue Zhang, зэрэг
Хэвлэсэн: (2022-10-01)

Monte-Carlo planning in large POMDPs
-н: Silver, David, зэрэг
Хэвлэсэн: (2015)

Planning with Macro-Actions in Decentralized POMDPs
-н: Amato, Christopher, зэрэг
Хэвлэсэн: (2016)

RAO*: an Algorithm for Chance-Constrained POMDP’s
-н: Santana, Pedro, зэрэг
Хэвлэсэн: (2016)

Safe POMDP online planning via shielding
-н: Sheng, S, зэрэг
Хэвлэсэн: (2024)

Modeling and Planning with Macro-Actions in Decentralized POMDPs
-н: Amato, Christopher, зэрэг
Хэвлэсэн: (2021)

Sampling-based algorithms for continuous-time POMDPs
-н: Chaudhari, Pratik Anil, зэрэг
Хэвлэсэн: (2013)

Trust oriented decision making via POMDPs
-н: Aravazhi Irissappane, Athirai
Хэвлэсэн: (2016)

Policy Evaluation in Decentralized POMDPs With Belief Sharing
-н: Mert Kayaalp, зэрэг
Хэвлэсэн: (2023-01-01)

DGA domain detection and botnet prevention using Q-learning for POMDP
-н: Y. V. Bubnov, зэрэг
Хэвлэсэн: (2021-03-01)

Policy Improvement for POMDPs Using Normalized Importance Sampling
-н: Shelton, Christian R.
Хэвлэсэн: (2004)

Spatial and Temporal Abstractions in POMDPs Applied to Robot Navigation
-н: Theocharous, Georgios, зэрэг
Хэвлэсэн: (2005)

A POMDP Approach to Map Victims in Disaster Scenarios
-н: Pedro Gabriel Villani, зэрэг
Хэвлэсэн: (2024-11-01)

Spectrum Access Algoritbm Based on POMDP Model in CVANET
-н: Xuefei Zhang, зэрэг
Хэвлэсэн: (2014-09-01)

Spectrum Access Algoritbm Based on POMDP Model in CVANET
-н: Xuefei Zhang, зэрэг
Хэвлэсэн: (2014-09-01)

Bottom-up learning of hierarchical models in a class of deterministic POMDP environments
-н: Itoh Hideaki, зэрэг
Хэвлэсэн: (2015-09-01)

Deep residual reinforcement learning
-н: Zhang, S, зэрэг
Хэвлэсэн: (2020)

Efficient POMDP Forward Search by Predicting the Posterior Belief Distribution
-н: Roy, Nicholas, зэрэг
Хэвлэсэн: (2009)

Interference Coordination Based on POMDP in Multi-Cell OFDMA System
-н: Qiang Wei, зэрэг
Хэвлэсэн: (2013-04-01)

Cognitive radio auto-adaptive sensing algorithm based on POMDP
-н: Rui-chen XU, зэрэг
Хэвлэсэн: (2013-06-01)

Interference Coordination Based on POMDP in Multi-Cell OFDMA System
-н: Qiang Wei, зэрэг
Хэвлэсэн: (2013-04-01)

Cognitive radio auto-adaptive sensing algorithm based on POMDP
-н: Rui-chen XU, зэрэг
Хэвлэсэн: (2013-06-01)

Point-Based Policy Transformation: Adapting Policy to Changing POMDP Models
-н: Kurniawati, Hanna, зэрэг
Хэвлэсэн: (2019)

Recent Advances in Deep Reinforcement Learning Applications for Solving Partially Observable Markov Decision Processes (POMDP) Problems Part 2—Applications in Transportation, Industries, Communications and Networking and More Topics
-н: Xuanchen Xiang, зэрэг
Хэвлэсэн: (2021-10-01)

Recent Advances in Deep Reinforcement Learning Applications for Solving Partially Observable Markov Decision Processes (POMDP) Problems: Part 1—Fundamentals and Applications in Games, Robotics and Natural Language Processing
-н: Xuanchen Xiang, зэрэг
Хэвлэсэн: (2021-07-01)

CAR-DESPOT: causally-informed online POMDP planning for robots in confounded environments
-н: Cannizzaro, R, зэрэг
Хэвлэсэн: (2023)

DualSMC: Tunneling Differentiable Filtering and Planning under Continuous POMDPs
-н: Wang, Yunbo, зэрэг
Хэвлэсэн: (2021)

Personalized Cotesting Policies for Cervical Cancer Screening: A POMDP Approach
-н: Malek Ebadi, зэрэг
Хэвлэсэн: (2021-03-01)

A POMDP Framework for Coordinated Guidance of Autonomous UAVs for Multitarget Tracking
Хэвлэсэн: (2009-03-01)