Efficient POMDP Forward Search by Predicting the Posterior Belief Distribution

Efficient POMDP Forward Search by Predicting the Posterior Belief Distribution

Online, forward-search techniques have demonstrated promising results for solving problems in partially observable environments. These techniques depend on the ability to efficiently search and evaluate the set of beliefs reachable from the current belief. However, enumerating or sampling action-obs...

সম্পূর্ণ বিবরণ

গ্রন্থ-পঞ্জীর বিবরন
প্রধান লেখক:	Roy, Nicholas, He, Ruijie
অন্যান্য লেখক:	Nicholas Roy
প্রকাশিত:	2009
অনলাইন ব্যবহার করুন:	http://hdl.handle.net/1721.1/46820

অনুরূপ উপাদানগুলি

An online algorithm for constrained POMDPs
অনুযায়ী: Undurti, Aditya, অন্যান্য
প্রকাশিত: (2011)

Reinforcement learning with limited reinforcement: Using Bayes risk for active learning in POMDPs
অনুযায়ী: Pineau, Joelle, অন্যান্য
প্রকাশিত: (2017)

Monte-Carlo planning in large POMDPs
অনুযায়ী: Silver, David, অন্যান্য
প্রকাশিত: (2015)

Planning with Macro-Actions in Decentralized POMDPs
অনুযায়ী: Amato, Christopher, অন্যান্য
প্রকাশিত: (2016)

RAO*: an Algorithm for Chance-Constrained POMDP’s
অনুযায়ী: Santana, Pedro, অন্যান্য
প্রকাশিত: (2016)

Deep variational reinforcement learning for POMDPs
অনুযায়ী: Igl, M, অন্যান্য
প্রকাশিত: (2018)

Point-Based Policy Transformation: Adapting Policy to Changing POMDP Models
অনুযায়ী: Kurniawati, Hanna, অন্যান্য
প্রকাশিত: (2019)

Modeling and Planning with Macro-Actions in Decentralized POMDPs
অনুযায়ী: Amato, Christopher, অন্যান্য
প্রকাশিত: (2021)

Sampling-based algorithms for continuous-time POMDPs
অনুযায়ী: Chaudhari, Pratik Anil, অন্যান্য
প্রকাশিত: (2013)

Stick-breaking policy learning in Dec-POMDPs
অনুযায়ী: Amato, Christopher, অন্যান্য
প্রকাশিত: (2016)

Safe POMDP online planning via shielding
অনুযায়ী: Sheng, S, অন্যান্য
প্রকাশিত: (2024)

Trust oriented decision making via POMDPs
অনুযায়ী: Aravazhi Irissappane, Athirai
প্রকাশিত: (2016)

Policy Improvement for POMDPs Using Normalized Importance Sampling
অনুযায়ী: Shelton, Christian R.
প্রকাশিত: (2004)

Spatial and Temporal Abstractions in POMDPs Applied to Robot Navigation
অনুযায়ী: Theocharous, Georgios, অন্যান্য
প্রকাশিত: (2005)

A POMDP Approach to Map Victims in Disaster Scenarios
অনুযায়ী: Pedro Gabriel Villani, অন্যান্য
প্রকাশিত: (2024-11-01)

Spectrum Access Algoritbm Based on POMDP Model in CVANET
অনুযায়ী: Xuefei Zhang, অন্যান্য
প্রকাশিত: (2014-09-01)

Spectrum Access Algoritbm Based on POMDP Model in CVANET
অনুযায়ী: Xuefei Zhang, অন্যান্য
প্রকাশিত: (2014-09-01)

Safe POMDP online planning among dynamic agents via adaptive conformal prediction
অনুযায়ী: Sheng, S, অন্যান্য
প্রকাশিত: (2024)

The Belief Roadmap: Efficient Planning in Belief Space by Factoring the Covariance
অনুযায়ী: Roy, Nicholas, অন্যান্য
প্রকাশিত: (2010)

Incremental Clustering and Expansion for Faster Optimal Planning in Dec-POMDPs
অনুযায়ী: Oliehoek, Frans A., অন্যান্য
প্রকাশিত: (2013)

Interference Coordination Based on POMDP in Multi-Cell OFDMA System
অনুযায়ী: Qiang Wei, অন্যান্য
প্রকাশিত: (2013-04-01)

Cognitive radio auto-adaptive sensing algorithm based on POMDP
অনুযায়ী: Rui-chen XU, অন্যান্য
প্রকাশিত: (2013-06-01)

Interference Coordination Based on POMDP in Multi-Cell OFDMA System
অনুযায়ী: Qiang Wei, অন্যান্য
প্রকাশিত: (2013-04-01)

Cognitive radio auto-adaptive sensing algorithm based on POMDP
অনুযায়ী: Rui-chen XU, অন্যান্য
প্রকাশিত: (2013-06-01)

Multi-Agent Active Perception Based on Reinforcement Learning and POMDP
অনুযায়ী: Tarik Selimovic, অন্যান্য
প্রকাশিত: (2024-01-01)

DualSMC: Tunneling Differentiable Filtering and Planning under Continuous POMDPs
অনুযায়ী: Wang, Yunbo, অন্যান্য
প্রকাশিত: (2021)

DGA domain detection and botnet prevention using Q-learning for POMDP
অনুযায়ী: Y. V. Bubnov, অন্যান্য
প্রকাশিত: (2021-03-01)

Graph-based Cross Entropy method for solving multi-robot decentralized POMDPs
অনুযায়ী: Agha-mohammadi, Ali-akbar, অন্যান্য
প্রকাশিত: (2016)

CAR-DESPOT: causally-informed online POMDP planning for robots in confounded environments
অনুযায়ী: Cannizzaro, R, অন্যান্য
প্রকাশিত: (2023)

POMDP-based probabilistic decision making for path planning in wheeled mobile robot
অনুযায়ী: Shripad V. Deshpande, অন্যান্য
প্রকাশিত: (2024-01-01)

Efficient Planning under Uncertainty with Macro-actions
অনুযায়ী: He, Ruijie, অন্যান্য
প্রকাশিত: (2011)

Optimal Joint Defense and Monitoring for Networks Security under Uncertainty: A POMDP-Based Approach
অনুযায়ী: Armita Kazeminajafabadi, অন্যান্য
প্রকাশিত: (2024-01-01)

Optimización Bayesiana no miope POMDP para procesos con restricciones de operación y presupuesto finito
অনুযায়ী: José Luis Pitarch, অন্যান্য
প্রকাশিত: (2024-07-01)

Efﬁcient Planning under Uncertainty for a Target-Tracking Micro-Aerial Vehicle
অনুযায়ী: He, Ruijie, অন্যান্য
প্রকাশিত: (2010)

In search of god : the language and logic of belief /
অনুযায়ী: 197843 Kolak, Daniel
প্রকাশিত: (1994)

Backward-forward search for manipulation planning
অনুযায়ী: Garrett, Caelan Reed, অন্যান্য
প্রকাশিত: (2017)

Exploring multivariate data with the forward search /
অনুযায়ী: Atkinson, A. C. (Anthony Curtis), অন্যান্য
প্রকাশিত: (2004)

When to monitor or control: Informed invasive species management using a partially observable Markov decision process (POMDP) framework
অনুযায়ী: Thomas K. Waring, অন্যান্য
প্রকাশিত: (2024-09-01)

Discussion: The forward search: Theory and data analysis.
অনুযায়ী: Johansen, S, অন্যান্য
প্রকাশিত: (2010)

Searching for saturation in forward dijet production at the LHC
অনুযায়ী: A. van Hameren, অন্যান্য
প্রকাশিত: (2023-10-01)