Probabilistic reach-avoid for Bayesian neural networks
Model-based reinforcement learning seeks to simultaneously learn the dynamics of an unknown stochastic environment and synthesise an optimal policy for acting in it. Ensuring the safety and robustness of sequential decisions made through a policy in such an environment is a key challenge for policie...
Egile Nagusiak: | , , , , , |
---|---|
Formatua: | Journal article |
Hizkuntza: | English |
Argitaratua: |
Elsevier
2024
|