Invariant causal prediction for block MDPs

Invariant causal prediction for block MDPs

Generalization across environments is critical to the successful application of reinforcement learning (RL) algorithms to real-world challenges. In this work we propose a method for learning state abstractions which generalize to novel observation distributions in the multi-environment RL setting. W...

সম্পূর্ণ বিবরণ

গ্রন্থ-পঞ্জীর বিবরন
প্রধান লেখক:	Zhang, A, Lyle, C, Sodhani, S, Filos, A, Kwiatkowska, M, Pineau, J, Gal, Y, Precup, D
বিন্যাস:	Conference item
ভাষা:	English
প্রকাশিত:	Proceedings of Machine Learning Research 2020

অনুরূপ উপাদানগুলি

Markov decision processes in artificial intelligence : MDPs, beyond MDPs and applications /
অনুযায়ী: Sigaud, Olivier, অন্যান্য
প্রকাশিত: (2010)

Transience in countable MDPs
অনুযায়ী: Kiefer, SM, অন্যান্য
প্রকাশিত: (2021)

Parity objectives in countable MDPs
অনুযায়ী: Kiefer, S, অন্যান্য
প্রকাশিত: (2017)

Büchi objectives in countable MDPs
অনুযায়ী: Kiefer, S, অন্যান্য
প্রকাশিত: (2019)

Social Interactions as Recursive MDPs
অনুযায়ী: Tejwani, Ravi, অন্যান্য
প্রকাশিত: (2022)

Planning with hidden parameter polynomial MDPs
অনুযায়ী: Costen, C, অন্যান্য
প্রকাশিত: (2023)

Incorporating Rich Social Interactions Into MDPs
অনুযায়ী: Tejwani, Ravi, অন্যান্য
প্রকাশিত: (2022)

Fast approximate hierarchical solution of MDPs
অনুযায়ী: Barry, Jennifer L. (Jennifer Lynn)
প্রকাশিত: (2010)

Combining dynamic abstractions in large MDPs
অনুযায়ী: Steinkraus, Kurt, অন্যান্য
প্রকাশিত: (2005)

Strategy complexity of parity objectives in countable MDPs
অনুযায়ী: Kiefer, S, অন্যান্য
প্রকাশিত: (2020)

Solving Dec-MDPs with options and intention recognition
অনুযায়ী: Cruz, Gabriel, M. Eng. Massachusetts Institute of Technology
প্রকাশিত: (2016)

Planning for risk-aversion and expected value in MDPs
অনুযায়ী: Rigter, M, অন্যান্য
প্রকাশিত: (2022)

Debugging of Markov Decision Processes (MDPs) Models
অনুযায়ী: Hichem Debbi
প্রকাশিত: (2016-08-01)

SEA-PARAM: Exploring Schedulers in Parametric MDPs
অনুযায়ী: Sebastian Arming, অন্যান্য
প্রকাশিত: (2017-07-01)

Batch-iFDD for representation expansion in large MDPs
অনুযায়ী: Geramifard, Alborz, অন্যান্য
প্রকাশিত: (2015)

Mixed observability MDPs for shared autonomy with uncertain human behaviour
অনুযায়ী: Costen, C, অন্যান্য
প্রকাশিত: (2021)

Adaptive Envelope MDPs for Relational Equivalence-based Planning
অনুযায়ী: Gardiol, Natalia H., অন্যান্য
প্রকাশিত: (2008)

Decomposition Methods for Solving Finite-Horizon Large MDPs
অনুযায়ী: Bouchra el Akraoui, অন্যান্য
প্রকাশিত: (2022-01-01)

Improving Probabilistic Bisimulation for MDPs Using Machine Learning
অনুযায়ী: Mohammadsadegh Mohagheghi, অন্যান্য
প্রকাশিত: (2024-06-01)

Solving Finite-Horizon Discounted Non-Stationary MDPS
অনুযায়ী: Bouchra El Akraoui, অন্যান্য
প্রকাশিত: (2023-06-01)

Probabilistic Bisimulations for PCTL Model Checking of Interval MDPs
অনুযায়ী: Vahid Hashemi, অন্যান্য
প্রকাশিত: (2014-03-01)

NP-Hardness of checking the unichain condition in average cost MDPs
অনুযায়ী: Tsitsiklis, John N.
প্রকাশিত: (2012)

Invariant Causal Prediction for Nonlinear Models
অনুযায়ী: Heinze-Deml Christina, অন্যান্য
প্রকাশিত: (2018-09-01)

Effects of Mitochondrial-Derived Peptides (MDPs) on Mitochondrial and Cellular Health in AMD
অনুযায়ী: Sonali Nashine, অন্যান্য
প্রকাশিত: (2020-04-01)

Exploring and Learning in Sparse Linear MDPs without Computationally Intractable Oracles
অনুযায়ী: Golowich, Noah, অন্যান্য
প্রকাশিত: (2024)

Solving uncertain MDPs with objectives that are separable over instantiations of model uncertainty
অনুযায়ী: Adulyasak, Yossiri, অন্যান্য
প্রকাশিত: (2018)

Planning large systems with MDPs: case study of inland waterways supervision
অনুযায়ী: Guillaume DESQUESNES, অন্যান্য
প্রকাশিত: (2016-12-01)

Near-Optimal Model-Free Reinforcement Learning in Non-Stationary Episodic MDPs
অনুযায়ী: Mao, Weichao, অন্যান্য
প্রকাশিত: (2023)

Provable guarantees on the robustness of decision rules to causal interventions
অনুযায়ী: Wang, B, অন্যান্য
প্রকাশিত: (2021)

Sampling Based Approaches for Minimizing Regret in Uncertain Markov Decision Processes (MDPs)
অনুযায়ী: Ahmed, Asrar, অন্যান্য
প্রকাশিত: (2021)

Finite Sample Analysis of Minmax Variant of Offline Reinforcement Learning for General MDPs
অনুযায়ী: Jayanth Reddy Regatti, অন্যান্য
প্রকাশিত: (2022-01-01)

A MDPs-Based Dynamic Path Planning in Unknown Environments for Hopping Locomotion
অনুযায়ী: Kosuke Sakamoto, অন্যান্য
প্রকাশিত: (2023-01-01)

Time-bounded mission planning in time-varying domains with semi-MDPS and Gaussian processes
অনুযায়ী: Duckworth, P, অন্যান্য
প্রকাশিত: (2021)

Strategy Complexity of Point Payoff, Mean Payoff and Total Payoff Objectives in Countable MDPs
অনুযায়ী: Richard Mayr, অন্যান্য
প্রকাশিত: (2023-03-01)

Context/Resource-Aware Mission Planning Based on BNs and Concurrent MDPs for Autonomous UAVs
অনুযায়ী: Chabha Hireche, অন্যান্য
প্রকাশিত: (2018-12-01)

Measuring Causal Invariance Formally
অনুযায়ী: Pierrick Bourrat
প্রকাশিত: (2021-05-01)

Evaluation of the Emission of Formaldehyde from Wood-Based Panels (MDFs and MDPs) in Brazil After Use
অনুযায়ী: José Carlos Cardozo, অন্যান্য
প্রকাশিত: (2023-03-01)

Characterization of MdpS: an in-depth analysis of a MUC5B-degrading protease from Streptococcus oralis
অনুযায়ী: Fredrik Leo, অন্যান্য
প্রকাশিত: (2024-01-01)

Invariance-based causal prediction to identify the direct causes of suicidal behavior
অনুযায়ী: Austin V. Goddard, অন্যান্য
প্রকাশিত: (2022-11-01)

Tangle blocks in the theory of link invariants
অনুযায়ী: A. Mironov, অন্যান্য
প্রকাশিত: (2018-09-01)